Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: adding tvmaze for accurate release time detection #923

Open
wants to merge 13 commits into
base: main
Choose a base branch
from

Conversation

wolfemir
Copy link
Collaborator

@wolfemir wolfemir commented Nov 30, 2024

TVMaze API and Timezone Enhancements
Added support for TVMaze's web schedule endpoint
Now checks the next 30 days of schedule data
Helps catch episodes that might be missing from the main episode list
Particularly useful for streaming services that release episodes early
Simplified timezone detection by comparing local time with UTC
Release times are now displayed in a clearer format (e.g., "2024-11-29 23:00:00 UTC+11")
The application correctly detects when episodes are released in the user's timezone
All components (TVMaze, Trakt, Indexer) use the same timezone detection method

Summary by CodeRabbit

Release Notes

  • New Features

    • Introduced support for the TVMaze API, allowing users to fetch show and episode details.
    • Added a new event listener for PostgreSQL connections to handle case-sensitive identifiers.
  • Improvements

    • Enhanced timezone handling for accurate release date processing in media items.
    • Improved error handling and logging during indexing operations.
    • Added a string representation for media items to aid in debugging.
    • Enhanced logging configuration for better readability and control.
    • Optimized database queries for ongoing media management.
    • Integrated TVMaze data into the TraktIndexer for more accurate release times.
  • Bug Fixes

    • Refined logic for determining the release status of media items to account for timezones.

Copy link
Contributor

coderabbitai bot commented Nov 30, 2024

Walkthrough

The changes introduce new functionalities and enhancements across several modules. A PostgreSQL connection event listener is added to handle case-sensitive identifiers. The API integration for TVMaze is implemented, including a new client module with error handling. The TraktAPI class is enhanced for better date and timezone management, while the MediaItem class improves its release status determination. Additionally, the TraktIndexer class is updated to integrate with TVMaze and enhance error handling. Logging configurations are improved for better readability and debugging.

Changes

File Change Summary
src/alembic/env.py Added function set_postgresql_case_sensitive for case-sensitive identifiers; enhanced logging.
src/program/apis/__init__.py Added TVMazeAPI integration; created __setup_tvmaze function to initialize API instance.
src/program/apis/trakt_api.py Enhanced TraktAPI for timezone handling; added local_tz attribute; improved date parsing.
src/program/apis/tvmaze_api.py Introduced TVMazeAPI, TVMazeRequestHandler, and TVMazeAPIError; added methods for show/episode retrieval.
src/program/media/item.py Refactored is_released property for timezone awareness; added __repr__ method for better representation.
src/program/services/indexers/trakt.py Updated TraktIndexer to include TVMazeAPI; improved error handling and release time comparisons.
src/program/program.py Enhanced _retry_library and _update_ongoing methods for improved media item state management.

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant TVMazeAPI
    participant TraktIndexer
    participant MediaItem

    Client->>TraktIndexer: Request to index media item
    TraktIndexer->>TVMazeAPI: Fetch episode release time
    TVMazeAPI->>TraktIndexer: Return release time
    TraktIndexer->>MediaItem: Determine if released
    MediaItem-->>TraktIndexer: Return release status
    TraktIndexer-->>Client: Return indexed media item
Loading

"In the code, new paths we weave,
With TVMaze, we now believe.
From time zones bright to logs so clear,
Our media's tale is now sincere.
Hopping through changes, we cheer with glee,
For every release, there's more to see! 🐰✨"

Possibly related PRs

  • fix: re-check ongoing/unreleased items #880: The changes in the Program class related to updating ongoing and unreleased items may interact with the logging and error handling improvements in the main PR, particularly in how state updates are managed during database interactions.

Suggested reviewers

  • Gaisberg

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Generate unit testing code for this file.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 10

🧹 Outside diff range and nitpick comments (9)
src/program/apis/__init__.py (1)

10-10: LGTM! Consider documenting the error handling pattern.

The import follows the established pattern of other API modules. While TVMazeAPIError is currently unused in this file, it maintains consistency and might be needed for error handling in dependent modules.

Consider adding a module-level docstring explaining the error handling pattern across API modules.

🧰 Tools
🪛 Ruff (0.8.0)

10-10: .tvmaze_api.TVMazeAPIError imported but unused; consider removing, adding to __all__, or using a redundant alias

(F401)

src/program/media/item.py (1)

177-188: Consider using UTC for consistent timezone handling

While the current implementation handles timezone-aware datetime comparison correctly, it could be more robust by explicitly using UTC throughout the application.

Consider this alternative implementation:

     @property
     def is_released(self) -> bool:
         """Check if an item has been released."""
         if not self.aired_at:
             return False
             
-        # Ensure both datetimes are timezone-aware for comparison
-        now = datetime.now().astimezone()
-        aired_at = self.aired_at
-        
-        # Make aired_at timezone-aware if it isn't already
-        if aired_at.tzinfo is None:
-            aired_at = aired_at.replace(tzinfo=now.tzinfo)
+        # Convert to UTC for consistent comparison
+        now = datetime.now().astimezone().astimezone(timezone.utc)
+        aired_at = self.aired_at.astimezone(timezone.utc) if self.aired_at.tzinfo else self.aired_at.replace(tzinfo=timezone.utc)
             
         return aired_at <= now

This approach:

  1. Explicitly uses UTC for all comparisons
  2. Handles timezone conversion in a single line
  3. Aligns with the PR's goal of simplifying timezone detection
src/program/services/indexers/trakt.py (1)

4-5: Remove unused imports time and Optional

The modules time and Optional are imported but not used in the code. Removing unused imports helps keep the code clean and improves readability.

Apply this diff to remove the unused imports:

-import time
-from typing import Generator, Optional, Union
+from typing import Generator, Union
🧰 Tools
🪛 Ruff (0.8.0)

4-4: time imported but unused

Remove unused import: time

(F401)


5-5: typing.Optional imported but unused

Remove unused import: typing.Optional

(F401)

src/program/apis/tvmaze_api.py (5)

66-67: Add Logging for Failed API Call in get_show_by_imdb_id

When the API call to /lookup/shows fails (not response.is_ok), the method returns None without logging the error. Adding a warning log will aid in debugging and monitoring API failures.

Apply this diff to add logging:

             if not response.is_ok:
+                logger.warning(f"Failed to get show by IMDb ID: {imdb_id}")
                 return None

88-89: Add Logging for Failed API Call in get_episode_by_number

When the API call fails or returns no data (not response.is_ok or not response.data), the method returns None without logging the error. Adding a warning log will assist with debugging.

Apply this diff to add logging:

             if not response.is_ok or not response.data:
+                logger.warning(f"Failed to get episode by number: show_id={show_id}, season={season}, episode={episode}")
                 return None

93-129: Simplify and Improve Date Parsing Logic

The _parse_air_date method contains complex logic to handle different date formats, which may be error-prone and difficult to maintain. Consider utilizing the dateutil library's parser.parse, which can handle ISO 8601 date strings with varying formats and timezones. This can simplify the code and improve reliability.

If adding an external dependency is not preferred, ensure that all possible date formats from the TVMaze API are properly handled, and consider adding unit tests for date parsing to cover edge cases.


145-146: Adjust Logging Level for Failed Show Retrieval

When the show is not retrieved successfully (not show or not hasattr(show, "id")), there is a debug log message. Consider upgrading this to a warning to highlight potential issues in data retrieval.

Apply this diff to adjust the logging level:

             if not show or not hasattr(show, "id"):
-                logger.debug(f"Failed to get TVMaze show for IMDb ID: {item.parent.parent.imdb_id}")
+                logger.warning(f"Failed to get TVMaze show for IMDb ID: {item.parent.parent.imdb_id}")
                 return None

158-173: Optimize API Calls for Streaming Schedule

The current implementation makes 30 separate API calls, one for each day in the next 30 days, which could be inefficient and may hit API rate limits. Consider modifying the approach to minimize the number of API calls. For example, the TVMaze API provides an endpoint /schedule/web, which can be queried without a date parameter to retrieve upcoming episodes. Fetching a broader schedule in fewer API calls or implementing caching mechanisms can improve performance.

src/program/apis/trakt_api.py (1)

3-4: Remove unused imports to clean up the codebase.

The imports time and ZoneInfo are not used and can be safely removed to tidy up the code.

Apply this diff to remove the unused imports:

-import time
-from zoneinfo import ZoneInfo
🧰 Tools
🪛 Ruff (0.8.0)

3-3: time imported but unused

Remove unused import: time

(F401)


4-4: zoneinfo.ZoneInfo imported but unused

Remove unused import: zoneinfo.ZoneInfo

(F401)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between d2a6b5b and fbce61e.

📒 Files selected for processing (6)
  • src/alembic/env.py (2 hunks)
  • src/program/apis/__init__.py (3 hunks)
  • src/program/apis/trakt_api.py (3 hunks)
  • src/program/apis/tvmaze_api.py (1 hunks)
  • src/program/media/item.py (3 hunks)
  • src/program/services/indexers/trakt.py (3 hunks)
🧰 Additional context used
🪛 Ruff (0.8.0)
src/program/apis/__init__.py

10-10: .tvmaze_api.TVMazeAPIError imported but unused; consider removing, adding to __all__, or using a redundant alias

(F401)

src/program/apis/trakt_api.py

3-3: time imported but unused

Remove unused import: time

(F401)


4-4: zoneinfo.ZoneInfo imported but unused

Remove unused import: zoneinfo.ZoneInfo

(F401)


5-5: tzlocal.get_localzone imported but unused

Remove unused import: tzlocal.get_localzone

(F401)

src/program/services/indexers/trakt.py

4-4: time imported but unused

Remove unused import: time

(F401)


5-5: typing.Optional imported but unused

Remove unused import: typing.Optional

(F401)

🔇 Additional comments (8)
src/program/apis/__init__.py (1)

49-51: Consider adding configuration parameters for TVMaze API.

The TVMazeAPI is instantiated without any configuration parameters, unlike other API clients that receive settings. Given that TVMaze is used for timezone-aware release time detection, consider adding configuration for:

  • Rate limiting
  • Request timeouts
  • Base URL (for testing/staging environments)

Let's verify if TVMazeAPI needs configuration by checking its implementation:

src/alembic/env.py (2)

4-4: LGTM: Import addition is appropriate.

The event import from SQLAlchemy is correctly added to support the new PostgreSQL connection event listener.


77-83: Verify the impact of case-sensitive identifiers.

Setting quote_all_identifiers at the session level could affect existing queries that rely on case-insensitive behavior.

Let's scan the codebase for potential issues:

✅ Verification successful

Based on the analysis of the codebase, I can now provide a final response:

Setting case-sensitive identifiers appears safe to implement.

The codebase shows that:

  1. All SQL queries are using SQLAlchemy's ORM with proper model definitions, not raw SQL queries with unquoted identifiers
  2. The database schema is managed through Alembic migrations with explicit column definitions
  3. The only raw SQL queries found are system-level operations (VACUUM, CREATE/DROP DATABASE, etc.) which are not affected by identifier quoting
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Look for SQL queries that might be affected by case-sensitive identifiers

# Search for raw SQL queries
rg -t python "text\([\"'].*SELECT|text\([\"'].*UPDATE|text\([\"'].*INSERT|text\([\"'].*DELETE" -A 3

# Search for SQLAlchemy model definitions to check table/column names
ast-grep --pattern 'class $_(db.Model):'

Length of output: 183


Script:

#!/bin/bash
# Search for raw SQL queries with different patterns
rg "text\(" -t py -A 3

# Search for SQLAlchemy model definitions
rg "class.*\(.*Model\)" -t py -A 3

# Search for table names and column definitions
rg "Table\(|Column\(" -t py -A 3

# Look for any database-related configuration
rg "SQLALCHEMY_|DB_|DATABASE" -t py

Length of output: 36610

src/program/media/item.py (1)

403-405: LGTM! Clear and helpful string representation

The __repr__ implementation follows Python conventions and provides useful debugging information by including both the log string and state.

src/program/apis/tvmaze_api.py (2)

19-21: LGTM: TVMazeAPIError Class Definition

The custom exception class TVMazeAPIError is well-defined and appropriately documents API-related errors.


202-206: LGTM: Final Release Time Determination

The final release time determination and logging are correctly implemented, ensuring that the earliest air date is used.

src/program/apis/trakt_api.py (2)

378-389: Date parsing logic handles different formats effectively.

The method correctly handles parsing dates with and without milliseconds.


393-398: Date parsing for movie release dates is correctly handled.

The code properly sets the movie release date to midnight in the user's local timezone.

src/program/apis/__init__.py Show resolved Hide resolved
Comment on lines 77 to 83
# Set PostgreSQL to handle case-sensitive identifiers
@event.listens_for(connectable, "connect")
def set_postgresql_case_sensitive(dbapi_connection, connection_record):
cursor = dbapi_connection.cursor()
cursor.execute("SET quote_all_identifiers = true;")
cursor.close()

Copy link
Contributor

@coderabbitai coderabbitai bot Nov 30, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🛠️ Refactor suggestion

Add error handling to the PostgreSQL event listener.

The event listener should handle potential database errors to prevent silent failures.

Consider wrapping the cursor operations in a try-finally block:

 @event.listens_for(connectable, "connect")
 def set_postgresql_case_sensitive(dbapi_connection, connection_record):
     cursor = dbapi_connection.cursor()
-    cursor.execute("SET quote_all_identifiers = true;")
-    cursor.close()
+    try:
+        cursor.execute("SET quote_all_identifiers = true;")
+    except Exception as e:
+        logger.error(f"Failed to set PostgreSQL case sensitivity: {e}")
+        raise
+    finally:
+        cursor.close()
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
# Set PostgreSQL to handle case-sensitive identifiers
@event.listens_for(connectable, "connect")
def set_postgresql_case_sensitive(dbapi_connection, connection_record):
cursor = dbapi_connection.cursor()
cursor.execute("SET quote_all_identifiers = true;")
cursor.close()
# Set PostgreSQL to handle case-sensitive identifiers
@event.listens_for(connectable, "connect")
def set_postgresql_case_sensitive(dbapi_connection, connection_record):
cursor = dbapi_connection.cursor()
try:
cursor.execute("SET quote_all_identifiers = true;")
except Exception as e:
logger.error(f"Failed to set PostgreSQL case sensitivity: {e}")
raise
finally:
cursor.close()

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please remove all the env.py changes

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@dreulavelle, understood. I'll disregard the changes in env.py.


✏️ Learnings added
Learnt from: dreulavelle
PR: rivenmedia/riven#923
File: src/alembic/env.py:77-83
Timestamp: 2024-12-06T01:28:31.170Z
Learning: Avoid suggesting changes to `src/alembic/env.py` in future code reviews unless explicitly required.

Note: Learnings are effective only in the context of similar code segments. To apply general coding guidelines, please configure review instructions. You can manage existing learnings in the app.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

removed

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems like the humans are having a chat. I'll hop back into my burrow for now. If you need me again, just tag @coderabbitai in a new comment, and I'll come hopping out!

Comment on lines +145 to +147
if episode_item.aired_at and episode_item.aired_at.tzinfo is None:
episode_item.aired_at = episode_item.aired_at.replace(tzinfo=self.local_tz)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue

Potential incorrect handling of naive aired_at datetimes in episodes

Similar to the seasons, directly setting the local timezone on a naive aired_at datetime for episodes may lead to incorrect times if the original datetime is not in the local timezone.

Suggested Fix: Properly localize the aired_at datetime

Assuming aired_at is in UTC, adjust accordingly.

-if episode_item.aired_at and episode_item.aired_at.tzinfo is None:
-    episode_item.aired_at = episode_item.aired_at.replace(tzinfo=self.local_tz)
+if episode_item.aired_at and episode_item.aired_at.tzinfo is None:
+    episode_item.aired_at = episode_item.aired_at.replace(tzinfo=timezone.utc).astimezone(self.local_tz)
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
if episode_item.aired_at and episode_item.aired_at.tzinfo is None:
episode_item.aired_at = episode_item.aired_at.replace(tzinfo=self.local_tz)
if episode_item.aired_at and episode_item.aired_at.tzinfo is None:
episode_item.aired_at = episode_item.aired_at.replace(tzinfo=timezone.utc).astimezone(self.local_tz)

Comment on lines 28 to 38
# Get timezone by comparing local time with UTC
local_time = datetime.now()
utc_time = datetime.now(timezone.utc)
offset = local_time.hour - utc_time.hour

# Create timezone with the calculated offset
try:
self.local_tz = timezone(timedelta(hours=offset))
except Exception:
self.local_tz = timezone.utc
logger.warning("Could not determine system timezone, using UTC")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue

Incorrect method for determining local timezone

Calculating the local timezone by subtracting UTC time from local time using hours may lead to incorrect results, especially in regions with non-integer hour offsets or during daylight saving time changes. This approach is unreliable.

Suggested Fix: Use the system's local timezone directly

Replace the timezone calculation with a method that reliably obtains the local timezone.

Apply this diff to fix the issue:

-# Get timezone by comparing local time with UTC
-local_time = datetime.now()
-utc_time = datetime.now(timezone.utc)
-offset = local_time.hour - utc_time.hour
-
-# Create timezone with the calculated offset
-try:
-    self.local_tz = timezone(timedelta(hours=offset))
-except Exception:
-    self.local_tz = timezone.utc
-    logger.warning("Could not determine system timezone, using UTC")
+self.local_tz = datetime.now().astimezone().tzinfo
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
# Get timezone by comparing local time with UTC
local_time = datetime.now()
utc_time = datetime.now(timezone.utc)
offset = local_time.hour - utc_time.hour
# Create timezone with the calculated offset
try:
self.local_tz = timezone(timedelta(hours=offset))
except Exception:
self.local_tz = timezone.utc
logger.warning("Could not determine system timezone, using UTC")
self.local_tz = datetime.now().astimezone().tzinfo

Comment on lines 164 to 165
if tvmaze_time.tzinfo is None:
tvmaze_time = tvmaze_time.replace(tzinfo=datetime.now().astimezone().tzinfo)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue

Potential incorrect handling of tvmaze_time when assigning timezone

Directly setting the local timezone on a naive tvmaze_time datetime may lead to incorrect times if the original datetime is not in the local timezone.

Suggested Fix: Properly localize tvmaze_time

Assuming tvmaze_time is in UTC, adjust accordingly.

-if tvmaze_time.tzinfo is None:
-    tvmaze_time = tvmaze_time.replace(tzinfo=datetime.now().astimezone().tzinfo)
+if tvmaze_time.tzinfo is None:
+    tvmaze_time = tvmaze_time.replace(tzinfo=timezone.utc).astimezone(self.local_tz)

Committable suggestion skipped: line range outside the PR's diff.

Comment on lines +135 to +137
if season_item.aired_at and season_item.aired_at.tzinfo is None:
season_item.aired_at = season_item.aired_at.replace(tzinfo=self.local_tz)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue

Potential incorrect handling of naive aired_at datetimes in seasons

Assigning self.local_tz to a naive aired_at datetime using replace(tzinfo=...) assumes the original datetime is in the local timezone, which may not be accurate and could lead to incorrect times. If aired_at represents UTC time, this method may produce wrong results.

Suggested Fix: Properly localize the aired_at datetime

Assuming aired_at is in UTC, set the timezone to UTC and convert to the local timezone if necessary.

-if season_item.aired_at and season_item.aired_at.tzinfo is None:
-    season_item.aired_at = season_item.aired_at.replace(tzinfo=self.local_tz)
+if season_item.aired_at and season_item.aired_at.tzinfo is None:
+    season_item.aired_at = season_item.aired_at.replace(tzinfo=timezone.utc).astimezone(self.local_tz)
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
if season_item.aired_at and season_item.aired_at.tzinfo is None:
season_item.aired_at = season_item.aired_at.replace(tzinfo=self.local_tz)
if season_item.aired_at and season_item.aired_at.tzinfo is None:
season_item.aired_at = season_item.aired_at.replace(tzinfo=timezone.utc).astimezone(self.local_tz)

Comment on lines 154 to 156
if trakt_time and trakt_time.tzinfo is None:
trakt_time = trakt_time.replace(tzinfo=datetime.now().astimezone().tzinfo)
episode_item.aired_at = trakt_time
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue

Potential incorrect handling of trakt_time when assigning timezone

Assigning the local timezone to a naive trakt_time datetime using replace(tzinfo=...) may result in incorrect time values if trakt_time is actually in UTC or another timezone.

Suggested Fix: Properly localize trakt_time

If trakt_time is in UTC, set the timezone to UTC and convert to the local timezone.

-if trakt_time and trakt_time.tzinfo is None:
-    trakt_time = trakt_time.replace(tzinfo=datetime.now().astimezone().tzinfo)
-    episode_item.aired_at = trakt_time
+if trakt_time and trakt_time.tzinfo is None:
+    trakt_time = trakt_time.replace(tzinfo=timezone.utc).astimezone(self.local_tz)
+    episode_item.aired_at = trakt_time

Committable suggestion skipped: line range outside the PR's diff.

Comment on lines 69 to 71
show_url = f"{self.BASE_URL}/shows/{response.data.id}"
show_response = self.request_handler.execute(HttpMethod.GET, show_url)
return show_response.data if show_response.is_ok else None
Copy link
Contributor

@coderabbitai coderabbitai bot Nov 30, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🛠️ Refactor suggestion

Avoid Redundant API Call in get_show_by_imdb_id

The method makes two API calls: one to /lookup/shows and then another to /shows/{id}. If the initial response from /lookup/shows contains all the necessary show information, the second API call may be unnecessary. Verify if the data returned by /lookup/shows suffices, and eliminate the redundant API call to improve performance.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@wolfemir you should take a look at this one, he's got a point ;)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems like the humans are having a chat. I'll hop back into my burrow for now. If you need me again, just tag @coderabbitai in a new comment, and I'll come hopping out!

src/program/apis/tvmaze_api.py Outdated Show resolved Hide resolved
src/program/apis/trakt_api.py Outdated Show resolved Hide resolved
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (3)
src/program/apis/tvmaze_api.py (3)

3-3: Remove unused import

The timezone import from datetime is not used in the code as timezone handling is done using datetime.now().astimezone().tzinfo.

-from datetime import datetime, timedelta, timezone
+from datetime import datetime, timedelta
🧰 Tools
🪛 Ruff (0.8.0)

3-3: datetime.timezone imported but unused

Remove unused import: datetime.timezone

(F401)


26-27: Add error handling documentation

Consider adding docstring to the execute method to document the possible exceptions that could be raised and their conditions.

     def execute(self, method: HttpMethod, endpoint: str, **kwargs):
+        """Execute a request to the TVMaze API
+        
+        Args:
+            method: HTTP method to use
+            endpoint: API endpoint to call
+            **kwargs: Additional arguments to pass to the request
+            
+        Raises:
+            TVMazeAPIError: When the API request fails
+        """
         return super()._request(method, endpoint, **kwargs)

120-196: Refactor get_episode_release_time for better maintainability

The method is quite long and handles multiple responsibilities. Consider breaking it down into smaller, focused methods:

  1. Validation logic (lines 122-132)
  2. Regular schedule check (lines 139-143)
  3. Streaming schedule check (lines 144-191)
+    def _validate_episode(self, item: MediaItem) -> bool:
+        """Validate if the item is a valid episode for release time lookup"""
+        if not isinstance(item, Episode):
+            logger.debug(f"Skipping release time lookup - not an episode: {item.log_string if item else None}")
+            return False
+            
+        if not item.parent or not item.parent.parent:
+            logger.debug(f"Skipping release time lookup - missing parent/show: {item.log_string}")
+            return False
+            
+        if not item.parent.parent.imdb_id:
+            logger.debug(f"Skipping release time lookup - no IMDb ID: {item.log_string}")
+            return False
+        
+        return True
+
+    def _check_streaming_schedule(self, show_id: int, item: Episode, start_date: datetime) -> Optional[datetime]:
+        """Check streaming schedule for the next 30 days"""
+        for i in range(30):
+            check_date = start_date + timedelta(days=i)
+            releases = self._get_streaming_releases(check_date)
+            if not releases:
+                continue
+                
+            for release in releases:
+                if self._is_matching_release(release, item, show_id):
+                    streaming_date = self._parse_air_date(release)
+                    if streaming_date:
+                        logger.debug(f"Found streaming release for {item.log_string}: {streaming_date}")
+                        return streaming_date
+        return None
+
     def get_episode_release_time(self, item: MediaItem) -> Optional[datetime]:
         """Get episode release time for a media item"""
-        if not isinstance(item, Episode):
-            logger.debug(f"Skipping release time lookup - not an episode: {item.log_string if item else None}")
-            return None
-            
-        if not item.parent or not item.parent.parent:
-            logger.debug(f"Skipping release time lookup - missing parent/show: {item.log_string}")
-            return None
-            
-        if not item.parent.parent.imdb_id:
-            logger.debug(f"Skipping release time lookup - no IMDb ID: {item.log_string}")
+        if not self._validate_episode(item):
             return None
 
         show = self.get_show_by_imdb_id(item.parent.parent.imdb_id)
         if not show or not hasattr(show, "id"):
             logger.debug(f"Failed to get TVMaze show for IMDb ID: {item.parent.parent.imdb_id}")
             return None
 
         # Get episode air date from regular schedule
         air_date = self.get_episode_by_number(show.id, item.parent.number, item.number)
         if air_date:
             logger.debug(f"Found regular schedule time for {item.log_string}: {air_date}")
 
         # Check streaming releases for next 30 days
         today = datetime.now(self.local_tz).replace(hour=0, minute=0, second=0, microsecond=0)
         logger.debug(f"Checking streaming schedule for {item.log_string} (Show ID: {show.id})")
         
-        for i in range(30):
-            check_date = today + timedelta(days=i)
-            url = f"{self.BASE_URL}/schedule/web"
-            response = self.request_handler.execute(
-                HttpMethod.GET, 
-                url, 
-                params={
-                    "date": check_date.strftime("%Y-%m-%d"),
-                    "country": None
-                }
-            )
-            
-            if not response.is_ok:
-                logger.debug(f"Failed to get streaming schedule for {check_date.strftime('%Y-%m-%d')}")
-                continue
-                
-            for release in response.data:
-                if not release:
-                    continue
-                    
-                show_data = getattr(release, "show", None)
-                if not show_data:
-                    continue
-                    
-                # Get externals safely
-                externals = getattr(show_data, "externals", {}) or {}
-                show_imdb = externals.get("imdb")
-                show_id = getattr(show_data, "id", None)
-                
-                # Check both IMDb ID and TVMaze show ID
-                is_match = (
-                    (show_imdb == item.parent.parent.imdb_id or show_id == show.id) and
-                    getattr(release, "season", 0) == item.parent.number and
-                    getattr(release, "number", 0) == item.number
-                )
-                
-                if is_match:
-                    streaming_date = self._parse_air_date(release)
-                    if streaming_date:
-                        logger.debug(f"Found streaming release for {item.log_string}: {streaming_date}")
-                        if not air_date or streaming_date < air_date:
-                            air_date = streaming_date
-                            logger.debug(f"Using earlier streaming release time for {item.log_string}: {streaming_date}")
+        streaming_date = self._check_streaming_schedule(show.id, item, today)
+        if streaming_date and (not air_date or streaming_date < air_date):
+            air_date = streaming_date
+            logger.debug(f"Using earlier streaming release time for {item.log_string}: {streaming_date}")
 
         if air_date:
             logger.debug(f"Final release time for {item.log_string}: {air_date}")
📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between fbce61e and 46f4e45.

📒 Files selected for processing (1)
  • src/program/apis/tvmaze_api.py (1 hunks)
🧰 Additional context used
🪛 Ruff (0.8.0)
src/program/apis/tvmaze_api.py

3-3: datetime.timezone imported but unused

Remove unused import: datetime.timezone

(F401)

🔇 Additional comments (3)
src/program/apis/tvmaze_api.py (3)

19-21: LGTM!

The exception class is well-defined and properly documented.


83-118: LGTM! Robust error handling in date parsing

The _parse_air_date method implements comprehensive error handling with:

  • Proper null checks using getattr
  • Specific exception handling for ValueError and AttributeError
  • Detailed error logging
  • Graceful fallbacks for different date formats

45-61: 🛠️ Refactor suggestion

Optimize API calls in get_show_by_imdb_id

As noted in previous reviews, this method makes two API calls where one might suffice. The /lookup/shows endpoint likely returns sufficient show information.

     def get_show_by_imdb_id(self, imdb_id: str) -> Optional[dict]:
         """Get show information by IMDb ID"""
         if not imdb_id:
             return None
         
         url = f"{self.BASE_URL}/lookup/shows"
         response = self.request_handler.execute(
             HttpMethod.GET, 
             url,
             params={"imdb": imdb_id}
         )
         if not response.is_ok:
             return None
 
-        show_url = f"{self.BASE_URL}/shows/{response.data.id}"
-        show_response = self.request_handler.execute(HttpMethod.GET, show_url)
-        return show_response.data if show_response.is_ok else None
+        return response.data

@@ -391,6 +400,9 @@ def _reset(self):
def log_string(self):
return self.title or self.id

def __repr__(self):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remove this, its not needed as item.log_string already does it for us

logger.debug(f"Skipping release time lookup - missing parent/show: {item.log_string}")
return None

if not item.parent.parent.imdb_id:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

instead of all the if not item.parent.parent stuff, you can check the type by doing if item.type == "episode"

doing it like it is now can cause errors to popup when an item doesn't have the nested parent attributes

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (5)
src/program/apis/tvmaze_api.py (2)

78-98: Consider adding type hints to the _parse_air_date method parameters.

The method lacks type hints which could improve code maintainability and IDE support.

-    def _parse_air_date(self, episode_data) -> Optional[datetime]:
+    def _parse_air_date(self, episode_data: SimpleNamespace) -> Optional[datetime]:

115-192: Consider breaking down the get_episode_release_time method.

The method is quite long (77 lines) and handles multiple responsibilities. Consider extracting the streaming schedule check into a separate method for better maintainability.

+    def _check_streaming_schedule(self, show_data: SimpleNamespace, episode_item: Episode, today: datetime) -> Optional[datetime]:
+        """Check streaming releases for the next 30 days."""
+        for i in range(30):
+            check_date = today + timedelta(days=i)
+            response = self.request_handler.execute(
+                HttpMethod.GET,
+                f"{self.BASE_URL}/schedule/web",
+                params={
+                    "date": check_date.strftime("%Y-%m-%d"),
+                    "country": None
+                }
+            )
+            
+            if not response.is_ok:
+                logger.debug(f"Failed to get streaming schedule for {check_date.strftime('%Y-%m-%d')}")
+                continue
+                
+            for release in response.data:
+                if self._is_matching_episode(release, show_data, episode_item):
+                    streaming_date = self._parse_air_date(release)
+                    if streaming_date:
+                        logger.debug(f"Found streaming release: {streaming_date}")
+                        return streaming_date
+        return None

+    def _is_matching_episode(self, release: SimpleNamespace, show_data: SimpleNamespace, episode_item: Episode) -> bool:
+        """Check if the release matches the episode."""
+        if not release:
+            return False
+            
+        show_data = getattr(release, "show", None)
+        if not show_data:
+            return False
+            
+        externals = getattr(show_data, "externals", {}) or {}
+        show_imdb = externals.get("imdb")
+        show_id = getattr(show_data, "id", None)
+        
+        return (
+            (show_imdb == show.imdb_id or show_id == show_data.id) and
+            getattr(release, "season", 0) == episode_item.parent.number and
+            getattr(release, "number", 0) == episode_item.number
+        )
src/program/services/indexers/trakt.py (1)

127-133: Consider using UTC for internal storage of dates.

While setting the timezone is necessary, it's generally better to store dates in UTC internally and only convert to local timezone when displaying to users.

src/program/apis/trakt_api.py (1)

2-5: Remove unused imports.

Several imports are not used in the code:

  • datetime.timedelta
  • time
  • zoneinfo.ZoneInfo
-from datetime import datetime, timedelta, timezone
-import time
-from zoneinfo import ZoneInfo
+from datetime import datetime, timezone
🧰 Tools
🪛 Ruff (0.8.0)

2-2: datetime.timedelta imported but unused

Remove unused import: datetime.timedelta

(F401)


3-3: time imported but unused

Remove unused import: time

(F401)


4-4: zoneinfo.ZoneInfo imported but unused

Remove unused import: zoneinfo.ZoneInfo

(F401)

src/program/media/item.py (1)

177-188: Consider handling additional timezone edge cases.

The timezone handling is improved, but consider these edge cases:

  1. Invalid timezone information in aired_at
  2. Cases where the local timezone might not be appropriate (e.g., international content)
 def is_released(self) -> bool:
     """Check if an item has been released."""
     if not self.aired_at:
         return False
         
     # Ensure both datetimes are timezone-aware for comparison
     now = datetime.now().astimezone()
     aired_at = self.aired_at
     
     # Make aired_at timezone-aware if it isn't already
     if aired_at.tzinfo is None:
+        try:
             aired_at = aired_at.replace(tzinfo=now.tzinfo)
+        except (ValueError, OSError) as e:
+            logger.warning(f"Invalid timezone for {self.log_string}: {e}")
+            # Fallback to UTC
+            aired_at = aired_at.replace(tzinfo=timezone.utc)
         
     return aired_at <= now
📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 46f4e45 and 049ecdc.

📒 Files selected for processing (4)
  • src/program/apis/trakt_api.py (3 hunks)
  • src/program/apis/tvmaze_api.py (1 hunks)
  • src/program/media/item.py (3 hunks)
  • src/program/services/indexers/trakt.py (3 hunks)
🧰 Additional context used
🪛 Ruff (0.8.0)
src/program/apis/trakt_api.py

2-2: datetime.timedelta imported but unused

Remove unused import: datetime.timedelta

(F401)


3-3: time imported but unused

Remove unused import: time

(F401)


4-4: zoneinfo.ZoneInfo imported but unused

Remove unused import: zoneinfo.ZoneInfo

(F401)

src/program/services/indexers/trakt.py

4-4: time imported but unused

Remove unused import: time

(F401)


5-5: typing.Optional imported but unused

Remove unused import: typing.Optional

(F401)

src/program/apis/tvmaze_api.py

3-3: datetime.timezone imported but unused

Remove unused import: datetime.timezone

(F401)


9-9: program.media.item.Episode imported but unused

Remove unused import: program.media.item.Episode

(F401)

🔇 Additional comments (7)
src/program/apis/tvmaze_api.py (2)

1-17: LGTM: Well-structured imports and request handling setup.

The implementation follows good practices with clear separation of concerns and proper error handling through the request handler.

🧰 Tools
🪛 Ruff (0.8.0)

3-3: datetime.timezone imported but unused

Remove unused import: datetime.timezone

(F401)


9-9: program.media.item.Episode imported but unused

Remove unused import: program.media.item.Episode

(F401)


19-21: LGTM: Clean exception and request handler implementation.

Good practice to have a custom exception class and proper inheritance from BaseRequestHandler.

Also applies to: 22-28

src/program/services/indexers/trakt.py (2)

29-34: LGTM: Improved timezone handling using tzlocal.

Good improvement using get_localzone() for reliable timezone detection, addressing the issues raised in previous reviews.


144-181: LGTM: Comprehensive release time handling.

The implementation properly compares release times from both Trakt and TVMaze, using the earlier time when available. The logging is thorough and helpful for debugging.

src/program/apis/trakt_api.py (1)

373-393: LGTM: Robust date parsing implementation.

The implementation properly handles different date formats and timezone conversions, with good error handling and logging.

src/program/media/item.py (2)

403-404: LGTM! Clear and informative string representation.

The __repr__ implementation provides useful debugging information by including both the item's log string and state.


432-432: LGTM! Good cleanup of duplicate __repr__ implementations.

Removing the __repr__ methods from child classes reduces code duplication and ensures consistent string representation across all media types. This change aligns with the DRY principle and makes the codebase more maintainable.

@wolfemir wolfemir requested a review from dreulavelle December 1, 2024 04:10
Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (2)
src/program/services/indexers/trakt.py (2)

4-4: Remove unused import time

The time module is imported but not used in this file. Removing it cleans up the code and eliminates unnecessary imports.

Apply this diff:

-import time
🧰 Tools
🪛 Ruff (0.8.0)

4-4: time imported but unused

Remove unused import: time

(F401)


5-5: Remove unused import Optional from typing module

The Optional type is imported but not used in this file. Removing it cleans up the code and eliminates unnecessary imports.

Apply this diff:

-from typing import Generator, Optional, Union
+from typing import Generator, Union
🧰 Tools
🪛 Ruff (0.8.0)

5-5: typing.Optional imported but unused

Remove unused import: typing.Optional

(F401)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 049ecdc and 1ad3dc9.

📒 Files selected for processing (2)
  • src/program/program.py (2 hunks)
  • src/program/services/indexers/trakt.py (3 hunks)
🧰 Additional context used
🪛 Ruff (0.8.0)
src/program/services/indexers/trakt.py

4-4: time imported but unused

Remove unused import: time

(F401)


5-5: typing.Optional imported but unused

Remove unused import: typing.Optional

(F401)

🔇 Additional comments (4)
src/program/services/indexers/trakt.py (4)

131-133: ⚠️ Potential issue

Potential incorrect handling of naive aired_at datetime in seasons

Assigning self.local_tz to a naive aired_at datetime using replace(tzinfo=...) assumes the original datetime is in the local timezone. If aired_at is actually in UTC, this can lead to incorrect times.

Suggested Fix: Properly localize the aired_at datetime

Assuming aired_at is in UTC, set the timezone to UTC and convert it to the local timezone.

Apply this diff:

-if season_item.aired_at and season_item.aired_at.tzinfo is None:
-    season_item.aired_at = season_item.aired_at.replace(tzinfo=self.local_tz)
+if season_item.aired_at and season_item.aired_at.tzinfo is None:
+    season_item.aired_at = season_item.aired_at.replace(tzinfo=timezone.utc).astimezone(self.local_tz)

141-143: ⚠️ Potential issue

Potential incorrect handling of naive aired_at datetime in episodes

Directly assigning self.local_tz to a naive aired_at datetime may result in incorrect times if the original datetime is in UTC or another timezone.

Suggested Fix: Properly localize the aired_at datetime

Assuming aired_at is in UTC, set the timezone to UTC and convert it to the local timezone.

Apply this diff:

-if episode_item.aired_at and episode_item.aired_at.tzinfo is None:
-    episode_item.aired_at = episode_item.aired_at.replace(tzinfo=self.local_tz)
+if episode_item.aired_at and episode_item.aired_at.tzinfo is None:
+    episode_item.aired_at = episode_item.aired_at.replace(tzinfo=timezone.utc).astimezone(self.local_tz)

157-159: ⚠️ Potential issue

Potential incorrect handling of trakt_time when assigning timezone

Assigning the local timezone to a naive trakt_time datetime using replace(tzinfo=...) may result in incorrect times if trakt_time is actually in UTC or another timezone.

Suggested Fix: Properly localize trakt_time

Assuming trakt_time is in UTC, set the timezone to UTC and convert it to the local timezone.

Apply this diff:

-if trakt_time and trakt_time.tzinfo is None:
-    trakt_time = trakt_time.replace(tzinfo=datetime.now().astimezone().tzinfo)
-    episode.aired_at = trakt_time
+if trakt_time and trakt_time.tzinfo is None:
+    trakt_time = trakt_time.replace(tzinfo=timezone.utc).astimezone(self.local_tz)
+    episode.aired_at = trakt_time

167-169: ⚠️ Potential issue

Potential incorrect handling of tvmaze_time when assigning timezone

Directly setting the local timezone on a naive tvmaze_time datetime may lead to incorrect times if the original datetime is in UTC.

Suggested Fix: Properly localize tvmaze_time

Assuming tvmaze_time is in UTC, set the timezone to UTC and convert it to the local timezone.

Apply this diff:

-if tvmaze_time.tzinfo is None:
-    tvmaze_time = tvmaze_time.replace(tzinfo=datetime.now().astimezone().tzinfo)
+if tvmaze_time.tzinfo is None:
+    tvmaze_time = tvmaze_time.replace(tzinfo=timezone.utc).astimezone(self.local_tz)

Comment on lines +248 to +250
show = item if item.type == "show" else (
item.parent.parent if item.type == "episode" else item.parent
)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue

Handle potential NoneType errors when accessing parent attributes

When determining the show object, accessing item.parent or item.parent.parent without checking for None may raise an AttributeError if any parent is missing. To prevent potential runtime errors, add checks to ensure that parent attributes exist before accessing them.

Apply this diff to handle potential None values:

-show = item if item.type == "show" else (
-    item.parent.parent if item.type == "episode" else item.parent
-)
+if item.type == "show":
+    show = item
+elif item.type == "episode" and item.parent and item.parent.parent:
+    show = item.parent.parent
+elif item.type == "season" and item.parent:
+    show = item.parent
+else:
+    logger.error(f"Cannot determine show for item {item.log_string} due to missing parent information.")
+    continue
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
show = item if item.type == "show" else (
item.parent.parent if item.type == "episode" else item.parent
)
if item.type == "show":
show = item
elif item.type == "episode" and item.parent and item.parent.parent:
show = item.parent.parent
elif item.type == "season" and item.parent:
show = item.parent
else:
logger.error(f"Cannot determine show for item {item.log_string} due to missing parent information.")
continue

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (5)
src/program/apis/tvmaze_api.py (3)

3-17: Clean up unused imports

Remove the following unused imports to improve code maintainability:

  • datetime.timedelta
  • datetime.timezone
  • program.media.item.MediaItem
-from datetime import datetime, timedelta, timezone
+from datetime import datetime
 from typing import Optional
 
 from loguru import logger
 from requests import Session
 
-from program.media.item import Episode, MediaItem
+from program.media.item import Episode
🧰 Tools
🪛 Ruff (0.8.0)

3-3: datetime.timedelta imported but unused

Remove unused import

(F401)


3-3: datetime.timezone imported but unused

Remove unused import

(F401)


9-9: program.media.item.MediaItem imported but unused

Remove unused import: program.media.item.MediaItem

(F401)


88-123: Enhance error logging for timestamp parsing

The error logging could be more detailed to help diagnose parsing issues.

-            logger.debug(f"Failed to parse TVMaze airstamp: {airstamp} - {e}")
+            logger.debug(f"Failed to parse TVMaze airstamp '{airstamp}' - Error: {e}. Format should be ISO 8601.")

127-128: Simplify parent relationship validation

Replace nested attribute checks with a more robust validation method.

-if not episode or not episode.parent or not episode.parent.parent:
+if not isinstance(episode, Episode) or not hasattr(episode, 'parent') or not isinstance(episode.parent, Season) or not hasattr(episode.parent, 'parent') or not isinstance(episode.parent.parent, Show):
     return None
src/program/services/indexers/trakt.py (2)

3-5: Remove unused imports

The following imports are not used:

  • time
  • typing.Optional
-from datetime import datetime, timedelta, timezone
-import time
-from typing import Generator, Optional, Union
+from datetime import datetime, timedelta, timezone
+from typing import Generator, Union
🧰 Tools
🪛 Ruff (0.8.0)

4-4: time imported but unused

Remove unused import: time

(F401)


5-5: typing.Optional imported but unused

Remove unused import: typing.Optional

(F401)


148-211: Consider breaking down the complex release time comparison logic

The method handles multiple responsibilities and could be split into smaller, focused methods:

  • Release time comparison logic
  • Skip condition evaluation
  • Time difference calculation

Example refactor:

def _compare_release_times(self, episode, trakt_time, tvmaze_time) -> datetime:
    """Compare and return the earlier release time."""
    if not trakt_time:
        return tvmaze_time
    if not tvmaze_time:
        return trakt_time
    return min(trakt_time, tvmaze_time)

def _should_skip_remaining(self, episode) -> bool:
    """Determine if remaining episodes should be skipped."""
    if not episode.aired_at:
        return True
    now = datetime.now(episode.aired_at.tzinfo)
    time_until_release = episode.aired_at - now
    return time_until_release.days > 7
📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 1ad3dc9 and 1cb35ca.

📒 Files selected for processing (2)
  • src/program/apis/tvmaze_api.py (1 hunks)
  • src/program/services/indexers/trakt.py (3 hunks)
🧰 Additional context used
🪛 Ruff (0.8.0)
src/program/services/indexers/trakt.py

4-4: time imported but unused

Remove unused import: time

(F401)


5-5: typing.Optional imported but unused

Remove unused import: typing.Optional

(F401)

src/program/apis/tvmaze_api.py

3-3: datetime.timedelta imported but unused

Remove unused import

(F401)


3-3: datetime.timezone imported but unused

Remove unused import

(F401)


9-9: program.media.item.MediaItem imported but unused

Remove unused import: program.media.item.MediaItem

(F401)

🔇 Additional comments (4)
src/program/apis/tvmaze_api.py (2)

33-44: LGTM: Well-structured initialization with proper configuration

The initialization includes:

  • Appropriate rate limiting (20 calls per 10 seconds)
  • Caching configuration with 24-hour TTL
  • Proper timezone handling

46-66: LGTM: Optimized API call implementation

The method efficiently retrieves show information with a single API call, addressing the previous feedback about redundant API calls.

src/program/services/indexers/trakt.py (2)

128-146: LGTM: Proper parent relationship and timezone handling

The implementation correctly:

  • Sets parent relationships for seasons and episodes
  • Ensures timezone-aware dates using the system's local timezone

169-182: 🛠️ Refactor suggestion

Optimize timezone handling

Use the already available self.local_tz instead of repeatedly getting the current timezone.

 trakt_time = episode.aired_at
 if trakt_time and trakt_time.tzinfo is None:
-    trakt_time = trakt_time.replace(tzinfo=datetime.now().astimezone().tzinfo)
+    trakt_time = trakt_time.replace(tzinfo=self.local_tz)
     episode.aired_at = trakt_time

 # Get release time from TVMaze and use it if it's earlier or if Trakt has no time
 tvmaze_time = self.tvmaze_api.get_episode_release_time(episode)
 if tvmaze_time:
     # Ensure TVMaze time is timezone-aware
     if tvmaze_time.tzinfo is None:
-        tvmaze_time = tvmaze_time.replace(tzinfo=datetime.now().astimezone().tzinfo)
+        tvmaze_time = tvmaze_time.replace(tzinfo=self.local_tz)

Likely invalid or redundant comment.

Comment on lines 68 to 86
def get_episode_by_number(self, show_id: int, season: int, episode: int) -> Optional[datetime]:
"""Get episode information by show ID and episode number"""
if not show_id or not season or not episode:
return None

url = f"{self.BASE_URL}/shows/{show_id}/episodebynumber"
response = self.request_handler.execute(
HttpMethod.GET,
url,
params={
"season": season,
"number": episode
}
)

if not response.is_ok or not response.data:
return None

return self._parse_air_date(response.data)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🛠️ Refactor suggestion

Enhance error handling for API calls

The method should handle specific API exceptions similar to get_show_by_imdb_id.

 def get_episode_by_number(self, show_id: int, season: int, episode: int) -> Optional[datetime]:
     if not show_id or not season or not episode:
         return None
 
     url = f"{self.BASE_URL}/shows/{show_id}/episodebynumber"
-    response = self.request_handler.execute(
-        HttpMethod.GET,
-        url,
-        params={
-            "season": season,
-            "number": episode
-        }
-    )
-    
-    if not response.is_ok or not response.data:
+    try:
+        response = self.request_handler.execute(
+            HttpMethod.GET,
+            url,
+            params={
+                "season": season,
+                "number": episode
+            }
+        )
+        
+        if not response.is_ok or not response.data:
+            return None
+
+        return self._parse_air_date(response.data)
+    except Exception as e:
+        logger.debug(f"Error getting episode by number - show_id={show_id}, S{season:02d}E{episode:02d}: {e}")
         return None
-
-    return self._parse_air_date(response.data)
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
def get_episode_by_number(self, show_id: int, season: int, episode: int) -> Optional[datetime]:
"""Get episode information by show ID and episode number"""
if not show_id or not season or not episode:
return None
url = f"{self.BASE_URL}/shows/{show_id}/episodebynumber"
response = self.request_handler.execute(
HttpMethod.GET,
url,
params={
"season": season,
"number": episode
}
)
if not response.is_ok or not response.data:
return None
return self._parse_air_date(response.data)
def get_episode_by_number(self, show_id: int, season: int, episode: int) -> Optional[datetime]:
"""Get episode information by show ID and episode number"""
if not show_id or not season or not episode:
return None
url = f"{self.BASE_URL}/shows/{show_id}/episodebynumber"
try:
response = self.request_handler.execute(
HttpMethod.GET,
url,
params={
"season": season,
"number": episode
}
)
if not response.is_ok or not response.data:
return None
return self._parse_air_date(response.data)
except Exception as e:
logger.debug(f"Error getting episode by number - show_id={show_id}, S{season:02d}E{episode:02d}: {e}")
return None

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (3)
src/program/apis/tvmaze_api.py (3)

3-17: Clean up unused imports

The following imports are not used in the code:

  • timedelta and timezone from datetime
  • MediaItem from program.media.item
-from datetime import datetime, timedelta, timezone
+from datetime import datetime
-from program.media.item import Episode, MediaItem
+from program.media.item import Episode
🧰 Tools
🪛 Ruff (0.8.0)

3-3: datetime.timedelta imported but unused

Remove unused import

(F401)


3-3: datetime.timezone imported but unused

Remove unused import

(F401)


9-9: program.media.item.MediaItem imported but unused

Remove unused import: program.media.item.MediaItem

(F401)


22-28: Consider simplifying TVMazeRequestHandler

The execute method is just a pass-through to the parent's _request method. Consider removing it and using _request directly, or document why this wrapper is necessary.


92-128: Simplify timestamp parsing logic

The timestamp parsing logic could be simplified by using a single try-except block and standardizing the input format first.

 def _parse_air_date(self, episode_data) -> Optional[datetime]:
     """Parse episode air date from TVMaze response"""
-    if airstamp := getattr(episode_data, "airstamp", None):
-        try:
-            # Handle both 'Z' suffix and explicit timezone
-            timestamp = airstamp.replace('Z', '+00:00')
-            if '.' in timestamp:
-                # Strip milliseconds but preserve timezone
-                parts = timestamp.split('.')
-                base = parts[0]
-                tz = parts[1][parts[1].find('+'):]
-                timestamp = base + tz if '+' in parts[1] else base + '+00:00'
-            elif not ('+' in timestamp or '-' in timestamp):
-                # Add UTC timezone if none specified
-                timestamp = timestamp + '+00:00'
-            # Convert to user's timezone
-            utc_dt = datetime.fromisoformat(timestamp)
-            return utc_dt.astimezone(self.local_tz)
-        except (ValueError, AttributeError) as e:
-            logger.debug(f"Failed to parse TVMaze airstamp: {airstamp} - {e}")
+    try:
+        # Try parsing airstamp first
+        if airstamp := getattr(episode_data, "airstamp", None):
+            # Standardize timestamp format
+            timestamp = airstamp.replace('Z', '+00:00')
+            if '.' in timestamp:
+                timestamp = timestamp.split('.')[0] + '+00:00'
+            elif not ('+' in timestamp or '-' in timestamp):
+                timestamp = timestamp + '+00:00'
+            return datetime.fromisoformat(timestamp).astimezone(self.local_tz)
 
-    try:
-        if airdate := getattr(episode_data, "airdate", None):
-            if airtime := getattr(episode_data, "airtime", None):
-                # Combine date and time with UTC timezone first
-                dt_str = f"{airdate}T{airtime}+00:00"
-                utc_dt = datetime.fromisoformat(dt_str)
-                # Convert to user's timezone
-                return utc_dt.astimezone(self.local_tz)
-            # If we only have a date, set time to midnight in user's timezone
-            local_midnight = datetime.fromisoformat(f"{airdate}T00:00:00").replace(tzinfo=self.local_tz)
-            return local_midnight
-    except (ValueError, AttributeError) as e:
-        logger.error(f"Failed to parse TVMaze air date/time: {airdate}/{getattr(episode_data, 'airtime', None)} - {e}")
-    
-    return None
+        # Fallback to separate date/time fields
+        if airdate := getattr(episode_data, "airdate", None):
+            time_str = getattr(episode_data, "airtime", "00:00:00")
+            dt_str = f"{airdate}T{time_str}+00:00"
+            return datetime.fromisoformat(dt_str).astimezone(self.local_tz)
+            
+        return None
+    except (ValueError, AttributeError) as e:
+        logger.debug(f"Failed to parse TVMaze air date: {e}")
+        return None
📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 1cb35ca and ad058d8.

📒 Files selected for processing (1)
  • src/program/apis/tvmaze_api.py (1 hunks)
🧰 Additional context used
🪛 Ruff (0.8.0)
src/program/apis/tvmaze_api.py

3-3: datetime.timedelta imported but unused

Remove unused import

(F401)


3-3: datetime.timezone imported but unused

Remove unused import

(F401)


9-9: program.media.item.MediaItem imported but unused

Remove unused import: program.media.item.MediaItem

(F401)

🔇 Additional comments (3)
src/program/apis/tvmaze_api.py (3)

33-44: LGTM! Well-structured initialization with proper rate limiting and caching

The implementation correctly sets up rate limiting, caching, and timezone handling.


46-67: LGTM! Clean implementation with proper error handling

The method makes a single API call and includes appropriate error handling and logging.


129-132: Use type checking instead of parent attribute checks

As suggested in the previous review, replace parent attribute checks with type checking for better reliability.

-        if not episode or not episode.parent or not episode.parent.parent:
+        if not isinstance(episode, Episode):
             return None

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

🧹 Outside diff range and nitpick comments (5)
src/program/apis/tvmaze_api.py (2)

3-3: Remove unused imports timedelta and timezone

The imports timedelta and timezone from datetime are not used in the code.

Apply this diff to remove the unused imports:

- from datetime import datetime, timedelta, timezone
+ from datetime import datetime
🧰 Tools
🪛 Ruff (0.8.0)

3-3: datetime.timedelta imported but unused

Remove unused import

(F401)


3-3: datetime.timezone imported but unused

Remove unused import

(F401)


9-9: Remove unused import MediaItem

The MediaItem class is imported but not used in the code.

Apply this diff to remove the unused import:

- from program.media.item import Episode, MediaItem
+ from program.media.item import Episode
🧰 Tools
🪛 Ruff (0.8.0)

9-9: program.media.item.MediaItem imported but unused

Remove unused import: program.media.item.MediaItem

(F401)

src/program/services/indexers/trakt.py (3)

4-4: Remove unused import time

The time module is imported but not used in the code.

Apply this diff to remove the unused import:

- import time
🧰 Tools
🪛 Ruff (0.8.0)

4-4: time imported but unused

Remove unused import: time

(F401)


5-5: Remove unused import Optional

The Optional type is imported but not used in the code.

Apply this diff to remove the unused import:

- from typing import Generator, Optional, Union
+ from typing import Generator, Union
🧰 Tools
🪛 Ruff (0.8.0)

5-5: typing.Optional imported but unused

Remove unused import: typing.Optional

(F401)


199-199: Remove unnecessary f-string prefix in log message

The log message is an f-string without any placeholders. Removing the f prefix enhances code clarity.

Apply this diff:

-    logger.debug(f"Using TVMaze time (no Trakt time available)")
+    logger.debug("Using TVMaze time (no Trakt time available)")
🧰 Tools
🪛 Ruff (0.8.0)

199-199: f-string without any placeholders

Remove extraneous f prefix

(F541)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between ad058d8 and a2b3290.

📒 Files selected for processing (4)
  • src/program/apis/tvmaze_api.py (1 hunks)
  • src/program/media/item.py (4 hunks)
  • src/program/program.py (2 hunks)
  • src/program/services/indexers/trakt.py (3 hunks)
🧰 Additional context used
🪛 Ruff (0.8.0)
src/program/services/indexers/trakt.py

4-4: time imported but unused

Remove unused import: time

(F401)


5-5: typing.Optional imported but unused

Remove unused import: typing.Optional

(F401)


199-199: f-string without any placeholders

Remove extraneous f prefix

(F541)

src/program/apis/tvmaze_api.py

3-3: datetime.timedelta imported but unused

Remove unused import

(F401)


3-3: datetime.timezone imported but unused

Remove unused import

(F401)


9-9: program.media.item.MediaItem imported but unused

Remove unused import: program.media.item.MediaItem

(F401)

🔇 Additional comments (3)
src/program/media/item.py (1)

406-408: Remove unnecessary __repr__ method

The __repr__ method in the MediaItem class may not be needed as item.log_string already provides a suitable string representation.

src/program/services/indexers/trakt.py (1)

132-134: ⚠️ Potential issue

Properly handle naive datetime objects when setting timezones

Directly setting the timezone on naive datetime objects using replace(tzinfo=...) assumes the datetime is in the local timezone, which may not be accurate and can lead to incorrect times if the original datetime is in UTC or another timezone.

Apply this diff to correctly localize aired_at assuming it is in UTC:

-    if season_item.aired_at and season_item.aired_at.tzinfo is None:
-        season_item.aired_at = season_item.aired_at.replace(tzinfo=self.local_tz)
+    if season_item.aired_at and season_item.aired_at.tzinfo is None:
+        season_item.aired_at = season_item.aired_at.replace(tzinfo=timezone.utc).astimezone(self.local_tz)

Similarly, for episode_item.aired_at and trakt_time, replace the timezone with UTC and convert to local timezone:

-    if episode_item.aired_at and episode_item.aired_at.tzinfo is None:
-        episode_item.aired_at = episode_item.aired_at.replace(tzinfo=self.local_tz)
+    if episode_item.aired_at and episode_item.aired_at.tzinfo is None:
+        episode_item.aired_at = episode_item.aired_at.replace(tzinfo=timezone.utc).astimezone(self.local_tz)
-    if trakt_time and trakt_time.tzinfo is None:
-        trakt_time = trakt_time.replace(tzinfo=datetime.now().astimezone().tzinfo)
+    if trakt_time and trakt_time.tzinfo is None:
+        trakt_time = trakt_time.replace(tzinfo=timezone.utc).astimezone(self.local_tz)

Also applies to: 142-144, 177-181

src/program/program.py (1)

280-282: ⚠️ Potential issue

Handle potential NoneType errors when accessing parent attributes

Accessing item.parent or item.parent.parent without checking for None may raise an AttributeError if any parent is missing. To prevent runtime errors, ensure that parent attributes exist before accessing them.

Apply this diff to add necessary checks:

-        show = item if item.type == "show" else (
-            item.parent.parent if item.type == "episode" else item.parent
-        )
+        if item.type == "show":
+            show = item
+        elif item.type == "episode" and item.parent and item.parent.parent:
+            show = item.parent.parent
+        elif item.type == "season" and item.parent:
+            show = item.parent
+        else:
+            logger.error(f"Cannot determine show for item {item.log_string} due to missing parent information.")
+            continue

Comment on lines +145 to +149
def store_state(self, given_state: States = None) -> tuple[States, States]:
"""Store the state of the item."""
if self.last_state == States.Completed:
return

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🛠️ Refactor suggestion

Reconsider early return in store_state to allow necessary state updates

The early return in store_state when self.last_state == States.Completed prevents any further state updates. This might be problematic if you need to update the state after completion, such as in cases of reprocessing or error corrections.

Consider adjusting the logic to allow state updates when appropriate.

Comment on lines +180 to +191
if not self.aired_at:
return False

# Ensure both datetimes are timezone-aware for comparison
now = datetime.now().astimezone()
aired_at = self.aired_at

# Make aired_at timezone-aware if it isn't already
if aired_at.tzinfo is None:
aired_at = aired_at.replace(tzinfo=now.tzinfo)

return aired_at <= now
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue

Properly handle naive aired_at datetime objects to ensure accurate timezone conversions

Directly setting the timezone on a naive datetime object using replace(tzinfo=...) assumes that the original datetime is in the local timezone, which may not be accurate. This can lead to incorrect release status determination.

Apply this diff to correctly localize aired_at assuming it is in UTC:

+    from datetime import timezone
     ...
     if aired_at.tzinfo is None:
-        aired_at = aired_at.replace(tzinfo=now.tzinfo)
+        aired_at = aired_at.replace(tzinfo=timezone.utc).astimezone(now.tzinfo)

Alternatively, if aired_at is intended to be in the local timezone, consider using a method that properly localizes naive datetime objects without assuming they are in UTC.

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
if not self.aired_at:
return False
# Ensure both datetimes are timezone-aware for comparison
now = datetime.now().astimezone()
aired_at = self.aired_at
# Make aired_at timezone-aware if it isn't already
if aired_at.tzinfo is None:
aired_at = aired_at.replace(tzinfo=now.tzinfo)
return aired_at <= now
if not self.aired_at:
return False
# Ensure both datetimes are timezone-aware for comparison
from datetime import timezone
now = datetime.now().astimezone()
aired_at = self.aired_at
# Make aired_at timezone-aware if it isn't already
if aired_at.tzinfo is None:
aired_at = aired_at.replace(tzinfo=timezone.utc).astimezone(now.tzinfo)
return aired_at <= now

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants