Make --archive save the links to the downloaded songs as soon as the songs are downloaded. Closes #2196 #2220

Shajal-Kumar · 2024-11-02T13:31:28Z

Title

Write downloaded songs to the archive file incrementally to prevent data loss on interruption
or
Make --archive save the links to the downloaded songs as soon as the songs are downloaded #2196

Description

This PR modifies the download/downloader.py and utils/archive.py modules to ensure that the archive file is created as soon as the script runs and updated incrementally. The initialize and add_entry methods have been added to archive.py to create the archive file if it does not already exist and add an entry to the archive file while flushing it immediately, respectively. Additionally, the download_multiple_songs method in the Downloader class has been updated to call add_entry as soon as each song is successfully downloaded. This change ensures that even if the download process is interrupted, the archive file will contain the URLs of all songs downloaded up to that point.

Related Issue

This change addresses issue #2196 where the archive file was not being updated during the download process, causing data loss if the process was interrupted.

Motivation and Context

This change is necessary to prevent data loss when using the --archive flag. Previously, the archive file was not created at the beginning of the script and if the download was interrupted, the archive file remained empty, even if most of the songs had been downloaded. By writing to the archive file incrementally, we ensure that users have an up-to-date record of all successfully downloaded songs, even in case of an unexpected interruption.

How Has This Been Tested?

Manually tested by downloading a playlist with songs that would not be present on the download source and interrupting the process at various points to ensure the archive file contains all URLs of downloaded songs.
Verified that the archive file is created at the start of the download process and updated immediately after each successful download.

Screenshots (if appropriate)

Types of Changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist

My code follows the code style of this project
My change requires a change to the documentation
I have updated the documentation accordingly
I have read the CONTRIBUTING document
I have read the CORE VALUES document
I have added tests to cover my changes
All new and existing tests passed

…gs incrementally. Closes spotDL#2196

Silverarmor · 2024-11-03T02:46:58Z

Thanks @Shajal-Kumar - I've changed the base to dev branch, can you resolve conflicts in downloader.py?

Shajal-Kumar · 2024-11-03T11:49:18Z

@Silverarmor I've resolved the conflicts in downloader.py. Can you please check it?

Shajal-Kumar · 2024-11-06T04:59:47Z

Hi @Silverarmor, I wanted to follow up and ask if there are any other changes that I need to make.

Silverarmor · 2024-11-06T05:03:39Z

I've requested a code review from @xnetcat who will look into it when they are free :)

xnetcat · 2024-11-14T16:36:18Z

This won't save the data incrementally. You would have to move the add entry code to the search and download function, while making sure that only one thread has access to the file at the time.

xnetcat

as stated in the comment

Shajal-Kumar · 2024-11-20T10:48:12Z

Hello @xnetcat, I'll make the required changes to the search_and_download function. Could you please tell me how I'm supposed to ensure that only one thread has access to the file at a time? Should I separately create a thread lock or do I have to use a part of the existing code while implementing the incremental updates to the archive?

Shajal-Kumar added 2 commits November 2, 2024 17:13

Tried making changes to the download_multiple_songs() function

237bc56

Made changes to --archive to allow saving the links to downloaded son…

ad080be

…gs incrementally. Closes spotDL#2196

Silverarmor changed the base branch from master to dev November 3, 2024 02:46

Silverarmor requested a review from xnetcat November 3, 2024 02:46

Merge branch 'dev' into master

43bd4d0

xnetcat requested changes Nov 14, 2024

View reviewed changes

Merge branch 'dev' into master

f947dd0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make --archive save the links to the downloaded songs as soon as the songs are downloaded. Closes #2196 #2220

Make --archive save the links to the downloaded songs as soon as the songs are downloaded. Closes #2196 #2220

Shajal-Kumar commented Nov 2, 2024

Silverarmor commented Nov 3, 2024

Shajal-Kumar commented Nov 3, 2024

Shajal-Kumar commented Nov 6, 2024

Silverarmor commented Nov 6, 2024

xnetcat commented Nov 14, 2024

xnetcat left a comment

Shajal-Kumar commented Nov 20, 2024

Make --archive save the links to the downloaded songs as soon as the songs are downloaded. Closes #2196 #2220

Are you sure you want to change the base?

Make --archive save the links to the downloaded songs as soon as the songs are downloaded. Closes #2196 #2220

Conversation

Shajal-Kumar commented Nov 2, 2024

Title

Description

Related Issue

Motivation and Context

How Has This Been Tested?

Screenshots (if appropriate)

Types of Changes

Checklist

Silverarmor commented Nov 3, 2024

Shajal-Kumar commented Nov 3, 2024

Shajal-Kumar commented Nov 6, 2024

Silverarmor commented Nov 6, 2024

xnetcat commented Nov 14, 2024

xnetcat left a comment

Choose a reason for hiding this comment

Shajal-Kumar commented Nov 20, 2024