Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

added debug flag #25

Merged
merged 1 commit into from
Aug 7, 2024
Merged

added debug flag #25

merged 1 commit into from
Aug 7, 2024

Conversation

davedavemckay
Copy link
Collaborator

No description provided.

@davedavemckay davedavemckay merged commit 5123b87 into 24-find-zipfiles Aug 7, 2024
2 checks passed
davedavemckay added a commit that referenced this pull request Aug 14, 2024
* added debug flag (#25)

* .

* debugging

* moved du.sh to csd3-side/scripts

* debugging

* .

* .

* .

* .

* .

* .

* .

* .

* names hashable

* timing

* .

* need to append zip contents paths to zip path stub

* .

* .

* .

* .

* .

* .

* .

* .

* .

* .

* .

* .

* .

* nightly - attempting to introduce pandas earlier on

* .

* .

* trying numpy.T on list of Series

* .

* .

* .

* .

* need to work with series of lists

* debug flag usage

* working on prepend_zipfile_path_to_contents func

* prepending paths

* for loop

* debug

* .

* debug

* .

* verify

* .

* .

* test extract and upload

* .

* debug

* .

* .

* .

* verify not finding existing files to skip

* .

* nightly - manual check all files are uploaded

* debugging key from contents is in all keys

* .

* .

* testing isin with list

* .

* .

* .

* .

* .

* .

* multiprocessing extract and upload

* .

* .

* .

* renamed find_collated_zips.py to process_collated_zips.py as "find" is only the basic option

* .

* du.sh

* corrected starmap args

* debug serial extract and upload

* debug to 10000

* .

* .

* .

* .

* nightly - debug version working for extract_and_upload

* using partials

* getting file sizes during search

* .

* calculate max pool size based on zipfile sizes and available RAM

* .

* debugging

* .

* .

* .

* .

* changed argument order in extract_and_upload_mp

* improved pbar

* .

* len instread of buffer

* .

* zf.file_size

* logging

* .

* only calculate pool_size if zips to extract

* user message clarified
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant