Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

54 change multiprocessing to dask #63

Merged
merged 153 commits into from
Sep 10, 2024

Conversation

davedavemckay
Copy link
Collaborator

DEPRECATED multiprocessing version: lsst-backup.py -> lsst-backup-mp-DEPRECATED.py
Replaced with Dask version: lsst-backup-dask.py -> lsst-backup.py

* Catch-up (#53)

* 45 zip file exclusion (#51)

* .

* existing file removal info

* .

* .

* .

* .

* .

* .

* fix mem_per_core to 1 GiB

* compare zip contents

* .

* boto3 Object get instead of load

* .

* decode

* handling metadata errors with nested try-excepts ?!!

* botocore.exceptions

* e

* .

* restored total_zips variable

* wait for results

* .

* .

* extra info

* elif

* .

* list of dicts for upload info

* less verbose

* update uploads dict

* .

* wrong i

* additional info in dicts

* typo

* .

* .

* exclude individual files

* .

* .

* .

* .

* .

* .

* .

* remove print e as error handled

* .

* .

* introduced upload timeouts

* .

* get instead of wait

* try removing result?

* .

* .

* try terminate

* .

* .

* .

* .

* .

* .

* remove duplicates

* clarified waiting loop

* .

* //

* .

* nightly - remove_duplicates not working

* use pandas to remove duplicates

* done

* type hints added to bucket_manager

* added metadata for multipart uploads of zip files

* filename exclusion

* .

* .

* fiex file exclusion

* done - testing (#52)

* removed boto3 type hints

* .

* use gc

* .

* .

* .

* catch broken pipe

* gc

* try metadata

* .

* .

* .

* .

* .

* df not series

* apply works

* workgin

* try existing zip contents

* sort isins

* ['CURRENT_OBJECTS']

* gc collect everywhere...

* Meatadata not metadata

* these_zip_contents

* brute froc

* less brute force

* .

* any

* check

* .

* .

* .

* .

* .

* .

* dropna

* .

* .

* debugging

* .

* .

* .

* .

* .

* pf replace

* .

* progress

* all

* any

* .

* .

* less verbose

* .

* deleting to_collate[parent_folder]

* change zip contents delimiter

* done.
* Catch-up (#53)

* 45 zip file exclusion (#51)

* .

* existing file removal info

* .

* .

* .

* .

* .

* .

* fix mem_per_core to 1 GiB

* compare zip contents

* .

* boto3 Object get instead of load

* .

* decode

* handling metadata errors with nested try-excepts ?!!

* botocore.exceptions

* e

* .

* restored total_zips variable

* wait for results

* .

* .

* extra info

* elif

* .

* list of dicts for upload info

* less verbose

* update uploads dict

* .

* wrong i

* additional info in dicts

* typo

* .

* .

* exclude individual files

* .

* .

* .

* .

* .

* .

* .

* remove print e as error handled

* .

* .

* introduced upload timeouts

* .

* get instead of wait

* try removing result?

* .

* .

* try terminate

* .

* .

* .

* .

* .

* .

* remove duplicates

* clarified waiting loop

* .

* //

* .

* nightly - remove_duplicates not working

* use pandas to remove duplicates

* done

* type hints added to bucket_manager

* added metadata for multipart uploads of zip files

* filename exclusion

* .

* .

* fiex file exclusion

* done - testing (#52)

* removed boto3 type hints

* .

* use gc

* .

* .

* .

* catch broken pipe

* gc

* try metadata

* .

* .

* .

* .

* .

* df not series

* apply works

* workgin

* try existing zip contents

* sort isins

* ['CURRENT_OBJECTS']

* gc collect everywhere...

* Meatadata not metadata

* these_zip_contents

* brute froc

* less brute force

* .

* any

* check

* .

* .

* .

* .

* .

* .

* dropna

* .

* .

* debugging

* .

* .

* .

* .

* .

* pf replace

* .

* progress

* all

* any

* .

* .

* less verbose

* .

* deleting to_collate[parent_folder]

* change zip contents delimiter

* done.
@davedavemckay davedavemckay merged commit c5dcf3a into main Sep 10, 2024
2 checks passed
@davedavemckay davedavemckay deleted the 54-change-multiprocessing-to-dask branch September 10, 2024 13:57
@davedavemckay davedavemckay restored the 54-change-multiprocessing-to-dask branch September 10, 2024 14:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant