Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

54 change multiprocessing to dask #62

Merged
merged 2 commits into from
Sep 3, 2024

Conversation

davedavemckay
Copy link
Collaborator

No description provided.

* Catch-up (#53)

* 45 zip file exclusion (#51)

* .

* existing file removal info

* .

* .

* .

* .

* .

* .

* fix mem_per_core to 1 GiB

* compare zip contents

* .

* boto3 Object get instead of load

* .

* decode

* handling metadata errors with nested try-excepts ?!!

* botocore.exceptions

* e

* .

* restored total_zips variable

* wait for results

* .

* .

* extra info

* elif

* .

* list of dicts for upload info

* less verbose

* update uploads dict

* .

* wrong i

* additional info in dicts

* typo

* .

* .

* exclude individual files

* .

* .

* .

* .

* .

* .

* .

* remove print e as error handled

* .

* .

* introduced upload timeouts

* .

* get instead of wait

* try removing result?

* .

* .

* try terminate

* .

* .

* .

* .

* .

* .

* remove duplicates

* clarified waiting loop

* .

* //

* .

* nightly - remove_duplicates not working

* use pandas to remove duplicates

* done

* type hints added to bucket_manager

* added metadata for multipart uploads of zip files

* filename exclusion

* .

* .

* fiex file exclusion

* done - testing (#52)

* removed boto3 type hints

* .

* use gc

* .

* .

* .

* catch broken pipe

* gc

* try metadata

* .

* .

* .

* .

* .

* df not series

* apply works

* workgin

* try existing zip contents

* sort isins

* ['CURRENT_OBJECTS']

* gc collect everywhere...

* Meatadata not metadata

* these_zip_contents

* brute froc

* less brute force

* .

* any

* check

* .

* .

* .

* .

* .

* .

* dropna

* .

* .

* debugging

* .

* .

* .

* .

* .

* pf replace

* .

* progress

* all

* any

* .

* .

* less verbose

* .

* deleting to_collate[parent_folder]

* change zip contents delimiter

* done.
* Catch-up (#53)

* 45 zip file exclusion (#51)

* .

* existing file removal info

* .

* .

* .

* .

* .

* .

* fix mem_per_core to 1 GiB

* compare zip contents

* .

* boto3 Object get instead of load

* .

* decode

* handling metadata errors with nested try-excepts ?!!

* botocore.exceptions

* e

* .

* restored total_zips variable

* wait for results

* .

* .

* extra info

* elif

* .

* list of dicts for upload info

* less verbose

* update uploads dict

* .

* wrong i

* additional info in dicts

* typo

* .

* .

* exclude individual files

* .

* .

* .

* .

* .

* .

* .

* remove print e as error handled

* .

* .

* introduced upload timeouts

* .

* get instead of wait

* try removing result?

* .

* .

* try terminate

* .

* .

* .

* .

* .

* .

* remove duplicates

* clarified waiting loop

* .

* //

* .

* nightly - remove_duplicates not working

* use pandas to remove duplicates

* done

* type hints added to bucket_manager

* added metadata for multipart uploads of zip files

* filename exclusion

* .

* .

* fiex file exclusion

* done - testing (#52)

* removed boto3 type hints

* .

* use gc

* .

* .

* .

* catch broken pipe

* gc

* try metadata

* .

* .

* .

* .

* .

* df not series

* apply works

* workgin

* try existing zip contents

* sort isins

* ['CURRENT_OBJECTS']

* gc collect everywhere...

* Meatadata not metadata

* these_zip_contents

* brute froc

* less brute force

* .

* any

* check

* .

* .

* .

* .

* .

* .

* dropna

* .

* .

* debugging

* .

* .

* .

* .

* .

* pf replace

* .

* progress

* all

* any

* .

* .

* less verbose

* .

* deleting to_collate[parent_folder]

* change zip contents delimiter

* done.
@davedavemckay davedavemckay linked an issue Sep 3, 2024 that may be closed by this pull request
@davedavemckay davedavemckay merged commit 9a7899c into main Sep 3, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Consider changing multiprocessing usage to Dask
1 participant