Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize sia download #68

Merged
merged 6 commits into from
Feb 3, 2022

Conversation

fccoelho
Copy link
Collaborator

@fccoelho fccoelho commented Feb 2, 2022

This PR closes issue #64 by not loading large SIA files into memory.

Instead, it save the dbf in the PySUS' cache directory and offers the user a chance to load the dataset in chunks.

This PR also adds some auxiliary functions to the readdbc module, mainly to stream converting dbc files to dbf, and dbf into csv.gz files.

@fccoelho
Copy link
Collaborator Author

fccoelho commented Feb 2, 2022

This PR also solves #27. Thanks to @heber-augusto for helping to detail this problem, and @alfonso777 for issue 27!

@xmnlab
Copy link
Member

xmnlab commented Feb 2, 2022

about the CI error, I guess you will need to add numba to https://github.com/AlertaDengue/PySUS/blob/master/environment.yaml

so conda/mamba can handle the dependency constraint for numba

Copy link

@heber-augusto heber-augusto left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I made two suggestions trying to make processing easier in multiple file scenarios.

pysus/online_data/SIA.py Outdated Show resolved Hide resolved
pysus/online_data/SIA.py Outdated Show resolved Hide resolved
pysus/online_data/SIA.py Outdated Show resolved Hide resolved
@fccoelho fccoelho merged commit bf86aca into AlertaDengue:master Feb 3, 2022
@fccoelho fccoelho deleted the optimize_SIA_Download branch July 13, 2022 13:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants