-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AirNow metadata options and other updates #146
base: stable
Are you sure you want to change the base?
Conversation
that can take date arg and defaults to s3 URI, which is faster (700 ms vs 400 ms ish on my laptop)
like done for timezone finder
with a simple two-day test, found able to get slightly more data (100 or so rows) by disabling today metadata, since fewer rows with bad UTC offset were dropped
so codes and such are string
mainly for my current testing/development, but could be useful
as semicolon-separated strings
Current test failure is openaq-fetches denying access ("PermissionError: Access Denied"). |
- merging of AirNow, with lat/lon if new site IDs - correct airnow_flag - drop site ID dupe rows - express columns to keep instead of columns to drop
Need to conserve more info when duplicate-dropping for AQS. For example, currently only get some of the full set of unique
|
Should use the CSV parameter list to translate code for ones we don't have custom names for. And/or to construct a new in-code translation table. |
else: | ||
raise ValueError("Could not determine if file is daily or hourly") | ||
|
||
dtype = {"siteid": str, "obs": float} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Need to ensure MSA code and such are read as str, not int/float. cmsa_name
and msa_code
seem to need this or they get convert to float64 (in order to represent missing with NaN).
Note that this PR fixes UTC offset to be float (integer conversion floors the few that are non-integer, making them incorrect). |
Resolves #129 and makes some progress towards #111.