-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unnecessarily Large Data Request #74
Comments
As you can see, xarray is calling pydap, which then does the fetch. I don't suppose there's anything that we can do about how that happens at the intake level. You might want to cross-post on xarray or pydap, though. Perhaps someone might chime in here on how siphon does this differently. Since the target file is, eventually, just a netCDF, you could pass its URL or an open fsspec file directly to xarray (presumably with engine "h5netcdf"), or kerchunk it. |
Since it's DAP, you can also try to make sure xarray is using the netcdf4 engine, since netCDF-c is usually compiled with DAP support. |
I tried the following variations: Variation 1catalog_u = intake.open_thredds_merged(wrf_url, driver='netcdf',
path=['*_U_2006060*'])
catalog_v = intake.open_thredds_merged(wrf_url, driver='netcdf',
path=['*_V_2006060*']) Output
Variation 2catalog_u = intake.open_thredds_merged(wrf_url,
xarray_kwargs={'engine': 'netcdf4'},
path=['*_U_2006060*'])
catalog_v = intake.open_thredds_merged(wrf_url,
xarray_kwargs={'engine': 'netcdf4'},
path=['*_V_2006060*']) Output
Variation 3catalog_u = intake.open_thredds_merged(wrf_url,
xarray_kwargs={'engine': 'h5netcdf'},
path=['*_U_2006060*'])
catalog_v = intake.open_thredds_merged(wrf_url,
xarray_kwargs={'engine': 'h5netcdf'},
path=['*_V_2006060*']) Output
I'll have to look into some of the other methods that were suggested, but this is not turning out to be as straightforward as I had hoped. |
I'm not sure if this is a bug report, feature request, or user error. I'm trying to access a giant dataset from the NCAR RDA in a smart way (only downloading what's necessary for the calculation), but a large data request is made anyway that exceeds the server's 500 MB limit.
Here's my code:
This fails with
because the data request is too large.
Folks at NCAR tell me the request comes across as
essentially pulling an entire variable.
Is what I'm trying to do supposed to work?
I can use siphon directly w/o issue:
but in that case I don't have all the xarray niceties w/o extra work.
The text was updated successfully, but these errors were encountered: