You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is this issue already tracked somewhere, or is this a new report?
I've reviewed existing issues and couldn't find a duplicate for this problem.
Current Behavior
I seem to be having an issue accessing LAADS data with earthaccess 0.12.0. The "Data" field of the granule returns only the "https" link but these data are in the Cloud according to earthdata search.
Expected Behavior
I would expect "Data" in the granule to return the S3 path link.
Steps To Reproduce
In Jupyter Notebook:
import earthaccess
from pprint import pprint
import boto3
auth = earthaccess.login(persist=True)
granules = earthaccess.search_data(concept_id = 'C2859273114-LAADS', temporal = ('2019-09-26','2019-09-27'))
if (boto3.client('s3').meta.region_name == 'us-west-2'):
print("found US-West-2")
else:
print("US-West-2 not found")
print(granules[0])
@meteodave Thanks for reporting this. I don't think this is unique to your LAADS example, as I see the same results when searching for an ICESat-2 collection in the cloud. I believe that the search_data results are only grabbing the first data access URL, which would be the HTTPS link in this case. @betolink does that sound right to you? Regardless, the s3 URL should still be found and utilized when using earthaccess.open().
So, I don't know if this is truly a bug versus an enhancement that we need to make to search_data() to provide all data access URLs that exist for the granule results, including s3.
Thanks for stopping by the poster @meteodave it was great meeting you in person! Not a full answer but some clarifications, the data_links() method to defaults to "out-of-region" for the representation, this means we'll always see the output you're seeing, which perhaps is a bug! Internally however if we use .download(granules) or .open(granules) it will check if we are in-region... which is also tricky as some instances and frameworks hide the required metadata to know if we are in us-west-2 or not.
We are having conversations around what should be the default, the best option so far is to assume that we are in the cloud and try the s3:linksif they are reachable. As for the representation, we may need to change the default to follow the same logic or even show both like:
Is this issue already tracked somewhere, or is this a new report?
Current Behavior
I seem to be having an issue accessing LAADS data with earthaccess 0.12.0. The "Data" field of the granule returns only the "https" link but these data are in the Cloud according to earthdata search.
Expected Behavior
I would expect "Data" in the granule to return the S3 path link.
Steps To Reproduce
In Jupyter Notebook:
Output:
Environment
Additional Context
No response
The text was updated successfully, but these errors were encountered: