Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support STAC Data sources #316

Open
jacobbieker opened this issue May 28, 2024 · 0 comments
Open

Support STAC Data sources #316

jacobbieker opened this issue May 28, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@jacobbieker
Copy link
Member

STAC is being used quite a bit for other datasets that might be useful to OCF and to general datapipes. The Clay EO foundation model runs off STAC for getting the satellite imagery, and more and more STAC catalogs are being added to Source Collective, Planetary Computer, etc. which might be helpful for OCF.

Detailed Description

There is https://github.com/Clay-foundation/stacchip that generates image chips from STAC catalogs. They also do a preprocessing to batches first approach, rather than streaming it in from what I can tell, but the flexibility in their batches could be something that might be worth trying? Its similar to NumpyBatch, but more flexible.

Context

Clay is a foundation model being built to be flexible with a variety of different inputs, currently mostly imagery, but also topographic maps, and the imagery is both satellite and aerial. The model can handle if any arbitrary number of bands are available or not available, as well as different resolutions, using some cool tricks from https://github.com/zhu-xlab/DOFA and the flexibility of the transformer. Their data loading has many of the same problems that datapipes is supposed to handle, although mostly using STAC and GeoTIFF vs Zarr, but could be some interesting things to look at.

Possible Implementation

Rather than NumpyBatch with Enum's that we have to track, you could do just a normal dictionary, like Clay does.

@jacobbieker jacobbieker added the enhancement New feature or request label May 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant