GTC-2683 Upgrade to GDAL 3.8.3 and Miniconda3 #242
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
GTC-2683 Upgrade to GDAL 3.8.3.
This allows upgrading to Miniconda3 as well (GTC-2774), which is much more recent that the old Miniconda that we've been using and seems to run quite a bit faster. GDAL 3.8.3 is also compatible with EMR-serverless, if we want to use it for certain jobs.
Created a new Dockerfile ci/Dockerfile for the docker that runs the github CI tests, since there is no quay.io image (what we were using previously) for GDAL 3.8.3. Change .github/workflows/ci.yaml to use this new docker image (which I built and uploaded separately).
Removed the top-level Dockerfile and entrypoint.sh, which are very old versions of what is needed to run the analyses in batch jobs. The current versions of these are in gfwpro-scheduler:src/docker.
Added some info in README.md about various files, including sbt, ci/Dockerfile, and scripts/gdal.sh.
Added new geotrellisGdalWarp dependency, needed for the upgraded GDAL.
Includes the new scripts/gdal.sh, which is the bootscript needed for EMR runs with GDAL 3.8.3. It uses Miniconda3 and avoids using the default anaconda repository. It seems to run in about 1 minute, whereas the old script took roughly 4 minutes.
Print out all environment variables when starting up Geotrellis, just as a way to debug various startup/configuration problem.