Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

export_workspace: allow setting the region when configuring the workspaces #913

Open
VictorVerhaert opened this issue Oct 22, 2024 · 7 comments
Assignees

Comments

@VictorVerhaert
Copy link

Check what region is used for setting up the s3 client to a bucket and allow for setting the region specific for export_workspace.
Otherwise the same workspaces that work on CDSE won't work on OTC which has a different default s3 endpoint

@JeroenVerstraelen
Copy link
Contributor

region = endpoint

Option 1:

  • Use the same bucket on OTC and WAW3-1
  • Bucket credentials given in WAW3-1 should accessible in OTC

Option 2:

  • Create new bucket on OTC
  • Merge buckets over time

@pvbouwel
Copy link

pvbouwel commented Nov 14, 2024

In the draft standard for workspaces (https://github.com/Open-EO/openeo-api/blob/draft/extensions/workspaces/openapi.yaml#L41) each workspace has a workspace provider. This workspace provider could determine which region (endpoint) should be used.

@pvbouwel
Copy link

pvbouwel commented Nov 20, 2024

This relates to MVP1 of https://confluence.vito.be/pages/viewpage.action?spaceKey=EP&title=OpenEO+S3+access

It requires:

@pvbouwel
Copy link

It seems we do not yet have a way to configure config files for a job execution in geopyspark yet. Most of the time environment variables are used. There are already config maps that are created for a spark application (like prometheus config) but these seem to be managed by the spark operator. It seems we need one for ourselves as well to store:

  • the token file
  • the aws profiles
  • bucket to profile mappings (future)

@pvbouwel
Copy link

First add public and private keys to each of the environments (CDSE) as that will be a pre-requisite for the web identity tokens

@pvbouwel
Copy link

pvbouwel commented Nov 22, 2024

Public and private keys for IDP are staged into vault. For each environment I created a new version of the vault object we keep but I added the 2 fields that are generated using:

echo -e "y\n\n\n" | ssh-keygen -t rsa -b 2048 -m PEM -f rsa &&ssh-keygen -f rsa -e -m PEM > rsa.pub && echo -e "  \"idp_private_key\": \"$(cat rsa | tr '\n' '@' | sed 's/@/\\n/g' | sed 's/\\n$//'| sed 's/\\n/\n/'g  | base64 | tr -d '\n')\",\n  \"idp_public_key\": \"$(cat rsa.pub | tr '\n' '@' | sed 's/@/\\n/g' | sed 's/\\n$//' | sed 's/\\n/\n/'g | base64 | tr -d '\n')\","

@pvbouwel
Copy link

Deployed a build-version of the STS and s3 proxy with the token trust to cdse-staging but when I run the artifacts upload workflow it fails if I don´t specify the region explicitly. So that would be a regression must fix that first.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants