How to read delta lake table via an s3 bucket #1165
-
As seen from the documentation here it is possible to use the But I have not been able to figure out how to open delta lake table on s3. The documentation for
I tried to pass in a url to s3 bucket since the documentation says the storage back-end will be inferred. But doing this
I had a feeling this should be trivial but unfortunately it has not been :( So anyone knows how to get open_table to open a table located on an s3 bucket? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 6 replies
-
Hi @finlaydotb, Working against an emulated service requires a bit more configuration, and we should update the docs to reflect that. Specifically, you can either set some configuration in the environment, or pass it explitily by using Looking at your example code, it would likely look something like this. storage_options = HashMap::from([
("allow_http", "true"),
("endpoint_url", "http://127.0.0.1:9000"),
("access_key_id", "some id"),
("access_key_secret", "some secret")
]); The actual error you are seeing is a different one though. Essentially the underlying object store tries to use the the metadata endpoint to fetch an access token, which is the default, if no credential is provided. If the local service allows anonymous access, an empty key id / secret should still be provided, to not query the metadata endpoint. If you want to set the config in the environment, you can use the same kieys as above, just capitalized and prefixed with Last but not least, the URL should be in the form of |
Beta Was this translation helpful? Give feedback.
-
In case anyone needs this: We need to also call |
Beta Was this translation helpful? Give feedback.
Hi @finlaydotb,
Working against an emulated service requires a bit more configuration, and we should update the docs to reflect that. Specifically, you can either set some configuration in the environment, or pass it explitily by using
open_table_with_storage_options
.Looking at your example code, it would likely look something like this.
The actual error you are seeing is a different one though. Essentially the underlying object store tries to use the the metadata endpoint to fetch an access token, whic…