Enhance the metadata reading during the Copy From scenarios #3196

WenyXu · 2024-01-19T04:06:57Z

What type of enhancement is this?

Performance

What does the enhancement do?

The file reader is designed for the sync reader. They typically support relatively cheap seek operation. However, if we try to seek on stream bytes (e.g., read bytes from S3), the seek position could be costly (re-send a new get request to S3).

I checked CSV and JSON format, and these file format reader only requires Read trait to infer a schema. Therefore, we might only need to care about the Parquet and the ORC format.

Implementation challenges

No response

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance the metadata reading during the Copy From scenarios #3196

Enhance the metadata reading during the Copy From scenarios #3196

WenyXu commented Jan 19, 2024

Enhance the metadata reading during the Copy From scenarios #3196

Enhance the metadata reading during the Copy From scenarios #3196

Comments

WenyXu commented Jan 19, 2024

What type of enhancement is this?

What does the enhancement do?

Implementation challenges