You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To batch read files from file system(s3, gcs, etc.), RisingWave supports both select ... from source and table function file_scan, the difference between them is that select ... from source requires the user to define the schema in the create source statement, while file_scan is simpler for batch queries as it directly uses a table function without the need to define a schema.
Take batch query parquet file as an example
select ... from source
CREATE source s3_parquet(
id int,
age int,
)
WITH (
connector = 's3_v2',
match_pattern = '*.parquet',
s3.region_name = 'xxx',
s3.bucket_name = 'xxx',
s3.credentials.access = 'xxx',
s3.credentials.secret = 'xxx',
s3.endpoint_url = 'xxx',
) FORMAT PLAIN ENCODE PARQUET;
select * from s3_parquet;
SELECT
id,
age
FROM file_scan(
'parquet',
's3',
'region',
'endpoint',
'xxx',
)
Users can choose different query methods according to their needs.
Currently, file_scan only supports Parquet files on S3 and is not fully developed; there are areas that can be improved.
Support more encode type json, csv encode), and automatic schema mapping.
Support different object store engine(s3, gcs, azblob).
more user-friendly syntax.
The text was updated successfully, but these errors were encountered:
To batch read files from file system(s3, gcs, etc.), RisingWave supports both
select ... from source
and table functionfile_scan
, the difference between them is thatselect ... from source
requires the user to define the schema in the create source statement, whilefile_scan
is simpler for batch queries as it directly uses a table function without the need to define a schema.Take batch query parquet file as an example
select ... from source
select * from s3_parquet;
Users can choose different query methods according to their needs.
Currently,
file_scan
only supports Parquet files on S3 and is not fully developed; there are areas that can be improved.The text was updated successfully, but these errors were encountered: