Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VTX-3411: Sync master with upstream #49

Closed
wants to merge 1,415 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1415 commits
Select commit Hold shift + click to select a range
8102911
Take kernel dyn Array (#4705)
tustvold Aug 17, 2023
31c81c5
Fix nightly tests (#4709)
tustvold Aug 17, 2023
f0200db
Add AnyDictionary Abstraction and Take ArrayRef in DictionaryArray::w…
tustvold Aug 17, 2023
a19ff2b
Tweak docs (#4711)
tustvold Aug 17, 2023
b810e8f
Support Field ID in ArrowWriter (#4702) (#4710)
tustvold Aug 17, 2023
8bbb5c1
Datum based comparison kernels (#4596) (#4701)
tustvold Aug 18, 2023
1afc7c3
Update parquet object_store 0.7 (#4715)
tustvold Aug 18, 2023
2c487d0
Parquet doc tweaks (#4680)
tustvold Aug 18, 2023
23db567
Cleanup redundant link targets (#4719)
tustvold Aug 21, 2023
bce0b41
Add distinct kernels (#960) (#4438) (#4716)
tustvold Aug 21, 2023
90449ff
Prepare Arrow 46.0.0 (#4720)
tustvold Aug 21, 2023
d9381c6
Add `IpcError` variant to replace some uses of `IoError`that don't ha…
alexandreyc Aug 24, 2023
dfb1ea2
Fix new clippy lints (#4734)
tustvold Aug 24, 2023
f2ba18b
Add `with_proxy_ca_certificate` and `with_proxy_excludes` (#4714)
gordonwang0 Aug 25, 2023
f332e23
chore: fix libpython in CI (#4738)
crepererum Aug 25, 2023
2fe9ef1
Cleanup length and bit_length kernels (#4718)
tustvold Aug 25, 2023
4533271
feat: expose DoGet response headers & trailers (#4727)
crepererum Aug 25, 2023
221f5d2
Datum based like kernels (#4595) (#4732)
tustvold Aug 25, 2023
4888dbf
Clear row buffer before reuse (#4742)
yjshen Aug 27, 2023
cb793a5
Update nix requirement from 0.26.1 to 0.27.1 in /object_store (#4744)
viirya Aug 28, 2023
32e973d
Chrono deprecations (#4748)
tustvold Aug 29, 2023
735f48d
return error (#4752)
wjones127 Aug 30, 2023
eeba0a3
Relax constraints on PyArrowType (#4757)
tustvold Aug 31, 2023
4927c1e
Implement PyArrowType for `Box<dyn RecordBatchReader + Send>` (#4751)
wjones127 Sep 1, 2023
6e28c03
Make ObjectStore::copy Atomic and Automatically Create Parent Directo…
tustvold Sep 2, 2023
611b129
Remove unused dyn_cmp_dict feature (#4766)
tustvold Sep 3, 2023
587250c
fix: avoid panic if offset index not exists. (#4761)
RinChanNOWWW Sep 4, 2023
b66c57c
Bump actions/checkout from 3 to 4 (#4767)
dependabot[bot] Sep 4, 2023
db5314c
Fix List Sorting, Revert Removal of Rank Kernels (#4747)
tustvold Sep 5, 2023
65edbb1
Re-encode dictionaries in selection kernels (#3558)
tustvold Sep 5, 2023
0847c9a
Update object_store chrono deprecations (#4786)
tustvold Sep 7, 2023
6fdbc26
Make coalesce_ranges and collect_bytes available to the users (#4784)
sumerman Sep 7, 2023
83390ed
Fix DictionaryArray::normalized_keys (#4788) (#4789)
tustvold Sep 7, 2023
2fe71ca
Allow custom tree builder for parquet::record::RowIter (#4783)
yuri-rs Sep 7, 2023
dd0c4ab
Relaxing type bounds on coalesce_ranges and collect_bytes (#4787)
sumerman Sep 7, 2023
15dde87
Re-export array crate root (#4780) (#4779) (#4791)
tustvold Sep 7, 2023
1e46f8f
Best effort cleanup of staged upload files (#4778) (#4792)
tustvold Sep 8, 2023
878217b
Add docstring and example to `Scalar` (#4793)
alamb Sep 8, 2023
b4997bc
Improved csv_reader benchmarks with smaller integers (#4803)
tustvold Sep 10, 2023
77455d4
fix: entries field is non-nullable (#4808)
wjones127 Sep 10, 2023
2075cd1
csv: Add option to specify custom null values (#4795)
vrongmeal Sep 13, 2023
229bf8b
ObjectStore Wasm32 Fixes (#4775) (#4776) (#4796)
tustvold Sep 14, 2023
7355e83
Update proc-macro2 requirement from =1.0.66 to =1.0.67 (#4816)
dependabot[bot] Sep 14, 2023
d2be733
More chrono deprecations (#4822)
tustvold Sep 15, 2023
d960379
Do not check schema for equality in `concat_batches` (#4815)
alamb Sep 16, 2023
80b0888
fix: export record batch through stream (#4806)
wjones127 Sep 17, 2023
b64e362
Adaptive Row Block Size (#4812) (#4818)
tustvold Sep 17, 2023
9cb4a75
Stateless Row Encoding / Don't Preserve Dictionaries in `RowConverter…
tustvold Sep 17, 2023
7b78531
Update chrono pin (#4824)
tustvold Sep 17, 2023
a6dffca
Update prost (#4825)
tustvold Sep 17, 2023
95ee5d3
feat: FixedSizeBinaryArray::value_data return reference (#4821)
wjones127 Sep 18, 2023
175c776
Fix typo in docstring (#4826)
harryscholes Sep 18, 2023
33b881d
Fix like scalar null (#4832)
tustvold Sep 18, 2023
47e8a8d
Fix flight sql do put handling, add bind parameter support to FlightS…
suremarc Sep 18, 2023
f7464bc
Fix merge_dictionary_values in selection kernels (#4833)
tustvold Sep 19, 2023
e214d6b
Respect FormatOption::nulls for NullArray (#4836)
tustvold Sep 19, 2023
1d6feea
Prepare arrow 47.0.0 (#4827)
tustvold Sep 19, 2023
a03ce56
fix: object store http header last modified (#4834)
universalmind303 Sep 19, 2023
407e575
Error if Remote Ignores HTTP Range Header (#4841)
universalmind303 Sep 20, 2023
f9cd26f
Refine documentation to `Array::is_null` (#4838)
alamb Sep 20, 2023
8465ed4
Update tonic-build requirement from =0.10.0 to =0.10.1 (#4846)
dependabot[bot] Sep 22, 2023
1de21d2
fix: make_primitive_scalar bug (#4852)
JasonLi-cn Sep 23, 2023
431be3f
Perform HEAD request for HttpStore::head (#4837)
tustvold Sep 23, 2023
6d5d7e3
Minor: Improve object_store docs.rs landing page (#4849)
alamb Sep 23, 2023
72a2dab
Allow Constructing Non-Empty StructArray with no Fields (#4842) (#4845)
tustvold Sep 25, 2023
b35511d
Allow overriding azure endpoint (#4853) (#4854)
tustvold Sep 25, 2023
7e7ac15
fix: add missing precision overflow checking for `cast_string_to_deci…
jonahgao Sep 25, 2023
74e2c5c
Make ArrowRowGroupWriter Public and SerializedRowGroupWriter Send (#4…
devinjdangelo Sep 25, 2023
2c9e2e9
Add ObjectStore BufReader (#4762) (#4857)
tustvold Sep 25, 2023
fbd9008
Faster Serde Integration (~80% faster) (#4861)
tustvold Sep 26, 2023
4ef7917
Prepare object_store 0.7.1 (#4860)
tustvold Sep 26, 2023
8a07be3
Make align_buffers as public API (#4863)
viirya Sep 27, 2023
b600e20
Enable new integration tests (#4828) (#4862)
tustvold Sep 27, 2023
786760a
Flush in multiple_append test (#4868) (#4869)
tustvold Sep 27, 2023
62bb64c
Check precision overflow for casting floating to decimal (#4866)
viirya Sep 27, 2023
284556e
feat: improve flight CLI error handling (#4873)
crepererum Sep 28, 2023
471f6dd
Flush in creates_dir_if_not_present_append (#4872) (#4874)
tustvold Sep 28, 2023
0e04757
Update tonic-build requirement from =0.10.1 to =0.10.2 (#4881)
dependabot[bot] Sep 29, 2023
3ac0053
Support Encoding Parquet Columns in Parallel (#4871)
tustvold Oct 1, 2023
8c495b6
parquet: Read field IDs from Parquet Schema (#4878)
Samrose-Ahmed Oct 1, 2023
3b0ede4
Replace lz4 with lz4_flex Allowing Compilation for WASM (#4884)
tustvold Oct 2, 2023
d941ff1
Support parquet bloom filter length (#4885)
letian-jiang Oct 2, 2023
39e4d94
Update ring requirement from 0.16 to 0.17 in /object_store (#4887)
dependabot[bot] Oct 2, 2023
4320a75
Implement Take for UnionArray (#4883)
avantgardnerio Oct 2, 2023
f0455d1
Support Parsing Avro File Headers (#4888)
tustvold Oct 4, 2023
97a82c0
Fix integration tests (#4889)
tustvold Oct 4, 2023
208da03
mark OnCloseRowGroup Send (#4893)
devinjdangelo Oct 5, 2023
2214fda
fix(arrow-json)!: include null fields in schema inference with a type…
kskalski Oct 5, 2023
2c4bc54
Upgrade to Rust 1.73.0 (#4899)
carols10cents Oct 6, 2023
431df70
Update proc-macro2 requirement from =1.0.67 to =1.0.68 (#4900)
dependabot[bot] Oct 9, 2023
c791128
Update proc-macro2 requirement from =1.0.68 to =1.0.69 (#4907)
dependabot[bot] Oct 9, 2023
ed58e76
chore: add csv example (#4904)
fansehep Oct 9, 2023
2af5163
Allow merge of Null to any datatype. (#4902)
kskalski Oct 9, 2023
16f5905
feat: log headers/trailers in flight CLI (+ minor fixes) (#4898)
crepererum Oct 10, 2023
c6387c1
fix(csv)!: infer null for empty column. (#4910)
kskalski Oct 10, 2023
538a7bf
Specialize Thrift Decoding (~40% Faster) (#4891) (#4892)
tustvold Oct 10, 2023
d3e1302
Update regex-syntax requirement from 0.7.1 to 0.8.0 (#4914)
dependabot[bot] Oct 10, 2023
4aabd2c
feat: document & streamline flight SQL CLI (#4912)
crepererum Oct 10, 2023
181cb3d
Minor: Clarify rationale for FlightDataEncoder API, add examples (#4916)
alamb Oct 11, 2023
d83008b
Cleanup `object_store::retry` client error handling (#4915)
tustvold Oct 11, 2023
556c5ff
Cleanup CSV WriterBuilder, Default to AutoSI Second Precision (#4735)…
tustvold Oct 11, 2023
d5a655d
Add option to `FlightDataEncoder` to always resend batch dictionaries…
alexwilcoxson-rel Oct 12, 2023
11205a8
Add AWS presigned URL support (#4876)
carols10cents Oct 12, 2023
0503d65
Update zstd requirement from 0.12.0 to 0.13.0 (#4923)
dependabot[bot] Oct 12, 2023
6e49f31
feat: add method for async read bloom filter (#4917)
hengfeiyang Oct 12, 2023
90bc5ec
Support Arbitrary JSON values in JSON Reader (#4905) (#4911)
tustvold Oct 12, 2023
bb8e42f
Add GetOptions::head (#4931)
tustvold Oct 15, 2023
57cd094
Allow opting out of request signing (#4927) (#4929)
tustvold Oct 16, 2023
31bc84c
Default connection and request timeouts of 5 seconds (#4928)
tustvold Oct 16, 2023
4a23ab9
Update pyo3 requirement from 0.19 to 0.20 (#4941)
crepererum Oct 16, 2023
69c9375
Support service_account in ApplicationDefaultCredentials and Use Self…
tustvold Oct 16, 2023
ce2a958
Add `FileWriter` schema getter (#4940)
haixuanTao Oct 16, 2023
95b015c
Evaluate null_regex for string type in csv (now such values will be p…
haohuaijin Oct 16, 2023
ab87abd
Generate `ETag`s for `InMemory` and `LocalFileSystem` (#4879) (#4922)
tustvold Oct 17, 2023
d4d11fe
Assume Pages Delimit Records When Offset Index Loaded (#4921) (#4943)
tustvold Oct 17, 2023
fa7a61a
Remove Nested async and Fallibility from ObjectStore::list (#4930)
tustvold Oct 17, 2023
511ac44
Fix object_store docs (#4947)
tustvold Oct 17, 2023
952cd2e
Expose SubstraitPlan structure in arrow_flight::sql (#4932) (#4933)
amartins23 Oct 17, 2023
a94ccff
feat: support parsing for parquet writer option (#4938)
fansehep Oct 18, 2023
4964d84
Add `ReaderBuilder::with_header` for csv reader (#4949)
tustvold Oct 18, 2023
6e332b8
Prepare arrow 48.0.0 (#4948)
tustvold Oct 18, 2023
51ac6fe
Respect ARROW_TEST_DATA in apache-avro tests (#4950)
tustvold Oct 18, 2023
4cca029
Return `PutResult` with an ETag from ObjectStore::put (#4934) (#4944)
tustvold Oct 19, 2023
62ca5f3
Split aws Module (#4953)
tustvold Oct 19, 2023
a425e7e
Split azure Module (#4954)
tustvold Oct 19, 2023
efd4d19
Add module links in docs root (#4955)
tustvold Oct 19, 2023
f597d3a
Split gcp Module (#4956)
tustvold Oct 19, 2023
7e134f4
Use rustfmt default line width (#4960)
tustvold Oct 19, 2023
0b9105d
Temporarily Disable Java Integration Tests (#4957)
tustvold Oct 20, 2023
f4a2a88
Add ObjectMeta::version and GetOptions::version (#4925) (#4935)
tustvold Oct 20, 2023
03d0505
Add SchemaBuilder::remove (#4952) (#4964)
tustvold Oct 20, 2023
14d6c8d
Add arrow_cast::base64 and document usage in arrow_json (#4975)
tustvold Oct 23, 2023
e79b3bc
Support ImdsManagedIdentityProvider in Azure Functions (#4976) (#4977)
tustvold Oct 23, 2023
2f33795
Update quick-xml requirement from 0.30.0 to 0.31.0 in /object_store (…
dependabot[bot] Oct 24, 2023
4748b61
Bump actions/setup-node from 3 to 4 (#4982)
dependabot[bot] Oct 24, 2023
c90aff3
feat: support schema change by idx and reverse (#4985)
fansehep Oct 25, 2023
a33d42f
Increase default timeout to 30 seconds (#4989)
tustvold Oct 25, 2023
a6a512f
Fix pre commit (#4990)
Jefffrey Oct 25, 2023
e78d140
Add MultiPartStore (#4961) (#4608) (#4971)
tustvold Oct 25, 2023
570c91e
Support bucket name with `.` when parsing GCS URL (#4991) (#4992)
tustvold Oct 26, 2023
1708b0b
Support metadata in SchemaBuilder (#4987)
tustvold Oct 26, 2023
6b4fd2f
Don't panic on invalid Azure access key (#4972) (#4974)
tustvold Oct 26, 2023
3dbe45b
[MINOR] No need to jump to web pages (#4994)
smallzhongfeng Oct 26, 2023
b07dabe
Use new integration scripts (#4963) (#4988)
tustvold Oct 26, 2023
cd069ea
Update tracing-log requirement from 0.1 to 0.2 (#4998)
dependabot[bot] Oct 26, 2023
5601b7a
Add `Field::remove()`, `Schema::remove()`, and `RecordBatch::remove_c…
Folyd Oct 26, 2023
e3cce56
Conditional Put (#4879) (#4984)
tustvold Oct 27, 2023
e4bb1e9
Support list_with_offset for GCS (#4993)
tustvold Oct 30, 2023
11b2f5f
Object tagging (#4754) (#4999)
tustvold Oct 30, 2023
890823b
feat(flight-sql): Allow custom commands in get-flight-info (#4997)
amartins23 Oct 30, 2023
d9aaa43
Add `RecordReader` trait and proc macro to implement it for a struct …
Joseph-Rance Oct 30, 2023
cc23cac
Improve object_store docs (#4978)
tustvold Oct 30, 2023
be093cb
Simplify datetime conversion (#5006)
tustvold Oct 30, 2023
65f7be8
Return row count when inferring schema from JSON (#5008)
asayers Oct 31, 2023
94fe6bb
Remove ObjectStore::append (#5016)
tustvold Nov 1, 2023
ec788e1
Decode URL paths (#5017) (#5018)
tustvold Nov 1, 2023
7873500
ObjectStore: make error msg thrown from retry more detailed (#5012)
Rachelint Nov 1, 2023
7281a0c
Relax path safety (#5019) (#5020)
tustvold Nov 2, 2023
ab53d2d
Support onelake fabric paths in parse_url (#5000) (#5002)
tustvold Nov 2, 2023
ad211fe
Prepare object_store 0.8.0 (#5010) (#5023)
tustvold Nov 2, 2023
829708d
Verify object_store with all features (#5024)
tustvold Nov 2, 2023
e4689e6
Fix invalid_path test (#5026)
tustvold Nov 2, 2023
1807aba
chore: Update docs to refer to non deprecated function (`partition`) …
alamb Nov 2, 2023
a447bcf
doc: update comment on sort_to_indices to reflect that IEE 754 totalO…
westonpace Nov 2, 2023
7705aca
Support casting from integer to binary (#5015)
viirya Nov 3, 2023
dcbe546
Update object_store 0.8.0 (#5043)
tustvold Nov 6, 2023
91acfb0
Fix serialization of large integers (#5038) (#5042)
tustvold Nov 6, 2023
8c20c98
Cast from integer/timestamp to timestamp/integer (#5040)
viirya Nov 7, 2023
20f10dc
Fix RowSelection::intersection (#5036) (#5041)
tustvold Nov 7, 2023
ffeda62
Parquet f32/f64 handle signed zeros in statistics (#5048)
Jefffrey Nov 7, 2023
1d16937
Prepare arrow 49.0.0 (#5054)
tustvold Nov 7, 2023
747dcbf
Update parquet encoding docs (#5053)
tustvold Nov 7, 2023
1635f5b
Remove ByteBufferPtr and replace with Bytes (#5055)
Jefffrey Nov 8, 2023
0cb30bb
refactor: change `object_store` CA handling (#5056)
crepererum Nov 8, 2023
f53f284
docs: re-order `object_store` intro (#5058)
crepererum Nov 8, 2023
31b5724
Add a PR under "Breaking changes" in the object_store 0.8.0 changelog…
carols10cents Nov 10, 2023
924b6e9
IPC writer truncated sliced list/map values (#5071)
Jefffrey Nov 13, 2023
7ba36b0
Parquet: read/write f16 for Arrow (#5003)
Jefffrey Nov 13, 2023
7941577
Enable truncation of binary statistics columns (#5076)
emcake Nov 15, 2023
7fa78b7
Support multiple GZip members in parquet page (#4951)
tustvold Nov 15, 2023
4b9d789
Update itertools requirement from 0.11.0 to 0.12.0 in /object_store (…
dependabot[bot] Nov 15, 2023
aff86e7
Implement Arrow PyCapsule Interface (#5070)
kylebarron Nov 15, 2023
a3687a7
Add FFI from_raw (#5082)
tustvold Nov 17, 2023
873d277
Update prost-build requirement from =0.12.1 to =0.12.2 (#5088)
dependabot[bot] Nov 17, 2023
481652a
Fix latest clippy lints (#5090)
tustvold Nov 17, 2023
bfe396e
Ensure arrays passed to MutableArrayData have same type (#5091) (#5092)
tustvold Nov 17, 2023
dc75a28
feat: cast (Large)List to FixedSizeList (#5081)
wjones127 Nov 17, 2023
61da64a
Extend aggregation benchmarks (#5096)
jhorstmann Nov 18, 2023
6815bf1
Expand parquet crate overview doc (#5093)
mmaitre314 Nov 20, 2023
4d141a3
Allow 'zip' compute function to operate on Scalar arrays (#5086)
Nathan-Fenner Nov 20, 2023
b724849
Improve C Data Interface and Add Integration Testing Entrypoints (#5080)
pitrou Nov 20, 2023
fbbb61d
Allow writing null valued keys in JSON (#5065)
Jefffrey Nov 20, 2023
df69ef5
fix: coerce_primitive for serde decoded data (#5101)
fansehep Nov 21, 2023
06a3a2e
Fix integration tests (#5111)
tustvold Nov 22, 2023
410fcbb
Update prost-build requirement from =0.12.2 to =0.12.3 (#5112)
dependabot[bot] Nov 22, 2023
200e8c8
Implementing `ArrayBuilder` for `Box<dyn ArrayBuilder>` (#5109)
viirya Nov 22, 2023
e1bafdf
Remove empty source file (#5119)
viirya Nov 23, 2023
ef1cc38
Cleanup list casting and support nested lists (#5113) (#5124)
tustvold Nov 27, 2023
409bb81
[fix #5044] Support converting 'yyyymmdd' format to date (#5078)
Tangruilin Nov 27, 2023
d5a6cf4
Fix ObjectStore.LocalFileSystem.put_opts for blobfuse (#5094)
RobinLin666 Nov 27, 2023
435b53d
Cast from numeric/timestamp to timestamp/numeric (#5123)
viirya Nov 27, 2023
4b7405c
Improve cast docs (#5114)
tustvold Nov 27, 2023
e26fa4f
Update proc-macro2 requirement from =1.0.69 to =1.0.70 (#5131)
dependabot[bot] Nov 27, 2023
8a0b5cb
Fix negative decimal string (#5128)
viirya Nov 28, 2023
34a816d
Parquet: derive boundary order when writing (#5110)
Jefffrey Nov 28, 2023
58c80e6
Update localstack to 3.0.1 (#5028)
tustvold Nov 28, 2023
a361ce1
Fix 'ColumnPath not found' error reading Parquet files with nested RE…
mmaitre314 Nov 28, 2023
093a10e
Parquet: Make `MetadataLoader` public (#5137)
AdamGS Nov 28, 2023
c161456
Support casting of Float16 with other numeric types (#5139)
viirya Nov 28, 2023
ef6932f
JSON: write struct array nulls as null (#5133)
Jefffrey Nov 28, 2023
cfdb505
Deprecate Fields::remove Schema::remove (#5144)
tustvold Nov 29, 2023
8867a1f
Allow 403 for overwrite prevention (#5134)
emcake Nov 29, 2023
6d4b8bb
Support nested schema projection (#5148) (#5149)
tustvold Nov 29, 2023
f621d28
Parquet: omit min/max for interval columns when writing stats (#5147)
Jefffrey Nov 30, 2023
5788c69
object_store: fix failing doctest with default features (#5161)
Jefffrey Dec 5, 2023
9efaf06
Adding `is_null` datatype shortcut method (#5157)
comphead Dec 5, 2023
6b905fe
Removing ahash (#5156)
psvri Dec 5, 2023
b8d3f33
Parquet: don't truncate f16/decimal min/max stats (#5154)
Jefffrey Dec 5, 2023
f16d2f5
Parquet: write column_orders in FileMetaData (#5158)
Jefffrey Dec 5, 2023
a36bf7a
Parquet: clear metadata and project fields of ParquetRecordBatchStrea…
Jefffrey Dec 5, 2023
cb338c9
Update rustls-pemfile requirement from 1.0 to 2.0 in /object_store (#…
dependabot[bot] Dec 5, 2023
f352cdc
Bump actions/labeler from 4.3.0 to 5.0.0 (#5167)
dependabot[bot] Dec 5, 2023
46bbd7d
Removing redundant as casts (#5168)
psvri Dec 5, 2023
f4bad68
fix: ensure take_fixed_size_list can handle null indices (#5170)
westonpace Dec 6, 2023
ea3aca1
fix: Changed labeler.yml to latest format (#5172)
viirya Dec 6, 2023
32b68f7
Add LargeListBuilder to make_builder (#5171)
viirya Dec 6, 2023
95f03ac
improve: make RunArray displayable (#5166)
yukkit Dec 6, 2023
298ddfd
ci: Add cargo audit CI action (#5160)
Jefffrey Dec 6, 2023
2923243
Labeller attempt 2 (#5174)
tustvold Dec 6, 2023
1534cc1
Bump actions/setup-python from 4 to 5 (#5175)
dependabot[bot] Dec 6, 2023
490c080
Parquet: Ensure page statistics are written only when conifgured from…
AdamGS Dec 7, 2023
b06ab13
Use Total Ordering for Aggregates and Refactor for Better Auto-Vector…
jhorstmann Dec 7, 2023
d41e90e
Remove SIMD Feature (#5184)
tustvold Dec 8, 2023
93a28a5
Fixed issue where timestamp parser incorrectly accepted characters af…
razeghi71 Dec 8, 2023
c821ae7
Fix deprecated note for `Buffer::from_raw_parts` (#5190)
mbrobbel Dec 8, 2023
a43e82c
Add `BooleanArray::into_parts` method (#5191)
mbrobbel Dec 8, 2023
2a213bc
Remove ScalarBuffer from parquet (#1849) (#5177) (#5178)
tustvold Dec 8, 2023
7e28913
Simplify parquet statistics generation (#5183)
tustvold Dec 8, 2023
c5a9953
Clarify interval comparison behavior with documentation and tests (#5…
alamb Dec 8, 2023
9630aaf
Blockwise IO in IPC FileReader (#5153) (#5179)
tustvold Dec 8, 2023
1553545
impl `From<OffsetBuffer<T>>` for `ScalarBuffer<T>` (#5203)
mbrobbel Dec 11, 2023
8aa55dd
impl `From<BufferBuilder<T>>` for `Buffer` (#5202)
mbrobbel Dec 11, 2023
2a84e85
feat(object_store): use http1 by default (#5204)
wjones127 Dec 12, 2023
7fd2d42
impl `From<BufferBuilder<T>>` for `ScalarBuffer<T>` (#5201)
mbrobbel Dec 12, 2023
bc39f25
refactor: simplify cast_string_to_interval (#5195)
jackwener Dec 12, 2023
802ed42
Support quote and escape in Csv WriterBuilder (#5196)
my-vegetable-has-exploded Dec 12, 2023
d8203af
Merge remote-tracking branch 'upstream/master' into update-master
fsdvh Dec 13, 2023
7977195
fix few methods
fsdvh Dec 13, 2023
cf6c394
trigger cicd
fsdvh Dec 14, 2023
65012e6
VTX-3411: fix for etag in s3 object store (#50)
fsdvh Dec 14, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
8 changes: 7 additions & 1 deletion .asf.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -38,4 +38,10 @@ github:
# require branches to be up-to-date before merging
strict: true
# don't require any jobs to pass
contexts: []
contexts: []

# publishes the content of the `asf-site` branch to
# https://arrow.apache.org/rust/
publish:
whoami: asf-site
subdir: rust
9 changes: 3 additions & 6 deletions .gitattributes
Original file line number Diff line number Diff line change
@@ -1,6 +1,3 @@
r/R/RcppExports.R linguist-generated=true
r/R/arrowExports.R linguist-generated=true
r/src/RcppExports.cpp linguist-generated=true
r/src/arrowExports.cpp linguist-generated=true
r/man/*.Rd linguist-generated=true

parquet/src/format.rs linguist-generated
arrow-flight/src/arrow.flight.protocol.rs linguist-generated
arrow-flight/src/sql/arrow.flight.protocol.sql.rs linguist-generated
21 changes: 18 additions & 3 deletions .github/actions/setup-builder/action.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -20,8 +20,12 @@ description: 'Prepare Rust Build Environment'
inputs:
rust-version:
description: 'version of rust to install (e.g. stable)'
required: true
required: false
default: 'stable'
target:
description: 'target architecture(s)'
required: false
default: 'x86_64-unknown-linux-gnu'
runs:
using: "composite"
steps:
Expand Down Expand Up @@ -51,6 +55,17 @@ runs:
shell: bash
run: |
echo "Installing ${{ inputs.rust-version }}"
rustup toolchain install ${{ inputs.rust-version }}
rustup toolchain install ${{ inputs.rust-version }} --target ${{ inputs.target }}
rustup default ${{ inputs.rust-version }}
echo "CARGO_TARGET_DIR=/github/home/target" >> $GITHUB_ENV
- name: Disable debuginfo generation
# Disable full debug symbol generation to speed up CI build and keep memory down
# "1" means line tables only, which is useful for panic tracebacks.
shell: bash
run: echo "RUSTFLAGS=-C debuginfo=1" >> $GITHUB_ENV
- name: Enable backtraces
shell: bash
run: echo "RUST_BACKTRACE=1" >> $GITHUB_ENV
- name: Fixup git permissions
# https://github.com/actions/checkout/issues/766
shell: bash
run: git config --global --add safe.directory "$GITHUB_WORKSPACE"
11 changes: 9 additions & 2 deletions .github/dependabot.yml
Original file line number Diff line number Diff line change
Expand Up @@ -6,10 +6,17 @@ updates:
interval: daily
open-pull-requests-limit: 10
target-branch: master
labels: [auto-dependencies]
labels: [ auto-dependencies, arrow ]
- package-ecosystem: cargo
directory: "/object_store"
schedule:
interval: daily
open-pull-requests-limit: 10
target-branch: master
labels: [ auto-dependencies, object_store ]
- package-ecosystem: "github-actions"
directory: "/"
schedule:
interval: "daily"
open-pull-requests-limit: 10
labels: [auto-dependencies]
labels: [ auto-dependencies ]
200 changes: 111 additions & 89 deletions .github/workflows/arrow.yml
Original file line number Diff line number Diff line change
Expand Up @@ -18,15 +18,34 @@
# tests for arrow crate
name: arrow

concurrency:
group: ${{ github.repository }}-${{ github.head_ref || github.sha }}-${{ github.workflow }}
cancel-in-progress: true

on:
# always trigger
push:
branches:
- master
pull_request:
paths:
- arrow/**
- .github/**
- arrow-arith/**
- arrow-array/**
- arrow-buffer/**
- arrow-cast/**
- arrow-csv/**
- arrow-data/**
- arrow-integration-test/**
- arrow-ipc/**
- arrow-json/**
- arrow-avro/**
- arrow-ord/**
- arrow-row/**
- arrow-schema/**
- arrow-select/**
- arrow-string/**
- arrow/**

jobs:

Expand All @@ -36,24 +55,46 @@ jobs:
runs-on: ubuntu-latest
container:
image: amd64/rust
env:
# Disable full debug symbol generation to speed up CI build and keep memory down
# "1" means line tables only, which is useful for panic tracebacks.
RUSTFLAGS: "-C debuginfo=1"
steps:
- uses: actions/checkout@v3
- uses: actions/checkout@v4
with:
submodules: true
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
rust-version: stable
- name: Test
run: |
cargo test -p arrow
- name: Test --features=force_validate,prettyprint,ipc_compression,ffi,dyn_cmp_dict
run: |
cargo test -p arrow --features=force_validate,prettyprint,ipc_compression,ffi,dyn_cmp_dict
- name: Test arrow-buffer with all features
run: cargo test -p arrow-buffer --all-features
- name: Test arrow-data with all features
run: cargo test -p arrow-data --all-features
- name: Test arrow-schema with all features
run: cargo test -p arrow-schema --all-features
- name: Test arrow-array with all features
run: cargo test -p arrow-array --all-features
- name: Test arrow-select with all features
run: cargo test -p arrow-select --all-features
- name: Test arrow-cast with all features
run: cargo test -p arrow-cast --all-features
- name: Test arrow-ipc with all features
run: cargo test -p arrow-ipc --all-features
- name: Test arrow-csv with all features
run: cargo test -p arrow-csv --all-features
- name: Test arrow-json with all features
run: cargo test -p arrow-json --all-features
- name: Test arrow-avro with all features
run: cargo test -p arrow-avro --all-features
- name: Test arrow-string with all features
run: cargo test -p arrow-string --all-features
- name: Test arrow-ord with all features
run: cargo test -p arrow-ord --all-features
- name: Test arrow-arith with all features
run: cargo test -p arrow-arith --all-features
- name: Test arrow-row with all features
run: cargo test -p arrow-row --all-features
- name: Test arrow-integration-test with all features
run: cargo test -p arrow-integration-test --all-features
- name: Test arrow with default features
run: cargo test -p arrow
- name: Test arrow with all features except pyarrow
run: cargo test -p arrow --features=force_validate,prettyprint,ipc_compression,ffi,chrono-tz
- name: Run examples
run: |
# Test arrow examples
Expand All @@ -64,114 +105,95 @@ jobs:
- name: Run non-archery based integration-tests
run: cargo test -p arrow-integration-testing

# test compilaton features
# test compilation features
linux-features:
name: Check Compilation
runs-on: ubuntu-latest
container:
image: amd64/rust
env:
# Disable full debug symbol generation to speed up CI build and keep memory down
# "1" means line tables only, which is useful for panic tracebacks.
RUSTFLAGS: "-C debuginfo=1"
steps:
- uses: actions/checkout@v3
- uses: actions/checkout@v4
with:
submodules: true
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
rust-version: stable
- name: Check compilation
run: |
cargo check -p arrow
run: cargo check -p arrow
- name: Check compilation --no-default-features
run: |
cargo check -p arrow --no-default-features
run: cargo check -p arrow --no-default-features
- name: Check compilation --all-targets
run: |
cargo check -p arrow --all-targets
run: cargo check -p arrow --all-targets
- name: Check compilation --no-default-features --all-targets
run: |
cargo check -p arrow --no-default-features --all-targets
run: cargo check -p arrow --no-default-features --all-targets
- name: Check compilation --no-default-features --all-targets --features test_utils
run: |
cargo check -p arrow --no-default-features --all-targets --features test_utils

# test the --features "simd" of the arrow crate. This requires nightly Rust.
linux-test-simd:
name: Test SIMD on AMD64 Rust ${{ matrix.rust }}
runs-on: ubuntu-latest
container:
image: amd64/rust
env:
# Disable full debug symbol generation to speed up CI build and keep memory down
# "1" means line tables only, which is useful for panic tracebacks.
RUSTFLAGS: "-C debuginfo=1"
steps:
- uses: actions/checkout@v3
with:
submodules: true
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
rust-version: nightly
- name: Run tests --features "simd"
run: |
cargo test -p arrow --features "simd"
- name: Check compilation --features "simd"
run: |
cargo check -p arrow --features simd
- name: Check compilation --features simd --all-targets
run: |
cargo check -p arrow --features simd --all-targets
run: cargo check -p arrow --no-default-features --all-targets --features test_utils
- name: Check compilation --no-default-features --all-targets --features ffi
run: cargo check -p arrow --no-default-features --all-targets --features ffi
- name: Check compilation --no-default-features --all-targets --features chrono-tz
run: cargo check -p arrow --no-default-features --all-targets --features chrono-tz


# test the arrow crate builds against wasm32 in stable rust
# test the arrow crate builds against wasm32 in nightly rust
wasm32-build:
name: Build wasm32
runs-on: ubuntu-latest
container:
image: amd64/rust
env:
# Disable full debug symbol generation to speed up CI build and keep memory down
# "1" means line tables only, which is useful for panic tracebacks.
RUSTFLAGS: "-C debuginfo=1"
steps:
- uses: actions/checkout@v3
- uses: actions/checkout@v4
with:
submodules: true
- name: Cache Cargo
uses: actions/cache@v3
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
path: /github/home/.cargo
key: cargo-wasm32-cache3-
- name: Setup Rust toolchain for WASM
run: |
rustup toolchain install nightly
rustup override set nightly
rustup target add wasm32-unknown-unknown
rustup target add wasm32-wasi
- name: Build
run: |
cd arrow
cargo build --no-default-features --features=json,csv,ipc,simd,ffi --target wasm32-unknown-unknown
cargo build --no-default-features --features=json,csv,ipc,simd,ffi --target wasm32-wasi
target: wasm32-unknown-unknown,wasm32-wasi
- name: Build wasm32-unknown-unknown
run: cargo build -p arrow --no-default-features --features=json,csv,ipc,ffi --target wasm32-unknown-unknown
- name: Build wasm32-wasi
run: cargo build -p arrow --no-default-features --features=json,csv,ipc,ffi --target wasm32-wasi

clippy:
name: Clippy
runs-on: ubuntu-latest
container:
image: amd64/rust
steps:
- uses: actions/checkout@v3
- uses: actions/checkout@v4
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
rust-version: stable
- name: Setup Clippy
run: |
rustup component add clippy
- name: Run clippy
run: |
cargo clippy -p arrow --features=prettyprint,csv,ipc,test_utils,ffi,ipc_compression,dyn_cmp_dict --all-targets -- -D warnings
run: rustup component add clippy
- name: Clippy arrow-buffer with all features
run: cargo clippy -p arrow-buffer --all-targets --all-features -- -D warnings
- name: Clippy arrow-data with all features
run: cargo clippy -p arrow-data --all-targets --all-features -- -D warnings
- name: Clippy arrow-schema with all features
run: cargo clippy -p arrow-schema --all-targets --all-features -- -D warnings
- name: Clippy arrow-array with all features
run: cargo clippy -p arrow-array --all-targets --all-features -- -D warnings
- name: Clippy arrow-select with all features
run: cargo clippy -p arrow-select --all-targets --all-features -- -D warnings
- name: Clippy arrow-cast with all features
run: cargo clippy -p arrow-cast --all-targets --all-features -- -D warnings
- name: Clippy arrow-ipc with all features
run: cargo clippy -p arrow-ipc --all-targets --all-features -- -D warnings
- name: Clippy arrow-csv with all features
run: cargo clippy -p arrow-csv --all-targets --all-features -- -D warnings
- name: Clippy arrow-json with all features
run: cargo clippy -p arrow-json --all-targets --all-features -- -D warnings
- name: Clippy arrow-avro with all features
run: cargo clippy -p arrow-avro --all-targets --all-features -- -D warnings
- name: Clippy arrow-string with all features
run: cargo clippy -p arrow-string --all-targets --all-features -- -D warnings
- name: Clippy arrow-ord with all features
run: cargo clippy -p arrow-ord --all-targets --all-features -- -D warnings
- name: Clippy arrow-arith with all features
run: cargo clippy -p arrow-arith --all-targets --all-features -- -D warnings
- name: Clippy arrow-row with all features
run: cargo clippy -p arrow-row --all-targets --all-features -- -D warnings
- name: Clippy arrow with all features
run: cargo clippy -p arrow --all-features --all-targets -- -D warnings
- name: Clippy arrow-integration-test with all features
run: cargo clippy -p arrow-integration-test --all-targets --all-features -- -D warnings
- name: Clippy arrow-integration-testing with all features
run: cargo clippy -p arrow-integration-testing --all-targets --all-features -- -D warnings
Loading
Loading