-
Notifications
You must be signed in to change notification settings - Fork 237
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fully support date/time legacy rebase for nested input [databricks] #9660
Fully support date/time legacy rebase for nested input [databricks] #9660
Conversation
Signed-off-by: Nghia Truong <[email protected]>
# Conflicts: # sql-plugin/src/main/scala/com/nvidia/spark/RebaseHelper.scala # sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuParquetScan.scala
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
…IA#9617)" This reverts commit 401d0d8. Signed-off-by: Nghia Truong <[email protected]> # Conflicts: # sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuParquetScan.scala
# Conflicts: # sql-plugin/src/main/scala/com/nvidia/spark/RebaseHelper.scala # sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuParquetScan.scala
# Conflicts: # integration_tests/src/main/python/parquet_test.py
# Conflicts: # integration_tests/src/main/python/parquet_test.py # sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuParquetScan.scala
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
# Conflicts: # integration_tests/src/main/python/parquet_test.py # integration_tests/src/main/python/parquet_write_test.py # sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuParquetScan.scala # sql-plugin/src/main/scala/com/nvidia/spark/rapids/datetimeRebaseUtils.scala
# Once https://github.com/NVIDIA/spark-rapids/issues/1126 is fixed delete this test and merge it | ||
# into test_parquet_read_roundtrip_datetime |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This deleted test is combined with the test_parquet_read_roundtrip_datetime
.
parquet_datetime_in_struct_gen = [ | ||
StructGen([['child' + str(ind), sub_gen] for ind, sub_gen in enumerate(parquet_datetime_gen_simple)])] | ||
parquet_datetime_in_array_gen = [ArrayGen(sub_gen, max_length=10) for sub_gen in | ||
parquet_datetime_gen_simple + parquet_datetime_in_struct_gen] | ||
parquet_nested_datetime_gen = parquet_datetime_gen_simple + parquet_datetime_in_struct_gen + \ | ||
parquet_datetime_in_array_gen |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Simplify the data generators a bit, since they are too heavy and the tests using them (especially in parquet read tests) become very very slow now.
Signed-off-by: Nghia Truong <[email protected]>
# Conflicts: # integration_tests/src/main/python/parquet_write_test.py
Signed-off-by: Nghia Truong <[email protected]>
build |
Signed-off-by: Nghia Truong <[email protected]>
build |
Signed-off-by: Nghia Truong <[email protected]>
build |
This adds full support for date/time legacy rebase for the input containing dates/timestamps nested under other columns.
Most related tests are updated to reflect the changes.
Depends on:
spark.sql.parquet.datetimeRebaseModeInRead=LEGACY
andspark.sql.parquet.int96RebaseModeInRead=LEGACY
[databricks] #9649spark.sql.parquet.int96RebaseModeInWrite=LEGACY
[databricks] #9658Closes #1126.
Warn: This PR contains code from the dependency PRs above and will be hidden after they are merged.