Merge branch-24.04 into main [skip ci] #55

NvTimLiu · 2024-03-28T14:28:47Z

Change version to 24.04.0

Note: merge this PR with Create a merge commit to merge

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Signed-off-by: Tim Liu <[email protected]>

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Signed-off-by: Robert (Bobby) Evans <[email protected]>

To fix: NVIDIA#10256 Bump up dependency version to 24.04.0-SNAPSHOT Signed-off-by: Tim Liu <[email protected]>

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Signed-off-by: Jason Lowe <[email protected]>

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Signed-off-by: Tim Liu <[email protected]>

* Distinct inner join Signed-off-by: Jason Lowe <[email protected]> * Distinct left join Signed-off-by: Jason Lowe <[email protected]> * Update to new API * Fix test --------- Signed-off-by: Jason Lowe <[email protected]>

Signed-off-by: Partho Sarthi <[email protected]>

…ment (NVIDIA#10564) * WIP Signed-off-by: Gera Shegalov <[email protected]> * WIP Signed-off-by: Gera Shegalov <[email protected]> * Enable specifying the pytest using file_or_dir args ```bash TEST_PARALLEL=0 \ SPARK_HOME=~/dist/spark-3.1.1-bin-hadoop3.2 \ TEST_FILE_OR_DIR=~/gits/NVIDIA/spark-rapids/integration_tests/src/main/python/arithmetic_ops_test.py::test_addition \ ./integration_tests/run_pyspark_from_build.sh --collect-only <Module src/main/python/arithmetic_ops_test.py> <Function test_addition[Byte]> <Function test_addition[Short]> <Function test_addition[Integer]> <Function test_addition[Long]> <Function test_addition[Float]> <Function test_addition[Double]> <Function test_addition[Decimal(7,3)]> <Function test_addition[Decimal(12,2)]> <Function test_addition[Decimal(18,0)]> <Function test_addition[Decimal(20,2)]> <Function test_addition[Decimal(30,2)]> <Function test_addition[Decimal(36,5)]> <Function test_addition[Decimal(38,10)]> <Function test_addition[Decimal(38,0)]> <Function test_addition[Decimal(7,7)]> <Function test_addition[Decimal(7,-3)]> <Function test_addition[Decimal(36,-5)]> <Function test_addition[Decimal(38,-10)]> ``` Signed-off-by: Gera Shegalov <[email protected]> Co-authored-by: Raza Jafri <[email protected]> * Changing to TESTS=module::method Signed-off-by: Gera Shegalov <[email protected]> --------- Signed-off-by: Gera Shegalov <[email protected]> Co-authored-by: Raza Jafri <[email protected]>

…VIDIA#10562) * Fix test_spark_from_json_date_with_format when run in a non-UTC TZ Signed-off-by: Robert (Bobby) Evans <[email protected]> * Copyright year --------- Signed-off-by: Robert (Bobby) Evans <[email protected]>

…0542) Signed-off-by: Andy Grove <[email protected]> Signed-off-by: Robert (Bobby) Evans <[email protected]> Co-authored-by: Andy Grove <[email protected]>

* WindowGroupLimit support for [databricks]. Fixes NVIDIA#10531. This is a followup to NVIDIA#10500, which added support to push down window-group-limit filters before the shuffle phase. NVIDIA#10500 inadvertently neglected to ensure that the optimization works on Databricks. (It turns out that window-group-limit was cherry-picked into Databricks 13.3, despite the nominal Spark version being `3.4.1`.) This change ensures that the same optimization is available on Databricks 13.3 (and beyond). --------- Signed-off-by: MithunR <[email protected]>

Signed-off-by: Alessandro Bellina <[email protected]>

…DIA#10572) Signed-off-by: Robert (Bobby) Evans <[email protected]>

Signed-off-by: Jason Lowe <[email protected]>

* Add in small optimization for instr comparison Signed-off-by: Robert (Bobby) Evans <[email protected]> * Review Comments --------- Signed-off-by: Robert (Bobby) Evans <[email protected]>

Fixes NVIDIA#10430. This PR ensures that Spark RAPIDS jobs are executed on supported GPU architectures without relying on manual configuration. ### Changes: 1. Processes `gpu_architectures` property from the `*version-info.properties` file generated by the native builds. 2. Verifies if the user is running the job on an architecture supported by the cuDF and JNI libraries and throws an exception if the architecture is unsupported. ### Testing Tested on a Dataproc VM running on Nvidia P4 (GPU Architecture 6.1) ``` 24/03/06 17:44:58 WARN RapidsPluginUtils: spark.rapids.sql.explain is set to `NOT_ON_GPU`. Set it to 'NONE' to suppress the diagnostics logging about the query placement on the GPU. 24/03/06 17:45:10 ERROR RapidsExecutorPlugin: Exception in the executor plugin, shutting down! java.lang.RuntimeException: Device architecture 61 is unsupported. Minimum supported architecture: 75. at com.nvidia.spark.rapids.RapidsPluginUtils$.checkGpuArchitectureInternal(Plugin.scala:366) at com.nvidia.spark.rapids.RapidsPluginUtils$.checkGpuArchitecture(Plugin.scala:375) at com.nvidia.spark.rapids.RapidsExecutorPlugin.init(Plugin.scala:461) ``` ### Related PR * NVIDIA/spark-rapids-jni#1840 * Add conf for minimum supported CUDA and error handling Signed-off-by: Partho Sarthi <[email protected]> * Revert "Add conf for minimum supported CUDA and error handling" This reverts commit 7b8eaea. * Verify the GPU architecture is supported by the plugin libraries Signed-off-by: Partho Sarthi <[email protected]> * Use semi-colon as delimiter and use intersection of supported gpu architectures Signed-off-by: Partho Sarthi <[email protected]> * Allow for compatibility with major architectures Signed-off-by: Partho Sarthi <[email protected]> * Check for version as integers Signed-off-by: Partho Sarthi <[email protected]> * Modify compatibility check for same major version and same or higher minor version Signed-off-by: Partho Sarthi <[email protected]> * Add a config to skip verification and refactor checking Signed-off-by: Partho Sarthi <[email protected]> * Update RapidsConf.scala Co-authored-by: Jason Lowe <[email protected]> * Update verification logic Signed-off-by: Partho Sarthi <[email protected]> * Update warning message Signed-off-by: Partho Sarthi <[email protected]> * Add unit tests and update warning message. Signed-off-by: Partho Sarthi <[email protected]> * Update exception class Signed-off-by: Partho Sarthi <[email protected]> * Address review comments Signed-off-by: Partho Sarthi <[email protected]> --------- Signed-off-by: Partho Sarthi <[email protected]> Co-authored-by: Jason Lowe <[email protected]>

Signed-off-by: YanxuanLiu <[email protected]>

…VIDIA#10575) Signed-off-by: Robert (Bobby) Evans <[email protected]>

Update CI script to support building and deploying using the same CUDA classifier Signed-off-by: Tim Liu <[email protected]>

Signed-off-by: Yinqing Hao <[email protected]>

…DIA#10602) * Disable InMemoryTableScanExec support by default for Spark 3.5+ * Signing off Signed-off-by: Raza Jafri <[email protected]> * Revert "Remove InMemoryTableScanExec support for Spark 3.5+" This reverts commit 8d38f8a. * Disable InMemoryTableScanExec by default for Spark versions 3.5+ --------- Signed-off-by: Raza Jafri <[email protected]>

…p command (NVIDIA#10599) * Call globStatus directly via PY4J instead of 'hadoop fs' Signed-off-by: Yinqing Hao <[email protected]> * Use pathlib.Path to handle local path only Signed-off-by: Yinqing Hao <[email protected]> * Use os.path.join to concatenate hdfs path and pattern Signed-off-by: Yinqing Hao <[email protected]> --------- Signed-off-by: Yinqing Hao <[email protected]>

* Do not replace TableCacheQueryStageExec * signoff Signed-off-by: Andy Grove <[email protected]> --------- Signed-off-by: Andy Grove <[email protected]> Co-authored-by: Raza Jafri <[email protected]>

Signed-off-by: Alessandro Bellina <[email protected]>

"shared-libs" had not been used since we switched to blossom Jenkins. We had ever used "shared-libs" before, but now only need "blossom-lib". Signed-off-by: Tim Liu <[email protected]>

* Pass metadata extractors to FileScanRDD * Signing off Signed-off-by: Raza Jafri <[email protected]> * addressed review comments * updated copyrights manually --------- Signed-off-by: Raza Jafri <[email protected]>

) Signed-off-by: Tim Liu <[email protected]>

* Add fix for removing internal metadata information from 350 shim Signed-off-by: Partho Sarthi <[email protected]> * Create a shim helper class wth the relevant change instead of duplicating code Signed-off-by: Partho Sarthi <[email protected]> * Remove extra whitespace Signed-off-by: Partho Sarthi <[email protected]> --------- Signed-off-by: Partho Sarthi <[email protected]>

* Use new kernel for getJsonObject Signed-off-by: Haoyang Li <[email protected]> * Use table to pass parsed path Signed-off-by: Haoyang Li <[email protected]> * use list/vector of instruction objects Signed-off-by: Haoyang Li <[email protected]> * fallback when nested too long Signed-off-by: Haoyang Li <[email protected]> * cancel xfail cases Signed-off-by: Haoyang Li <[email protected]> * cancel xfail cases Signed-off-by: Haoyang Li <[email protected]> * generated and modified docs Signed-off-by: Haoyang Li <[email protected]> * wip Signed-off-by: Haoyang Li <[email protected]> * wip Signed-off-by: Haoyang Li <[email protected]> * apply jni change and remove xpass Signed-off-by: Haoyang Li <[email protected]> * Adds test cases Signed-off-by: Haoyang Li <[email protected]> --------- Signed-off-by: Haoyang Li <[email protected]>

Signed-off-by: jenkins <jenkins@localhost>

NvTimLiu · 2024-03-28T14:28:51Z

[skip ci] as branch-24.04 already PASS the build

nvauto and others added 30 commits January 24, 2024 17:20

Merge pull request NVIDIA#10259 from NVIDIA/branch-24.02

edd0db9

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10265 from NVIDIA/branch-24.02

3672dcc

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10266 from NVIDIA/branch-24.02

f3c9bdd

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10269 from NVIDIA/branch-24.02

8ac471b

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10271 from NVIDIA/branch-24.02

b07c14d

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Init project version 24.04.0-SNAPSHOT (NVIDIA#10258)

ad646cc

Signed-off-by: Tim Liu <[email protected]>

Merge pull request NVIDIA#10280 from NVIDIA/branch-24.02

1da861e

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10283 from NVIDIA/branch-24.02

b9a1e85

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10293 from NVIDIA/branch-24.02

4a89338

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10294 from NVIDIA/branch-24.02

da1ca7b

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10303 from NVIDIA/branch-24.02

dc8ea95

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10304 from NVIDIA/branch-24.02

27d041f

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10305 from NVIDIA/branch-24.02

c7fea18

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10307 from NVIDIA/branch-24.02

fc6ac90

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10308 from NVIDIA/branch-24.02

5408b09

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10313 from NVIDIA/branch-24.02

3e4168a

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10314 from NVIDIA/branch-24.02

0e54176

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10315 from NVIDIA/branch-24.02

83d012e

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10320 from NVIDIA/branch-24.02

31da3a6

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10332 from NVIDIA/branch-24.02

14629cd

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10333 from NVIDIA/branch-24.02

ae54560

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10335 from NVIDIA/branch-24.02

d5ae6d9

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10338 from NVIDIA/branch-24.02

5b8ee36

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Add tryAcquire to GpuSemaphore (NVIDIA#10330)

803e498

Signed-off-by: Robert (Bobby) Evans <[email protected]>

Bump up dependency version to 24.04.0-SNAPSHOT (NVIDIA#10321)

bf67814

To fix: NVIDIA#10256 Bump up dependency version to 24.04.0-SNAPSHOT Signed-off-by: Tim Liu <[email protected]>

Merge pull request NVIDIA#10346 from NVIDIA/branch-24.02

0943b95

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10354 from NVIDIA/branch-24.02

862e8df

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Remove redundant joinOutputRows metric (NVIDIA#10348)

04b687d

Signed-off-by: Jason Lowe <[email protected]>

Merge pull request NVIDIA#10357 from NVIDIA/branch-24.02

7a34757

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

Merge pull request NVIDIA#10363 from NVIDIA/branch-24.02

3b14e01

[auto-merge] branch-24.02 to branch-24.04 [skip ci] [bot]

NvTimLiu and others added 27 commits March 7, 2024 22:56

Move K8s cloud name into common lib for Jenkins CI (NVIDIA#10538)

852b74a

Signed-off-by: Tim Liu <[email protected]>

Distinct left join (NVIDIA#10520)

e1cbd6e

* Distinct inner join Signed-off-by: Jason Lowe <[email protected]> * Distinct left join Signed-off-by: Jason Lowe <[email protected]> * Update to new API * Fix test --------- Signed-off-by: Jason Lowe <[email protected]>

Append new authorized user to blossom-ci safelist (NVIDIA#10563)

11fde83

Signed-off-by: Partho Sarthi <[email protected]>

Make JSON parsing common between JsonToStructs and ScanJson (NVIDIA#1…

3d3ade2

…0542) Signed-off-by: Andy Grove <[email protected]> Signed-off-by: Robert (Bobby) Evans <[email protected]> Co-authored-by: Andy Grove <[email protected]>

Add configuration to share JNI pinned pool with cuIO (NVIDIA#10568)

f26ce1f

Signed-off-by: Alessandro Bellina <[email protected]>

Improve performance of Sort for the common single batch use case (NVI…

9105fd7

…DIA#10572) Signed-off-by: Robert (Bobby) Evans <[email protected]>

Turn on transition logging in HostAllocSuite (NVIDIA#10590)

e0ef44a

Signed-off-by: Jason Lowe <[email protected]>

Add in small optimization for instr comparison (NVIDIA#10584)

4393e9f

* Add in small optimization for instr comparison Signed-off-by: Robert (Bobby) Evans <[email protected]> * Review Comments --------- Signed-off-by: Robert (Bobby) Evans <[email protected]>

add cloud guardword (NVIDIA#10597)

7177c3a

Signed-off-by: YanxuanLiu <[email protected]>

Update JsonToStructs and ScanJson to have white space normalization (N…

a56da0c

…VIDIA#10575) Signed-off-by: Robert (Bobby) Evans <[email protected]>

Scritp to build and deploy using the same classifier (NVIDIA#10598)

2700f1e

Update CI script to support building and deploying using the same CUDA classifier Signed-off-by: Tim Liu <[email protected]>

Update perfio.s3.enabled doc (NVIDIA#10608)

3ce6af1

Signed-off-by: Yinqing Hao <[email protected]>

Do not replace TableCacheQueryStageExec (NVIDIA#10610)

65ec25d

* Do not replace TableCacheQueryStageExec * signoff Signed-off-by: Andy Grove <[email protected]> --------- Signed-off-by: Andy Grove <[email protected]> Co-authored-by: Raza Jafri <[email protected]>

Turn off state logging in HostAllocSuite (NVIDIA#10615)

079d0fe

Signed-off-by: Alessandro Bellina <[email protected]>

Remove unused shared lib in Jenkins files (NVIDIA#10620)

579cc13

"shared-libs" had not been used since we switched to blossom Jenkins. We had ever used "shared-libs" before, but now only need "blossom-lib". Signed-off-by: Tim Liu <[email protected]>

Pass metadata extractors to FileScanRDD [databricks] (NVIDIA#10616)

09a0081

* Pass metadata extractors to FileScanRDD * Signing off Signed-off-by: Raza Jafri <[email protected]> * addressed review comments * updated copyrights manually --------- Signed-off-by: Raza Jafri <[email protected]>

Auto merge PRs to branch-24.06 from branch-24.04 [skip ci] (NVIDIA#10623

e833d39

) Signed-off-by: Tim Liu <[email protected]>

Merge branch-24.04 into main

eb6b83c

Change version to 24.04.0

404d937

Signed-off-by: jenkins <jenkins@localhost>

NvTimLiu merged commit 42adf6d into main Mar 28, 2024

NvTimLiu deleted the branch-24.04-to-main branch March 28, 2024 14:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge branch-24.04 into main [skip ci] #55

Merge branch-24.04 into main [skip ci] #55

NvTimLiu commented Mar 28, 2024

NvTimLiu commented Mar 28, 2024

Merge branch-24.04 into main [skip ci] #55

Merge branch-24.04 into main [skip ci] #55

Conversation

NvTimLiu commented Mar 28, 2024

NvTimLiu commented Mar 28, 2024