update the main branch for 2412 release #480

nvliyuan · 2024-12-24T06:59:49Z

update the main branch for v2412 release.
Please create a merge commit, not squash.

Signed-off-by: nvauto <[email protected]>

[auto-merge] branch-24.10 to branch-24.12 [skip ci] [bot]

…NVIDIA#444) Signed-off-by: Peixin Li <[email protected]>

[auto-merge] branch-24.10 to branch-24.12 [skip ci] [bot]

Signed-off-by: Bobby Wang <[email protected]>

[auto-merge] branch-24.10 to branch-24.12 [skip ci] [bot]

1, Add the variables 'SPARK_MASTER_URL' and 'DATA_ROOT' to support automated testing from CI/CD jobs. 2, 'output_prefix' is NOT referenced in below lines Signed-off-by: timl <[email protected]>

[auto-merge] branch-24.10 to branch-24.12 [skip ci] [bot]

* Add a TPC-DS SF 10 Notebook for locall Jupyter or Google Colab Signed-off-by: Gera Shegalov <[email protected]> * Update link to the current blob Signed-off-by: Gera Shegalov <[email protected]> --------- Signed-off-by: Gera Shegalov <[email protected]>

* Update "Open in Colab" link * Update README.md Signed-off-by: Gera Shegalov <[email protected]> --------- Signed-off-by: Gera Shegalov <[email protected]>

* Add Tools Notebooks for EMR * Update README * Sign-off commit Signed-off-by: Partho Sarthi <[email protected]> --------- Signed-off-by: Partho Sarthi <[email protected]>

* add notebook Signed-off-by: YanxuanLiu <[email protected]> * change default kernel Signed-off-by: YanxuanLiu <[email protected]> * add test Signed-off-by: YanxuanLiu <[email protected]> * print Signed-off-by: YanxuanLiu <[email protected]> * change spark master Signed-off-by: YanxuanLiu <[email protected]> * change path of data path Signed-off-by: YanxuanLiu <[email protected]> * remove old notebook Signed-off-by: YanxuanLiu <[email protected]> * optimize default value Signed-off-by: YanxuanLiu <[email protected]> * clear all output Signed-off-by: YanxuanLiu <[email protected]> * remove id Signed-off-by: YanxuanLiu <[email protected]> --------- Signed-off-by: YanxuanLiu <[email protected]>

* change data parh Signed-off-by: YanxuanLiu <[email protected]> * change data path Signed-off-by: YanxuanLiu <[email protected]> * save result to file Signed-off-by: YanxuanLiu <[email protected]> * change format of output Signed-off-by: YanxuanLiu <[email protected]> --------- Signed-off-by: YanxuanLiu <[email protected]>

Signed-off-by: Peixin Li <[email protected]>

…A#469) Signed-off-by: Partho Sarthi <[email protected]>

* Support running optuna on Spark * play around with optuna + xgboost + joblibspark * update * update * update * Optuna: Add how optuna + xgboost + spark works * deploy optuna examples on databricks * Cleanup and prepare for open source * Update README.md * Update run-optuna-spark-xgboost.sh * Update README.md with chmod for run scripts * update/add copyright * Replace RDD example with Dataframe example, cleanups to README and repo structure * Move files to separate dir for merge to spark-rapids-examples * separate dir * Merge branch-24.10 * Remove gitignore Signed-off-by: Rishi Chandra <[email protected]> * remove username * Update README.md * Update README.md * Update README.md * Update README.md * Fix run-sparkrapids-xgboost.sh implementation path * Fix corrupt mysql installation in init_optuna.sh * Fix corrupt mysql installation in init_optuna_xgboost.sh * Update run-joblibspark-xgboost.sh * Update run-sparkrapids-xgboost.sh * Update run-joblibspark-simple.sh * Update run-joblibspark-xgboost.sh * Update run-sparkrapids-xgboost.sh * Update to 24.10, include apt updates * use gpu_hist for older xgb versions * use gpu_hist for older xgb versions * Update sparkrapids-xgboost-read-per-worker.py * Update init script with mysql installation fixes * Address comments * Add cluster startup script * Move around files, add notebooks * Repo renovations * Update README, remove run scripts, fix PCA to use 24.10.1 * minor updates * Final cleanups, runs passed on databricks * README updates * Updates to comments, cleanup outputs * Address comments, minor reordering, update README * comment fix * remove unnecessary imports * tuning max bins and n_estimators bug fixes * 'max_bin' != 'max_bins' 🤦 * ensure QDM and XGB bins are the same * typos * cleanup * Address comments * Note about sampler serialization * Add link * Add link * Undo benchmark commit * typo --------- Signed-off-by: Rishi Chandra <[email protected]> Co-authored-by: Bobby Wang (SW-TEGRA) <[email protected]> Co-authored-by: Bobby Wang <[email protected]> Co-authored-by: Erik Ordentlich <[email protected]>

Signed-off-by: liyuan <[email protected]>

…ease

nvauto and others added 23 commits September 24, 2024 07:36

Init version 24.12.0-SNAPSHOT

6b1c689

Signed-off-by: nvauto <[email protected]>

Merge pull request NVIDIA#436 from NVIDIA/branch-24.10

77fb305

[auto-merge] branch-24.10 to branch-24.12 [skip ci] [bot]

Merge pull request NVIDIA#441 from NVIDIA/branch-24.10

e596414

[auto-merge] branch-24.10 to branch-24.12 [skip ci] [bot]

Merge pull request NVIDIA#443 from NVIDIA/branch-24.10

b4f10d0

[auto-merge] branch-24.10 to branch-24.12 [skip ci] [bot]

Disable kvikio remote I/O to avoid openssl dependencies in cudf build (…

2641ebf

…NVIDIA#444) Signed-off-by: Peixin Li <[email protected]>

Merge pull request NVIDIA#445 from NVIDIA/branch-24.10

0ce65c7

[auto-merge] branch-24.10 to branch-24.12 [skip ci] [bot]

Merge pull request NVIDIA#447 from NVIDIA/branch-24.10

1c36fb9

[auto-merge] branch-24.10 to branch-24.12 [skip ci] [bot]

[xgboost] Fix eval dataset issues (NVIDIA#450)

8015ba4

Signed-off-by: Bobby Wang <[email protected]>

Merge pull request NVIDIA#454 from NVIDIA/branch-24.10

16ca6ec

[auto-merge] branch-24.10 to branch-24.12 [skip ci] [bot]

Update the python notebooks for the customer churn examples (NVIDIA#452)

119897c

1, Add the variables 'SPARK_MASTER_URL' and 'DATA_ROOT' to support automated testing from CI/CD jobs. 2, 'output_prefix' is NOT referenced in below lines Signed-off-by: timl <[email protected]>

Merge pull request NVIDIA#455 from NVIDIA/branch-24.10

2121693

[auto-merge] branch-24.10 to branch-24.12 [skip ci] [bot]

Update "Open in Colab" link (NVIDIA#457)

6e7bfdd

* Update "Open in Colab" link * Update README.md Signed-off-by: Gera Shegalov <[email protected]> --------- Signed-off-by: Gera Shegalov <[email protected]>

Include Tools Notebook for EMR (NVIDIA#459)

d84c593

* Add Tools Notebooks for EMR * Update README * Sign-off commit Signed-off-by: Partho Sarthi <[email protected]> --------- Signed-off-by: Partho Sarthi <[email protected]>

Prepare the arg to disable kvikIO remote IO (NVIDIA#465)

d989b14

Signed-off-by: Peixin Li <[email protected]>

Remove the kvikIO workaround (NVIDIA#466)

af324af

Signed-off-by: Peixin Li <[email protected]>

Update udf native example cmake version to 3.28.6 (NVIDIA#468)

20556db

Signed-off-by: Peixin Li <[email protected]>

Updated title in README for EMR and Databricks Tools Notebooks (NVIDI…

d101065

…A#469) Signed-off-by: Partho Sarthi <[email protected]>

update for 2412 release (NVIDIA#478)

e3d16dd

Signed-off-by: liyuan <[email protected]>

Merge remote-tracking branch 'origin/branch-24.12' into main-2412-rel…

7da464e

…ease

nvliyuan merged commit e863522 into NVIDIA:main Dec 24, 2024
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update the main branch for 2412 release #480

update the main branch for 2412 release #480

nvliyuan commented Dec 24, 2024

update the main branch for 2412 release #480

update the main branch for 2412 release #480

Conversation

nvliyuan commented Dec 24, 2024