Fix Delta Lake atomic table operations on spark341db [databricks] #9729

jlowe · 2023-11-15T19:24:25Z

Fixes #9676. Databricks 13.3 includes the changes from SPARK-43088 where atomic operations do not include the query at the point where the staged table is created but instead create then append the data. For Delta Lake, this manifests as a subplan being executed which contains an AppendDataExecV1 using the table and WriteBuilder created to handle the original atomic table operation. That explains why the issue reported seeing a tagging attempt on a RAPIDS Accelerator class.

The fix is relatively straightforward. We go ahead and let the AppendDataExecV1 use the RAPIDS Accelerator classes, but we recognize these classes in the tagging code and skip the CPU extraction code when that is the case. This keeps the append operation on the GPU because we're already using a table and writer geared to the GPU.

Signed-off-by: Jason Lowe <[email protected]>

jlowe · 2023-11-15T19:37:52Z

build

jlowe · 2023-11-15T22:46:53Z

build

jlowe · 2023-11-16T15:45:30Z

Build failed due to an unrelated test, posted fix for that test at #9745

jlowe · 2023-11-16T21:30:52Z

build

jlowe · 2023-11-17T16:05:20Z

build

Fix Delta Lake atomic table operations on spark341db

72bf836

Signed-off-by: Jason Lowe <[email protected]>

jlowe self-assigned this Nov 15, 2023

revans2 approved these changes Nov 15, 2023

View reviewed changes

jlowe merged commit 94b25db into NVIDIA:branch-23.12 Nov 17, 2023
37 checks passed

jlowe deleted the fix-delta-atomic-341db branch November 17, 2023 22:00

sameerz added the bug Something isn't working label Nov 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Delta Lake atomic table operations on spark341db [databricks] #9729

Fix Delta Lake atomic table operations on spark341db [databricks] #9729

jlowe commented Nov 15, 2023 •

edited by revans2

Loading

jlowe commented Nov 15, 2023

jlowe commented Nov 15, 2023

jlowe commented Nov 16, 2023

jlowe commented Nov 16, 2023

jlowe commented Nov 17, 2023

Fix Delta Lake atomic table operations on spark341db [databricks] #9729

Fix Delta Lake atomic table operations on spark341db [databricks] #9729

Conversation

jlowe commented Nov 15, 2023 • edited by revans2 Loading

jlowe commented Nov 15, 2023

jlowe commented Nov 15, 2023

jlowe commented Nov 16, 2023

jlowe commented Nov 16, 2023

jlowe commented Nov 17, 2023

jlowe commented Nov 15, 2023 •

edited by revans2

Loading