"
### Release Notes
-* New functionality and performance improvements for this release include:
-* Performance improvements for S3 reading.
-Refer to perfio.s3.enabled in [advanced_configs](./additional-functionality/advanced_configs.md) for more details.
-* Performance improvements when doing a joins on unique keys.
-* Enhanced decompression kernels for zstd and snappy.
-* Enhanced Parquet reading performance with modular kernels.
-* Added compatibility with Spark version 3.5.1.
-* Deprecated support for Databricks 10.4 ML LTS.
-* For updates on RAPIDS Accelerator Tools, please visit [this link](https://github.com/NVIDIA/spark-rapids-tools/releases).
+* Improve support for Unity Catalog on Databricks
+* Added support for parse_url PATH
+* Added support for array_filter
+* Added support for Spark 3.4.3
+* For updates on RAPIDS Accelerator Tools, please visit [this link](https://github.com/NVIDIA/spark-rapids-tools/releases)
For a detailed list of changes, please refer to the
[CHANGELOG](https://github.com/NVIDIA/spark-rapids/blob/main/CHANGELOG.md).
## Archived releases
-As new releases come out, previous ones will still be available in [archived releases](./archive.md).
\ No newline at end of file
+As new releases come out, previous ones will still be available in [archived releases](./archive.md).
diff --git a/docs/supported_ops.md b/docs/supported_ops.md
index a9cd9ec13cb..fbafcfbf81d 100644
--- a/docs/supported_ops.md
+++ b/docs/supported_ops.md
@@ -2288,6 +2288,74 @@ are limited.
|
+ArrayFilter |
+`filter` |
+Filter an input array using a given predicate |
+None |
+project |
+argument |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+PS UTC is only supported TZ for child TIMESTAMP; unsupported child types BINARY, CALENDAR, UDT |
+ |
+ |
+ |
+
+
+function |
+S |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+
+
+result |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+PS UTC is only supported TZ for child TIMESTAMP; unsupported child types BINARY, CALENDAR, UDT |
+ |
+ |
+ |
+
+
ArrayIntersect |
`array_intersect` |
Returns an array of the elements in the intersection of array1 and array2, without duplicates |
@@ -2518,6 +2586,32 @@ are limited.
|
+Expression |
+SQL Functions(s) |
+Description |
+Notes |
+Context |
+Param/Output |
+BOOLEAN |
+BYTE |
+SHORT |
+INT |
+LONG |
+FLOAT |
+DOUBLE |
+DATE |
+TIMESTAMP |
+STRING |
+DECIMAL |
+NULL |
+BINARY |
+CALENDAR |
+ARRAY |
+MAP |
+STRUCT |
+UDT |
+
+
ArrayRepeat |
`array_repeat` |
Returns the array containing the given input value (left) count (right) times |
@@ -2586,32 +2680,6 @@ are limited.
|
-Expression |
-SQL Functions(s) |
-Description |
-Notes |
-Context |
-Param/Output |
-BOOLEAN |
-BYTE |
-SHORT |
-INT |
-LONG |
-FLOAT |
-DOUBLE |
-DATE |
-TIMESTAMP |
-STRING |
-DECIMAL |
-NULL |
-BINARY |
-CALENDAR |
-ARRAY |
-MAP |
-STRUCT |
-UDT |
-
-
ArrayTransform |
`transform` |
Transform elements in an array using the transform function. This is similar to a `map` in functional programming |
@@ -2910,6 +2978,32 @@ are limited.
|
+Expression |
+SQL Functions(s) |
+Description |
+Notes |
+Context |
+Param/Output |
+BOOLEAN |
+BYTE |
+SHORT |
+INT |
+LONG |
+FLOAT |
+DOUBLE |
+DATE |
+TIMESTAMP |
+STRING |
+DECIMAL |
+NULL |
+BINARY |
+CALENDAR |
+ARRAY |
+MAP |
+STRUCT |
+UDT |
+
+
Asin |
`asin` |
Inverse sine |
@@ -3000,32 +3094,6 @@ are limited.
|
-Expression |
-SQL Functions(s) |
-Description |
-Notes |
-Context |
-Param/Output |
-BOOLEAN |
-BYTE |
-SHORT |
-INT |
-LONG |
-FLOAT |
-DOUBLE |
-DATE |
-TIMESTAMP |
-STRING |
-DECIMAL |
-NULL |
-BINARY |
-CALENDAR |
-ARRAY |
-MAP |
-STRUCT |
-UDT |
-
-
Asinh |
`asinh` |
Inverse hyperbolic sine |
@@ -3343,6 +3411,32 @@ are limited.
|
+Expression |
+SQL Functions(s) |
+Description |
+Notes |
+Context |
+Param/Output |
+BOOLEAN |
+BYTE |
+SHORT |
+INT |
+LONG |
+FLOAT |
+DOUBLE |
+DATE |
+TIMESTAMP |
+STRING |
+DECIMAL |
+NULL |
+BINARY |
+CALENDAR |
+ARRAY |
+MAP |
+STRUCT |
+UDT |
+
+
AttributeReference |
|
References an input column |
@@ -3391,32 +3485,6 @@ are limited.
NS |
-Expression |
-SQL Functions(s) |
-Description |
-Notes |
-Context |
-Param/Output |
-BOOLEAN |
-BYTE |
-SHORT |
-INT |
-LONG |
-FLOAT |
-DOUBLE |
-DATE |
-TIMESTAMP |
-STRING |
-DECIMAL |
-NULL |
-BINARY |
-CALENDAR |
-ARRAY |
-MAP |
-STRUCT |
-UDT |
-
-
BRound |
`bround` |
Round an expression to d decimal places using HALF_EVEN rounding mode |
@@ -4044,6 +4112,54 @@ are limited.
|
+BoundReference |
+ |
+Reference to a bound variable |
+None |
+project |
+result |
+S |
+S |
+S |
+S |
+S |
+S |
+S |
+S |
+PS UTC is only supported TZ for TIMESTAMP |
+S |
+S |
+S |
+S |
+NS |
+PS UTC is only supported TZ for child TIMESTAMP; unsupported child types CALENDAR, UDT |
+PS UTC is only supported TZ for child TIMESTAMP; unsupported child types CALENDAR, UDT |
+PS UTC is only supported TZ for child TIMESTAMP; unsupported child types CALENDAR, UDT |
+NS |
+
+
+AST |
+result |
+S |
+S |
+S |
+S |
+S |
+S |
+S |
+S |
+PS UTC is only supported TZ for TIMESTAMP |
+S |
+NS |
+NS |
+NS |
+NS |
+NS |
+NS |
+NS |
+NS |
+
+
CaseWhen |
`when` |
CASE WHEN expression |
@@ -8222,7 +8338,7 @@ are limited.
JsonTuple |
`json_tuple` |
Returns a tuple like the function get_json_object, but it takes multiple names. All the input parameters and output column types are string. |
-This is disabled by default because JsonTuple on the GPU does not support all of the normalization that the CPU supports. |
+This is disabled by default because Experimental feature that could be unstable or have performance issues. |
project |
json |
|
@@ -10817,7 +10933,7 @@ are limited.
|
|
|
-PS only support partToExtract = PROTOCOL | HOST | QUERY; Literal value only |
+PS only support partToExtract = PROTOCOL | HOST | QUERY | PATH; Literal value only |
|
|
|