Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] group_merge_tdigest() always produces one empty cluster regardless of the number of groups when all values are null #16881

Closed
jihoonson opened this issue Sep 23, 2024 · 0 comments · Fixed by #16897
Labels
bug Something isn't working

Comments

@jihoonson
Copy link
Contributor

Describe the bug
This bug was originally reported in NVIDIA/spark-rapids#11367.

When all values are null, group_merge_tdigest() always produces only one empty cluster regardless of the number of groups in the result key column. As a result, the result table cannot be created because of the column size mismatch between the key and the value columns.

Steps/Code to reproduce bug
Repro steps are described in NVIDIA/spark-rapids#11367 (comment).

Expected behavior
group_merge_tdigest() should return the same number of empty clusters as the keys in the result.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant