Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support of HiveTableScan and InsertIntoHive text-format #723

Merged
merged 2 commits into from
Jan 11, 2024

Conversation

amahussein
Copy link
Collaborator

Fixes #681

  • Add unit-test to support scan-hive-text
  • The check for the SerDe class to determine if the Hive format is supported or not
  • Only text-hive is supported for now.
  • Updated the read-format to include the ScanHive operations.
  • Added support to NativeScan as an alternative to the "Node Scan"

Fixes NVIDIA#681

- Add unit-test to support scan-hive-text
- The check for the SerDe class to determine if the Hive format is
  supported or not
- Only text-hive is supported for now.
- Updated the read-format to include the ScanHive operations.
- Added support to NativeScan as an alternative to the "Node Scan"

Signed-off-by: Ahmed Hussein (amahussein) <[email protected]>
@amahussein amahussein added feature request New feature or request core_tools Scope the core module (scala) labels Jan 10, 2024
@amahussein amahussein requested a review from nartal1 January 10, 2024 18:10
@amahussein amahussein self-assigned this Jan 10, 2024
Copy link
Collaborator

@nartal1 nartal1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @amahussein ! Overall LGTM. Just a few nits.

Signed-off-by: Ahmed Hussein (amahussein) <[email protected]>
Copy link
Collaborator Author

@amahussein amahussein left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @nartal1

@amahussein amahussein requested a review from parthosa January 11, 2024 15:39
@amahussein amahussein merged commit 63b3671 into NVIDIA:dev Jan 11, 2024
13 checks passed
@amahussein amahussein deleted the spark-rapids-tools-681-b branch January 11, 2024 16:12
@parthosa
Copy link
Collaborator

Thanks @amahussein for this change. Could we have a demo or brief walkthrough for this as I am not very familiar with Hive and would like to know more?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core_tools Scope the core module (scala) feature request New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEA] Qualification tool: Parse HiveTableScan in read format and investigate InsertIntoHiveTable
3 participants