Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore(sink): exclude java iceberg and deltalake sink from connector node #14277

Merged
merged 9 commits into from
Jan 2, 2024

Conversation

wenym1
Copy link
Contributor

@wenym1 wenym1 commented Dec 29, 2023

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

Exclude the module of java iceberg and deltalake sink from connector node.

The previous deltalake_rust will take over the current java version of delta lake and deltalake sink created with connector = 'deltalake' will use the rust implementation now.

The package size reduced from 407,850,813 to 119,393,608. Fix #13113.

Checklist

  • I have written necessary rustdoc comments
  • I have added necessary unit tests and integration tests
  • I have added test labels as necessary. See details.
  • I have added fuzzing tests or opened an issue to track them. (Optional, recommended for new SQL features Sqlsmith: Sql feature generation #7934).
  • My PR contains breaking changes. (If it deprecates some features, please create a tracking issue to remove them in the future).
  • All checks passed in ./risedev check (or alias, ./risedev c)
  • My PR changes performance-critical code. (Please run macro/micro-benchmarks and show the results.)
  • My PR contains critical fixes that are necessary to be merged into the latest release. (Please check out the details)

Documentation

  • My PR needs documentation updates. (Please use the Release note section below to summarize the impact on users)

Release note

  1. The previous connector = 'iceberg_java' is deprecated, and users can only create sink with rust version of iceberg.
  2. The deltalake sink will use the rust version implementation instead.

@wenym1 wenym1 force-pushed the yiming/remove-java-iceberg-deltalake branch from 122641a to 904bffa Compare December 29, 2023 14:56
@wenym1 wenym1 force-pushed the yiming/remove-java-iceberg-deltalake branch from b0f3eee to 5397607 Compare December 29, 2023 15:31
Copy link
Contributor

@tabVersion tabVersion left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice to switch the connector to rust version.

connector = 'deltalake_rust',
connector = 'deltalake',
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it a user facing change?

Comment on lines -154 to -156
// iceberg with multiple parallelism will fail easily with concurrent commit
// on metadata
// TODO: reset iceberg sink to have multiple parallelism
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What's the parallelism requirement for iceberg rust version

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The comment was for previous iceberg sink. Now there is no singleton requirement on both iceberg sink and delta lake sink.

ci/scripts/connector-node-integration-test.sh Outdated Show resolved Hide resolved
@wenym1 wenym1 added the user-facing-changes Contains changes that are visible to users label Jan 2, 2024
Copy link
Contributor

@liurenjie1024 liurenjie1024 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. BTW, why we don't remove all codes rather just remove the packaging?

@wenym1
Copy link
Contributor Author

wenym1 commented Jan 2, 2024

LGTM. BTW, why we don't remove all codes rather just remove the packaging?

Just save for reference given that there is a simple way to exclude it from packaging. We can remove the code when it needs extra effort to maintain in the future.

@wenym1 wenym1 enabled auto-merge January 2, 2024 06:52
@wenym1 wenym1 added this pull request to the merge queue Jan 2, 2024
@liurenjie1024
Copy link
Contributor

LGTM. BTW, why we don't remove all codes rather just remove the packaging?

Just save for reference given that there is a simple way to exclude it from packaging. We can remove the code when it needs extra effort to maintain in the future.

How about also remove related tests?

Merged via the queue into main with commit b38da9c Jan 2, 2024
31 of 32 checks passed
@wenym1 wenym1 deleted the yiming/remove-java-iceberg-deltalake branch January 2, 2024 07:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

refactor: remove hadoop-aws dependency for connector node
5 participants