Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(dml): sent dml data from the same session to a fixed worker node/channel #14380

Merged
merged 7 commits into from
Jan 8, 2024

Conversation

chenzl25
Copy link
Contributor

@chenzl25 chenzl25 commented Jan 5, 2024

… and let group batch perserve transaction order

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

Checklist

  • I have written necessary rustdoc comments
  • I have added necessary unit tests and integration tests
  • I have added test labels as necessary. See details.
  • I have added fuzzing tests or opened an issue to track them. (Optional, recommended for new SQL features Sqlsmith: Sql feature generation #7934).
  • My PR contains breaking changes. (If it deprecates some features, please create a tracking issue to remove them in the future).
  • All checks passed in ./risedev check (or alias, ./risedev c)
  • My PR changes performance-critical code. (Please run macro/micro-benchmarks and show the results.)
  • My PR contains critical fixes that are necessary to be merged into the latest release. (Please check out the details)

Documentation

  • My PR needs documentation updates. (Please use the Release note section below to summarize the impact on users)

Release note

If this PR includes changes that directly affect users or other significant modifications relevant to the community, kindly draft a release note to provide a concise summary of these changes. Please prioritize highlighting the impact these changes will have on users.

… and let group batch perserve transaction order
@chenzl25 chenzl25 marked this pull request as draft January 5, 2024 08:25
@chenzl25 chenzl25 marked this pull request as ready for review January 5, 2024 09:04
Comment on lines -99 to +100
.get((txn_id % len as u64) as usize)
.get((session_id % len as u32) as usize)
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Use session id instead of txn_id to choose channel so that we can preserve transaction order in the same session

Comment on lines 584 to 586
candidates.choose(&mut rand::thread_rng()).unwrap().clone()
candidates
.get(stage.session_id.0 as usize % candidates.len())
.context("no available worker node for dml")?
.clone()
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Choose a worker based on session id.

Comment on lines 709 to 713
return Ok(Some(
candidates
.get(self.stage.session_id.0 as usize % candidates.len())
.context("no available worker node for dml")?
.clone(),
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Choose a worker based on session id.

Comment on lines +196 to +208
if !batch_group.is_empty() {
let vec = mem::take(&mut batch_group);
for chunk in vec {
for (op, row) in chunk.rows() {
if let Some(chunk) = builder.append_row(op, row) {
yield Message::Chunk(chunk);
}
}
}
if let Some(chunk) = builder.take() {
yield Message::Chunk(chunk);
}
}
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Always preserve transaction orders even with batch group optimization

Copy link
Member

@xxchan xxchan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Generally LGTM

Copy link
Member

@BugenZhao BugenZhao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rest LGTM

src/frontend/src/scheduler/distributed/stage.rs Outdated Show resolved Hide resolved
src/frontend/src/scheduler/local.rs Outdated Show resolved Hide resolved
src/source/src/table.rs Show resolved Hide resolved
@chenzl25 chenzl25 requested review from BugenZhao and xxchan January 8, 2024 05:05
Copy link
Member

@BugenZhao BugenZhao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@chenzl25 chenzl25 added this pull request to the merge queue Jan 8, 2024
Merged via the queue into main with commit 512e884 Jan 8, 2024
30 of 31 checks passed
@chenzl25 chenzl25 deleted the dylan/preserve_transaction_order_in_the_same_session branch January 8, 2024 06:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

feat: sent dml data from the same session to a fixed worker node/channel
3 participants