feat(stream): support row merge (a.k.a keyed merge) #17930

kwannoel · 2024-08-05T14:54:51Z

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

Subsequently, we will also need to modify simple agg executor to always output, as long as there's some input chunks in the current epoch.

Checklist

I have written necessary rustdoc comments
I have added necessary unit tests and integration tests
I have added test labels as necessary. See details.
I have added fuzzing tests or opened an issue to track them. (Optional, recommended for new SQL features Sqlsmith: Sql feature generation #7934).
My PR contains breaking changes. (If it deprecates some features, please create a tracking issue to remove them in the future).
All checks passed in ./risedev check (or alias, ./risedev c)
My PR changes performance-critical code. (Please run macro/micro-benchmarks and show the results.)

My PR contains critical fixes that are necessary to be merged into the latest release. (Please check out the details)

Documentation

My PR needs documentation updates. (Please use the Release note section below to summarize the impact on users)

Release note

If this PR includes changes that directly affect users or other significant modifications relevant to the community, kindly draft a release note to provide a concise summary of these changes. Please prioritize highlighting the impact these changes will have on users.

… use reverse iterator

stdrc

Seems this PR has many duplication with the 2-phase approx percentile one, what about trying Graphite to construct a PR stack?

kwannoel · 2024-08-06T06:20:38Z

Seems this PR has many duplication with the 2-phase approx percentile one, what about trying Graphite to construct a PR stack?

It's rebased ontop of that PR. I have already changed the target to make it easier for reviewers.

chenzl25 · 2024-08-07T10:50:16Z

src/stream/src/executor/row_merge.rs

+                    AlignedMessage::Left(chunk) => {
+                        lhs_buffer = Some(chunk);
+                    }
+                    AlignedMessage::Right(chunk) => {
+                        rhs_buffer = Some(chunk);
+                    }


This executor looks like a very special executor which accepts one chunk from each input per barrier. In my mind, it should be a general merge executor that can process any number of chunks from input and need to maintain a hash map to match rows for both side.

I think it's similar to the difference between hash agg and simple agg. Once we decide to implement 'keyed merge', we have to consider whether the number of rows coming from both sides will exceed memory usage, and then consider whether to introduce a separate state table or spill mechanism. Therefore, introducing a simple implementation to determine the cardinality of both sides of the relation as 1 can quickly support percentile aggregation.

Prefer a simpler executor compared to a more generalized one. We can generalize it if the need arises.

Got it. The row merge is the same as the simple agg and the generalized one should be called key merge. BTW, can we add an assertion to ensure lhs_buffer and rhs_buffer is None before assigning?

Got it. The row merge is the same as the simple agg and the generalized one should be called key merge. BTW, can we add an assertion to ensure lhs_buffer and rhs_buffer is None before assigning?

Hmm I think the normal agg executor may output multiple chunks in one epoch. I will instead parse the chunks to ensure consistent operations.

Edit: Nevermind this will add more complexity. I will instead just buffer everything in one epoch and flush them.

Done in 677edab

st1page · 2024-08-07T11:23:03Z

src/stream/src/executor/row_merge.rs

+                        if !(1..=2).contains(&lhs_chunk.cardinality()) {
+                            bail!("lhs chunk cardinality should be 1 or 2");
+                        }
+                        if !(1..=2).contains(&rhs_chunk.cardinality()) {
+                            bail!("rhs chunk cardinality should be 1 or 2");
+                        }
+                        if lhs_chunk.cardinality() != rhs_chunk.cardinality() {
+                            bail!("lhs and rhs chunk cardinality should be the same");
+                        }


Is this assumption a bit too strong? What we know at this operator is that logically, the relations it inputs have only one row each, but we do not assume that only one chunk of changes will come during a barrier period, or that this chunk contains only one or two rows. Can we still parse each row of every incoming chunk when we receive it to get the current values on both sides? This way, we can handle chunks like (-+-+-+).

The semantics of row merge is such that the input chunks should only have 1 operation, e.g. update, insert or delete.

Update will be for chunks with cardinality 2.
Insert will be for chunks with cardinality 1.

It's specific to simple normal agg and simple approx percentile agg, where the output cardinality of both sides should be 1 (insert or delete) or 2 (update) at most.

We can generalize it later if needed.

Oh I got what you mean. We can't simply overwrite the chunks. I thought you meant 1 chunk with 3 ops inside. I guess you mean 3 different update chunks.

Handled by 677edab

chenzl25 · 2024-08-08T09:10:19Z

src/stream/src/executor/row_merge.rs

+                self.ctx.id,
+                self.ctx.fragment_id,
+                self.ctx.streaming_metrics.clone(),
+                "Join",


Should be RowMerge?

Handled: ebf0545

chenzl25 · 2024-08-08T09:16:07Z

src/stream/src/executor/row_merge.rs

+                    AlignedMessage::WatermarkLeft(watermark) => {
+                        yield Message::Watermark(watermark);
+                    }
+                    AlignedMessage::WatermarkRight(watermark) => {
+                        yield Message::Watermark(watermark);
+                    }


Since we buffer the chunks, we can't emit watermark messages bypassing the chunk; otherwise, the watermark guarantee cannot be maintained. It seems necessary to buffer the watermark messages as well. Refer to this PR for a solution to this issue.

I don't think we should see watermark at all actually. Agg with approx percentile should not propagate watermark. I suggest just absorbing and ignoring any watermark, leaving a trace::warn if we see unexpected watermarks.

Handled: ebf0545

chenzl25

LGTM!

kwannoel added 19 commits August 2, 2024 22:10

support local stateless approx percentile

1be08ab

handle chunk

3d8f85b

handle barrier

1d2626d

add local approx percentile proto

c8c900f

from_proto for global

1170acc

convert plans to proto

19060d6

fmt

48b3b46

defer keyed merge

0bd820d

interim commit: adding tests but failing

356763b

revert some debug in global

a437469

minor

768a70a

add more test, fix bugs in calculating percentile

5b0af78

support negative, but needs some fixes still, specifically we need to…

c362ed2

… use reverse iterator

properly handle neg

b2d92ba

revert debug stmts

5178950

remove some fixme

c134477

fmt

dbe8018

more fmt

a4d68b3

drop table and mv

6c93832

github-actions bot added the type/feature label Aug 5, 2024

kwannoel marked this pull request as ready for review August 6, 2024 02:23

graphite-app bot requested a review from a team August 6, 2024 03:29

stdrc reviewed Aug 6, 2024

View reviewed changes

kwannoel changed the base branch from main to kwannoel/approx-percentile-simple-two-phase August 6, 2024 06:12

fix comments

84c656f

kwannoel force-pushed the kwannoel/keyed-merge branch from 4485c01 to 3d92790 Compare August 6, 2024 06:18

kwannoel changed the title ~~feat(stream): support keyed merge~~ feat(stream): support row merge Aug 6, 2024

ignore watermarks

66ecdea

kwannoel requested a review from chenzl25 August 7, 2024 09:17

kwannoel changed the title ~~feat(stream): support row merge~~ feat(stream): support row merge (a.k.a keyed merge) Aug 7, 2024

kwannoel requested review from fuyufjh and st1page August 7, 2024 09:17

chenzl25 reviewed Aug 7, 2024

View reviewed changes

st1page reviewed Aug 7, 2024

View reviewed changes

kwannoel added 9 commits August 8, 2024 13:14

rename

8b2619c

implement merge project

863316c

finish impl

dc3ed93

add test + fix bugs

e3d23bb

fmt

b4f4cfe

rename MergeProject to RowMerge

264ef05

dapt

9df0692

fix

b7edcba

fix

51cdb17

kwannoel force-pushed the kwannoel/keyed-merge branch from 305c48d to 51cdb17 Compare August 8, 2024 05:14

Base automatically changed from kwannoel/approx-percentile-simple-two-phase to main August 8, 2024 05:25

graphite-app bot requested a review from a team August 8, 2024 05:39

kwannoel added 2 commits August 8, 2024 14:19

handle multiple inputs in one epoch

677edab

Merge branch 'main' into kwannoel/keyed-merge

a4cd962

chenzl25 reviewed Aug 8, 2024

View reviewed changes

address review comments

ebf0545

chenzl25 approved these changes Aug 8, 2024

View reviewed changes

kwannoel added this pull request to the merge queue Aug 8, 2024

Merged via the queue into main with commit f5f5701 Aug 8, 2024
31 of 32 checks passed

kwannoel deleted the kwannoel/keyed-merge branch August 8, 2024 11:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(stream): support row merge (a.k.a keyed merge) #17930

feat(stream): support row merge (a.k.a keyed merge) #17930

kwannoel commented Aug 5, 2024

stdrc left a comment

kwannoel commented Aug 6, 2024

chenzl25 Aug 7, 2024

st1page Aug 7, 2024

kwannoel Aug 8, 2024

chenzl25 Aug 8, 2024 •

edited

Loading

kwannoel Aug 8, 2024 •

edited

Loading

kwannoel Aug 8, 2024

st1page Aug 7, 2024

kwannoel Aug 8, 2024

kwannoel Aug 8, 2024 •

edited

Loading

kwannoel Aug 8, 2024

chenzl25 Aug 8, 2024

kwannoel Aug 8, 2024

chenzl25 Aug 8, 2024

kwannoel Aug 8, 2024 •

edited

Loading

kwannoel Aug 8, 2024

chenzl25 left a comment

feat(stream): support row merge (a.k.a keyed merge) #17930

feat(stream): support row merge (a.k.a keyed merge) #17930

Conversation

kwannoel commented Aug 5, 2024

What's changed and what's your intention?

Checklist

Documentation

Release note

stdrc left a comment

Choose a reason for hiding this comment

kwannoel commented Aug 6, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chenzl25 Aug 8, 2024 • edited Loading

Choose a reason for hiding this comment

kwannoel Aug 8, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kwannoel Aug 8, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kwannoel Aug 8, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chenzl25 left a comment

Choose a reason for hiding this comment

chenzl25 Aug 8, 2024 •

edited

Loading

kwannoel Aug 8, 2024 •

edited

Loading

kwannoel Aug 8, 2024 •

edited

Loading

kwannoel Aug 8, 2024 •

edited

Loading