Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ch-benchmark q6 mv has consistently throughput after the source stopped producing data #18055

Closed
cyliu0 opened this issue Aug 15, 2024 · 5 comments · Fixed by #18374
Closed
Assignees
Labels
type/bug Something isn't working
Milestone

Comments

@cyliu0 cyliu0 added the type/bug Something isn't working label Aug 15, 2024
@github-actions github-actions bot added this to the release-2.0 milestone Aug 15, 2024
@cyliu0
Copy link
Collaborator Author

cyliu0 commented Aug 16, 2024

@StrikeW
Copy link
Contributor

StrikeW commented Aug 20, 2024

@KeXiangWang could you help to take a look? The problem is q6 has continuous throughput (2 rows/sec) even the cdc source has finished loading all data from upstream, so that the test job cannot finish. The test script expects all mv throughput decrease to 0.

@KeXiangWang KeXiangWang assigned KeXiangWang and unassigned StrikeW Aug 20, 2024
@cyliu0
Copy link
Collaborator Author

cyliu0 commented Aug 30, 2024

This bug still exists in the upgrade test for v2.0.0-rt.2 https://buildkite.com/risingwave-test/upgrade/builds/59#0191a228-107d-4a5f-9305-ca32872835b0
image

@lmatz
Copy link
Contributor

lmatz commented Sep 2, 2024

is the problem solved by #18307 and #18303

close the issue now, we reopen if wrong

@lmatz lmatz closed this as completed Sep 2, 2024
@KeXiangWang KeXiangWang reopened this Sep 2, 2024
@KeXiangWang
Copy link
Contributor

KeXiangWang commented Sep 2, 2024

I have figured out the root cause. Starts from 17945, RW outputs noop rows every barrier in some cases. The fix is 18292, which has been included in v2.0.0-rc.2. So if you run ch-benchmark-pg-cdc q1-12 daily test directly with v2.0.0-rc.2, it will succeed.

However, the problem is fixed by adding noop_update_hint: true to the physical graph. As a result, if user create a MV with an older version (< v2.0.0), the physical graph would be the old version without noop_update_hint: true. When the user upgrade the cluster to v2.0.0, it would output noop rows because of the old version physical graph following the changes in 17945. cc @stdrc @kwannoel any ideas to fix this issue? Maybe it's also fine to keep it, as the extra rows are just noop actually.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type/bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants