fix: Correctly handle the Parallelism inference under the DefaultParallelism configuration. #15543

shanicky · 2024-03-08T06:41:25Z

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

This PR correctly handles the compatibility between default parallelism and table parallelism.

	DefaultParallelism::Full	DefaultParallelism::Default(n)
Adaptive	Adaptive, all	Fixed, n
Fixed(m)	Fixed(m), m	Fixed, m

And during restarts, it automatically scales the logic to address the impacts that may arise from old versions. For instance, when the default parallelism is set to n, older versions may still be set to Adaptive mode. This PR, upon startup, will determine if the actual parallelism is n and will update it to Fixed(n).

Checklist

I have written necessary rustdoc comments
I have added necessary unit tests and integration tests
I have added test labels as necessary. See details.
I have added fuzzing tests or opened an issue to track them. (Optional, recommended for new SQL features Sqlsmith: Sql feature generation #7934).
My PR contains breaking changes. (If it deprecates some features, please create a tracking issue to remove them in the future).
All checks passed in ./risedev check (or alias, ./risedev c)
My PR changes performance-critical code. (Please run macro/micro-benchmarks and show the results.)

My PR contains critical fixes that are necessary to be merged into the latest release. (Please check out the details)

Documentation

My PR needs documentation updates. (Please use the Release note section below to summarize the impact on users)

Release note

If this PR includes changes that directly affect users or other significant modifications relevant to the community, kindly draft a release note to provide a concise summary of these changes. Please prioritize highlighting the impact these changes will have on users.

Signed-off-by: Shanicky Chen <[email protected]>

yezizp2012

LGTM, @neverchanje PTAL.

neverchanje · 2024-03-08T07:34:51Z

DefaultParallelism::Full	DefaultParallelism::Default(n)
Adaptive	Fixed(n) for creating MV, while remaining Adaptive for existing MV
Fixed(m)	Fixed(m)

If a users explicitly alter a MV to adaptive, with default_parallelism preconfigured as n in the meanwhile, the parallelism should be changed to adaptive.

neverchanje · 2024-03-08T08:02:57Z

src/meta/src/barrier/recovery.rs

+    // If it's lower, we'll set it to Fixed.
+    // If it was previously set to Adaptive, but the default_parallelism in the configuration isn’t Full,
+    // and it matches the actual fragment parallelism, in this case, it will be handled by downgrading to Fixed.
+    fn derive_target_parallelism(


Is this function only called when MV is being created? alter mv and create mv are two different handling paths. Only create mv should evaluate default_parallelism, alter mv should not.

The logic is pretty simple from my understanding.

// Let's divide the handling into two phases: fn create_mv() { // handle `default_parallelism` first. let mut parallelism = match config.default_parallelism { Some(n) => Fixed(n) None => Adaptive }; // Evaluate the sesssion variable "streaming_parallelism". if let Some(p) = session_variable.streaming_parallelism { parallelism = p; } } fn alter_mv(p: TableParallelism) { // `p` will be the target parallelism, no matter what value `default_parallelism` is. }

This modification is made to handle offline scaling during startup recovery, addressing the situation in version 1.7 where new creations using default parallelism with an adaptive policy might exist. This aims to prevent a mass scale-out occurrence immediately after going live.

new creations using default parallelism

It sounds good.

I am not familar with the code so I cannot tell if it's correct. Maybe you can manually test it, and make sure:

alter mv will apply regardless of default_parallelism .

Restarting the cluster will not change the parallelism of existing materialized views back to default_parallelism.

…llelism configuration. (#15543) Signed-off-by: Shanicky Chen <[email protected]>

…llelism configuration. (#15543) (#15568) Signed-off-by: Shanicky Chen <[email protected]>

shanicky added 3 commits March 8, 2024 13:41

tmp

69b4aca

Signed-off-by: Shanicky Chen <[email protected]>

Updated DefaultParallelism, added derive_target_parallelism

58dc774

Change default_parallelism from Default(4) to Full in config

4559350

shanicky requested review from yezizp2012 and neverchanje March 8, 2024 06:41

github-actions bot added the type/fix Bug fix label Mar 8, 2024

neverchanje requested a review from hzxa21 March 8, 2024 06:45

Update barrier recov. logic comments

724651c

shanicky added need-cherry-pick-release-1.7 labels Mar 8, 2024

yezizp2012 approved these changes Mar 8, 2024

View reviewed changes

neverchanje reviewed Mar 8, 2024

View reviewed changes

shanicky added this pull request to the merge queue Mar 8, 2024

Merged via the queue into main with commit e0f305e Mar 8, 2024
28 of 29 checks passed

shanicky deleted the peng/fix-default-as-fixed branch March 8, 2024 10:12

shanicky added a commit that referenced this pull request Mar 8, 2024

fix: Correctly handle the Parallelism inference under the DefaultPara…

cd2a040

…llelism configuration. (#15543) Signed-off-by: Shanicky Chen <[email protected]>

shanicky added a commit that referenced this pull request Mar 8, 2024

fix: Correctly handle the Parallelism inference under the DefaultPara…

7cfc7e9

…llelism configuration. (#15543) Signed-off-by: Shanicky Chen <[email protected]>

shanicky added a commit that referenced this pull request Mar 8, 2024

fix: Correctly handle the Parallelism inference under the DefaultPara…

cc6d10f

…llelism configuration. (#15543) Signed-off-by: Shanicky Chen <[email protected]>

shanicky added a commit that referenced this pull request Mar 8, 2024

fix: Correctly handle the Parallelism inference under the DefaultPara…

2b7e18f

…llelism configuration. (#15543) (#15568) Signed-off-by: Shanicky Chen <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Correctly handle the Parallelism inference under the DefaultParallelism configuration. #15543

fix: Correctly handle the Parallelism inference under the DefaultParallelism configuration. #15543

shanicky commented Mar 8, 2024 •

edited

Loading

yezizp2012 left a comment

neverchanje commented Mar 8, 2024 •

edited

Loading

neverchanje Mar 8, 2024 •

edited

Loading

shanicky Mar 8, 2024

neverchanje Mar 8, 2024 •

edited

Loading

fix: Correctly handle the Parallelism inference under the DefaultParallelism configuration. #15543

fix: Correctly handle the Parallelism inference under the DefaultParallelism configuration. #15543

Conversation

shanicky commented Mar 8, 2024 • edited Loading

What's changed and what's your intention?

Checklist

Documentation

Release note

yezizp2012 left a comment

Choose a reason for hiding this comment

neverchanje commented Mar 8, 2024 • edited Loading

neverchanje Mar 8, 2024 • edited Loading

Choose a reason for hiding this comment

shanicky Mar 8, 2024

Choose a reason for hiding this comment

neverchanje Mar 8, 2024 • edited Loading

Choose a reason for hiding this comment

shanicky commented Mar 8, 2024 •

edited

Loading

neverchanje commented Mar 8, 2024 •

edited

Loading

neverchanje Mar 8, 2024 •

edited

Loading

neverchanje Mar 8, 2024 •

edited

Loading