feat(storage): only update related read version on flush finish #16725

wenym1 · 2024-05-13T09:16:11Z

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

Currently in event handler, when flush task finishes, we need to update all instances of read version, even though the flush task does not include the imm of the read version. In this PR, we change to store the mapping from instance id to its included imm in the output, so that we can know which instances should be updated and avoid updating all instances.

Besides, the algorithm to update each read version can be simplified. Previously, we need to do time-consuming sanity check when we update the read version. After this PR, since we separate the input imm of each read version shard, when we update the read version, we only need to check whether the input is at the imm queue end, which is greatly simplified.

The implementation of for_each_read_version is also modified. Previously, we will blocking wait for lock acquisition, even though we are able to acquire the lock of other read versions. In this PR, in the first round, we only call try_write on each read version. The try_write succeeds, we do the work, and otherwise, we add the instance id to a queue. And then we will keep trying to acquire the write lock for instance at the queue front for at most 10 milli second, if it fails, we will move it to queue end and continue trying to acquire the write lock for the next item.

The code of merge imm is removed by the way in this PR, because if we consider merged imm, we need to maintain the order of normal imm and merged imm, which unnecessarily increases the complexity.

Checklist

I have written necessary rustdoc comments
I have added necessary unit tests and integration tests
I have added test labels as necessary. See details.
I have added fuzzing tests or opened an issue to track them. (Optional, recommended for new SQL features Sqlsmith: Sql feature generation #7934).
My PR contains breaking changes. (If it deprecates some features, please create a tracking issue to remove them in the future).
All checks passed in ./risedev check (or alias, ./risedev c)
My PR changes performance-critical code. (Please run macro/micro-benchmarks and show the results.)

My PR contains critical fixes that are necessary to be merged into the latest release. (Please check out the details)

Documentation

My PR needs documentation updates. (Please use the Release note section below to summarize the impact on users)

Release note

If this PR includes changes that directly affect users or other significant modifications relevant to the community, kindly draft a release note to provide a concise summary of these changes. Please prioritize highlighting the impact these changes will have on users.

StrikeW · 2024-05-15T04:04:15Z

The code of merge imm is removed by the way in this PR, because if we consider merged imm, we need to maintain the order of normal imm and merged imm, which unnecessarily increases the complexity.

IIRC we set the checkpoint_interval to 1 by default, but when memtable spill occurs, a state table would still generate multiple IMMs in a checkpoint epoch. Do you mean we don't need optimization for multiple imm case? Or there is other optimization can handle it?

wenym1 · 2024-05-15T06:01:59Z

The code of merge imm is removed by the way in this PR, because if we consider merged imm, we need to maintain the order of normal imm and merged imm, which unnecessarily increases the complexity.

IIRC we set the checkpoint_interval to 1 by default, but when memtable spill occurs, a state table would still generate multiple IMMs in a checkpoint epoch. Do you mean we don't need optimization for multiple imm case? Or there is other optimization can handle it?

When spill happens, it's likely to trigger a shared buffer compaction that merge multiple imms into ssts. This serves similar functionality to merge imm.

Li0k

Remember to deprecated the config related to "merge imm", Rest LGTM

src/storage/src/hummock/event_handler/hummock_event_handler.rs

…16725) to release-1.9 (#16921)

wenym1 added 2 commits May 13, 2024 17:03

feat(storage): only update related read version on flush finish

166144c

Merge branch 'main' into yiming/staging-sst-shard-imm

e3f7549

github-actions bot added the type/feature label May 13, 2024

wenym1 requested review from hzxa21, StrikeW, Li0k, Little-Wallace and MrCroxx May 13, 2024 09:55

Merge branch 'main' into yiming/staging-sst-shard-imm

788355c

Li0k approved these changes May 16, 2024

View reviewed changes

src/storage/src/hummock/event_handler/hummock_event_handler.rs Outdated Show resolved Hide resolved

wenym1 added 2 commits May 16, 2024 17:44

Merge branch 'main' into yiming/staging-sst-shard-imm

56bffeb

address comment

50fbde1

wenym1 enabled auto-merge May 16, 2024 10:00

wenym1 added this pull request to the merge queue May 16, 2024

Merged via the queue into main with commit 8a98b85 May 16, 2024
27 of 28 checks passed

wenym1 deleted the yiming/staging-sst-shard-imm branch May 16, 2024 11:03

wenym1 mentioned this pull request May 17, 2024

fix(storage): fix flush small files when the capacity of shared-buffer is full #15832

Closed

9 tasks

kwannoel added the need-cherry-pick-release-1.9 label May 24, 2024

github-actions bot mentioned this pull request May 24, 2024

cherrypick feat(storage): only update related read version on flush finish (#16725) to branch release-1.9 #16916

Closed

wenym1 added a commit that referenced this pull request May 24, 2024

feat(storage): only update related read version on flush finish (#16725)

8d9c7fb

github-merge-queue bot pushed a commit that referenced this pull request May 24, 2024

feat: cherry-pick only update related read version on flush finish (#…

a4ae1d2

…16725) to release-1.9 (#16921)

MrCroxx mentioned this pull request Jun 5, 2024

bug: panicked at should be valid staging_sst.size #17111

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(storage): only update related read version on flush finish #16725

feat(storage): only update related read version on flush finish #16725

wenym1 commented May 13, 2024

StrikeW commented May 15, 2024

wenym1 commented May 15, 2024

Li0k left a comment

feat(storage): only update related read version on flush finish #16725

feat(storage): only update related read version on flush finish #16725

Conversation

wenym1 commented May 13, 2024

What's changed and what's your intention?

Checklist

Documentation

Release note

StrikeW commented May 15, 2024

wenym1 commented May 15, 2024

Li0k left a comment

Choose a reason for hiding this comment