perf: table scan with too many delete tombstones is slow #12903

BugenZhao · 2023-10-17T08:27:41Z

To resolve the backfill live-lock described in #12680, we ensure that the backfill should make progress before getting canceled in #12780. However, the underlying performance is still not resolved. This may affect the scan performance on tables with frequent deletes, like the materialized views cleaned up with the temporal filter.

If I understand it correctly, all full scans on such tables will be affected, right? If we make specific fixes or improvements to Backfill, users may still find it confusing that batch scanning a temporal-filtered table is slow. This could be another reason to consider fixing it in a more general way...

Originally posted by @BugenZhao in #12680 (comment)

This can be easily reproduced:

dev=> create table t (v int);
CREATE_TABLE

Time: 27.679 ms
dev=> select count(*) from t;
 count 
-------
     0
(1 row)

Time: 4.860 ms

dev=> insert into t select generate_series(1, 10000000);
INSERT 0 10000000
Time: 40462.840 ms (00:40.463)
dev=> delete from t;
DELETE 10000000
Time: 12125.790 ms (00:12.126)

dev=> select count(*) from t;
 count 
-------
     0
(1 row)

Time: 2036.083 ms (00:02.036)

As expected, nearly all time is spent on rewinding the merge iterator.

Ideas given by @hzxa21:

Some ideas:

~~Storage expose a "internal key" iterator to backfill so that backfill can use the tombstone key for backfilled position. This ensures that backfill can make progress even though no data is returned.~~

Maintain table statistics to guide backfill. A simple statistic is per table min/max key. Backfill can use vnode | min_key as the start_key when it begins. This can solve a subset of the issues when the upstream table performs sequential deletes (e.g. TTL).

Tune compaction to be more aggressive when the SST delete ratio is high. This can improve the situation but cannot solve it completely since it order to compaction to clean up a tombstone key completely, a full compaction is needed. When the data volume is high, full compaction may not be done in a fast manner even though compaction is triggered aggressively.

The text was updated successfully, but these errors were encountered:

kwannoel · 2023-10-17T08:42:56Z

Maintain table statistics to guide backfill. A simple statistic is per table min/max key. Backfill can use vnode | min_key as the start_key when it begins. This can solve a subset of the issues when the upstream table performs sequential deletes (e.g. TTL).

Is this generalizable to some sort of cache? Such that we can handle tombstone ranges.

Tune compaction to be more aggressive when the SST delete ratio is high. This can improve the situation but cannot solve it completely since it order to compaction to clean up a tombstone key completely, a full compaction is needed. When the data volume is high, full compaction may not be done in a fast manner even though compaction is triggered aggressively.

I think @Li0k has worked on this.

kwannoel · 2023-10-19T15:15:55Z

Tune compaction to be more aggressive when the SST delete ratio is high. This can improve the situation but cannot solve it completely since it order to compaction to clean up a tombstone key completely, a full compaction is needed. When the data volume is high, full compaction may not be done in a fast manner even though compaction is triggered aggressively.

Can I confirm again @Li0k? This is done by your PRs right?

Li0k · 2023-10-20T06:13:17Z

Tune compaction to be more aggressive when the SST delete ratio is high. This can improve the situation but cannot solve it completely since it order to compaction to clean up a tombstone key completely, a full compaction is needed. When the data volume is high, full compaction may not be done in a fast manner even though compaction is triggered aggressively.

Can I confirm again @Li0k? This is done by your PRs right?

yeah, but cannot solve it completely, If the tombstone is affecting performance, we can consider adjusting the frequency of the tombstone picker triggers

kwannoel · 2023-10-20T06:44:02Z

since it order to compaction to clean up a tombstone key completely, a full compaction is needed. When the data volume is high, full compaction may not be done in a fast manner even though compaction is triggered aggressively.

Further discussed offline with @Li0k , this part is still under discussion.

Tune compaction to be more aggressive when the SST delete ratio is high.

This part is done, compaction will occur in 10 min intervals. See #12776.

kwannoel · 2023-10-20T07:35:15Z

For testing compaction: https://github.com/risingwavelabs/kube-bench/issues/306

github-actions · 2024-06-12T09:00:05Z

This issue has been open for 60 days with no activity. Could you please update the status? Feel free to continue discussion or close as not planned.

github-actions bot added this to the release-1.4 milestone Oct 17, 2023

BugenZhao added component/storage Storage type/perf labels Oct 17, 2023

kwannoel mentioned this issue Oct 19, 2023

Kwannoel/tombstone with repro #12759

Closed

8 tasks

BugenZhao modified the milestones: release-1.4, future-release-1.6 Nov 8, 2023

fuyufjh modified the milestones: release-1.6, release-1.7 Jan 9, 2024

fuyufjh assigned Li0k Jan 9, 2024

github-actions bot added the no-issue-activity label Jun 12, 2024

hzxa21 removed this from the release-1.7 milestone Oct 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: table scan with too many delete tombstones is slow #12903

perf: table scan with too many delete tombstones is slow #12903

BugenZhao commented Oct 17, 2023 •

edited

Loading

kwannoel commented Oct 17, 2023 •

edited

Loading

kwannoel commented Oct 19, 2023

Li0k commented Oct 20, 2023

kwannoel commented Oct 20, 2023

kwannoel commented Oct 20, 2023

github-actions bot commented Jun 12, 2024

perf: table scan with too many delete tombstones is slow #12903

perf: table scan with too many delete tombstones is slow #12903

Comments

BugenZhao commented Oct 17, 2023 • edited Loading

kwannoel commented Oct 17, 2023 • edited Loading

kwannoel commented Oct 19, 2023

Li0k commented Oct 20, 2023

kwannoel commented Oct 20, 2023

kwannoel commented Oct 20, 2023

github-actions bot commented Jun 12, 2024

BugenZhao commented Oct 17, 2023 •

edited

Loading

kwannoel commented Oct 17, 2023 •

edited

Loading