Observable write stalls on high load #3085

v0y4g3r · 2024-01-03T07:36:09Z

What type of bug is this?

Performance issue

What subsystems are affected?

Datanode

Minimal reproduce step

You can simply reproduce this in TSBS suite.

What did you expect to see?

There should be no peaks and valleys in requests handled per second.

What did you see instead?

When performs benchmark on disks with average performance (like AWS gp2/3 with no extra IOPS budget), we can observe noticeable write stall according to the metric mito_write_rows_total.

The stalls have a correlation in terms of time with fsync operations in WAL. Everytime the WAL rotates, it will allocate a new log file and fsync the previous log file to ensure durability. This will cause high IO util.

Some methods to mitigate these stalls:

We may break the large fsync into frequent smaller fsyncs to amortize the cost
Enable log recycle to reuse obsolate log files.

What operating system did you use?

NA

The text was updated successfully, but these errors were encountered:

v0y4g3r added C-performance Category Performance A-storage Involves code in storage engines labels Jan 3, 2024

v0y4g3r self-assigned this Jan 3, 2024

v0y4g3r mentioned this issue Jan 7, 2024

feat: add options to enable log recycle and periodical fsync #3114

Merged

3 tasks

v0y4g3r closed this as completed in #3114 Jan 9, 2024

This was referenced Jan 10, 2024

PoC bytes_per_sync tikv/raft-engine#346

Closed

Support asynchronously incrementally sync files to disk while they are being written tikv/raft-engine#348

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Observable write stalls on high load #3085

Observable write stalls on high load #3085

v0y4g3r commented Jan 3, 2024 •

edited

Loading

Observable write stalls on high load #3085

Observable write stalls on high load #3085

Comments

v0y4g3r commented Jan 3, 2024 • edited Loading

What type of bug is this?

What subsystems are affected?

Minimal reproduce step

What did you expect to see?

What did you see instead?

What operating system did you use?

v0y4g3r commented Jan 3, 2024 •

edited

Loading