Skip to content

There are too many small files in the delta-log directory in the collection #37438

Discussion options

You must be logged in to vote

The initial suggestion I have is to minimize the creation of excessive segments. It's likely that the frequent execution of #flush() calls or improperly configured import tasks have led to the generation of many small segments. You might consider modifying these operations to prevent the creation of additional small segments.

As for optimizing compaction, here are a couple of adjustments you could consider:

  1. Starting from version 2.4.14, setting datacoord.compaction.taskPrioritizer=mix will prioritize mix compactions over L0 compactions. Please be aware that this is an advanced configuration option, and it will block L0 compactions until the mix compactions are completed.
  2. Increasing the n…

Replies: 3 comments 12 replies

Comment options

You must be logged in to vote
1 reply
@AndrewOYLK
Comment options

Comment options

You must be logged in to vote
6 replies
@AndrewOYLK
Comment options

@AndrewOYLK
Comment options

@AndrewOYLK
Comment options

@xiaofan-luan
Comment options

@xiaofan-luan
Comment options

Comment options

You must be logged in to vote
5 replies
@AndrewOYLK
Comment options

@AndrewOYLK
Comment options

@tedxu
Comment options

Answer selected by AndrewOYLK
@xiaofan-luan
Comment options

@xiaofan-luan
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
4 participants