-
Notifications
You must be signed in to change notification settings - Fork 3.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to completely remove overshadowed segments under var/druid/segments/ ? #17486
Comments
I found this API endpoint achieved exactly what I want: |
@nibinqtl , typically, you shouldn't need to call the API explicitly. |
@kfaraz Thanks, but I already have
in the configuration, which does not seems to do anything. |
I just checked the content under var/druid/segments/, all the ingested small segments since 2024-11-18 (the day after I ran the DELETE API) are still there. Are they supposed to be killed 3 days after being compacted according to my configuration? Is there any other factors that can prevent the killing from happening? |
And those segments are not in the "sys.segments" table. Maybe this is why the killing is not working? |
sys.segments only shows "used" segments. So it is expected that unused segments will not show up there.
Based on Druid configuration docs,
the
the To debug your particular problem, you can check if any |
Affected Version
I'm running the currently latest 31.0.0
Description
I repeatedly use the "local input source" to ingest records in json format at about 10,000 rows every 20 seconds.
Each ingest end up as a segment.
Then I use auto-compaction to combine them into larger segments of about 3M rows. It worked really well as shown on the console. In the segments list, I can only see the compacted large segments and the recent small segments not yet compacted.
However, I noticed that the old (overshadowed) small segments are still in the storage taking disk space. They are at:
var/druid/segments/
none of them are actually removed.
The directory var/druid/segments-cache/ seems to contain only the active segments and its size matches the size of the data source shown on the console.
How can I configure druid to actually remove those old unused overshadowed segments from disk?
The text was updated successfully, but these errors were encountered: