Skip to content

Commit

Permalink
add apache license
Browse files Browse the repository at this point in the history
  • Loading branch information
317brian committed Oct 20, 2023
1 parent fabee9e commit 387fd2b
Show file tree
Hide file tree
Showing 2 changed files with 20 additions and 0 deletions.
1 change: 1 addition & 0 deletions docs/data-management/compaction.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,7 @@ description: "Defines compaction and automatic compaction (auto-compaction or au
~ specific language governing permissions and limitations
~ under the License.
-->

Query performance in Apache Druid depends on optimally sized segments. Compaction is one strategy you can use to optimize segment size for your Druid database. Compaction tasks read an existing set of segments for a given time interval and combine the data into a new "compacted" set of segments. In some cases the compacted segments are larger, but there are fewer of them. In other cases the compacted segments may be smaller. Compaction tends to increase performance because optimized segments require less per-segment processing and less memory overhead for ingestion and for querying paths.

## Compaction guidelines
Expand Down
19 changes: 19 additions & 0 deletions docs/data-management/manual-compaction.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,25 @@ id: manual-compaction
title: "Manual compaction"
---

<!--
~ Licensed to the Apache Software Foundation (ASF) under one
~ or more contributor license agreements. See the NOTICE file
~ distributed with this work for additional information
~ regarding copyright ownership. The ASF licenses this file
~ to you under the Apache License, Version 2.0 (the
~ "License"); you may not use this file except in compliance
~ with the License. You may obtain a copy of the License at
~
~ http://www.apache.org/licenses/LICENSE-2.0
~
~ Unless required by applicable law or agreed to in writing,
~ software distributed under the License is distributed on an
~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
~ KIND, either express or implied. See the License for the
~ specific language governing permissions and limitations
~ under the License.
-->

In Apache Druid, compaction is a special type of ingestion task that reads data from a Druid datasource and writes it back into the same datasource. A common use case for this is to [optimally size segments](../operations/segment-optimization.md) after ingestion to improve query performance.

You can perform manual compaction where you submit a one-time compaction task for a specific interval. Generally, you don't need to do this if you use [automatic compaction](./automatic-compaction.md), which is recommended for most workloads.
Expand Down

0 comments on commit 387fd2b

Please sign in to comment.