Range query optim #3194

fulmicoton · 2023-04-19T02:39:30Z

We have two ways to deal with range queries...

a DocSet that uses the scans the columnar and creates a buffer of the next valid docs. In its current version it is already quite sophisticated... The buffering window change size to try to adapt to different match ratio dynamically, and the buffering itself relies on SIMD. The multilinear codec probably hinders perf a lot however.
a filter at the collector level.

Experiment with the two solutions, and see if the filter solution outperforms the docset solution for most of the queries.
If it is the case, we can then work on the QueryAST (once #3148 has landed) to bubble up range queries and extract range queries as filters.

trinity-1686a · 2023-05-22T10:16:56Z

#3329 will add support for warming up a range of inverted index. This means there is a 3rd option, using classic tantivy RangeQuery over the inverted index. This is likely slower on large ranges, but likely faster on smaller ranges.

PSeitz · 2023-11-22T11:31:20Z

Related issue: quickwit-oss/tantivy#2266

PSeitz · 2024-10-28T07:25:38Z

Related issue: quickwit-oss/tantivy#2531

fulmicoton added the enhancement New feature or request label Apr 19, 2023

fulmicoton assigned PSeitz Apr 19, 2023

fulmicoton mentioned this issue May 3, 2023

Extract timestamp range to enable pruning #3256

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Range query optim #3194

Range query optim #3194

fulmicoton commented Apr 19, 2023

trinity-1686a commented May 22, 2023 •

edited

Loading

PSeitz commented Nov 22, 2023

PSeitz commented Oct 28, 2024

Range query optim #3194

Range query optim #3194

Comments

fulmicoton commented Apr 19, 2023

trinity-1686a commented May 22, 2023 • edited Loading

PSeitz commented Nov 22, 2023

PSeitz commented Oct 28, 2024

trinity-1686a commented May 22, 2023 •

edited

Loading