-
Notifications
You must be signed in to change notification settings - Fork 332
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix(mito): pruning for mito2 #2525
Conversation
…group size for each flush task.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 😅
Codecov Report
Additional details and impacted files@@ Coverage Diff @@
## develop #2525 +/- ##
===========================================
- Coverage 85.28% 84.96% -0.32%
===========================================
Files 730 733 +3
Lines 116093 117035 +942
===========================================
+ Hits 99009 99444 +435
- Misses 17084 17591 +507 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
* fix: pruning for mito2 * chore: refactor projection parameters; add some tests; customize row group size for each flush task. * chore: pass whole RegionFlushRequest --------- Co-authored-by: Lei, HUANG <[email protected]>
I hereby agree to the terms of the GreptimeDB CLA
What's changed and what's your intention?
This PR implements row group pruning for the new mito engine.
Previous implementation uses the schema of the region to find the index of a column, which is incorrect under the new SST format. Due to the restriction that the row group metadata only contains the min/max values of primary keys, not tags, the
ParquetReader
can only use the first tag in the primary key to prune row groups.Checklist
Refer to a related PR or issue link (optional)
fixes #2524