Fix error: invalid child of chunk append #7514

erimatnor · 2024-12-03T15:48:53Z

When doing startup and runtime chunk exclusion, the chunk append node could sometimes throw an error: "invalid child of chunk append: Sort". Unfortunately, this error hasn't been successfully reproduced in a test but has been reported by users.

However, it seems clear that the error happens in
ts_chunk_append_get_scan_plan() when it can't find a "known" plan node to use for chunk exclusion. This function should ideally never throw and error and instead just return NULL, which means that chunk append falls back to not doing any exclusion instead of throwing an error.

It is also possible to improve the code and make it properly handle Sort and Result nodes by not special-casing them. By inspecting ts_chunk_append_get_scan_plan(), it is clear that it can only throw the error if it encounters a Result node with a Sort child, because in those two cases it didn't descend down the lefttree child node using a recursive call. Therefore, remove the special case and instead do a recursive call similar to how other nodes are handled. Also remove the special case for the vector agg node, since it should be OK to recursively process all CustomScan nodes.

codecov · 2024-12-03T18:56:38Z

Codecov Report

Attention: Patch coverage is 25.00000% with 3 lines in your changes missing coverage. Please review.

Project coverage is 82.18%. Comparing base (59f50f2) to head (e9a8fda).
Report is 648 commits behind head on main.

Files with missing lines	Patch %	Lines
src/nodes/chunk_append/planner.c	25.00%	3 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7514      +/-   ##
==========================================
+ Coverage   80.06%   82.18%   +2.11%     
==========================================
  Files         190      230      +40     
  Lines       37181    43230    +6049     
  Branches     9450    10875    +1425     
==========================================
+ Hits        29770    35527    +5757     
- Misses       2997     3378     +381     
+ Partials     4414     4325      -89

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

akuzm · 2024-12-04T11:20:29Z

src/nodes/chunk_append/planner.c

+			 * nodes with one child, recurse into the child to find a baserel
+			 * scan. It is not clear what a CustomScan with multiple children
+			 * would mean here so we don't handle it. */
+			if (list_length(custom->custom_plans) == 1)


This part is not needed for the fix? I'd leave it as is, not sure that recursing into arbitrary custom scan nodes is always correct for the ChunkAppend optimization. Technically this might even be a plan node from another extension.

Yes, it is not needed to avoid the error. But why would it not be correct? It is a recursive call that will just do nothing if it doesn't find a baserel plan node it recognizes for the chunk append purpose.

I removed it for now, but I don't think this special case is needed.

When doing startup and runtime chunk exclusion, the chunk append node could sometimes throw an error: "invalid child of chunk append: Sort". Unfortunately, this error hasn't been successfully reproduced in a test but has been reported by users. However, it seems clear that the error happens in ts_chunk_append_get_scan_plan() when it can't find a "known" plan node to use for chunk exclusion. This function should ideally never throw and error and instead just return NULL, which means that chunk append falls back to not doing any exclusion instead of throwing an error. It is also possible to improve the code and make it properly handle Sort and Result nodes by not special-casing them. By inspecting ts_chunk_append_get_scan_plan(), it is clear that it can only throw the error if it encounters a Result node with a Sort child, because in those two cases it didn't descend down the lefttree child node using a recursive call. Therefore, remove the special case and instead do a recursive call similar to how other nodes are handled.

@bharrisau

This release contains performance improvements and bug fixes since the 2.17.2 release. We recommend that you upgrade at the next available opportunity. **Features** * timescale#6901 Add hypertable support for transition tables * timescale#7104 Hypercore table access method * timescale#7271 Push down ORDER BY in real time continuous aggregate queries * timescale#7295: Support ALTER TABLE SET ACCESS METHOD on hypertable. * timescale#7390 Disable custom hashagg planner code * timescale#7411 Change parameter name to enable Hypercore TAM * timescale#7412 Add GUC for hypercore_use_access_method default * timescale#7413: Add GUC for segmentwise recompression. * timescale#7443 Add Hypercore function and view aliases * timescale#7455: Support DROP NOT NULL on compressed hypertables * timescale#7486 Prevent building against postgres versions with broken ABI **Bugfixes** * timescale#7378 Remove obsolete job referencing policy_job_error_retention * timescale#7409 Update bgw job table when altering procedure * timescale#7410 "aggregated compressed column not found" error on aggregation query. * timescale#7426 Fix datetime parsing error in chunk constraint creation * timescale#7432 Verify that heap tuple is valid before using * timescale#7434 Fixes segfault when internally set the replica identity for a given chunk * timescale#7488 Emit error for transition table trigger on chunks * timescale#7514 Fix error: invalid child of chunk append **Thanks** * @bharrisau for reporting the segfault when creating chunks * @pgloader for reporting an issue an internal background job * @uasiddiqi for reporting the "aggregated compressed column not found" error.

@bharrisau

This release contains performance improvements and bug fixes since the 2.18.0 release. We recommend that you upgrade at the next available opportunity. **Features** * timescale#6901 Add hypertable support for transition tables * timescale#7104 Hypercore table access method * timescale#7271 Push down ORDER BY in real time continuous aggregate queries * timescale#7295: Support ALTER TABLE SET ACCESS METHOD on hypertable. * timescale#7390 Disable custom hashagg planner code * timescale#7411 Change parameter name to enable Hypercore TAM * timescale#7412 Add GUC for hypercore_use_access_method default * timescale#7413: Add GUC for segmentwise recompression. * timescale#7443 Add Hypercore function and view aliases * timescale#7455: Support DROP NOT NULL on compressed hypertables * timescale#7486 Prevent building against postgres versions with broken ABI **Bugfixes** * timescale#7378 Remove obsolete job referencing policy_job_error_retention * timescale#7409 Update bgw job table when altering procedure * timescale#7410 "aggregated compressed column not found" error on aggregation query. * timescale#7426 Fix datetime parsing error in chunk constraint creation * timescale#7432 Verify that heap tuple is valid before using * timescale#7434 Fixes segfault when internally set the replica identity for a given chunk * timescale#7488 Emit error for transition table trigger on chunks * timescale#7514 Fix error: invalid child of chunk append **Thanks** * @bharrisau for reporting the segfault when creating chunks * @pgloader for reporting an issue an internal background job * @uasiddiqi for reporting the "aggregated compressed column not found" error.

@bharrisau

This release contains performance improvements and bug fixes since the 2.17.2 release. We recommend that you upgrade at the next available opportunity. **Features** * timescale#6901 Add hypertable support for transition tables * timescale#7104 Hypercore table access method * timescale#7271 Push down ORDER BY in real time continuous aggregate queries * timescale#7295: Support ALTER TABLE SET ACCESS METHOD on hypertable. * timescale#7390 Disable custom hashagg planner code * timescale#7411 Change parameter name to enable Hypercore TAM * timescale#7412 Add GUC for hypercore_use_access_method default * timescale#7413: Add GUC for segmentwise recompression. * timescale#7443 Add Hypercore function and view aliases * timescale#7455: Support DROP NOT NULL on compressed hypertables * timescale#7486 Prevent building against postgres versions with broken ABI **Bugfixes** * timescale#7378 Remove obsolete job referencing policy_job_error_retention * timescale#7409 Update bgw job table when altering procedure * timescale#7410 "aggregated compressed column not found" error on aggregation query. * timescale#7426 Fix datetime parsing error in chunk constraint creation * timescale#7432 Verify that heap tuple is valid before using * timescale#7434 Fixes segfault when internally set the replica identity for a given chunk * timescale#7488 Emit error for transition table trigger on chunks * timescale#7514 Fix error: invalid child of chunk append **Thanks** * @bharrisau for reporting the segfault when creating chunks * @pgloader for reporting an issue an internal background job * @uasiddiqi for reporting the "aggregated compressed column not found" error.

@bharrisau

This release contains performance improvements and bug fixes since the 2.17.2 release. We recommend that you upgrade at the next available opportunity. **Features** * timescale#6901 Add hypertable support for transition tables * timescale#7104 Hypercore table access method * timescale#7271 Push down ORDER BY in real time continuous aggregate queries * timescale#7295: Support ALTER TABLE SET ACCESS METHOD on hypertable. * timescale#7390 Disable custom hashagg planner code * timescale#7411 Change parameter name to enable Hypercore TAM * timescale#7412 Add GUC for hypercore_use_access_method default * timescale#7413: Add GUC for segmentwise recompression. * timescale#7443 Add Hypercore function and view aliases * timescale#7455: Support DROP NOT NULL on compressed hypertables * timescale#7486 Prevent building against postgres versions with broken ABI **Bugfixes** * timescale#7378 Remove obsolete job referencing policy_job_error_retention * timescale#7409 Update bgw job table when altering procedure * timescale#7410 "aggregated compressed column not found" error on aggregation query. * timescale#7426 Fix datetime parsing error in chunk constraint creation * timescale#7432 Verify that heap tuple is valid before using * timescale#7434 Fixes segfault when internally set the replica identity for a given chunk * timescale#7488 Emit error for transition table trigger on chunks * timescale#7514 Fix error: invalid child of chunk append **Thanks** * @bharrisau for reporting the segfault when creating chunks * @pgloader for reporting an issue an internal background job * @uasiddiqi for reporting the "aggregated compressed column not found" error.

@bharrisau

This release contains performance improvements and bug fixes since the 2.17.2 release. We recommend that you upgrade at the next available opportunity. **Features** * timescale#6901 Add hypertable support for transition tables * timescale#7104 Hypercore table access method * timescale#7271 Push down ORDER BY in real time continuous aggregate queries * timescale#7295: Support ALTER TABLE SET ACCESS METHOD on hypertable. * timescale#7390 Disable custom hashagg planner code * timescale#7411 Change parameter name to enable Hypercore TAM * timescale#7412 Add GUC for hypercore_use_access_method default * timescale#7413: Add GUC for segmentwise recompression. * timescale#7443 Add Hypercore function and view aliases * timescale#7455: Support DROP NOT NULL on compressed hypertables * timescale#7486 Prevent building against postgres versions with broken ABI **Bugfixes** * timescale#7378 Remove obsolete job referencing policy_job_error_retention * timescale#7409 Update bgw job table when altering procedure * timescale#7410 "aggregated compressed column not found" error on aggregation query. * timescale#7426 Fix datetime parsing error in chunk constraint creation * timescale#7432 Verify that heap tuple is valid before using * timescale#7434 Fixes segfault when internally set the replica identity for a given chunk * timescale#7488 Emit error for transition table trigger on chunks * timescale#7514 Fix error: invalid child of chunk append **Thanks** * @bharrisau for reporting the segfault when creating chunks * @pgloader for reporting an issue an internal background job * @uasiddiqi for reporting the "aggregated compressed column not found" error.

@bharrisau

This release contains performance improvements and bug fixes since the 2.17.2 release. We recommend that you upgrade at the next available opportunity. **Features** * timescale#6901 Add hypertable support for transition tables * timescale#7104 Hypercore table access method * timescale#7271 Push down ORDER BY in real time continuous aggregate queries * timescale#7295: Support ALTER TABLE SET ACCESS METHOD on hypertable. * timescale#7390 Disable custom hashagg planner code * timescale#7411 Change parameter name to enable Hypercore TAM * timescale#7412 Add GUC for hypercore_use_access_method default * timescale#7413: Add GUC for segmentwise recompression. * timescale#7443 Add Hypercore function and view aliases * timescale#7455: Support DROP NOT NULL on compressed hypertables * timescale#7458 Support vecorized aggregation with aggregate FILTER clauses that are also vectorizable * timescale#7486 Prevent building against postgres versions with broken ABI **Bugfixes** * timescale#7378 Remove obsolete job referencing policy_job_error_retention * timescale#7409 Update bgw job table when altering procedure * timescale#7410 "aggregated compressed column not found" error on aggregation query. * timescale#7426 Fix datetime parsing error in chunk constraint creation * timescale#7432 Verify that heap tuple is valid before using * timescale#7434 Fixes segfault when internally set the replica identity for a given chunk * timescale#7488 Emit error for transition table trigger on chunks * timescale#7514 Fix error: invalid child of chunk append **Thanks** * @bharrisau for reporting the segfault when creating chunks * @pgloader for reporting an issue an internal background job * @uasiddiqi for reporting the "aggregated compressed column not found" error.

erimatnor added the bug label Dec 3, 2024

erimatnor requested review from akuzm and svenklemm December 3, 2024 15:48

erimatnor force-pushed the fix-chunk-append-error branch 3 times, most recently from c21e648 to c3a271b Compare December 3, 2024 16:07

akuzm reviewed Dec 4, 2024

View reviewed changes

akuzm approved these changes Dec 4, 2024

View reviewed changes

fabriziomello approved these changes Dec 4, 2024

View reviewed changes

fabriziomello assigned erimatnor Dec 6, 2024

erimatnor force-pushed the fix-chunk-append-error branch from c3a271b to e9a8fda Compare December 11, 2024 09:34

erimatnor enabled auto-merge (rebase) December 11, 2024 09:35

erimatnor merged commit 38dbe3c into timescale:main Dec 11, 2024
48 of 49 checks passed

timescale-automation mentioned this pull request Dec 11, 2024

Backport to 2.17.x: #7514: Fix error: invalid child of chunk append #7526

Merged

timescale-automation added the backported-2.17.x label Dec 11, 2024

erimatnor deleted the fix-chunk-append-error branch December 11, 2024 14:05

pallavisontakke mentioned this pull request Dec 13, 2024

Release 2.18.0 (Test PR) #7534

Closed

pallavisontakke mentioned this pull request Dec 16, 2024

Release 2.18.0 (Test) #7539

Closed

pallavisontakke mentioned this pull request Dec 16, 2024

Release 2.18.0 (Test) #7540

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix error: invalid child of chunk append #7514

Fix error: invalid child of chunk append #7514

erimatnor commented Dec 3, 2024 •

edited

Loading

codecov bot commented Dec 3, 2024 •

edited

Loading

akuzm Dec 4, 2024

erimatnor Dec 10, 2024

erimatnor Dec 11, 2024

Fix error: invalid child of chunk append #7514

Fix error: invalid child of chunk append #7514

Conversation

erimatnor commented Dec 3, 2024 • edited Loading

codecov bot commented Dec 3, 2024 • edited Loading

Codecov Report

akuzm Dec 4, 2024

Choose a reason for hiding this comment

erimatnor Dec 10, 2024

Choose a reason for hiding this comment

erimatnor Dec 11, 2024

Choose a reason for hiding this comment

erimatnor commented Dec 3, 2024 •

edited

Loading

codecov bot commented Dec 3, 2024 •

edited

Loading