refactor(meta): clarify the completeness of internal table catalogs #18944

BugenZhao · 2024-10-16T09:45:07Z

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

When creating a streaming job, the internal tables directly scraped from the frontend's StreamFragmentGraph are incomplete, because several fields like fragment_id or vnode_count are not filled yet and will be determined later.

Previously, we notified the frontend nodes with these catalogs but did not update them afterward. This will cause problems since variable vnode count support is introduced, as the vnode count of a table is a significant property when scheduling a batch scan over it.

~~This PR changes to notify the internal table catalogs only after a TableFragments is build, where all information is final and complete.~~

This PR revises the documentation, comments, and naming of related snippets to clarify the completeness of internal table catalogs during various phases of creating a streaming job.

Checklist

I have written necessary rustdoc comments
I have added necessary unit tests and integration tests
All checks passed in ./risedev check (or alias, ./risedev c)

Documentation

My PR needs documentation updates. (Please use the Release note section below to summarize the impact on users)

Release note

If this PR includes changes that directly affect users or other significant modifications relevant to the community, kindly draft a release note to provide a concise summary of these changes. Please prioritize highlighting the impact these changes will have on users.

BugenZhao · 2024-10-18T02:41:38Z

This stack of pull requests is managed by Graphite. Learn more about stacking.

Join @BugenZhao and the rest of your teammates on Graphite

Signed-off-by: Bugen Zhao <[email protected]> only notify internal tables Signed-off-by: Bugen Zhao <[email protected]> add more comments Signed-off-by: Bugen Zhao <[email protected]>

Signed-off-by: Bugen Zhao <[email protected]>

BugenZhao · 2024-10-21T03:29:22Z

src/meta/src/controller/streaming_job.rs

+            // so we need to notify the frontend to delete them here.
+            // The frontend will ignore the request if the object does not exist,
+            // so it's safe to always notify.
+            self.notify_frontend(Operation::Delete, build_relation_group_for_delete(objs))


As the notification now goes after successfully scheduling and building of the job, a failure here does not always mean that the notification was sent. That's why we changed all DROP handlers in the frontend to be DROP .. IF EXISTS.

An alternative could be introducing a new operation like Cleanup for this purposes, ensuring Delete remains strictly applied. But it seems too ad-hoc to me as there's no much other usage of it.

Can there be a race condition? Whereby we have Delete of some catalog followed by Add of that catalog?

I think not because try_abort_creating_streaming_job is called only after create_streaming_job_inner has returned with error, where Add is either pushed into the notification queue or not issued at all. After entering this try_abort function, no further Add operations will occur.

🤔 We ensured that CREATE and DROP will be properly paired in #18476 .Even if the CREATE notification fails to send due to either a FE or meta node reboot or a broken connection between them, the creation of the mview and its internal table catalogs will still be synchronized through an initial snapshot.

src/frontend/src/catalog/schema_catalog.rs

src/meta/src/model/stream.rs

BugenZhao · 2024-10-21T06:31:44Z

Oops, I realized that we'll eventually notify the frontend nodes to complete the catalogs for both the job and the internal tables in finish_streaming_job.

risingwave/src/meta/src/controller/streaming_job.rs

Lines 771 to 797 in 265a7ac

    
           // notify frontend: job, internal tables. 
        
           let internal_table_objs = Table::find() 
        
               .find_also_related(Object) 
        
               .filter(table::Column::BelongsToJobId.eq(job_id)) 
        
               .all(&txn) 
        
               .await?; 
        
           let mut relations = internal_table_objs 
        
               .iter() 
        
               .map(|(table, obj)| PbRelation { 
        
                   relation_info: Some(PbRelationInfo::Table( 
        
                       ObjectModel(table.clone(), obj.clone().unwrap()).into(), 
        
                   )), 
        
               }) 
        
               .collect_vec(); 
        
           let mut notification_op = NotificationOperation::Add; 
        
           match job_type { 
        
               ObjectType::Table => { 
        
                   let (table, obj) = Table::find_by_id(job_id) 
        
                       .find_also_related(Object) 
        
                       .one(&txn) 
        
                       .await? 
        
                       .ok_or_else(|| MetaError::catalog_id_not_found("table", job_id))?; 
        
                   if table.table_type == TableType::MaterializedView { 
        
                       notification_op = NotificationOperation::Update; 
        
                   }

So this PR is more like a refactor rather than a fix. Actually the main motivation of this PR is to enable #18976, where we don't expect the frontend nodes to receive a VnodeCount::Placeholder to avoid potential confusion.

However, I think it's also okay to bypass the check and allow VnodeCount::Placeholder if the table is in StreamJobStatus::Creating. This is because informing the frontends about these tables is primarily for reference in DROP statements, where querying them is not permitted. As a result, refactoring in this PR may not be necessary anymore.

Signed-off-by: Bugen Zhao <[email protected]>

BugenZhao · 2024-10-21T07:00:23Z

As a result, refactoring in this PR may not be necessary anymore.

Have updated the PR to only

revises the documentation, comments, and naming of related snippets to clarify the completeness of internal table catalogs during various phases of creating a streaming job.

BugenZhao · 2024-10-21T07:02:50Z

src/meta/src/rpc/ddl_controller.rs

@@ -1643,6 +1645,7 @@ impl DdlController {
            table_parallelism,
            max_parallelism.get(),
        );
+        let internal_tables = table_fragments.internal_tables();


The internal_tables field in CreateStreamingJobContext will now be complete.

github-actions bot added type/refactor ci/run-e2e-single-node-tests labels Oct 16, 2024

BugenZhao force-pushed the bz/create-mv-only-notify-complete-catalogs branch from 55de07d to eb9552d Compare October 17, 2024 06:44

BugenZhao mentioned this pull request Oct 17, 2024

refactor: distinguish between placeholder and compat for vnode count #18976

Merged

4 tasks

refactor(meta): only notify complete catalogs when creating mv

b365566

Signed-off-by: Bugen Zhao <[email protected]> only notify internal tables Signed-off-by: Bugen Zhao <[email protected]> add more comments Signed-off-by: Bugen Zhao <[email protected]>

BugenZhao force-pushed the bz/create-mv-only-notify-complete-catalogs branch from 0c7b344 to b365566 Compare October 18, 2024 02:55

BugenZhao added 2 commits October 18, 2024 17:32

minor rename

2e753f3

Signed-off-by: Bugen Zhao <[email protected]>

ignore drop notification if not exists

54b30c5

Signed-off-by: Bugen Zhao <[email protected]>

BugenZhao changed the title ~~refactor(meta): only notify complete catalogs when creating mv~~ fix(meta): notify complete internal table catalogs when creating mv Oct 18, 2024

github-actions bot added the type/fix Bug fix label Oct 18, 2024

format

1ab43fd

Signed-off-by: Bugen Zhao <[email protected]>

BugenZhao requested review from fuyufjh, yezizp2012 and kwannoel October 21, 2024 03:17

BugenZhao added the need-cherry-pick-release-2.1 label Oct 21, 2024

BugenZhao commented Oct 21, 2024

View reviewed changes

kwannoel added ci/run-recovery-test-deterministic-simulation ci/main-cron/run-selected labels Oct 21, 2024

kwannoel reviewed Oct 21, 2024

View reviewed changes

src/frontend/src/catalog/schema_catalog.rs Outdated Show resolved Hide resolved

kwannoel reviewed Oct 21, 2024

View reviewed changes

src/meta/src/model/stream.rs Show resolved Hide resolved

revert logical changes

5adb7fe

Signed-off-by: Bugen Zhao <[email protected]>

BugenZhao force-pushed the bz/create-mv-only-notify-complete-catalogs branch from a543ff2 to 5adb7fe Compare October 21, 2024 06:56

BugenZhao changed the title ~~fix(meta): notify complete internal table catalogs when creating mv~~ refactor(meta): clarify the completeness of internal table catalogs Oct 21, 2024

yezizp2012 approved these changes Oct 21, 2024

View reviewed changes

BugenZhao commented Oct 21, 2024

View reviewed changes

kwannoel approved these changes Oct 21, 2024

View reviewed changes

BugenZhao added this pull request to the merge queue Oct 21, 2024

Merged via the queue into main with commit 9bbf418 Oct 21, 2024
36 of 37 checks passed

BugenZhao deleted the bz/create-mv-only-notify-complete-catalogs branch October 21, 2024 08:29

github-actions bot mentioned this pull request Oct 21, 2024

cherry-pick refactor(meta): clarify the completeness of internal table catalogs (#18944) to branch release-2.1 #19037

Closed

This was referenced Oct 21, 2024

refactor: use 1 for vnode count of singletons #18753

Merged

fix(meta): align job's max parallelism while replacing table #19052

Merged

Make state table visible for creating MV #19031

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(meta): clarify the completeness of internal table catalogs #18944

refactor(meta): clarify the completeness of internal table catalogs #18944

BugenZhao commented Oct 16, 2024 •

edited

Loading

BugenZhao commented Oct 18, 2024 •

edited

Loading

BugenZhao Oct 21, 2024 •

edited

Loading

kwannoel Oct 21, 2024

BugenZhao Oct 21, 2024

yezizp2012 Oct 21, 2024

BugenZhao commented Oct 21, 2024

BugenZhao commented Oct 21, 2024

BugenZhao Oct 21, 2024

refactor(meta): clarify the completeness of internal table catalogs #18944

refactor(meta): clarify the completeness of internal table catalogs #18944

Conversation

BugenZhao commented Oct 16, 2024 • edited Loading

What's changed and what's your intention?

Checklist

Documentation

Release note

BugenZhao commented Oct 18, 2024 • edited Loading

BugenZhao Oct 21, 2024 • edited Loading

Choose a reason for hiding this comment

kwannoel Oct 21, 2024

Choose a reason for hiding this comment

BugenZhao Oct 21, 2024

Choose a reason for hiding this comment

yezizp2012 Oct 21, 2024

Choose a reason for hiding this comment

BugenZhao commented Oct 21, 2024

BugenZhao commented Oct 21, 2024

BugenZhao Oct 21, 2024

Choose a reason for hiding this comment

BugenZhao commented Oct 16, 2024 •

edited

Loading

BugenZhao commented Oct 18, 2024 •

edited

Loading

BugenZhao Oct 21, 2024 •

edited

Loading