-
Notifications
You must be signed in to change notification settings - Fork 590
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support pausing a decoupled Sink from SQL #17357
Comments
so why not directly drop the sink? |
If we sink a MV out, when recreate the sink we need to backfill historical data again to ensure all data go out to downstream. Backfill historical data will deliver data already in downstream and introduce unnecessary write traffic to downstream. |
We support a When |
Dropping the sink and recreate it with I can think of one case this feature is useful, which is when user's downstream system is overloaded and want to stop the traffic for a while before they can fix the downstream. Not sure whether it is valid though. If it is just a temporarily downstream failure, the current retry with backoff mechanism with sink decouple on seems good enough. |
That's it. To ensure at-least once delivery, backfilling is still needed.
Yes. We encounter a case that a user sinking a MV with large data into downstream PG but there is some issues with the PG cdc source they want to troubleshoot. |
Applying this idea here, I think we may do this in |
Is your feature request related to a problem? Please describe.
Usecase: sometimes user may want to prevent data coming out from RW but don't want to pause the entire cluster.
We don’t provide a way for user to just pause a Sink right now, I think for the Sink with
sink_decuple = true
it is feasible to provide a way to pause a Sink job from sql.Describe the solution you'd like
No response
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: