-
Notifications
You must be signed in to change notification settings - Fork 594
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bug: panicked at should be valid staging_sst.size #17111
Comments
Related PR: #16725 |
branch test |
There was sync failure happened right before the panic.
Let's say in an instance, the imm order is [epoch1: [imm1], epoch2: [imm2]]. When sync failure of epoch1 happens, we simply notify the failure, but won't clear imm1 or clear the uploader. When the ongoing epoch2 sync finished, it will try to clear imm2, but then it will see that imm2 is not the oldest imm, which causes the panics. With #17113, the panic can be fixed, which clear the uploader state on sync error, and will do nothing until a recovery happens. |
Based on the above analysis, the race has existed before #16962 but we didn't see panic in |
Is #17113 ready? We can review and merge it and unlock the main branch. |
Yes. It and its preliminary PR has been approved. We can merge them one by one. |
I have been running a branch recently based on a May/21 commit. In all the runs prior to today, I didn't seen the panic reported in this issue but for some reason I see the panic in today's run. This means the panic exists at least from May/21 but for some reason it starts to trigger frequently since today. |
Describe the bug
Error message/log
To Reproduce
No response
Expected behavior
No response
How did you deploy RisingWave?
No response
The version of RisingWave
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: