-
Notifications
You must be signed in to change notification settings - Fork 597
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bug: frontend need restart to recover from breakdown #9239
Comments
Is possible following case happen🤔: |
The error remains for a long time until restart. |
This comment was marked as resolved.
This comment was marked as resolved.
Do we have more concrate log to reproduce this bug? Such as what reschedule happen before compute join or leave the cluster. |
Informed @ZENOTME when it occurred once again recently. |
Describe the bug
We encounter frontend breakdown, as below:
After restarting frontend, the breakdown is gone.
To Reproduce
No response
Expected behavior
No response
Additional context
There has been compute node leaving/joining the cluster. Can it be related to frontend's compute node client pool?
The text was updated successfully, but these errors were encountered: