-
Notifications
You must be signed in to change notification settings - Fork 590
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Discussion: persist system events and query via SQL #13267
Comments
Implementation aside, I like the idea of having SQL commands for
Most of the time, these SQL commands should be run by user, not us. Therefore, I suggest making it simple (one-line). |
Overall, I think 2 is our major pain and I guess this is your motivation, so let's keep it simple now by only recording these events.
This seems to be the only feasible approach. I don't think it can rely on Hummock because Hummock may also be a source of failure. Additionally, according to my comments above, the true critical events mostly happen on Meta node. |
Is your feature request related to a problem? Please describe.
I'd like to discuss whether introducing such system events can help address trivial issues from users more efficiently, saving the time for communication and environment setup.
The system event I'm talking about is basically a duplication of most crucial logs from all worker nodes, with certain amendment and supplement (for metrics). It's persisted and queryable via SQL. It feels like implementing a logging service inside kernel 🥵 .
Note that system event is not a replacement of logs or metrics, but as a method to filter out simple cases in advance.
Below are some system events I think helpful.
Describe the solution you'd like
With regards to the implementation,
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: