-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Packet Loss Monitoring? #22
Comments
Interesting idea of whether use of the If a node or region had an issue, we would see all/most pings to it see latency/loss, and potentially pings out would never be recorded/captured. Maybe some sort of aggregation based on a target label and tracking deviations from averages? |
https://github.com/benjojo/sping might be another interesting tool to further investigate. Handily exposes |
I had seen https://grafana.com/grafana/plugins/grafana-synthetic-monitoring-app/ before, but had assumed it was Grafana Cloud only. In fact, it's all open source (plugin is AGPLv3, client is Apache 2), though I the reality is that it's just blackbox_exporter + logs, and pretty graphs. We could just use blackbox_exporter and the graphs as inspiration: And it's the graphs that give at least some idea how we could use the data. We would have a dashboard like that ^^ with var selectors for each instance. Nice world map would highlight issues between regions. Would we opt for every edge tracking every ingress + every ingress tracking every edge? That's the internal flow of data, so most relevant. I expect it'd be cheap to add more locations on diff hosting providers if we wanted an external perspective. == Thoughts @clone1018 |
I think this sounds like a good approach! |
Should we be monitoring packet loss between our Ingest / Edge servers? Should we be tracking NACK stats in prometheus as well?
The text was updated successfully, but these errors were encountered: