Uptime Monitoring: Share your experience and expectations #51110
Replies: 19 comments 10 replies
-
I like https://gatus.io/ feature rich, easy customization |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
We started using Grafana Synthetic Monitoring quite recently. |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
I have used:
Things I like:
You might notice I don't say anything much positive about Google's - it gives value, for sure, and it does the basic job. But beyond dumping an alert out if we've a long running database migration on deploy, it's not doing much more. So my future plan is this: https://github.com/dwd/sentrynet - which is (or will be, it's only doing very basic stuff now) an attempt to make a synthetics source we can deploy mega-cheap in a bunch of locations (hosting a small docker image is like $5 a month these days), and give us the shiny integration with Sentry (Issues and Performance) that my tiny heart craves. So... your questions:
|
Beta Was this translation helpful? Give feedback.
-
What I find really useful is an ability to create Selenium-like tests, and not just checking that a URL returns 200 to some response. Uptime.com has this feature. It's really useful for downtimes caused by interaction of frontend & backend (for example, backend using an incorrect CDN url, resulting in a blank page in production). Those are really hard to cover by regular tests, so having a tool notify you immediately after you broke something, instead of waiting for users to wake you up is really nice. The thing I disliked about the uptime.com implementation is that it runs at most every 5 minutes, which makes for a not-very-accurate uptime calculation, and also delays my average time to reaction by 2.5 minutes, which is a pretty big deal. It would be nice to see a once-a-minute option. |
Beta Was this translation helpful? Give feedback.
-
Definitely interested in this and it seems like a perfect feature for Sentry. |
Beta Was this translation helpful? Give feedback.
-
We've been using UptimeRobot and HetrixTools, free or paid, depending on the company. Both work quite well. Features that I like/need:
|
Beta Was this translation helpful? Give feedback.
-
Using freshping.io ATM Pros
Cons
|
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
@therealarkin any news on an offering in this space? If nothing in sentry, it would be great to just have an integration w/ https://uptime.betterstack.com/ to highlight periods with downtime |
Beta Was this translation helpful? Give feedback.
-
@therealarkin What is the status on this? Currently looking for a monitoring solution for my FastAPI's. All looks good and works great on the implementation. The Uptime Monitoring is not yet there I see, but is a requirement for a client of mine. Would love to have that in Sentry! Can you tell us a bit more on if this is still on the roadmap, if yes when can we expect it? |
Beta Was this translation helpful? Give feedback.
-
Also interested in this |
Beta Was this translation helpful? Give feedback.
-
We're currently using BetterUptime so fwiw my key points are similar to others comparing with BetterUptime. A key thing I would add, and that Sentry is in a perfect position to bring, would be to have more levels on public status pages (or internal dashboards for that matter). So rather than up/down, we'd also have "degraded" if errors are above a certain reference level. Even better yet, having this grouped by certain rules instead of just being based on a certain page returning a certain status code and/or string. I would imagine the main rule would be route segments for most users but I suppose that can be a bit arbitrary. For example, Sentry would know that I have a high error rate at example.com/dashboard but everything else is OK; so it could be "Up - Dashboard degraded". A high error rate at example.com/members/:memberId/edit could be "Up - Member management degraded". But then very high error rates thorough the whole app could just be classified as "down". This would be much smarter than the oversimplified up/down that most services currently allow. |
Beta Was this translation helpful? Give feedback.
-
This is 🔥 congrats to the team: https://sentry.io/changelog/uptime-monitoring-early-adopters-release. Looks like @gaprl will take it from here 😃 |
Beta Was this translation helpful? Give feedback.
-
Hi! Thanks! |
Beta Was this translation helpful? Give feedback.
-
Is dedicated Uptime Monitoring available in the open source self-hosted version as well? I am unable to find a way to enable it. |
Beta Was this translation helpful? Give feedback.
-
I would need response time monitoring. Having stats over the response time is a good metric if something is wrong. |
Beta Was this translation helpful? Give feedback.
-
Hello Everyone!
I'm Eran from the Sentry product team. We're always looking for opportunities to improve and extend our services to help you manage your applications more effectively. One such area that has come to our attention is Uptime Monitoring!
As we contemplate a service in this direction, we'd love to understand how you currently handle uptime monitoring, what challenges you face, and how an integrated Sentry service might add value.
Here are some starter questions to spur the conversation:
Your input is invaluable to us as we explore this new venture. All thoughts, experiences, and ideas are welcome! Even if you have never used an uptime monitoring service, we're curious to hear what you'd expect from one.
Thanks for taking the time to contribute. We're looking forward to hearing your thoughts and ideas!
Beta Was this translation helpful? Give feedback.
All reactions