Spike/monitor: Triage FormsAPI monitoring for Sept 30 deadline #15195
Labels
backend
Find a form
CMS managed product, owned by Public Websites team
Needs refining
Issue status
non-quarter-prio
(PW) not related to quarterly priorities
Public Websites
Scrum team in the Sitewide crew
sitewide
VA.gov frontend
CMS team practice area
Description
User story
AS A PO/PM
I WANT PW to manually monitor every error case we know of that Veterans can encounter on the FaF product
SO THAT Veteran's can't have problems we don't know about within 4 hours.
AS A PO/PM/Forms Stakeholder
I WANT members of the PW team to notice within 4 business hours when an API issue is impacting the Veteran Forms experience
SO THAT we can quickly respond to (or ideally anticipate) any Forms API issues that might impact Veterans.
Key failure modes we want to "alert" on (either be alerted automatically or notice through manual checking):
Engineering notes / background
Very helpful slack thread started by Jill on this topic: https://dsva.slack.com/archives/CE4304QPK/p1694461749806729
Monitoring tools notes:
Possible ACs
Asked Lighthouse if there is a notification system in place to warn teams when an endpoint's rate limit is being approachedwe think we can set it up in DatadogAcceptance criteria
The text was updated successfully, but these errors were encountered: