Skip to content

How to Evaluate statistics for generic "Lambda Error" Alerts posted to Slack

Michael J Burling edited this page Sep 9, 2024 · 14 revisions

Pre-Requisites

  • Access to CMS VPN
  • Access to BFD/CMS AWS account(s)
  • Installation of AWS CLI, properly configured for access to BFD/CMS AWS account
  • Installation of jq

Instructions

Determine which alarms have been tripped within the past N days (24h is default):

Execute script ops/ccs-ops-misc/scripts/lambda-error-metrics from our Github repository and proceed with further detailed investigation as warranted; optionally, add an argument for the number of historical days from current day for which to evaluate alarms.

Example 1: 1 day of metrics

$ ops/ccs-ops-misc/scripts/lambda-error-metrics

Example 2: 3 days of metrics

$ ops/ccs-ops-misc/scripts/lambda-error-metrics 3

Clone this wiki locally