Skip to content

Commit

Permalink
Tweak MachineCapacityLow alert, lower repeat interval on Slack to 1h.
Browse files Browse the repository at this point in the history
  • Loading branch information
Gerrit91 committed Sep 21, 2023
1 parent 92ce365 commit 7a3415d
Showing 1 changed file with 2 additions and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -121,6 +121,7 @@ alertmanager:
- name: 'null'
{% if monitoring_slack_api_url is defined and monitoring_slack_notification_channel is defined %}
- name: slack
repeat_interval: 1h
slack_configs:
- channel: "{{ monitoring_slack_notification_channel }}"
send_resolved: true
Expand Down Expand Up @@ -207,7 +208,7 @@ additionalPrometheusRulesMap:
annotations:
description: "Partition {{ $labels.partition }} has {{ $value }} DEAD Machines"
- alert: MachineCapacityLow
expr: (avg(metal_partition_capacity_total{size!="unknown"})by (partition, size) - avg(metal_partition_capacity_free{size!="unknown"}) by (partition, size) ) / avg(metal_partition_capacity_total{size!="unknown"})by (partition, size) *100 > 80
expr: avg(metal_partition_capacity_free{size!="unknown"} > 10) by (partition, size) / avg(metal_partition_capacity_total{size!="unknown"} > 10) by (partition, size) < 0.10
for: 10m
labels:
severity: "warning"
Expand Down

0 comments on commit 7a3415d

Please sign in to comment.