Skip to content

Commit

Permalink
Adds service alert for cabinet power loss
Browse files Browse the repository at this point in the history
  • Loading branch information
aturner-epcc committed Dec 1, 2024
1 parent 457ebcd commit 6131e35
Showing 1 changed file with 9 additions and 0 deletions.
9 changes: 9 additions & 0 deletions _alerts/2024-12-01_nodes-down.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
---
status: Ongoing
type: Service Alert
start_date: 2024-12-01 17:50 GMT
end_date:
scope: Subset of ARCHER2 compute nodes
impact: Around 1,000 compute nodes are unavailable. Jobs running on the affected nodes at the time of failure will have crashed. The rest of system remains available and jobs on unaffected nodes continue as usual.
reason: Loss of power to some ARCHER2 cabinets
---

0 comments on commit 6131e35

Please sign in to comment.