catch and display disk usage issue in wis2box-api #674

maaikelimper · 2024-05-06T07:40:10Z

While debugging and supporting users I have noted that a lack of disk space results in the error:
elasticsearch.exceptions.TransportError: TransportError(429, 'cluster_block_exception', 'index [stations] blocked by: [TOO_MANY_REQUESTS/12/disk usage exceeded flood-stage watermark, index has read-only-allow-delete block];')
which is raised inside the wis2box-api container.

This issue often goes unnoticed as the status of the wis2box-containers does not change and the error is not displayed in the Grafana Dashboard.

We should display this issue in the main Grafana dashboard and document it in the FAQ

The text was updated successfully, but these errors were encountered:

tomkralidis · 2024-07-24T13:52:38Z

2024-07-24:

@maaikelimper to check w/ @alimand on status/updates

maaikelimper · 2024-07-25T07:39:13Z

wis2box silently failing due the disk usage issue resulted in 5 days of data loss in the CMO-server, so I'm raising the priority of this issue.
I think instead of just displaying the disk usage issue we should consider disabling the wis2box as a whole when we detect that Elasticsearch stops functioning.

tomkralidis · 2024-08-14T11:29:25Z

2024-08-14:

PR in Issue 674 #721 (needs to resolve conflicts)
needs deploy/test/validate

alimand · 2024-08-14T13:08:22Z

Thanks for checking and now there is no conflicts with main branch

maaikelimper · 2024-08-18T13:50:06Z

I've tested and reviewed the fix for and merged this in #729

This fix adds a new panel to Grafana to display alerts and includes an alert triggered by the elasticsearch-exporter:

maaikelimper added the monitoring Monitoring label May 6, 2024

maaikelimper added this to the sprint-015 milestone May 6, 2024

maaikelimper self-assigned this May 6, 2024

maaikelimper assigned alimand Jul 3, 2024

maaikelimper assigned tomkralidis Jul 25, 2024

maaikelimper added the blocker Critical issue that should be given high priority label Jul 25, 2024

tomkralidis closed this as completed Aug 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

catch and display disk usage issue in wis2box-api #674

catch and display disk usage issue in wis2box-api #674

maaikelimper commented May 6, 2024

tomkralidis commented Jul 24, 2024

maaikelimper commented Jul 25, 2024

tomkralidis commented Aug 14, 2024

alimand commented Aug 14, 2024

maaikelimper commented Aug 18, 2024 •

edited

Loading

catch and display disk usage issue in wis2box-api #674

catch and display disk usage issue in wis2box-api #674

Comments

maaikelimper commented May 6, 2024

tomkralidis commented Jul 24, 2024

maaikelimper commented Jul 25, 2024

tomkralidis commented Aug 14, 2024

alimand commented Aug 14, 2024

maaikelimper commented Aug 18, 2024 • edited Loading

maaikelimper commented Aug 18, 2024 •

edited

Loading