-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TPP Job raised INTERNAL_ERROR #709
Comments
Interesting, looks like the state machine hit something unexpected:
From Honeycomb logs:
I note that we entered maintenance mode immediately afterwards, so my assumption would be that it's somehow related. We've had race-condition-like issues around this before. |
I dug into it in the slack thread but here's my conclusion: so, db mode killed it. It looks like it was in EXECUTING state, but apparently wasn't.
|
Just having a read around this - the I think it's a reasonable assumption for maintenance mode to attempt to stop all jobs that could possibly be running rather than all jobs that are running , so maybe we should just update |
here's a demo of the behaviour #710 I think this has fallen through the cracks because we have unit tests of |
Ah, great, thanks for digging into this properly.
Pragmatically, both of these feel like the right immediate next steps to me. |
Note: removing tech-support label, as this is scheduled to fix during deck-scrubbing |
Honeycomb reports that a TPP Job raised an INTERNAL_ERROR.1 The affected jobs in Job Request 22288 are:
I've scanned the logs on L3 (both are 53,466 lines long 🤔), but can't determine why the jobs failed.
Footnotes
https://bennettoxford.slack.com/archives/C0270Q313H7/p1708990931394309 ↩
The text was updated successfully, but these errors were encountered: