Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Avoid skipping triage jobs #309

Closed
wants to merge 2 commits into from
Closed

Avoid skipping triage jobs #309

wants to merge 2 commits into from

Conversation

olupton
Copy link
Collaborator

@olupton olupton commented Oct 16, 2023

Avoid skipping triage jobs in case of failure. See here in the docs; without this then I believe the failed status of the earlier run-jobs.outcome job step caused the triage job to be skipped.

@terrykong
Copy link
Contributor

Thanks for tackling this @olupton ! Can you confirm if this would address this: #306 ?

Two questions:

  1. Do you know why it failed in the first place and why needs.publish-completion.outputs.STATUS != 'success' was insufficient?
  2. Would needs.<job_id>.result work?

@olupton
Copy link
Collaborator Author

olupton commented Oct 17, 2023

Can you confirm if this would address this: #306 ?

I'm not sure about that. It's not obvious to me why the triage job ran in https://github.com/NVIDIA/JAX-Toolbox/actions/runs/6516962688, I'm not optimistic that this change will have fixed that issue.

  1. Do you know why it failed in the first place and why needs.publish-completion.outputs.STATUS != 'success' was insufficient?

Based on https://docs.github.com/en/actions/learn-github-actions/expressions#failure-with-conditions:

You can include extra conditions for a step to run after a failure, but you must still include failure() to override the default status check of success() that is automatically applied to if conditions that don't contain a status check function.

I understand there was an implicit success() in the if clause of the triage job, so the (intentional) failure of run-jobs.outcome (because Slurm jobs failed) stopped the triage job from running.

  1. Would needs.<job_id>.result work?

I guess yes if we made the triage job explicitly depend (needs) on run-jobs (which is the job whose overall status should be red in the case that I understand is supposed to trigger the triage job). Not sure what the advantage would be.

I launched https://github.com/NVIDIA/JAX-Toolbox/actions/runs/6545350101 manually using the latest version of this PR.

@olupton olupton changed the title triage: avoid skipping due to failed status of run-jobs.outcome Handle multi-arch container images; avoid skipping triage jobs Oct 17, 2023
yhtang pushed a commit that referenced this pull request Dec 7, 2023
Closes #263. See also #309 for other fixes to the pipeline that relies
on these helpers.

---------

Co-authored-by: Terry Kong <[email protected]>
@olupton olupton changed the title Handle multi-arch container images; avoid skipping triage jobs Avoid skipping triage jobs Dec 7, 2023
@olupton
Copy link
Collaborator Author

olupton commented Apr 3, 2024

Outdated.

@olupton olupton closed this Apr 3, 2024
@olupton olupton deleted the olupton/ga-tweak branch April 3, 2024 15:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants