Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DAT-18204: fix concurrent runs #156

Merged
merged 17 commits into from
Jul 29, 2024
Merged

DAT-18204: fix concurrent runs #156

merged 17 commits into from
Jul 29, 2024

Conversation

sayaliM0412
Copy link
Contributor

@sayaliM0412 sayaliM0412 commented Jul 19, 2024

feat: .github/workflows/lth.yml: use workflow_dispatch to run manually and fix concurrent runs issue

@sayaliM0412 sayaliM0412 changed the title DAT-18204: use workflow_dispatch DAT-18204: fix concurrent runs Jul 19, 2024
@sayaliM0412 sayaliM0412 marked this pull request as ready for review July 19, 2024 15:37
# ensure only one run or job within the group runs at a time.
concurrency:
group: ${{ github.workflow }}-${{ github.ref }}
cancel-in-progress: true
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

    strategy:
      max-parallel: 1

was working fine. I think the issues is when a matrix jobs fails and the schema is not correctly removed.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we do have this step here to ensure removal of the test schema even if one or more matrix jobs fail. : https://github.com/liquibase/liquibase-databricks/blob/main/.github/workflows/lth.yml#L73 destroying the schema

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice but it seems it does not destroy the endpoint, see https://github.com/liquibase/liquibase-databricks/actions/runs/10009689797/job/27669131489#step:11:14, also cancel-in-progress: true will abort TH jobs. The team would have to manually re-run the canceled ones, right? If the infra is not removed then all jobs will fail.

Copy link
Contributor Author

@sayaliM0412 sayaliM0412 Jul 24, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

right, I was looking at a way to destroy the resources if creation fails as the resources already exists, but TF doesnt have the context and therefore shows "No changes" : tested here: https://github.com/liquibase/liquibase-databricks/actions/runs/10081580514/job/27873923161#step:6:12. I have set cancel-in-progress: false for the same reason.

# ensure only one run or job within the group runs at a time.
concurrency:
group: ${{ github.workflow }}-${{ github.ref }}
cancel-in-progress: true
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice but it seems it does not destroy the endpoint, see https://github.com/liquibase/liquibase-databricks/actions/runs/10009689797/job/27669131489#step:11:14, also cancel-in-progress: true will abort TH jobs. The team would have to manually re-run the canceled ones, right? If the infra is not removed then all jobs will fail.

@jandroav jandroav self-requested a review July 29, 2024 07:04
@sayaliM0412 sayaliM0412 merged commit b02507e into main Jul 29, 2024
4 of 5 checks passed
@sayaliM0412 sayaliM0412 deleted the DAT-18204 branch July 29, 2024 13:10
Copy link

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants