Verification tests fail in CI with 403 status codes #4434

iaindillingham · 2024-07-09T16:10:28Z

It's not uncommon for the verification tests to fail in CI with 403 (forbidden) status codes.¹ The verification tests aren't run locally (e.g. with just test), so there's no reason to think the failures are specific to CI, however.

The failures result in long stack traces that conclude with coverage errors. However, coverage isn't to blame. When looking at the following examples, focus on the "short test summary info" section:

Notice that the associated PRs may not touch the tests or the functions under test. Indeed, they may relate to JavaScript rather than to Python.

Considerations

Are we hitting GitHub's API too many times in one run?
Or in several concurrent runs?
Can we run the verification tests nightly?
Are we hitting primary or secondary rate limits?
Are we following best practices?
Should we set GITHUB_TOKEN? (CI test run can be flaky due to exceeding GitHub API limits reports#613)

https://bennettoxford.slack.com/archives/C069SADHP1Q/p1719926997444489?thread_ts=1719821734.604979&cid=C069SADHP1Q ↩

The text was updated successfully, but these errors were encountered:

evansd · 2024-07-16T16:58:07Z

Sorry, I've had the Slack conversation regarding this bookmarked to come back to for ages but only just got round to it.

My high-level comment is that I think the right approach here is to not run these tests as part of CI and instead to run them on a schedule, as you suggest:

Can we run the verification tests nightly?

I think it's really important for developer sanity that CI be, as far as possible, deterministic and hermetic: CI should fail because of some change in the PR, and for no other reason.

Tests that external services still behave as expected are important, but don't relate to specific code changes and so should be run on a schedule rather than in CI. And that's better because operationally as well because you find out about potential breakages soon after they occur, rather than the next time you happen to open a PR.

Of course that still leaves the problem that the scheduled tests might end up being flaky too! I don't have any particular expertise to offer here, but the things you've already listed cover everything I'd look at in the first instance.

StevenMaude mentioned this issue Jul 30, 2024

Flaky test: test_get_issue_number_with_sort_order #4501

Closed

Jongmassey self-assigned this Aug 7, 2024

Jongmassey mentioned this issue Aug 8, 2024

Move verification tests to own workflow #4522

Merged

Jongmassey closed this as completed in #4522 Aug 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verification tests fail in CI with 403 status codes #4434

Verification tests fail in CI with 403 status codes #4434

iaindillingham commented Jul 9, 2024

evansd commented Jul 16, 2024

Verification tests fail in CI with 403 status codes #4434

Verification tests fail in CI with 403 status codes #4434

Comments

iaindillingham commented Jul 9, 2024

Considerations

Footnotes

evansd commented Jul 16, 2024