Fix flaky `test_batching_equivalence` #35564

ydshieh · 2025-01-08T15:42:19Z

What does this PR do?

I am the king serial killer of flaky tests in transformers!

The the ratio of failures:

tests/models/flaubert/test_modeling_flaubert.py::FlaubertModelTest::test_batching_equivalence:
- 3000 runs:
  - main: 1.2 %
  - PR: 0 %
tests/models/mobilevitv2/test_modeling_mobilevitv2.py::MobileViTV2ModelTest::test_batching_equivalence:
- 300 runs:
  - main: 2.3 %
  - PR: 0 %
tests/models/xlm/test_modeling_xlm.py::XLMModelTest::test_batching_equivalence:
- 300 runs:
  - main: 3.3 %
  - PR: 0 %

I will apply the same changes to overwritten tests wherever they are flaky too.

ydshieh · 2025-01-08T16:04:06Z

@gante @zucchini-nlp

For context: see previous similar fixes #34995

HuggingFaceDocBuilderDev · 2025-01-08T16:30:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ydshieh · 2025-01-08T18:48:25Z

src/transformers/testing_utils.py

+    if hasattr(test_case.model_tester, "out_features") or hasattr(test_case.model_tester, "out_indices"):
+        target_num_hidden_layers = None


some vision models is hard to adjust the number of layers as sometimes other parameters have to be changed at the same time

zucchini-nlp

Yay, less flaky tests! Thanks!

ydshieh · 2025-01-09T12:57:55Z

There are some model test cases overwrite the common one with is_flaky.
I decide to merge this PR first, and in a follow-up PR to try to remove those overwritten tests.

yes!

39d3a35

ydshieh force-pushed the fix_flaky_test_batching_equivalence branch from 223bfe2 to 39d3a35 Compare January 8, 2025 16:01

ydshieh marked this pull request as ready for review January 8, 2025 16:01

ydshieh requested review from ArthurZucker, gante and zucchini-nlp January 8, 2025 16:02

oh no!!!

4a92e8a

ydshieh requested a review from Rocketknight1 as a code owner January 8, 2025 16:32

ydshieh added 4 commits January 8, 2025 17:40

oh no!!!

b87a811

style

9120a53

oh no!!!

2cd007b

oh no!!!

de7664d

ydshieh marked this pull request as draft January 8, 2025 17:45

ydshieh added 2 commits January 8, 2025 19:05

oh no!!!

1b42ba1

oh no!!!

9eb2ea6

ydshieh marked this pull request as ready for review January 8, 2025 18:47

ydshieh commented Jan 8, 2025

View reviewed changes

zucchini-nlp approved these changes Jan 9, 2025

View reviewed changes

ydshieh removed request for gante, Rocketknight1 and ArthurZucker January 9, 2025 10:04

ydshieh merged commit 1b2f942 into main Jan 9, 2025
26 checks passed

ydshieh deleted the fix_flaky_test_batching_equivalence branch January 9, 2025 13:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix flaky `test_batching_equivalence` #35564

Fix flaky `test_batching_equivalence` #35564

ydshieh commented Jan 8, 2025 •

edited

Loading

ydshieh commented Jan 8, 2025

HuggingFaceDocBuilderDev commented Jan 8, 2025

ydshieh Jan 8, 2025

zucchini-nlp left a comment

ydshieh commented Jan 9, 2025

		if hasattr(test_case.model_tester, "out_features") or hasattr(test_case.model_tester, "out_indices"):
		target_num_hidden_layers = None

Fix flaky test_batching_equivalence #35564

Fix flaky test_batching_equivalence #35564

Conversation

ydshieh commented Jan 8, 2025 • edited Loading

What does this PR do?

ydshieh commented Jan 8, 2025

HuggingFaceDocBuilderDev commented Jan 8, 2025

ydshieh Jan 8, 2025

Choose a reason for hiding this comment

zucchini-nlp left a comment

Choose a reason for hiding this comment

ydshieh commented Jan 9, 2025

Fix flaky `test_batching_equivalence` #35564

Fix flaky `test_batching_equivalence` #35564

ydshieh commented Jan 8, 2025 •

edited

Loading