Support Transformers 4.43 #856

IlyasMoutawwakil · 2024-07-31T13:21:28Z

What does this PR do?

Adds support of transfromers 4.43 as in huggingface/optimum#1971, mainly:

whisper introducing a new forward input argument cache_position.
_reorder_cache removed from some causal models to use Cache.reorder_cache instead.
missing model.dtype used by transfromers pipelines to cast some features (pixels, audio samples, etc)

The rest is changes in the testing suite to adapt to the modeling changes in the library.
Also fixes the failing OV training tests on main.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2024-07-31T13:27:40Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

IlyasMoutawwakil · 2024-08-01T14:30:57Z

I think it's ready.
ITREX still needs to fix qbits modeling for WOQ. OV looks good to me (compared to CI on main).

IlyasMoutawwakil · 2024-08-02T10:46:52Z

There's one last qwen test that fails with transfromers 4.37 even tho I did add the transformers_stream_generator package
https://github.com/huggingface/optimum-intel/actions/runs/10213421527/job/28258759384?pr=856#step:5:2163

echarlaix

Looks great, thanks a lot @IlyasMoutawwakil !

optimum/intel/openvino/modeling_seq2seq.py

helena-intel

Thank you! Also really appreciate keeping transformers 4.36 support.

…my model fix

tests/openvino/test_exporters_cli.py

optimum/intel/openvino/configuration.py

optimum/intel/openvino/modeling_base.py

Co-authored-by: Ella Charlaix <[email protected]>

…gface/optimum-intel into support-transformers-4.43

IlyasMoutawwakil · 2024-08-05T13:03:13Z

ready to merge

optimum/intel/openvino/modeling_seq2seq.py

optimum/intel/openvino/utils.py

IlyasMoutawwakil · 2024-08-06T10:47:17Z

tests/openvino/test_modeling.py

@@ -943,7 +943,7 @@ def test_beam_search(self, model_arch):
            eos_token_id=None,
        )

-        if model_arch == "minicpm":
+        if model_arch in ["minicpm", "internlm2"]:


is this okay ? idk why the generation loop is not deterministic with internlm2

yes, it is due to random initialization of pt model

IlyasMoutawwakil · 2024-08-06T11:37:43Z

looks good, the failing tests in windows are mostly os errors in the testing logic (use of unsafe rmtree), I can try fixing them in another PR

* install from pr * updates * fix * update TRANSFORMERS_MAX_VERSION * fix sdpa in training * fix whisper * fix * whisper calibration checks * fix OVTrainerTextClassificationTrainingTest's expected fake quantize * fix OVCLIExportTestCase's expected_int4 * update min ci transformers version to 4.37 * fix OVQuantizerTest's expected fake quantize * reorder_cache * fix expected compressed matmuls * fix test_exporters_cli_int4_with_local_model_and_default_config * fix qwen custom modeling test * fix failing ipex tests * fix ipex * fix the last ipex failing test_compare_with_and_without_past_key_values * use minimal prepare_inputs_for_generation in OVModelForSpeechSeq2Seq * keeping compatibility with transformers 4.36 * keep support of whisper using WhisperGenerationMixin.generate a,d dummy model fix * trigger * fix * device property * standardize .device and ._device attributes/properties * fix * fix * revert Co-authored-by: Ella Charlaix <[email protected]> * use falcon * torch.device property always cpu * style * resolve conflicts * decoder_attention_mask for older versions * optimum main * limit inc transformers version * fix pipeline missing dtype * add dtype for seq to seq models * pass phi beam search test and skip internlm2 * fix for internlm2 --------- Co-authored-by: Ella Charlaix <[email protected]>

IlyasMoutawwakil added 2 commits July 31, 2024 15:20

install from pr

f796736

updates

97813a0

IlyasMoutawwakil added 6 commits July 31, 2024 15:30

fix

3f0b5fb

update TRANSFORMERS_MAX_VERSION

0971db8

fix sdpa in training

55efb91

fix whisper

836bde1

fix

84ad56f

whisper calibration checks

22f023a

IlyasMoutawwakil added 9 commits August 1, 2024 16:36

fix OVTrainerTextClassificationTrainingTest's expected fake quantize

aa07fd3

fix OVCLIExportTestCase's expected_int4

eb28cb9

update min ci transformers version to 4.37

a3f74d6

fix OVQuantizerTest's expected fake quantize

a40940a

reorder_cache

c9e6792

fix expected compressed matmuls

6c4d667

fix test_exporters_cli_int4_with_local_model_and_default_config

33dee65

fix qwen custom modeling test

446afd5

fix failing ipex tests

40add7d

IlyasMoutawwakil requested a review from echarlaix August 2, 2024 09:09

IlyasMoutawwakil added 2 commits August 2, 2024 11:26

fix ipex

182a0cf

fix the last ipex failing test_compare_with_and_without_past_key_values

0823c74

IlyasMoutawwakil requested review from eaidova and helena-intel August 2, 2024 10:49

use minimal prepare_inputs_for_generation in OVModelForSpeechSeq2Seq

b433fd5

echarlaix approved these changes Aug 2, 2024

View reviewed changes

optimum/intel/openvino/modeling_seq2seq.py Show resolved Hide resolved

keeping compatibility with transformers 4.36

fb730f0

helena-intel approved these changes Aug 2, 2024

View reviewed changes

keep support of whisper using WhisperGenerationMixin.generate a,d dum…

59f9e05

…my model fix

fix

d71342b

echarlaix approved these changes Aug 5, 2024

View reviewed changes

tests/openvino/test_exporters_cli.py Outdated Show resolved Hide resolved

optimum/intel/openvino/configuration.py Outdated Show resolved Hide resolved

optimum/intel/openvino/modeling_base.py Outdated Show resolved Hide resolved

IlyasMoutawwakil and others added 6 commits August 5, 2024 12:32

revert

ea0b370

Co-authored-by: Ella Charlaix <[email protected]>

use falcon

6ad7bd4

torch.device property always cpu

92f4bee

Merge branch 'support-transformers-4.43' of https://github.com/huggin…

45d4d6d

…gface/optimum-intel into support-transformers-4.43

style

5b17c76

resolve conflicts

d20301a

echarlaix reviewed Aug 5, 2024

View reviewed changes

optimum/intel/openvino/modeling_seq2seq.py Show resolved Hide resolved

IlyasMoutawwakil added 3 commits August 5, 2024 16:03

decoder_attention_mask for older versions

eaff2ed

optimum main

3b5f424

limit inc transformers version

815d0ed

echarlaix force-pushed the support-transformers-4.43 branch from cb1714e to 815d0ed Compare August 5, 2024 15:14

echarlaix added the openvino-test Trigger OpenVINO slow tests label Aug 5, 2024

fix pipeline missing dtype

0a54d3e

eaidova reviewed Aug 6, 2024

View reviewed changes

optimum/intel/openvino/utils.py Show resolved Hide resolved

eaidova approved these changes Aug 6, 2024

View reviewed changes

IlyasMoutawwakil added 3 commits August 6, 2024 10:40

add dtype for seq to seq models

33b7418

pass phi beam search test and skip internlm2

53d914f

fix for internlm2

0581877

IlyasMoutawwakil commented Aug 6, 2024

View reviewed changes

IlyasMoutawwakil merged commit 12438c4 into main Aug 6, 2024
22 of 24 checks passed

IlyasMoutawwakil deleted the support-transformers-4.43 branch August 6, 2024 11:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Transformers 4.43 #856

Support Transformers 4.43 #856

IlyasMoutawwakil commented Jul 31, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 31, 2024

IlyasMoutawwakil commented Aug 1, 2024

IlyasMoutawwakil commented Aug 2, 2024 •

edited

Loading

echarlaix left a comment

helena-intel left a comment

IlyasMoutawwakil commented Aug 5, 2024

IlyasMoutawwakil Aug 6, 2024

eaidova Aug 6, 2024

IlyasMoutawwakil commented Aug 6, 2024 •

edited

Loading

Support Transformers 4.43 #856

Support Transformers 4.43 #856

Conversation

IlyasMoutawwakil commented Jul 31, 2024 • edited Loading

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Jul 31, 2024

IlyasMoutawwakil commented Aug 1, 2024

IlyasMoutawwakil commented Aug 2, 2024 • edited Loading

echarlaix left a comment

Choose a reason for hiding this comment

helena-intel left a comment

Choose a reason for hiding this comment

IlyasMoutawwakil commented Aug 5, 2024

IlyasMoutawwakil Aug 6, 2024

Choose a reason for hiding this comment

eaidova Aug 6, 2024

Choose a reason for hiding this comment

IlyasMoutawwakil commented Aug 6, 2024 • edited Loading

IlyasMoutawwakil commented Jul 31, 2024 •

edited

Loading

IlyasMoutawwakil commented Aug 2, 2024 •

edited

Loading

IlyasMoutawwakil commented Aug 6, 2024 •

edited

Loading