[hotfix] fix bug in flash_attention_patch.py #4908

Orion-Zheng · 2023-10-13T10:19:13Z

To be compatible with the new change in the Transformers library, where a new argument 'padding_mask' was added to forward function of attention layer. huggingface/transformers#25598

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

Link this PR to your issue with words like fixed to automatically close the linked issue upon merge

e.g. fixed #1234, closed #1234, resolved #1234

Closed #4903 (comment)

📝 What does this PR do?

Summarize your work here.
if you have any plots/diagrams/screenshots/tables, please attach them here.

Add **kwargs to the original attention_forward function, letting flash_attention_patch be compatible with the new change in Transformers library.

💥 Checklist before requesting a review

I have linked my PR to an issue (instruction)
My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
I have performed a self-review of my code
I have added thorough tests.
I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

🌝 Yes, I do.
🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

To be compatible with the new change in the Transformers library, where a new argument 'padding_mask' was added to forward function of attention layer. huggingface/transformers#25598

Update flash_attention_patch.py

4fa59f3

To be compatible with the new change in the Transformers library, where a new argument 'padding_mask' was added to forward function of attention layer. huggingface/transformers#25598

Orion-Zheng requested review from TongLi3701 and ver217 October 13, 2023 10:19

ver217 approved these changes Oct 16, 2023

View reviewed changes

Orion-Zheng requested a review from binmakeswell October 16, 2023 05:55

binmakeswell approved these changes Oct 16, 2023

View reviewed changes

Orion-Zheng merged commit 7768afb into main Oct 16, 2023

ver217 deleted the hotfix/flash_attn_patch branch October 16, 2023 06:01

fxmarty mentioned this pull request Oct 17, 2023

Remove ambiguous padding_mask and instead use a 2D->4D Attn Mask Mapper huggingface/transformers#26792

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[hotfix] fix bug in flash_attention_patch.py #4908

[hotfix] fix bug in flash_attention_patch.py #4908

Orion-Zheng commented Oct 13, 2023

[hotfix] fix bug in flash_attention_patch.py #4908

[hotfix] fix bug in flash_attention_patch.py #4908

Conversation

Orion-Zheng commented Oct 13, 2023

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

💥 Checklist before requesting a review

⭐️ Do you enjoy contributing to Colossal-AI?