Enable fwd and varlen_fwd on AMD #60

micmelesse · 2024-06-05T15:45:15Z

Enable flash_attn_func using amd's triton flash attention kernel.

This enables flash_attn_func. There will be follow up prs to enable the forward pass of the other flash attention apis.

we skip building the cuda kernels and use the triton kernel on AMD. This is fine for our fork but upstream will probably require a way to have one wheel.

Compress This is a combination of 12 commits. add scripts save add our kernel import our kernel round trip use bshd layout figure out segfault fix show backward failure with prints save backward work run forward only test smallest config on everything add test fix remove pre commit install triton skip dropout pin d 32 factor d just run power of 2 remove timeout run serially clean up clean up 2

This is a combination of 6 commits. save some tests passing enable more enable everything move around alibi works

flash_attn/flash_attn_triton_amd.py

tests/test_flash_attn.py

micmelesse · 2024-06-19T15:25:45Z

@vgokhale Can I merge this?

micmelesse marked this pull request as ready for review June 7, 2024 16:12

micmelesse changed the title ~~Enable AMD Trition Fwd Kernel~~ Enable flash_attn_func on AMD Jun 7, 2024

micmelesse requested a review from vgokhale June 7, 2024 16:14

micmelesse force-pushed the micmelesse/enable_fwd branch from 74654c7 to 2e2cc43 Compare June 7, 2024 16:17

Varlen works

527af6c

This is a combination of 6 commits. save some tests passing enable more enable everything move around alibi works

micmelesse changed the title ~~Enable flash_attn_func on AMD~~ Enable fwd and varlen_fwd on AMD Jun 12, 2024

vgokhale reviewed Jun 13, 2024

View reviewed changes

flash_attn/flash_attn_triton_amd.py Outdated Show resolved Hide resolved

vgokhale reviewed Jun 13, 2024

View reviewed changes

tests/test_flash_attn.py Show resolved Hide resolved

micmelesse added 2 commits June 14, 2024 08:44

keep interface and kernel seperate

13ba75a

clean up

b6ea085

micmelesse merged commit 8fdb1bc into main_perf Jun 19, 2024
2 checks passed

micmelesse mentioned this pull request Jun 19, 2024

Enable fwd and varlen_fwd on AMD #63

Merged

micmelesse deleted the micmelesse/enable_fwd branch August 5, 2024 20:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable fwd and varlen_fwd on AMD #60

Enable fwd and varlen_fwd on AMD #60

micmelesse commented Jun 5, 2024 •

edited

Loading

micmelesse commented Jun 19, 2024

Enable fwd and varlen_fwd on AMD #60

Enable fwd and varlen_fwd on AMD #60

Conversation

micmelesse commented Jun 5, 2024 • edited Loading

micmelesse commented Jun 19, 2024

micmelesse commented Jun 5, 2024 •

edited

Loading