ROCm / flash-attention Public

forked from Dao-AILab/flash-attention

Notifications You must be signed in to change notification settings
Fork 48
Star 148

Code
Issues 23
Pull requests 11
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: ROCm/flash-attention

Labels 17 Milestones 0

New pull request New

11 Open 58 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Support for Sliding Window Attention

#109 opened Dec 10, 2024 by alexkranias-amd

Loading…

fp8 bwd

#108 opened Dec 9, 2024 by micmelesse • Draft

Clean Up

#107 opened Dec 6, 2024 by micmelesse • Draft

[Do not merge] vllm layout varlen WIP

work in progress

#106 opened Dec 3, 2024 by rocking5566

Loading…

Added Benchmark for Rotary Decode Kernel + Performance Speed Up for Rotary Kernel

#102 opened Nov 22, 2024 by alexkranias-amd

Loading…

Added Dropout BWD

#95 opened Nov 5, 2024 by alexkranias-amd

Loading…

Fix stride issues in flash_attn_interface

#58 opened May 31, 2024 by clintg6

Loading…

GPUAI-1250 - Flash Attention v2.04 two modules layer_norm cannot be used fixed

#52 opened Apr 3, 2024 by xiaoxiangAMD

Loading…

add FA api benchmark csv

#48 opened Mar 7, 2024 by fsx950223

Loading…

GPUAI-1250 - Flash Attention v2.04 module rotary cannot be used code fixed

#47 opened Mar 1, 2024 by xiaoxiangAMD

Loading…

Flash attention for rocm

#1 opened Feb 17, 2023 by groenenboomj

Loading…

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly