ROCm / flash-attention Public

forked from Dao-AILab/flash-attention

Notifications You must be signed in to change notification settings
Fork 45
Star 138

Code
Issues 26
Pull requests 9
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: ROCm/flash-attention

Labels 17 Milestones 0

New pull request New

9 Open 51 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Enable MQA/GQA in backward

#100 opened Nov 14, 2024 by micmelesse

Loading…

Added Support for Rotary Positional Embeddings (both non-fused and fused kernel)

#99 opened Nov 13, 2024 by alexkranias-amd

Loading…

Added Dropout BWD

#95 opened Nov 5, 2024 by alexkranias-amd

Loading…

Integrated Rotary Positional Embeddings (RoPEs) into flash_attn_kvcache

#83 opened Sep 27, 2024 by alexkranias-amd

Loading…

Fix stride issues in flash_attn_interface

#58 opened May 31, 2024 by clintg6

Loading…

GPUAI-1250 - Flash Attention v2.04 two modules layer_norm cannot be used fixed

#52 opened Apr 3, 2024 by xiaoxiangAMD

Loading…

add FA api benchmark csv

#48 opened Mar 7, 2024 by fsx950223

Loading…

GPUAI-1250 - Flash Attention v2.04 module rotary cannot be used code fixed

#47 opened Mar 1, 2024 by xiaoxiangAMD

Loading…

Flash attention for rocm

#1 opened Feb 17, 2023 by groenenboomj

Loading…

ProTip! What’s not been updated in a month: updated:<2024-10-15.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly