-
Notifications
You must be signed in to change notification settings - Fork 45
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Flash attention for rocm #1
base: main
Are you sure you want to change the base?
Commits on Feb 16, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f5d8763 - Browse repository at this point
Copy the full SHA f5d8763View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c257c9 - Browse repository at this point
Copy the full SHA 5c257c9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 63405db - Browse repository at this point
Copy the full SHA 63405dbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 17ea3a7 - Browse repository at this point
Copy the full SHA 17ea3a7View commit details
Commits on Feb 17, 2023
-
Configuration menu - View commit details
-
Copy full SHA for d1bf99a - Browse repository at this point
Copy the full SHA d1bf99aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 84ed6d5 - Browse repository at this point
Copy the full SHA 84ed6d5View commit details
Commits on Feb 20, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 43f28bd - Browse repository at this point
Copy the full SHA 43f28bdView commit details
Commits on Feb 21, 2023
-
Configuration menu - View commit details
-
Copy full SHA for c730d50 - Browse repository at this point
Copy the full SHA c730d50View commit details
Commits on Feb 25, 2023
-
Run the FlashAttention benchmark on more configs and on forward pass only.
Configuration menu - View commit details
-
Copy full SHA for 24c81ea - Browse repository at this point
Copy the full SHA 24c81eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 228bb1a - Browse repository at this point
Copy the full SHA 228bb1aView commit details
Commits on Feb 27, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 53dd6cd - Browse repository at this point
Copy the full SHA 53dd6cdView commit details
Commits on Feb 28, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 55165c0 - Browse repository at this point
Copy the full SHA 55165c0View commit details
Commits on Mar 1, 2023
-
Configuration menu - View commit details
-
Copy full SHA for c7ec4c0 - Browse repository at this point
Copy the full SHA c7ec4c0View commit details -
Configuration menu - View commit details
-
Copy full SHA for b1473a8 - Browse repository at this point
Copy the full SHA b1473a8View commit details
Commits on Mar 2, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f788e7d - Browse repository at this point
Copy the full SHA f788e7dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9f6d0ae - Browse repository at this point
Copy the full SHA 9f6d0aeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7164b75 - Browse repository at this point
Copy the full SHA 7164b75View commit details
Commits on Mar 3, 2023
-
Configuration menu - View commit details
-
Copy full SHA for de43726 - Browse repository at this point
Copy the full SHA de43726View commit details
Commits on Mar 6, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f51255f - Browse repository at this point
Copy the full SHA f51255fView commit details
Commits on Mar 7, 2023
-
Configuration menu - View commit details
-
Copy full SHA for fb1be67 - Browse repository at this point
Copy the full SHA fb1be67View commit details -
Configuration menu - View commit details
-
Copy full SHA for f6b11c7 - Browse repository at this point
Copy the full SHA f6b11c7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9de5c29 - Browse repository at this point
Copy the full SHA 9de5c29View commit details
Commits on Mar 8, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f1eb89e - Browse repository at this point
Copy the full SHA f1eb89eView commit details -
Configuration menu - View commit details
-
Copy full SHA for f26ced0 - Browse repository at this point
Copy the full SHA f26ced0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 93677be - Browse repository at this point
Copy the full SHA 93677beView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9b94f55 - Browse repository at this point
Copy the full SHA 9b94f55View commit details -
Configuration menu - View commit details
-
Copy full SHA for 40978cd - Browse repository at this point
Copy the full SHA 40978cdView commit details -
Configuration menu - View commit details
-
Copy full SHA for ccd80bf - Browse repository at this point
Copy the full SHA ccd80bfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 05aec02 - Browse repository at this point
Copy the full SHA 05aec02View commit details
Commits on Mar 9, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 75458cb - Browse repository at this point
Copy the full SHA 75458cbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 83c46c8 - Browse repository at this point
Copy the full SHA 83c46c8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 27f84e8 - Browse repository at this point
Copy the full SHA 27f84e8View commit details
Commits on Mar 10, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 06acbdb - Browse repository at this point
Copy the full SHA 06acbdbView commit details
Commits on Mar 11, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 065c2f0 - Browse repository at this point
Copy the full SHA 065c2f0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 324bcbf - Browse repository at this point
Copy the full SHA 324bcbfView commit details
Commits on Mar 13, 2023
-
Configuration menu - View commit details
-
Copy full SHA for d3b9fc6 - Browse repository at this point
Copy the full SHA d3b9fc6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 890091e - Browse repository at this point
Copy the full SHA 890091eView commit details
Commits on Mar 14, 2023
-
Merge pull request #14 from fsx950223/mlperf_test2
Optimized api and enabled bwd pass
Configuration menu - View commit details
-
Copy full SHA for cefe848 - Browse repository at this point
Copy the full SHA cefe848View commit details -
Configuration menu - View commit details
-
Copy full SHA for 80b3a49 - Browse repository at this point
Copy the full SHA 80b3a49View commit details -
Configuration menu - View commit details
-
Copy full SHA for d4d0c6f - Browse repository at this point
Copy the full SHA d4d0c6fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 92cedaf - Browse repository at this point
Copy the full SHA 92cedafView commit details
Commits on Mar 15, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 0103fcb - Browse repository at this point
Copy the full SHA 0103fcbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2d64089 - Browse repository at this point
Copy the full SHA 2d64089View commit details
Commits on Apr 11, 2023
-
Merge pull request #12 from ROCmSoftwarePlatform/dropout-verify
Added dropout for flash_attention_for_rocm
Configuration menu - View commit details
-
Copy full SHA for a3ecabe - Browse repository at this point
Copy the full SHA a3ecabeView commit details
Commits on Apr 13, 2023
-
Configuration menu - View commit details
-
Copy full SHA for d0cc349 - Browse repository at this point
Copy the full SHA d0cc349View commit details -
Configuration menu - View commit details
-
Copy full SHA for 79d7ca1 - Browse repository at this point
Copy the full SHA 79d7ca1View commit details -
Configuration menu - View commit details
-
Copy full SHA for f4827f8 - Browse repository at this point
Copy the full SHA f4827f8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 325367c - Browse repository at this point
Copy the full SHA 325367cView commit details
Commits on Apr 14, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 7a81af7 - Browse repository at this point
Copy the full SHA 7a81af7View commit details -
Configuration menu - View commit details
-
Copy full SHA for a67bc9c - Browse repository at this point
Copy the full SHA a67bc9cView commit details
Commits on Apr 19, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 36de0b6 - Browse repository at this point
Copy the full SHA 36de0b6View commit details
Commits on Apr 21, 2023
-
Configuration menu - View commit details
-
Copy full SHA for e3ff7b1 - Browse repository at this point
Copy the full SHA e3ff7b1View commit details -
Configuration menu - View commit details
-
Copy full SHA for f7d1133 - Browse repository at this point
Copy the full SHA f7d1133View commit details -
Configuration menu - View commit details
-
Copy full SHA for e84f4a0 - Browse repository at this point
Copy the full SHA e84f4a0View commit details
Commits on Apr 27, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 963dfb9 - Browse repository at this point
Copy the full SHA 963dfb9View commit details
Commits on May 22, 2023
-
udpate dockerfile for ROCm 5.4 and Py3.8; modify patch path
Junhao committedMay 22, 2023 Configuration menu - View commit details
-
Copy full SHA for 9ee09b1 - Browse repository at this point
Copy the full SHA 9ee09b1View commit details
Commits on May 30, 2023
-
add switch for RTZ and deterministic
Junhao committedMay 30, 2023 Configuration menu - View commit details
-
Copy full SHA for 3b883df - Browse repository at this point
Copy the full SHA 3b883dfView commit details -
add switches for RTZ and deterministic
Junhao committedMay 30, 2023 Configuration menu - View commit details
-
Copy full SHA for 58b0844 - Browse repository at this point
Copy the full SHA 58b0844View commit details -
Junhao committed
May 30, 2023 Configuration menu - View commit details
-
Copy full SHA for 44a17a5 - Browse repository at this point
Copy the full SHA 44a17a5View commit details
Commits on May 31, 2023
-
Junhao committed
May 31, 2023 Configuration menu - View commit details
-
Copy full SHA for 66cd14d - Browse repository at this point
Copy the full SHA 66cd14dView commit details
Commits on Jun 1, 2023
-
python runtime api for deterministic and performance mode
Junhao committedJun 1, 2023 Configuration menu - View commit details
-
Copy full SHA for b6b4090 - Browse repository at this point
Copy the full SHA b6b4090View commit details -
Junhao committed
Jun 1, 2023 Configuration menu - View commit details
-
Copy full SHA for 618918f - Browse repository at this point
Copy the full SHA 618918fView commit details -
Junhao committed
Jun 1, 2023 Configuration menu - View commit details
-
Copy full SHA for c0be910 - Browse repository at this point
Copy the full SHA c0be910View commit details -
Junhao committed
Jun 1, 2023 Configuration menu - View commit details
-
Copy full SHA for 261c92a - Browse repository at this point
Copy the full SHA 261c92aView commit details -
Junhao committed
Jun 1, 2023 Configuration menu - View commit details
-
Copy full SHA for f4854a2 - Browse repository at this point
Copy the full SHA f4854a2View commit details -
Junhao committed
Jun 1, 2023 Configuration menu - View commit details
-
Copy full SHA for 93844af - Browse repository at this point
Copy the full SHA 93844afView commit details -
Junhao committed
Jun 1, 2023 Configuration menu - View commit details
-
Copy full SHA for f638aa6 - Browse repository at this point
Copy the full SHA f638aa6View commit details
Commits on Jun 2, 2023
-
Junhao Zhang authored
Jun 2, 2023 Configuration menu - View commit details
-
Copy full SHA for 6cb6b26 - Browse repository at this point
Copy the full SHA 6cb6b26View commit details -
Junhao committed
Jun 2, 2023 Configuration menu - View commit details
-
Copy full SHA for d5d80c5 - Browse repository at this point
Copy the full SHA d5d80c5View commit details -
Junhao Zhang authored
Jun 2, 2023 Configuration menu - View commit details
-
Copy full SHA for adcd98f - Browse repository at this point
Copy the full SHA adcd98fView commit details -
Merge pull request #15 from ROCmSoftwarePlatform/jhzhan/release_test
Release merged with test_rtz
Configuration menu - View commit details
-
Copy full SHA for 0c84715 - Browse repository at this point
Copy the full SHA 0c84715View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7633247 - Browse repository at this point
Copy the full SHA 7633247View commit details -
Merge pull request #2 from ROCmSoftwarePlatform/jhzhan/release_test
Jhzhan/release test
Configuration menu - View commit details
-
Copy full SHA for 782e7ab - Browse repository at this point
Copy the full SHA 782e7abView commit details -
modify readme and minor changes
Junhao committedJun 2, 2023 Configuration menu - View commit details
-
Copy full SHA for 918cd00 - Browse repository at this point
Copy the full SHA 918cd00View commit details -
Junhao committed
Jun 2, 2023 Configuration menu - View commit details
-
Copy full SHA for cfb7f3f - Browse repository at this point
Copy the full SHA cfb7f3fView commit details -
Junhao committed
Jun 2, 2023 Configuration menu - View commit details
-
Copy full SHA for 2205fdc - Browse repository at this point
Copy the full SHA 2205fdcView commit details -
Update flash_attn_interface.py
Junhao Zhang authoredJun 2, 2023 Configuration menu - View commit details
-
Copy full SHA for 9273197 - Browse repository at this point
Copy the full SHA 9273197View commit details
Commits on Jun 5, 2023
-
Junhao committed
Jun 5, 2023 Configuration menu - View commit details
-
Copy full SHA for 7e6a96a - Browse repository at this point
Copy the full SHA 7e6a96aView commit details
Commits on Jun 6, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 9c01c25 - Browse repository at this point
Copy the full SHA 9c01c25View commit details -
unify data types of input, output, and gemm in either FP16 or BF16 fo…
…r tuning performance; refactor codes
Junhao committedJun 6, 2023 Configuration menu - View commit details
-
Copy full SHA for ceea624 - Browse repository at this point
Copy the full SHA ceea624View commit details
Commits on Jun 7, 2023
-
using BF16 as GEMM type in performance mode
Junhao committedJun 7, 2023 Configuration menu - View commit details
-
Copy full SHA for d565fad - Browse repository at this point
Copy the full SHA d565fadView commit details -
Merge branch 'flash_attention_for_rocm' of https://github.com/ROCmSof…
…twarePlatform/flash-attention into flash_attention_for_rocm
Junhao committedJun 7, 2023 Configuration menu - View commit details
-
Copy full SHA for e488af5 - Browse repository at this point
Copy the full SHA e488af5View commit details
Commits on Jun 15, 2023
-
change random seeds api in accordance with PyTorch 1.13.1+
Junhao committedJun 15, 2023 Configuration menu - View commit details
-
Copy full SHA for ee0665c - Browse repository at this point
Copy the full SHA ee0665cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8559ccd - Browse repository at this point
Copy the full SHA 8559ccdView commit details
Commits on Jun 19, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 9887a29 - Browse repository at this point
Copy the full SHA 9887a29View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3e1f9ea - Browse repository at this point
Copy the full SHA 3e1f9eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8512242 - Browse repository at this point
Copy the full SHA 8512242View commit details
Commits on Jun 20, 2023
-
Configuration menu - View commit details
-
Copy full SHA for beab3fb - Browse repository at this point
Copy the full SHA beab3fbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d05af4 - Browse repository at this point
Copy the full SHA 4d05af4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9838670 - Browse repository at this point
Copy the full SHA 9838670View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0317244 - Browse repository at this point
Copy the full SHA 0317244View commit details -
Configuration menu - View commit details
-
Copy full SHA for 662535c - Browse repository at this point
Copy the full SHA 662535cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6a51836 - Browse repository at this point
Copy the full SHA 6a51836View commit details
Commits on Jun 21, 2023
-
Configuration menu - View commit details
-
Copy full SHA for ad3259a - Browse repository at this point
Copy the full SHA ad3259aView commit details -
Merge remote-tracking branch 'public/flash_attention_for_rocm2' into …
…flash_attention_for_rocm2
Configuration menu - View commit details
-
Copy full SHA for 99637e4 - Browse repository at this point
Copy the full SHA 99637e4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 983d299 - Browse repository at this point
Copy the full SHA 983d299View commit details -
Configuration menu - View commit details
-
Copy full SHA for e90010b - Browse repository at this point
Copy the full SHA e90010bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 63ce40f - Browse repository at this point
Copy the full SHA 63ce40fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 78aada9 - Browse repository at this point
Copy the full SHA 78aada9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1a11344 - Browse repository at this point
Copy the full SHA 1a11344View commit details
Commits on Jun 26, 2023
-
Configuration menu - View commit details
-
Copy full SHA for dedea21 - Browse repository at this point
Copy the full SHA dedea21View commit details
Commits on Jun 27, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 424141b - Browse repository at this point
Copy the full SHA 424141bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 994eca4 - Browse repository at this point
Copy the full SHA 994eca4View commit details -
Configuration menu - View commit details
-
Copy full SHA for f67f948 - Browse repository at this point
Copy the full SHA f67f948View commit details
Commits on Jun 28, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 7e190a4 - Browse repository at this point
Copy the full SHA 7e190a4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 98258ef - Browse repository at this point
Copy the full SHA 98258efView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8bb4d98 - Browse repository at this point
Copy the full SHA 8bb4d98View commit details
Commits on Jul 4, 2023
-
Configuration menu - View commit details
-
Copy full SHA for ab576b9 - Browse repository at this point
Copy the full SHA ab576b9View commit details
Commits on Jul 5, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 6834c97 - Browse repository at this point
Copy the full SHA 6834c97View commit details
Commits on Jul 11, 2023
-
This reverts commit 8559ccd. Revert "change random seeds api in accordance with PyTorch 1.13.1+" This reverts commit ee0665c. Revert "using BF16 as GEMM type in performance mode" This reverts commit d565fad. Revert "unify data types of input, output, and gemm in either FP16 or BF16 for tuning performance; refactor codes" This reverts commit ceea624. Revert "update docker and readme to remove private reference" This reverts commit 9c01c25. Revert "Update dockerfile" This reverts commit 7e6a96a.
Configuration menu - View commit details
-
Copy full SHA for 10d7481 - Browse repository at this point
Copy the full SHA 10d7481View commit details -
Configuration menu - View commit details
-
Copy full SHA for 777e166 - Browse repository at this point
Copy the full SHA 777e166View commit details -
Configuration menu - View commit details
-
Copy full SHA for b8f2ee6 - Browse repository at this point
Copy the full SHA b8f2ee6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3e4f367 - Browse repository at this point
Copy the full SHA 3e4f367View commit details -
Configuration menu - View commit details
-
Copy full SHA for bfb1d75 - Browse repository at this point
Copy the full SHA bfb1d75View commit details -
Configuration menu - View commit details
-
Copy full SHA for b83723c - Browse repository at this point
Copy the full SHA b83723cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6e2a304 - Browse repository at this point
Copy the full SHA 6e2a304View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ad9386 - Browse repository at this point
Copy the full SHA 5ad9386View commit details -
Configuration menu - View commit details
-
Copy full SHA for e29d75f - Browse repository at this point
Copy the full SHA e29d75fView commit details -
Configuration menu - View commit details
-
Copy full SHA for deb2e94 - Browse repository at this point
Copy the full SHA deb2e94View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f5297b - Browse repository at this point
Copy the full SHA 3f5297bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 22b64b1 - Browse repository at this point
Copy the full SHA 22b64b1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 535f1b7 - Browse repository at this point
Copy the full SHA 535f1b7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1ddabb8 - Browse repository at this point
Copy the full SHA 1ddabb8View commit details -
Configuration menu - View commit details
-
Copy full SHA for db62edc - Browse repository at this point
Copy the full SHA db62edcView commit details -
Configuration menu - View commit details
-
Copy full SHA for cf2ffe1 - Browse repository at this point
Copy the full SHA cf2ffe1View commit details
Commits on Jul 12, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 6aacb04 - Browse repository at this point
Copy the full SHA 6aacb04View commit details
Commits on Jul 14, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 67d897b - Browse repository at this point
Copy the full SHA 67d897bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0cb0cd5 - Browse repository at this point
Copy the full SHA 0cb0cd5View commit details -
Merge pull request #6 from ROCmSoftwarePlatform/attn-qloop-kloop-v2
Enable both Qloop and Kloop
Configuration menu - View commit details
-
Copy full SHA for 9551449 - Browse repository at this point
Copy the full SHA 9551449View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0ba1882 - Browse repository at this point
Copy the full SHA 0ba1882View commit details -
Configuration menu - View commit details
-
Copy full SHA for 489a673 - Browse repository at this point
Copy the full SHA 489a673View commit details -
Configuration menu - View commit details
-
Copy full SHA for a988787 - Browse repository at this point
Copy the full SHA a988787View commit details -
Configuration menu - View commit details
-
Copy full SHA for 34e29f7 - Browse repository at this point
Copy the full SHA 34e29f7View commit details
Commits on Jul 31, 2023
-
Reduce the compiling time by spliting into several cpp files (#7)
Tested the elapsed time of "python setup.py install" on ROCm5.7/PyTorch 1.13.1: Older version: 26m1.244s This version: 4m11.111s on PyTorch 1.13.1;3m39.470s on PyTorch 2.0.1 Unit tests passed on ROCm5.7 + PyTorch 1.13.1:2113 passed, 2848 skipped in 119.70s * refactoring code * update ignores * bug fixes * patch updates * fix test cases * remove useless fils * update ck --------- Co-authored-by: Junhao <[email protected]> Co-authored-by: fsx950223 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0821eb0 - Browse repository at this point
Copy the full SHA 0821eb0View commit details
Commits on Aug 7, 2023
-
Remove PyTorch patch by updating PyTorch
Use a version of PyTorch with the hipify changes included.
Configuration menu - View commit details
-
Copy full SHA for 05d45e4 - Browse repository at this point
Copy the full SHA 05d45e4View commit details
Commits on Aug 11, 2023
-
root committed
Aug 11, 2023 Configuration menu - View commit details
-
Copy full SHA for a2e81ca - Browse repository at this point
Copy the full SHA a2e81caView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0627500 - Browse repository at this point
Copy the full SHA 0627500View commit details
Commits on Aug 12, 2023
-
Configuration menu - View commit details
-
Copy full SHA for ed7ccb3 - Browse repository at this point
Copy the full SHA ed7ccb3View commit details
Commits on Aug 14, 2023
-
Configuration menu - View commit details
-
Copy full SHA for b6d78bd - Browse repository at this point
Copy the full SHA b6d78bdView commit details
Commits on Aug 16, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 21b45c3 - Browse repository at this point
Copy the full SHA 21b45c3View commit details -
Merge pull request #8 from ROCmSoftwarePlatform/remove_patch
Remove patch
Configuration menu - View commit details
-
Copy full SHA for 52427b5 - Browse repository at this point
Copy the full SHA 52427b5View commit details
Commits on Aug 17, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 6d88e70 - Browse repository at this point
Copy the full SHA 6d88e70View commit details -
Configuration menu - View commit details
-
Copy full SHA for eabcebf - Browse repository at this point
Copy the full SHA eabcebfView commit details
Commits on Aug 18, 2023
-
Configuration menu - View commit details
-
Copy full SHA for fe1cb5a - Browse repository at this point
Copy the full SHA fe1cb5aView commit details
Commits on Aug 22, 2023
-
Merge pull request #10 from ROCmSoftwarePlatform/inference-opt
Optimization based on profiling for forward.
Configuration menu - View commit details
-
Copy full SHA for 4619d9c - Browse repository at this point
Copy the full SHA 4619d9cView commit details
Commits on Aug 28, 2023
-
root committed
Aug 28, 2023 Configuration menu - View commit details
-
Copy full SHA for c902c75 - Browse repository at this point
Copy the full SHA c902c75View commit details
Commits on Aug 29, 2023
-
Junhao committed
Aug 29, 2023 Configuration menu - View commit details
-
Copy full SHA for ce59e9f - Browse repository at this point
Copy the full SHA ce59e9fView commit details
Commits on Aug 31, 2023
-
Optimized API for packed conditions (#12)
* optimized api for fwd in packed conditions * optimized api for bwd
Configuration menu - View commit details
-
Copy full SHA for d394549 - Browse repository at this point
Copy the full SHA d394549View commit details
Commits on Sep 13, 2023
-
compatiable with xformers (#13)
* compatiable with xformers * add get_package_version function
Configuration menu - View commit details
-
Copy full SHA for efd5e04 - Browse repository at this point
Copy the full SHA efd5e04View commit details
Commits on Sep 15, 2023
-
Merge tag 'v2.0.0' of https://github.com/Dao-AILab/flash-attention in…
…to junhzhan/ifu-v2.0.0
Junhao Zhang committedSep 15, 2023 Configuration menu - View commit details
-
Copy full SHA for 0b037c2 - Browse repository at this point
Copy the full SHA 0b037c2View commit details -
added setup.py for ROCm; increase code readability; rename files.
Junhao Zhang committedSep 15, 2023 Configuration menu - View commit details
-
Copy full SHA for 0de9665 - Browse repository at this point
Copy the full SHA 0de9665View commit details -
modified mha_fwd; added mha_varlen_fwd
Junhao Zhang committedSep 15, 2023 Configuration menu - View commit details
-
Copy full SHA for 48f57bf - Browse repository at this point
Copy the full SHA 48f57bfView commit details
Commits on Sep 18, 2023
-
enable mha_bwd + mha_varlen_bwd
Junhao Zhang committedSep 18, 2023 Configuration menu - View commit details
-
Copy full SHA for cef81d1 - Browse repository at this point
Copy the full SHA cef81d1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 16b0c17 - Browse repository at this point
Copy the full SHA 16b0c17View commit details -
Junhao Zhang committed
Sep 18, 2023 Configuration menu - View commit details
-
Copy full SHA for 1e9ddf8 - Browse repository at this point
Copy the full SHA 1e9ddf8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7581be2 - Browse repository at this point
Copy the full SHA 7581be2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 94273a8 - Browse repository at this point
Copy the full SHA 94273a8View commit details
Commits on Sep 19, 2023
-
Configuration menu - View commit details
-
Copy full SHA for b13603f - Browse repository at this point
Copy the full SHA b13603fView commit details -
Junhao Zhang authored
Sep 19, 2023 Configuration menu - View commit details
-
Copy full SHA for f978f3e - Browse repository at this point
Copy the full SHA f978f3eView commit details -
Configuration menu - View commit details
-
Copy full SHA for c656139 - Browse repository at this point
Copy the full SHA c656139View commit details -
Junhao Zhang authored
Sep 19, 2023 Configuration menu - View commit details
-
Copy full SHA for 3f53461 - Browse repository at this point
Copy the full SHA 3f53461View commit details -
Configuration menu - View commit details
-
Copy full SHA for 631f027 - Browse repository at this point
Copy the full SHA 631f027View commit details -
Junhao Zhang committed
Sep 19, 2023 Configuration menu - View commit details
-
Copy full SHA for a6900a4 - Browse repository at this point
Copy the full SHA a6900a4View commit details -
Junhao Zhang committed
Sep 19, 2023 Configuration menu - View commit details
-
Copy full SHA for dc98ee5 - Browse repository at this point
Copy the full SHA dc98ee5View commit details -
Junhao committed
Sep 19, 2023 Configuration menu - View commit details
-
Copy full SHA for 37e5961 - Browse repository at this point
Copy the full SHA 37e5961View commit details
Commits on Sep 20, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 609262f - Browse repository at this point
Copy the full SHA 609262fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8216584 - Browse repository at this point
Copy the full SHA 8216584View commit details -
Configuration menu - View commit details
-
Copy full SHA for de70d9d - Browse repository at this point
Copy the full SHA de70d9dView commit details
Commits on Sep 21, 2023
-
Configuration menu - View commit details
-
Copy full SHA for e82b97a - Browse repository at this point
Copy the full SHA e82b97aView commit details -
Configuration menu - View commit details
-
Copy full SHA for d7be208 - Browse repository at this point
Copy the full SHA d7be208View commit details
Commits on Sep 22, 2023
-
Merge pull request #15 from ROCmSoftwarePlatform/bwd-prof-opt
* updated ck and removed kloop * removed kloop related files * updated test file * modified test file * added bwd light version * optimize code for light * stage process for bwd nonpadding * modified ratit for bwd * added padding branch * removed kloop stuff * added rtn to ck TBD: Fix accuracy degradation since introduction of int8 drop.
Configuration menu - View commit details
-
Copy full SHA for 444e15a - Browse repository at this point
Copy the full SHA 444e15aView commit details
Commits on Sep 28, 2023
-
Junhao committed
Sep 28, 2023 Configuration menu - View commit details
-
Copy full SHA for 1d4913f - Browse repository at this point
Copy the full SHA 1d4913fView commit details -
Junhao committed
Sep 28, 2023 Configuration menu - View commit details
-
Copy full SHA for 39c6578 - Browse repository at this point
Copy the full SHA 39c6578View commit details -
Junhao Zhang committed
Sep 28, 2023 Configuration menu - View commit details
-
Copy full SHA for 0d557df - Browse repository at this point
Copy the full SHA 0d557dfView commit details -
Merge pull request '#15' into junhzhan/ifu-v2.0.0;
Junhao Zhang committedSep 28, 2023 Configuration menu - View commit details
-
Copy full SHA for 623ffbb - Browse repository at this point
Copy the full SHA 623ffbbView commit details -
Junhao Zhang authored
Sep 28, 2023 Configuration menu - View commit details
-
Copy full SHA for e61ba7a - Browse repository at this point
Copy the full SHA e61ba7aView commit details
Commits on Oct 7, 2023
-
Junhao Zhang authored
Oct 7, 2023 Configuration menu - View commit details
-
Copy full SHA for 94b9dd5 - Browse repository at this point
Copy the full SHA 94b9dd5View commit details -
Junhao Zhang authored
Oct 7, 2023 Configuration menu - View commit details
-
Copy full SHA for 67162e3 - Browse repository at this point
Copy the full SHA 67162e3View commit details -
bug fixes for batched template
Junhao Zhang committedOct 7, 2023 Configuration menu - View commit details
-
Copy full SHA for fe31011 - Browse repository at this point
Copy the full SHA fe31011View commit details -
bug fixes for batched template
Junhao Zhang committedOct 7, 2023 Configuration menu - View commit details
-
Copy full SHA for ae87e65 - Browse repository at this point
Copy the full SHA ae87e65View commit details
Commits on Oct 8, 2023
-
Junhao Zhang committed
Oct 8, 2023 Configuration menu - View commit details
-
Copy full SHA for 89806a4 - Browse repository at this point
Copy the full SHA 89806a4View commit details
Commits on Oct 11, 2023
-
params -> BaseParams for static members
Junhao Zhang authoredOct 11, 2023 Configuration menu - View commit details
-
Copy full SHA for aa59b0f - Browse repository at this point
Copy the full SHA aa59b0fView commit details -
hpp suffix is prefered in cpp hence changed
Junhao Zhang authoredOct 11, 2023 Configuration menu - View commit details
-
Copy full SHA for aa96f3e - Browse repository at this point
Copy the full SHA aa96f3eView commit details -
removing deprecated files for ifu readiness
Junhao Zhang authoredOct 11, 2023 Configuration menu - View commit details
-
Copy full SHA for f47a112 - Browse repository at this point
Copy the full SHA f47a112View commit details -
Junhao Zhang authored
Oct 11, 2023 Configuration menu - View commit details
-
Copy full SHA for 86964ee - Browse repository at this point
Copy the full SHA 86964eeView commit details -
Junhao Zhang committed
Oct 11, 2023 Configuration menu - View commit details
-
Copy full SHA for 185fd79 - Browse repository at this point
Copy the full SHA 185fd79View commit details -
Junhao Zhang authored
Oct 11, 2023 Configuration menu - View commit details
-
Copy full SHA for 46172fb - Browse repository at this point
Copy the full SHA 46172fbView commit details
Commits on Oct 12, 2023
-
Junhao Zhang committed
Oct 12, 2023 Configuration menu - View commit details
-
Copy full SHA for bc37a40 - Browse repository at this point
Copy the full SHA bc37a40View commit details -
Junhao Zhang committed
Oct 12, 2023 Configuration menu - View commit details
-
Copy full SHA for 14df6f1 - Browse repository at this point
Copy the full SHA 14df6f1View commit details -
Junhao Zhang committed
Oct 12, 2023 Configuration menu - View commit details
-
Copy full SHA for 149c2b4 - Browse repository at this point
Copy the full SHA 149c2b4View commit details
Commits on Oct 13, 2023
-
Junhao Zhang committed
Oct 13, 2023 Configuration menu - View commit details
-
Copy full SHA for 8cfc14c - Browse repository at this point
Copy the full SHA 8cfc14cView commit details -
Junhao Zhang committed
Oct 13, 2023 Configuration menu - View commit details
-
Copy full SHA for f047ddb - Browse repository at this point
Copy the full SHA f047ddbView commit details -
Junhao Zhang committed
Oct 13, 2023 Configuration menu - View commit details
-
Copy full SHA for 2338516 - Browse repository at this point
Copy the full SHA 2338516View commit details -
Junhao Zhang committed
Oct 13, 2023 Configuration menu - View commit details
-
Copy full SHA for 7a382ae - Browse repository at this point
Copy the full SHA 7a382aeView commit details -
Junhao Zhang committed
Oct 13, 2023 Configuration menu - View commit details
-
Copy full SHA for 889d4a8 - Browse repository at this point
Copy the full SHA 889d4a8View commit details
Commits on Oct 18, 2023
-
Junhao Zhang authored
Oct 18, 2023 Configuration menu - View commit details
-
Copy full SHA for 182ef77 - Browse repository at this point
Copy the full SHA 182ef77View commit details -
Junhao Zhang authored
Oct 18, 2023 Configuration menu - View commit details
-
Copy full SHA for 89e44a0 - Browse repository at this point
Copy the full SHA 89e44a0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 790eca7 - Browse repository at this point
Copy the full SHA 790eca7View commit details -
Junhao Zhang authored
Oct 18, 2023 Configuration menu - View commit details
-
Copy full SHA for 6bcce4f - Browse repository at this point
Copy the full SHA 6bcce4fView commit details -
Junhao Zhang authored
Oct 18, 2023 Configuration menu - View commit details
-
Copy full SHA for 393b1ad - Browse repository at this point
Copy the full SHA 393b1adView commit details -
Junhao Zhang authored
Oct 18, 2023 Configuration menu - View commit details
-
Copy full SHA for cbca76f - Browse repository at this point
Copy the full SHA cbca76fView commit details -
Junhao Zhang committed
Oct 18, 2023 Configuration menu - View commit details
-
Copy full SHA for 490a01b - Browse repository at this point
Copy the full SHA 490a01bView commit details -
Junhao Zhang committed
Oct 18, 2023 Configuration menu - View commit details
-
Copy full SHA for 232e5a9 - Browse repository at this point
Copy the full SHA 232e5a9View commit details -
Junhao Zhang authored
Oct 18, 2023 Configuration menu - View commit details
-
Copy full SHA for db9541b - Browse repository at this point
Copy the full SHA db9541bView commit details
Commits on Oct 20, 2023
-
Junhao Zhang committed
Oct 20, 2023 Configuration menu - View commit details
-
Copy full SHA for f046d04 - Browse repository at this point
Copy the full SHA f046d04View commit details -
Junhao Zhang committed
Oct 20, 2023 Configuration menu - View commit details
-
Copy full SHA for 1fe24cf - Browse repository at this point
Copy the full SHA 1fe24cfView commit details
Commits on Oct 24, 2023
-
Junhao Zhang committed
Oct 24, 2023 Configuration menu - View commit details
-
Copy full SHA for f5783bb - Browse repository at this point
Copy the full SHA f5783bbView commit details -
Junhao Zhang committed
Oct 24, 2023 Configuration menu - View commit details
-
Copy full SHA for cd463f9 - Browse repository at this point
Copy the full SHA cd463f9View commit details -
Merge branch 'junhzhan/ifu-v2.0.0' of https://github.com/ROCmSoftware…
…Platform/flash-attention into junhzhan/ifu-v2.0.0
Junhao Zhang committedOct 24, 2023 Configuration menu - View commit details
-
Copy full SHA for 9f90750 - Browse repository at this point
Copy the full SHA 9f90750View commit details
Commits on Oct 25, 2023
-
added optional FP32 dQKV for unit tests
Junhao Zhang committedOct 25, 2023 Configuration menu - View commit details
-
Copy full SHA for 5d1365a - Browse repository at this point
Copy the full SHA 5d1365aView commit details -
pass qkv.contiguous() instead of assigning values
Junhao Zhang committedOct 25, 2023 Configuration menu - View commit details
-
Copy full SHA for a807948 - Browse repository at this point
Copy the full SHA a807948View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3a31e7e - Browse repository at this point
Copy the full SHA 3a31e7eView commit details
Commits on Oct 26, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f4c8dde - Browse repository at this point
Copy the full SHA f4c8ddeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6bc3374 - Browse repository at this point
Copy the full SHA 6bc3374View commit details -
Configuration menu - View commit details
-
Copy full SHA for b4d20b2 - Browse repository at this point
Copy the full SHA b4d20b2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 15c19e2 - Browse repository at this point
Copy the full SHA 15c19e2View commit details -
fix dropout z tensors allocation; enable unit test
Junhao Zhang committedOct 26, 2023 Configuration menu - View commit details
-
Copy full SHA for 0c5b579 - Browse repository at this point
Copy the full SHA 0c5b579View commit details -
Merge branch 'junhzhan/ifu-v2.0.0' of https://github.com/ROCmSoftware…
…Platform/flash-attention into junhzhan/ifu-v2.0.0
Junhao Zhang committedOct 26, 2023 Configuration menu - View commit details
-
Copy full SHA for d7b631a - Browse repository at this point
Copy the full SHA d7b631aView commit details -
Configuration menu - View commit details
-
Copy full SHA for b5ba498 - Browse repository at this point
Copy the full SHA b5ba498View commit details -
Configuration menu - View commit details
-
Copy full SHA for cc78698 - Browse repository at this point
Copy the full SHA cc78698View commit details
Commits on Oct 27, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 6daeb0c - Browse repository at this point
Copy the full SHA 6daeb0cView commit details -
Configuration menu - View commit details
-
Copy full SHA for b6a9f6e - Browse repository at this point
Copy the full SHA b6a9f6eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5e80fc7 - Browse repository at this point
Copy the full SHA 5e80fc7View commit details
Commits on Oct 30, 2023
-
Merge pull request #16 from ROCmSoftwarePlatform/ifu-mqa
Add MQA & GQA
Configuration menu - View commit details
-
Copy full SHA for 02c234b - Browse repository at this point
Copy the full SHA 02c234bView commit details -
Junhao Zhang authored
Oct 30, 2023 Configuration menu - View commit details
-
Copy full SHA for b27bd1d - Browse repository at this point
Copy the full SHA b27bd1dView commit details -
Junhao Zhang authored
Oct 30, 2023 Configuration menu - View commit details
-
Copy full SHA for 9a5273d - Browse repository at this point
Copy the full SHA 9a5273dView commit details -
Junhao committed
Oct 30, 2023 Configuration menu - View commit details
-
Copy full SHA for 2d11119 - Browse repository at this point
Copy the full SHA 2d11119View commit details -
Junhao committed
Oct 30, 2023 Configuration menu - View commit details
-
Copy full SHA for 4d79450 - Browse repository at this point
Copy the full SHA 4d79450View commit details -
update RTN swtich; enable MQA/GQA UT
Junhao committedOct 30, 2023 Configuration menu - View commit details
-
Copy full SHA for a197406 - Browse repository at this point
Copy the full SHA a197406View commit details -
Merge branch 'junhzhan/ifu-v2.0.0' of https://github.com/ROCmSoftware…
…Platform/flash-attention into junhzhan/ifu-v2.0.0
Junhao committedOct 30, 2023 Configuration menu - View commit details
-
Copy full SHA for 8da5b66 - Browse repository at this point
Copy the full SHA 8da5b66View commit details
Commits on Oct 31, 2023
-
Junhao committed
Oct 31, 2023 Configuration menu - View commit details
-
Copy full SHA for 5378a20 - Browse repository at this point
Copy the full SHA 5378a20View commit details -
Junhao committed
Oct 31, 2023 Configuration menu - View commit details
-
Copy full SHA for 1b808f4 - Browse repository at this point
Copy the full SHA 1b808f4View commit details -
Merge branch 'junhzhan/ifu-v2.0.0' of https://github.com/ROCmSoftware…
…Platform/flash-attention into junhzhan/ifu-v2.0.0
Junhao committedOct 31, 2023 Configuration menu - View commit details
-
Copy full SHA for 23ee8fb - Browse repository at this point
Copy the full SHA 23ee8fbView commit details -
Junhao committed
Oct 31, 2023 Configuration menu - View commit details
-
Copy full SHA for 0c92f31 - Browse repository at this point
Copy the full SHA 0c92f31View commit details
Commits on Nov 1, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 2c057b4 - Browse repository at this point
Copy the full SHA 2c057b4View commit details
Commits on Nov 3, 2023
-
Junhao Zhang authored
Nov 3, 2023 Configuration menu - View commit details
-
Copy full SHA for 1cd7f89 - Browse repository at this point
Copy the full SHA 1cd7f89View commit details -
Merge pull request #14 from ROCmSoftwarePlatform/junhzhan/ifu-v2.0.0
IFU to v2.0.4 Add MQA/GQA but MQA UT is disabled due to some failures. Support new hardware.
Configuration menu - View commit details
-
Copy full SHA for edc7698 - Browse repository at this point
Copy the full SHA edc7698View commit details
Commits on Nov 17, 2023
-
Remove Hardcoded Building Options (#19)
* Update README * Update Dockerfile for customized image building * Sync test scripts * Remove internal cmake file since no longer worked * Remove headers that is used for internal testing * Refine and add options for different GCN archs * Add clang-format file * Remove dockerfile that is no longer used * Chang utils location
Junhao Zhang authoredNov 17, 2023 Configuration menu - View commit details
-
Copy full SHA for 5f1ae07 - Browse repository at this point
Copy the full SHA 5f1ae07View commit details
Commits on Nov 21, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 675d324 - Browse repository at this point
Copy the full SHA 675d324View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a77d72 - Browse repository at this point
Copy the full SHA 8a77d72View commit details
Commits on Nov 29, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 18060ee - Browse repository at this point
Copy the full SHA 18060eeView commit details -
Configuration menu - View commit details
-
Copy full SHA for fa589c3 - Browse repository at this point
Copy the full SHA fa589c3View commit details -
Configuration menu - View commit details
-
Copy full SHA for fa285bf - Browse repository at this point
Copy the full SHA fa285bfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3d2b6f5 - Browse repository at this point
Copy the full SHA 3d2b6f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3b786a2 - Browse repository at this point
Copy the full SHA 3b786a2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 820b2b1 - Browse repository at this point
Copy the full SHA 820b2b1View commit details
Commits on Dec 5, 2023
-
Merge pull request #23 from Naomiusearch/flash_attention_for_rocm
Make installation steps look better
Configuration menu - View commit details
-
Copy full SHA for 68aac13 - Browse repository at this point
Copy the full SHA 68aac13View commit details
Commits on Jan 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b64f45e - Browse repository at this point
Copy the full SHA b64f45eView commit details
Commits on Feb 4, 2024
-
Merge pull request #38 from luizanao/add-support-gfx908
Allow gfx908 to build
Configuration menu - View commit details
-
Copy full SHA for ae7928c - Browse repository at this point
Copy the full SHA ae7928cView commit details
Commits on Mar 8, 2024
-
* add benchmark script * fix bugs * fix a bug * add output csv
Configuration menu - View commit details
-
Copy full SHA for 2554f49 - Browse repository at this point
Copy the full SHA 2554f49View commit details