This repository has been archived by the owner on Aug 30, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 38
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[BesTLA] Support int5&int6 for kernels and models (#259)
* add initial of int5 * add all gemv of int5 * finish all avx2 kernels of int5 * add benchmark of int5 * add avx512f s5_s8, s5_fp * add avx512f kernels of int5 * test LLaMa2-7B with int5, sym and asym. * fix code scan * clang-format * add avx2 decompress kernels of int6 * add avx2 gemv kernels for int6 * add avx512f kernels for int6 * clang-format * fix UT * fix UT bug * add UTs for new bits * update doc * fix UT bug * fix ISA check * fix bug of AVX2 s6_s8 * update dtypes in advanced_usage.md
- Loading branch information
1 parent
3257516
commit 68d2cff
Showing
18 changed files
with
7,447 additions
and
2,585 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.