[Tentative] Adding new intrinsics for gemm. #98

Narsil · 2023-07-06T20:13:31Z

Hi here.

I am attempting to port basically ggml matrix multiplication into a standalone crate: https://github.com/Narsil/ggblas

For most of the operations, I was able to leverage intrinsics: https://doc.rust-lang.org/core/arch/arm/index.html
However for M1 (so arm aarch64), it's missing some SIMD f16 intrinsics.

https://developer.arm.com/documentation/101028/0012/13--Advanced-SIMD--Neon--intrinsics

Not sure if the approach I suggest here is viable, my understanding of low level primitives such as these is fairly limited.

Happy to run a more complete set of operations if this is indeed deemed interesting.

Seems the proper implementation into the compiler itself would be something like : rust-lang/stdarch#344

That's why I felt the intrinsics would have their place here.

Cheers !

Other refS: rust-lang/rfcs#3451

Still failing when compiling `fmlaq` without black_box.

starkat99 · 2023-08-05T00:09:54Z

I'm fine with putting these in the crate, maybe make sure that existing aarch64 assembly in crate doesn't overlap though, and make any existing code use the new names if there is any overlap.

However, I don't want to publicly expose the binary16 module, that's an internal structural implementation detail. Perhaps just expose these at half::arch::aarch64?

HuggingFace-MacMini-Wozniak and others added 5 commits July 6, 2023 22:00

Adding new intrinsics for ggblas.

68a2955

Remove nightly requirements.

dd6ce8e

Going around clippy.

31ef63b

More intrinsics.

a627771

Still failing when compiling `fmlaq` without black_box.

Fix.

4d309a0

Narsil mentioned this pull request Aug 1, 2023

[DIRTY] Using m1 intrinsics for f16xf16 LaurentMazare/gemm#4

Closed

in -> inout

d3e042a

Narsil changed the title ~~[Tentative] Adding new intrinsics for ggblas.~~ [Tentative] Adding new intrinsics for gemm. Aug 1, 2023

This was referenced Aug 1, 2023

F16 intrinsics standalone sarah-quinones/gemm#14

Closed

F16 intrinsics standalone LaurentMazare/gemm#5

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tentative] Adding new intrinsics for gemm. #98

[Tentative] Adding new intrinsics for gemm. #98

Narsil commented Jul 6, 2023 •

edited

Loading

starkat99 commented Aug 5, 2023 •

edited

Loading

[Tentative] Adding new intrinsics for gemm. #98

Are you sure you want to change the base?

[Tentative] Adding new intrinsics for gemm. #98

Conversation

Narsil commented Jul 6, 2023 • edited Loading

starkat99 commented Aug 5, 2023 • edited Loading

Narsil commented Jul 6, 2023 •

edited

Loading

starkat99 commented Aug 5, 2023 •

edited

Loading