Add a quantize_symmetric operation and the corresponding CPU kernel #83

dacorvo · 2024-02-09T15:14:21Z

This is mainly to test the library design with a different operation.

The quantize_symmetric operation simply rescales and maps the source tensor to either int8 of float8.

The pull-request also includes a minimal C++ kernel for CPU devices, that only supports per-tensor int8 quantization for now.

On a Macbook air, the optimized kernel is only 10 % faster than the default implementation using python torch operators.

For now only per-tensor quantization is supported.

dacorvo added 3 commits February 9, 2024 16:17

feat(library): allow fallback when kernel fails

d443109

feat(library): add quantize_symmetric op

89e365a

feat(cpp): add quantize_symmetric CPU kernel

d612b6b

For now only per-tensor quantization is supported.

dacorvo force-pushed the quantize_kernel branch from 557251b to d612b6b Compare February 9, 2024 15:19

dacorvo merged commit 36124db into main Feb 9, 2024
3 checks passed

dacorvo deleted the quantize_kernel branch February 9, 2024 15:26

Provide feedback