Skip to content

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument #4067

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument #4067

Annotations

2 errors

unit-tests

failed Jan 25, 2025 in 6h 0m 13s