Skip to content

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument #4067

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument #4067