Extend DeepSpeed inference initialization API with a 'quantize_groups' argument #4067
Annotations
2 errors
unit-tests
The job running on runner GitHub Actions 713 has exceeded the maximum execution time of 360 minutes.
|
unit-tests
The operation was canceled.
|