DeepSparse v0.11.0

jeanniefinks released this 11 Mar 18:31

· 2 commits to release/0.11 since this release

New Features:

High-performance sparse quantized convolutional neural networks supported on AVX2 systems.
CCX detection added to the DeepSparse Engine for AMD systems.
deepsparse.server integration and CLIs added with Hugging Face transformers pipelines support.

Changes:

Performance improvements made for

FP32 sparse BERT models
batch size 1 networks
quantized sparse BERT models
Pooling operations

Resolved Issues:

When hyperthreads are disabled in the BIOS, core/socket information on certain systems can now be detected.
Hugging Face transformers validation flows for QQP now giving correct accuracy metrics.
PyTorch downloaded for YOLO model stubs now supported.

Known Issues:

When running NanoDet-Plus-m, the DeepSparse Engine will fail with an assertion (See #279). A hotfix is being pursued.

Assets 8