Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #11461

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #11461

unit-tests (3.10)

succeeded Jan 21, 2025 in 1m 49s