nv-pre-compile-ops

Enabled high-performance Automatic Tensor Parallelism (auto TP) for the Qwen2-MoE and DeepSeek-V2 models on multiple GPUs/HPUs #9304

Sign in to view logs

Summary
Jobs
- unit-tests
Run details
- Usage
- Workflow file

Re-run triggered January 24, 2025 18:01

loadams

#6964

gyou2021:autoTP_Qwen2Moe_DeepSeekv2

Status Success

Total duration 14m 56s

Artifacts –

nv-pre-compile-ops.yml

on: pull_request