Skip to content

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel #10174

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel #10174

Re-run triggered June 9, 2024 22:36
Status Success
Total duration 2h 1m 59s
Artifacts

nv-torch-latest-v100.yml

on: pull_request
Fit to window
Zoom out
Zoom in