Skip to content

Actions: microsoft/DeepSpeed

nv-pre-compile-ops

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,867 workflow runs
4,867 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Autotp training
nv-pre-compile-ops #9293: Pull request #6922 synchronize by inkcherry
January 23, 2025 07:50 14m 59s inkcherry:autotp_training
January 23, 2025 07:50 14m 59s
nv-pre-compile-ops
nv-pre-compile-ops #9292: Scheduled
January 23, 2025 00:02 15m 4s master
January 23, 2025 00:02 15m 4s
[DEBUG] Add diagnostics for cpu-torch-latest intermittent hang
nv-pre-compile-ops #9291: Pull request #6942 synchronize by loadams
January 22, 2025 23:14 15m 14s loadams/cpu-runner-debug
January 22, 2025 23:14 15m 14s
Update A6000 workflows to use newer docker container - 24.09 vs 24.03
nv-pre-compile-ops #9290: Pull request #6967 synchronize by loadams
January 22, 2025 23:07 15m 24s loadams/update-a6000-workflows
January 22, 2025 23:07 15m 24s
Tecorigin sdaa accelerator
nv-pre-compile-ops #9289: Pull request #6903 synchronize by tjruwase
January 22, 2025 22:25 Action required siqi654321:Tecorigin-SDAA-accelerator
January 22, 2025 22:25 Action required
Update sharded_moe.py to support top2 gate with Tutel
nv-pre-compile-ops #9285: Pull request #6948 synchronize by loadams
January 22, 2025 17:16 15m 11s xenshinu:patch-1
January 22, 2025 17:16 15m 11s
Precisely track nvme optimizer offload
nv-pre-compile-ops #9284: Pull request #6963 synchronize by tjruwase
January 22, 2025 15:54 15m 1s olruwase/ds_4998
January 22, 2025 15:54 15m 1s
Autotp training
nv-pre-compile-ops #9283: Pull request #6922 synchronize by inkcherry
January 22, 2025 05:40 15m 31s inkcherry:autotp_training
January 22, 2025 05:40 15m 31s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-pre-compile-ops #9282: Pull request #6553 synchronize by gyou2021
January 22, 2025 03:03 Action required gyou2021:configurable_autoTP
January 22, 2025 03:03 Action required
nv-pre-compile-ops
nv-pre-compile-ops #9281: Scheduled
January 22, 2025 00:02 15m 48s master
January 22, 2025 00:02 15m 48s
Explicitly use the linalg.vector_norm call in comm/
nv-pre-compile-ops #9280: Pull request #6960 synchronize by loadams
January 21, 2025 22:35 14m 56s loadams/fix-torch-linalg-norm
January 21, 2025 22:35 14m 56s
generalize deepspeed linear and implement it for non cuda systems
nv-pre-compile-ops #9279: Pull request #6932 synchronize by loadams
January 21, 2025 22:34 15m 0s oelayan7:linear
January 21, 2025 22:34 15m 0s
Update version.txt after 0.16.3 release
nv-pre-compile-ops #9278: Pull request #6965 opened by loadams
January 21, 2025 22:31 14m 52s AutoPR/0.16.3
January 21, 2025 22:31 14m 52s
generalize deepspeed linear and implement it for non cuda systems
nv-pre-compile-ops #9276: Pull request #6932 synchronize by loadams
January 21, 2025 21:54 Action required oelayan7:linear
January 21, 2025 21:54 Action required
Add Cache to Comm Group
nv-pre-compile-ops #9275: Pull request #4849 synchronize by loadams
January 21, 2025 20:42 15m 22s cholmes/comm-group-cache
January 21, 2025 20:42 15m 22s
Explicitly use the linalg.vector_norm call in comm/
nv-pre-compile-ops #9274: Pull request #6960 synchronize by loadams
January 21, 2025 20:35 15m 8s loadams/fix-torch-linalg-norm
January 21, 2025 20:35 15m 8s
Fix: forbid repeated deepspeed.initialize on training objects
nv-pre-compile-ops #9271: Pull request #6874 synchronize by loadams
January 21, 2025 19:49 Action required traincheck-team:fix-6848-forbid-repeated-init
January 21, 2025 19:49 Action required
fix: RuntimeError for UCP large DP
nv-pre-compile-ops #9268: Pull request #6918 synchronize by loadams
January 21, 2025 18:51 15m 30s saforem2/ucp-bug
January 21, 2025 18:51 15m 30s