Skip to content

Actions: microsoft/DeepSpeed

nv-torch-latest-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,912 workflow runs
4,912 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Autotp training
nv-torch-latest-v100 #12992: Pull request #6922 synchronize by delock
January 24, 2025 07:34 Action required inkcherry:autotp_training
January 24, 2025 07:34 Action required
Update A6000 workflows to use newer docker container - 24.06 vs 24.03
nv-torch-latest-v100 #12991: Pull request #6967 synchronize by loadams
January 24, 2025 01:42 1h 40m 22s loadams/update-a6000-workflows
January 24, 2025 01:42 1h 40m 22s
Update A6000 workflows to use newer docker container - 24.06 vs 24.03
nv-torch-latest-v100 #12990: Pull request #6967 synchronize by loadams
January 24, 2025 01:41 1m 49s loadams/update-a6000-workflows
January 24, 2025 01:41 1m 49s
Update A6000 workflows to use newer docker container - 24.06 vs 24.03
nv-torch-latest-v100 #12989: Pull request #6967 synchronize by loadams
January 24, 2025 01:31 9m 34s loadams/update-a6000-workflows
January 24, 2025 01:31 9m 34s
nv-torch-latest-v100
nv-torch-latest-v100 #12988: Scheduled
January 24, 2025 00:20 1h 31m 40s master
January 24, 2025 00:20 1h 31m 40s
Tecorigin sdaa accelerator
nv-torch-latest-v100 #12987: Pull request #6903 synchronize by loadams
January 23, 2025 23:49 Action required siqi654321:Tecorigin-SDAA-accelerator
January 23, 2025 23:49 Action required
generalize deepspeed linear and implement it for non cuda systems
nv-torch-latest-v100 #12986: Pull request #6932 synchronize by loadams
January 23, 2025 18:35 2h 34m 15s oelayan7:linear
January 23, 2025 18:35 2h 34m 15s
nv-torch-latest-v100
nv-torch-latest-v100 #12985: Merge group checks requested
January 23, 2025 16:42 1h 35m 12s
January 23, 2025 16:42 1h 35m 12s
Autotp training
nv-torch-latest-v100 #12984: Pull request #6922 synchronize by inkcherry
January 23, 2025 07:50 3m 6s inkcherry:autotp_training
January 23, 2025 07:50 3m 6s
nv-torch-latest-v100
nv-torch-latest-v100 #12983: Scheduled
January 23, 2025 00:20 1h 37m 20s master
January 23, 2025 00:20 1h 37m 20s
[DEBUG] Add diagnostics for cpu-torch-latest intermittent hang
nv-torch-latest-v100 #12982: Pull request #6942 synchronize by loadams
January 22, 2025 23:14 6h 0m 23s loadams/cpu-runner-debug
January 22, 2025 23:14 6h 0m 23s
Update A6000 workflows to use newer docker container - 24.06 vs 24.03
nv-torch-latest-v100 #12981: Pull request #6967 synchronize by loadams
January 22, 2025 23:07 1h 31m 7s loadams/update-a6000-workflows
January 22, 2025 23:07 1h 31m 7s
Tecorigin sdaa accelerator
nv-torch-latest-v100 #12980: Pull request #6903 synchronize by tjruwase
January 22, 2025 22:25 Action required siqi654321:Tecorigin-SDAA-accelerator
January 22, 2025 22:25 Action required
Update A6000 workflows to use newer docker container - 24.06 vs 24.03
nv-torch-latest-v100 #12979: Pull request #6967 synchronize by loadams
January 22, 2025 18:53 1h 31m 3s loadams/update-a6000-workflows
January 22, 2025 18:53 1h 31m 3s
Update sharded_moe.py to support top2 gate with Tutel
nv-torch-latest-v100 #12976: Pull request #6948 synchronize by loadams
January 22, 2025 17:16 1h 34m 0s xenshinu:patch-1
January 22, 2025 17:16 1h 34m 0s
Precisely track nvme optimizer offload
nv-torch-latest-v100 #12975: Pull request #6963 synchronize by tjruwase
January 22, 2025 15:54 1h 31m 38s olruwase/ds_4998
January 22, 2025 15:54 1h 31m 38s
Autotp training
nv-torch-latest-v100 #12974: Pull request #6922 synchronize by inkcherry
January 22, 2025 05:40 1h 34m 53s inkcherry:autotp_training
January 22, 2025 05:40 1h 34m 53s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-torch-latest-v100 #12973: Pull request #6553 synchronize by gyou2021
January 22, 2025 03:03 Action required gyou2021:configurable_autoTP
January 22, 2025 03:03 Action required
nv-torch-latest-v100
nv-torch-latest-v100 #12972: Scheduled
January 22, 2025 00:20 1h 33m 57s master
January 22, 2025 00:20 1h 33m 57s
Explicitly use the linalg.vector_norm call in comm/
nv-torch-latest-v100 #12971: Pull request #6960 synchronize by loadams
January 21, 2025 22:35 1h 30m 47s loadams/fix-torch-linalg-norm
January 21, 2025 22:35 1h 30m 47s
generalize deepspeed linear and implement it for non cuda systems
nv-torch-latest-v100 #12970: Pull request #6932 synchronize by loadams
January 21, 2025 22:34 1h 40m 12s oelayan7:linear
January 21, 2025 22:34 1h 40m 12s
Update version.txt after 0.16.3 release
nv-torch-latest-v100 #12969: Pull request #6965 opened by loadams
January 21, 2025 22:31 1h 19m 46s AutoPR/0.16.3
January 21, 2025 22:31 1h 19m 46s
generalize deepspeed linear and implement it for non cuda systems
nv-torch-latest-v100 #12967: Pull request #6932 synchronize by loadams
January 21, 2025 21:54 Action required oelayan7:linear
January 21, 2025 21:54 Action required