nv-torch-latest-v100

Actions

nv-torch-latest-v100

Actions

Loading...
Loading

nv-torch-latest-v100.yml

4,912 workflow runs

Autotp training nv-torch-latest-v100 #12992: Pull request #6922 synchronize by delock

January 24, 2025 07:34

Action required inkcherry:autotp_training

inkcherry:autotp_training

January 24, 2025 07:34

Action required

Update A6000 workflows to use newer docker container - 24.06 vs 24.03 nv-torch-latest-v100 #12991: Pull request #6967 synchronize by loadams

January 24, 2025 01:42

1h 40m 22s loadams/update-a6000-workflows

loadams/update-a6000-workflows

January 24, 2025 01:42

1h 40m 22s

Update A6000 workflows to use newer docker container - 24.06 vs 24.03 nv-torch-latest-v100 #12990: Pull request #6967 synchronize by loadams

January 24, 2025 01:41

1m 49s loadams/update-a6000-workflows

loadams/update-a6000-workflows

January 24, 2025 01:41

1m 49s

Update A6000 workflows to use newer docker container - 24.06 vs 24.03 nv-torch-latest-v100 #12989: Pull request #6967 synchronize by loadams

January 24, 2025 01:31

9m 34s loadams/update-a6000-workflows

loadams/update-a6000-workflows

January 24, 2025 01:31

9m 34s

nv-torch-latest-v100 nv-torch-latest-v100 #12988: Scheduled

January 24, 2025 00:20

1h 31m 40s master

master

January 24, 2025 00:20

1h 31m 40s

Tecorigin sdaa accelerator nv-torch-latest-v100 #12987: Pull request #6903 synchronize by loadams

January 23, 2025 23:49

Action required siqi654321:Tecorigin-SDAA-accelerator

siqi654321:Tecorigin-SDAA-accelerator

January 23, 2025 23:49

Action required

generalize deepspeed linear and implement it for non cuda systems nv-torch-latest-v100 #12986: Pull request #6932 synchronize by loadams

January 23, 2025 18:35

2h 34m 15s oelayan7:linear

oelayan7:linear

January 23, 2025 18:35

2h 34m 15s

nv-torch-latest-v100 nv-torch-latest-v100 #12985: Merge group checks requested

January 23, 2025 16:42

1h 35m 12s

January 23, 2025 16:42

1h 35m 12s

Autotp training nv-torch-latest-v100 #12984: Pull request #6922 synchronize by inkcherry

January 23, 2025 07:50

3m 6s inkcherry:autotp_training

inkcherry:autotp_training

January 23, 2025 07:50

3m 6s

nv-torch-latest-v100 nv-torch-latest-v100 #12983: Scheduled

January 23, 2025 00:20

1h 37m 20s master

master

January 23, 2025 00:20

1h 37m 20s

[DEBUG] Add diagnostics for cpu-torch-latest intermittent hang nv-torch-latest-v100 #12982: Pull request #6942 synchronize by loadams

January 22, 2025 23:14

6h 0m 23s loadams/cpu-runner-debug

loadams/cpu-runner-debug

January 22, 2025 23:14

6h 0m 23s

Update A6000 workflows to use newer docker container - 24.06 vs 24.03 nv-torch-latest-v100 #12981: Pull request #6967 synchronize by loadams

January 22, 2025 23:07

1h 31m 7s loadams/update-a6000-workflows

loadams/update-a6000-workflows

January 22, 2025 23:07

1h 31m 7s

Tecorigin sdaa accelerator nv-torch-latest-v100 #12980: Pull request #6903 synchronize by tjruwase

January 22, 2025 22:25

Action required siqi654321:Tecorigin-SDAA-accelerator

siqi654321:Tecorigin-SDAA-accelerator

January 22, 2025 22:25

Action required

Update A6000 workflows to use newer docker container - 24.06 vs 24.03 nv-torch-latest-v100 #12979: Pull request #6967 synchronize by loadams

January 22, 2025 18:53

1h 31m 3s loadams/update-a6000-workflows

loadams/update-a6000-workflows

January 22, 2025 18:53

1h 31m 3s

Update A6000 workflows to use newer docker container - 24.06 vs 24.03 nv-torch-latest-v100 #12978: Pull request #6967 opened by loadams

January 22, 2025 18:40

13m 25s loadams/update-a6000-workflows

loadams/update-a6000-workflows

January 22, 2025 18:40

13m 25s

Use default value of initial_scale_power if FP16 scaling params not provided nv-torch-latest-v100 #12977: Pull request #4986 synchronize by loadams

January 22, 2025 17:28

1h 19m 13s ShukantPal:shukant/fix-initial-scale

ShukantPal:shukant/fix-initial-scale

January 22, 2025 17:28

1h 19m 13s

Update sharded_moe.py to support top2 gate with Tutel nv-torch-latest-v100 #12976: Pull request #6948 synchronize by loadams

January 22, 2025 17:16

1h 34m 0s xenshinu:patch-1

xenshinu:patch-1

January 22, 2025 17:16

1h 34m 0s

Precisely track nvme optimizer offload nv-torch-latest-v100 #12975: Pull request #6963 synchronize by tjruwase

January 22, 2025 15:54

1h 31m 38s olruwase/ds_4998

olruwase/ds_4998

January 22, 2025 15:54

1h 31m 38s

Autotp training nv-torch-latest-v100 #12974: Pull request #6922 synchronize by inkcherry

January 22, 2025 05:40

1h 34m 53s inkcherry:autotp_training

inkcherry:autotp_training

January 22, 2025 05:40

1h 34m 53s

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models nv-torch-latest-v100 #12973: Pull request #6553 synchronize by gyou2021

January 22, 2025 03:03

Action required gyou2021:configurable_autoTP

gyou2021:configurable_autoTP

January 22, 2025 03:03

Action required

nv-torch-latest-v100 nv-torch-latest-v100 #12972: Scheduled

January 22, 2025 00:20

1h 33m 57s master

master

January 22, 2025 00:20

1h 33m 57s

Explicitly use the linalg.vector_norm call in comm/ nv-torch-latest-v100 #12971: Pull request #6960 synchronize by loadams

January 21, 2025 22:35

1h 30m 47s loadams/fix-torch-linalg-norm

loadams/fix-torch-linalg-norm

January 21, 2025 22:35

1h 30m 47s

generalize deepspeed linear and implement it for non cuda systems nv-torch-latest-v100 #12970: Pull request #6932 synchronize by loadams

January 21, 2025 22:34

1h 40m 12s oelayan7:linear

oelayan7:linear

January 21, 2025 22:34

1h 40m 12s

Update version.txt after 0.16.3 release nv-torch-latest-v100 #12969: Pull request #6965 opened by loadams

January 21, 2025 22:31

1h 19m 46s AutoPR/0.16.3

AutoPR/0.16.3

January 21, 2025 22:31

1h 19m 46s

generalize deepspeed linear and implement it for non cuda systems nv-torch-latest-v100 #12967: Pull request #6932 synchronize by loadams

January 21, 2025 21:54

Action required oelayan7:linear

oelayan7:linear

January 21, 2025 21:54

Action required

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management