nv-pre-compile-ops

Actions

nv-pre-compile-ops

Actions

Loading...
Loading

nv-pre-compile-ops.yml

4,867 workflow runs

Autotp training nv-pre-compile-ops #9293: Pull request #6922 synchronize by inkcherry

January 23, 2025 07:50

14m 59s inkcherry:autotp_training

inkcherry:autotp_training

January 23, 2025 07:50

14m 59s

nv-pre-compile-ops nv-pre-compile-ops #9292: Scheduled

January 23, 2025 00:02

15m 4s master

master

January 23, 2025 00:02

15m 4s

[DEBUG] Add diagnostics for cpu-torch-latest intermittent hang nv-pre-compile-ops #9291: Pull request #6942 synchronize by loadams

January 22, 2025 23:14

15m 14s loadams/cpu-runner-debug

loadams/cpu-runner-debug

January 22, 2025 23:14

15m 14s

Update A6000 workflows to use newer docker container - 24.09 vs 24.03 nv-pre-compile-ops #9290: Pull request #6967 synchronize by loadams

January 22, 2025 23:07

15m 24s loadams/update-a6000-workflows

loadams/update-a6000-workflows

January 22, 2025 23:07

15m 24s

Tecorigin sdaa accelerator nv-pre-compile-ops #9289: Pull request #6903 synchronize by tjruwase

January 22, 2025 22:25

Action required siqi654321:Tecorigin-SDAA-accelerator

siqi654321:Tecorigin-SDAA-accelerator

January 22, 2025 22:25

Action required

Update A6000 workflows to use newer docker container - 24.09 vs 24.03 nv-pre-compile-ops #9288: Pull request #6967 synchronize by loadams

January 22, 2025 18:53

15m 7s loadams/update-a6000-workflows

loadams/update-a6000-workflows

January 22, 2025 18:53

15m 7s

Update A6000 workflows to use newer docker container - 24.09 vs 24.03 nv-pre-compile-ops #9287: Pull request #6967 opened by loadams

January 22, 2025 18:40

13m 10s loadams/update-a6000-workflows

loadams/update-a6000-workflows

January 22, 2025 18:40

13m 10s

Use default value of initial_scale_power if FP16 scaling params not provided nv-pre-compile-ops #9286: Pull request #4986 synchronize by loadams

January 22, 2025 17:28

15m 8s ShukantPal:shukant/fix-initial-scale

ShukantPal:shukant/fix-initial-scale

January 22, 2025 17:28

15m 8s

Update sharded_moe.py to support top2 gate with Tutel nv-pre-compile-ops #9285: Pull request #6948 synchronize by loadams

January 22, 2025 17:16

15m 11s xenshinu:patch-1

xenshinu:patch-1

January 22, 2025 17:16

15m 11s

Precisely track nvme optimizer offload nv-pre-compile-ops #9284: Pull request #6963 synchronize by tjruwase

January 22, 2025 15:54

15m 1s olruwase/ds_4998

olruwase/ds_4998

January 22, 2025 15:54

15m 1s

Autotp training nv-pre-compile-ops #9283: Pull request #6922 synchronize by inkcherry

January 22, 2025 05:40

15m 31s inkcherry:autotp_training

inkcherry:autotp_training

January 22, 2025 05:40

15m 31s

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models nv-pre-compile-ops #9282: Pull request #6553 synchronize by gyou2021

January 22, 2025 03:03

Action required gyou2021:configurable_autoTP

gyou2021:configurable_autoTP

January 22, 2025 03:03

Action required

nv-pre-compile-ops nv-pre-compile-ops #9281: Scheduled

January 22, 2025 00:02

15m 48s master

master

January 22, 2025 00:02

15m 48s

Explicitly use the linalg.vector_norm call in comm/ nv-pre-compile-ops #9280: Pull request #6960 synchronize by loadams

January 21, 2025 22:35

14m 56s loadams/fix-torch-linalg-norm

loadams/fix-torch-linalg-norm

January 21, 2025 22:35

14m 56s

generalize deepspeed linear and implement it for non cuda systems nv-pre-compile-ops #9279: Pull request #6932 synchronize by loadams

January 21, 2025 22:34

15m 0s oelayan7:linear

oelayan7:linear

January 21, 2025 22:34

15m 0s

Update version.txt after 0.16.3 release nv-pre-compile-ops #9278: Pull request #6965 opened by loadams

January 21, 2025 22:31

14m 52s AutoPR/0.16.3

AutoPR/0.16.3

January 21, 2025 22:31

14m 52s

generalize deepspeed linear and implement it for non cuda systems nv-pre-compile-ops #9276: Pull request #6932 synchronize by loadams

January 21, 2025 21:54

Action required oelayan7:linear

oelayan7:linear

January 21, 2025 21:54

Action required

Add Cache to Comm Group nv-pre-compile-ops #9275: Pull request #4849 synchronize by loadams

January 21, 2025 20:42

15m 22s cholmes/comm-group-cache

cholmes/comm-group-cache

January 21, 2025 20:42

15m 22s

Explicitly use the linalg.vector_norm call in comm/ nv-pre-compile-ops #9274: Pull request #6960 synchronize by loadams

January 21, 2025 20:35

15m 8s loadams/fix-torch-linalg-norm

loadams/fix-torch-linalg-norm

January 21, 2025 20:35

15m 8s

Support the parallel conversion from ZeRO checkpoints to FP32/FP16/BF16 param weight nv-pre-compile-ops #9273: Pull request #6655 synchronize by xylian86

January 21, 2025 20:33

14m 40s xylian86:parallel_zero_to_fp32_conversion

xylian86:parallel_zero_to_fp32_conversion

January 21, 2025 20:33

14m 40s

Support the parallel conversion from ZeRO checkpoints to FP32/FP16/BF16 param weight nv-pre-compile-ops #9272: Pull request #6655 synchronize by xylian86

January 21, 2025 20:17

Action required xylian86:parallel_zero_to_fp32_conversion

xylian86:parallel_zero_to_fp32_conversion

January 21, 2025 20:17

Action required

Fix: forbid repeated deepspeed.initialize on training objects nv-pre-compile-ops #9271: Pull request #6874 synchronize by loadams

January 21, 2025 19:49

Action required traincheck-team:fix-6848-forbid-repeated-init

traincheck-team:fix-6848-forbid-repeated-init

January 21, 2025 19:49

Action required

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models nv-pre-compile-ops #9270: Pull request #6553 synchronize by loadams

January 21, 2025 19:49

15m 36s gyou2021:configurable_autoTP

gyou2021:configurable_autoTP

January 21, 2025 19:49

15m 36s

Enabled high-performance Automatic Tensor Parallelism (auto TP) for the Qwen2-MoE and DeepSeek-V2 models on multiple GPUs/HPUs nv-pre-compile-ops #9269: Pull request #6964 synchronize by loadams

January 21, 2025 19:02

Action required gyou2021:autoTP_Qwen2Moe_DeepSeekv2

gyou2021:autoTP_Qwen2Moe_DeepSeekv2

January 21, 2025 19:02

Action required

fix: RuntimeError for UCP large DP nv-pre-compile-ops #9268: Pull request #6918 synchronize by loadams

January 21, 2025 18:51

15m 30s saforem2/ucp-bug

saforem2/ucp-bug

January 21, 2025 18:51

15m 30s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management