Skip to content

Z3: optimizations for grad norm calculation and gradient clipping #10012

Z3: optimizations for grad norm calculation and gradient clipping

Z3: optimizations for grad norm calculation and gradient clipping #10012

Triggered via pull request May 23, 2024 21:13
Status Failure
Total duration 5h 14m 8s
Artifacts

nv-torch-latest-v100.yml

on: pull_request
Fit to window
Zoom out
Zoom in

Annotations

1 error
unit-tests
The self-hosted runner: ds-nv-v100-cu117-runner-9e4b8c3c lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.