Skip to content
This repository has been archived by the owner on Oct 11, 2024. It is now read-only.

Misc CI/CD updates #240

Merged
merged 5 commits into from
May 14, 2024
Merged

Misc CI/CD updates #240

merged 5 commits into from
May 14, 2024

Conversation

dbarbuzzi
Copy link

This PR includes a handful of small fixes for some of the recently updated workflows:

  • Include the multi-GPU test job in the WEEKLY run
  • Disable a model in the full accuracy test suite (no currently available runners have enough VRAM to run it)
  • Increase the server startup wait time (both in accuracy tests and benchmarking)
  • Unify the naming for the two accuracy jobs

Including the green remote-push workflow on the latest commit in the branch, there’s also a (mostly) green nightly run here: https://github.com/neuralmagic/nm-vllm/actions/runs/9066138594

  • In this run, there appears to be a single test failure which has been failing intermittently (perhaps related to non-determinism of marlin?) unrelated to these changes (tests/models/test_marlin.py::test_models[5-32-half-model_pair0])

Copy link
Member

@andy-neuma andy-neuma left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cool

@dbarbuzzi dbarbuzzi merged commit 3a25456 into main May 14, 2024
23 of 24 checks passed
@dbarbuzzi dbarbuzzi deleted the gha-fixes branch May 14, 2024 14:41
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants