feat: support vllm in controller #635

zhuangqh · 2024-10-17T11:30:03Z

Reason for Change:

support vllm runtime deployments
add a feature gate setting vllm as the default runtime
able to select runtime by annotation tag

Requirements

added unit tests and e2e tests (if applicable).

zhuangqh · 2024-10-28T11:12:00Z

docker/presets/models/tfs/Dockerfile

@@ -5,31 +5,39 @@ ARG MODEL_TYPE
 ARG VERSION


. ├── chat_templates │ ├── alpaca.jinja │ ├── amberchat.jinja │ ├── chatml.jinja │ ├── chatqa.jinja │ ├── falcon-instruct.jinja │ ├── gemma-it.jinja │ ├── llama-2-chat.jinja │ ├── llama-3-instruct.jinja │ ├── mistral-instruct.jinja │ ├── openchat-3.5.jinja │ ├── phi-3-small.jinja │ ├── phi-3.jinja │ ├── saiga.jinja │ ├── solar-instruct.jinja │ ├── vicuna.jinja │ └── zephyr.jinja ├── tfs │ ├── cli.py │ ├── dataset.py │ ├── fine_tuning.py │ ├── inference-requirements.txt │ ├── inference_api.py │ ├── metrics_server.py │ ├── parser.py │ ├── tuning-requirements.txt │ └── weights -> /workspace/weights ├── version.txt ├── vllm │ ├── inference-requirements.txt │ ├── inference_api.py │ └── weights -> /workspace/weights └── weights

zhuangqh · 2024-10-28T11:18:42Z

pkg/model/interface.go

+	ModelRunParams     map[string]string // Parameters for running the model training/inference.
+}
+
+func (p *PresetParam) DeepCopy() *PresetParam {


we may update some params according to the node counts. Thus, we must deepcopy it at first.

- set vllm as the default runtime Signed-off-by: jerryzhuang <[email protected]>

Signed-off-by: jerryzhuang <[email protected]>

ishaansehgal99 · 2024-11-14T02:29:38Z

pkg/model/interface.go

-	TorchRunRdzvParams            map[string]string // Optional rendezvous parameters for distributed training/inference using torchrun (elastic).
-	BaseCommand                   string            // The initial command (e.g., 'torchrun', 'accelerate launch') used in the command line.
-	ModelRunParams                map[string]string // Parameters for running the model training/inference.
+	Tag             string // The model image tag


does this tag field get used

zhuangqh had a problem deploying to unit-tests October 17, 2024 11:30 — with GitHub Actions Error

zhuangqh had a problem deploying to e2e-test October 17, 2024 11:30 — with GitHub Actions Error

zhuangqh force-pushed the controller-support-vllm branch from 936993c to 1fcd0b1 Compare October 28, 2024 11:09

zhuangqh had a problem deploying to preset-env October 28, 2024 11:09 — with GitHub Actions Error

zhuangqh had a problem deploying to unit-tests October 28, 2024 11:09 — with GitHub Actions Error

zhuangqh had a problem deploying to e2e-test October 28, 2024 11:09 — with GitHub Actions Error

zhuangqh commented Oct 28, 2024

View reviewed changes

zhuangqh marked this pull request as ready for review October 28, 2024 11:13

zhuangqh requested review from Fei-Guo, helayoty and ishaansehgal99 as code owners October 28, 2024 11:13

zhuangqh commented Oct 28, 2024

View reviewed changes

zhuangqh had a problem deploying to unit-tests October 28, 2024 11:24 — with GitHub Actions Error

zhuangqh had a problem deploying to preset-env October 28, 2024 11:24 — with GitHub Actions Error

zhuangqh had a problem deploying to e2e-test October 28, 2024 11:24 — with GitHub Actions Error

zhuangqh force-pushed the controller-support-vllm branch from 3dbfdb6 to 3b52cc4 Compare November 6, 2024 11:07

zhuangqh had a problem deploying to unit-tests November 6, 2024 11:07 — with GitHub Actions Error

zhuangqh had a problem deploying to e2e-test November 6, 2024 11:07 — with GitHub Actions Error

feat: support vllm in controller

8244ebc

- set vllm as the default runtime Signed-off-by: jerryzhuang <[email protected]>

zhuangqh force-pushed the controller-support-vllm branch from 3b52cc4 to 8244ebc Compare November 14, 2024 00:05

zhuangqh had a problem deploying to unit-tests November 14, 2024 00:05 — with GitHub Actions Error

zhuangqh had a problem deploying to e2e-test November 14, 2024 00:05 — with GitHub Actions Error

update deepcopy method

0545480

Signed-off-by: jerryzhuang <[email protected]>

zhuangqh requested a deployment to unit-tests November 14, 2024 00:18 — with GitHub Actions Waiting

zhuangqh requested a deployment to e2e-test November 14, 2024 00:18 — with GitHub Actions Waiting

ishaansehgal99 reviewed Nov 14, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support vllm in controller #635

feat: support vllm in controller #635

zhuangqh commented Oct 17, 2024 •

edited

Loading

zhuangqh Oct 28, 2024

zhuangqh Oct 28, 2024

ishaansehgal99 Nov 14, 2024 •

edited

Loading

feat: support vllm in controller #635

Are you sure you want to change the base?

feat: support vllm in controller #635

Conversation

zhuangqh commented Oct 17, 2024 • edited Loading

zhuangqh Oct 28, 2024

Choose a reason for hiding this comment

zhuangqh Oct 28, 2024

Choose a reason for hiding this comment

ishaansehgal99 Nov 14, 2024 • edited Loading

Choose a reason for hiding this comment

zhuangqh commented Oct 17, 2024 •

edited

Loading

ishaansehgal99 Nov 14, 2024 •

edited

Loading