-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: support vllm in controller #635
base: main
Are you sure you want to change the base?
Conversation
936993c
to
1fcd0b1
Compare
docker/presets/models/tfs/Dockerfile
Outdated
@@ -5,31 +5,39 @@ ARG MODEL_TYPE | |||
ARG VERSION |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
.
├── chat_templates
│ ├── alpaca.jinja
│ ├── amberchat.jinja
│ ├── chatml.jinja
│ ├── chatqa.jinja
│ ├── falcon-instruct.jinja
│ ├── gemma-it.jinja
│ ├── llama-2-chat.jinja
│ ├── llama-3-instruct.jinja
│ ├── mistral-instruct.jinja
│ ├── openchat-3.5.jinja
│ ├── phi-3-small.jinja
│ ├── phi-3.jinja
│ ├── saiga.jinja
│ ├── solar-instruct.jinja
│ ├── vicuna.jinja
│ └── zephyr.jinja
├── tfs
│ ├── cli.py
│ ├── dataset.py
│ ├── fine_tuning.py
│ ├── inference-requirements.txt
│ ├── inference_api.py
│ ├── metrics_server.py
│ ├── parser.py
│ ├── tuning-requirements.txt
│ └── weights -> /workspace/weights
├── version.txt
├── vllm
│ ├── inference-requirements.txt
│ ├── inference_api.py
│ └── weights -> /workspace/weights
└── weights
ModelRunParams map[string]string // Parameters for running the model training/inference. | ||
} | ||
|
||
func (p *PresetParam) DeepCopy() *PresetParam { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we may update some params according to the node counts. Thus, we must deepcopy it at first.
3dbfdb6
to
3b52cc4
Compare
- set vllm as the default runtime Signed-off-by: jerryzhuang <[email protected]>
3b52cc4
to
8244ebc
Compare
Signed-off-by: jerryzhuang <[email protected]>
TorchRunRdzvParams map[string]string // Optional rendezvous parameters for distributed training/inference using torchrun (elastic). | ||
BaseCommand string // The initial command (e.g., 'torchrun', 'accelerate launch') used in the command line. | ||
ModelRunParams map[string]string // Parameters for running the model training/inference. | ||
Tag string // The model image tag |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
does this tag field get used
Reason for Change:
Requirements