Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docker NVILA inference error #176

Open
JIA-HONG-CHU opened this issue Jan 2, 2025 · 2 comments
Open

docker NVILA inference error #176

JIA-HONG-CHU opened this issue Jan 2, 2025 · 2 comments
Assignees

Comments

@JIA-HONG-CHU
Copy link

I follow your instruction by running:

  1. docker build -t vila-server:latest .
  2. docker run --gpus all --ipc=host --ulimit memlock=-1 --ulimit stack=67108864
    -v ./hub:/root/.cache/huggingface/hub
    -it --rm -p 8000:8000
    -e VILA_MODEL_PATH=Efficient-Large-Model/NVILA-15B
    -e VILA_CONV_MODE=auto
    vila-server:latest
  3. from openai import OpenAI

client = OpenAI(
base_url="http://localhost:8000",
api_key="MYKEY",
)
response = client.chat.completions.create(
messages=[
{
"role": "user",
"content": [
{"type": "text", "text": "What’s in this image?"},
{
"type": "image_url",
"image_url": {
"url": "https://blog.logomyway.com/wp-content/uploads/2022/01/NVIDIA-logo.jpg",
# Or you can pass in a base64 encoded image
# "url": "data:image/png;base64,<base64_encoded_image>",
},
},
],
}
],
model="NVILA-15B",
)
print(response.choices[0].message.content)

Encountered the error:
openai.UnprocessableEntityError: Error code: 422 - {'detail': [{'loc': ['body', 'model'], 'msg': "unexpected value; permitted: 'VILA1.5-3B', 'VILA1.5-3B-AWQ', 'VILA1.5-3B-S2', 'VILA1.5-3B-S2-AWQ', 'Llama-3-VILA1.5-8B', 'Llama-3-VILA1.5-8B-AWQ', 'VILA1.5-13B', 'VILA1.5-13B-AWQ', 'VILA1.5-40B', 'VILA1.5-40B-AWQ'", 'type': 'value_error.const', 'ctx': {'given': 'NVILA-15B', 'permitted': ['VILA1.5-3B', 'VILA1.5-3B-AWQ', 'VILA1.5-3B-S2', 'VILA1.5-3B-S2-AWQ', 'Llama-3-VILA1.5-8B', 'Llama-3-VILA1.5-8B-AWQ', 'VILA1.5-13B', 'VILA1.5-13B-AWQ', 'VILA1.5-40B', 'VILA1.5-40B-AWQ']}}]}

@Lyken17
Copy link
Collaborator

Lyken17 commented Jan 7, 2025

oh, for NVILA the docker image and serving scripts is not updated to support yet. Please stay tuned and we will update an version in the coming week.

@Lyken17 Lyken17 self-assigned this Jan 7, 2025
@wesen02
Copy link

wesen02 commented Jan 14, 2025

Is there any news about the docker updates?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants