How to send image to vision model #283

engdante · 2024-09-17T13:39:54Z

Please tell me the correct way to send an image to the vision model.

this is my function:

def generate_image_description(image_path):
prompt = f"Describe the content of this image: {image_path}."
response = client.chat(model='llava-phi3:3.8b', messages=[
{
'role': 'user',
'content': prompt,
},
])
return response['message']['content']

karanravindra · 2024-11-05T20:40:24Z

Please refer to the definition of a "chat message" in the python code Message Type Dict.

The image can be passed in using the "images" key in your message dictionary. The "images" key is a sequence of "bytes" or "path-like str".

Here is an example:

import ollama

response = ollama.chat(
    model="moondream",
    messages=[
        {"role": "user", "content": "Describe the image", "images": ["./cat.jpeg"]}
    ],
)

print(response["message"]['role'])
print(response["message"]['content'])

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to send image to vision model #283

How to send image to vision model #283

engdante commented Sep 17, 2024

karanravindra commented Nov 5, 2024

How to send image to vision model #283

How to send image to vision model #283

Comments

engdante commented Sep 17, 2024

karanravindra commented Nov 5, 2024