Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added a Gradio UI for multi-modal inferencing using Llama 3.2 Vision/ #718

Open
wants to merge 13 commits into
base: main
Choose a base branch
from

Conversation

himanshushukla12
Copy link
Contributor

@himanshushukla12 himanshushukla12 commented Oct 8, 2024

What does this PR do?

This PR introduces multi-modal inference using the Gradio UI for Llama 3.2 vision models. The Gradio UI allows users to upload images and generate descriptive text based on a prompt, with adjustable parameters such as top-k, max-tokens, temperature and top-p for fine-tuning text generation. With chatbox like interface.

Additionally, this PR:

Integrates the transformers and accelerate libraries for efficient model loading and inference.
Implements memory management for releasing GPU resources after inference.
Adds support for Hugging Face tokens to authenticate and access Llama models.

Copy link
Contributor

@init27 init27 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the super fast PR! I left some requests

@himanshushukla12
Copy link
Contributor Author

@init27 I did the changes you asked, please check and let me know... I'll be happy to make it better

Modified readme for new code for passing token via argument
Used small case "g" in gradio
@himanshushukla12
Copy link
Contributor Author

@init27 added the changes you asked, please check...

@himanshushukla12
Copy link
Contributor Author

@init27 please let me know if anything required...
I'm waiting for your response😄

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants