Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor: reintroduce LLM_MODEL Env Variable for Flexible Model Implementation in chatqna-xeon-ui-server #1265

Open
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

jotpalch
Copy link
Contributor

Description

This update reintroduces the environment variable LLM_MODEL to the chatqna-xeon-ui-server, enabling flexible model implementation.

Issues

When attempting to use a model other than Intel/neural-chat-7b-v3-3, the request header from the UI service to the backend still contains the fixed model identifier 'model': 'Intel/neural-chat-7b-v3-3', causing issues with model flexibility.

Screenshot:

image

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)
  • Others (enhancement, documentation, validation, etc.)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Describe the tests that you ran to verify your changes.

Copy link

github-actions bot commented Dec 18, 2024

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants