You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have a main branch but also a feature branch in my HF model repo that has a new version of a lora model. I want to directly load in a specific branch based on my multi_lora.json config file.
Additional context
When I have the config above, the container build correctly in the system logs, however when i go to run inference against the 8000 endpoint I get this error:
for additional context, I am sort of implementing my code off this example, however I loading everything from HF instead of locally and saving it in the image. Triton Multi-lora example
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem? Please describe.
Currently we cant load different branches off of HF model repos directly from the HF repo.
Current set up of multi-lora.json:
Describe the solution you'd like
I have a main branch but also a feature branch in my HF model repo that has a new version of a lora model. I want to directly load in a specific branch based on my multi_lora.json config file.
Additional context
When I have the config above, the container build correctly in the system logs, however when i go to run inference against the 8000 endpoint I get this error:
for additional context, I am sort of implementing my code off this example, however I loading everything from HF instead of locally and saving it in the image. Triton Multi-lora example
The text was updated successfully, but these errors were encountered: