Commercial Use of Text Generation WebUI for High-Concurrency Applications #5915
Unanswered
SpaceDandy13
asked this question in
Q&A
Replies: 1 comment
-
Hi, |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello everyone,
I'm currently exploring the feasibility of deploying a text generation WebUI for commercial use and have a couple of questions regarding its licensing and performance capabilities.
Licensing Concerns with AGPL-3.0: I plan to deploy my own model as an API service without modifying the original WebUI code. Given that the WebUI is licensed under AGPL-3.0, it should be ok?
Performance for High-Concurrency: The service needs to support over 100 concurrent users. Does the Text Generation WebUI have the capability to handle such a level of concurrency efficiently, or would I be better off directly deploying the model using a more specialized framework like VLLM? Additionally, how does its performance compare to using llama.cpp directly?
I'm new to large language models, any related information would be appreciated!
Beta Was this translation helpful? Give feedback.
All reactions