Are API requests queued? If not, is it possible to overload GPU with a bunch of users calling the API simultaneously? #5958
Unanswered
regunakyle
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Title.
Could I possibly be DOS-ed by too many requests at the same time? If yes, is there a setting in text-generation-webui that can help prevent this?
Beta Was this translation helpful? Give feedback.
All reactions