Load/Unload model via API #1167

Jonseed · 2024-10-14T21:06:12Z

Jonseed
Oct 14, 2024

It would be great if the API could load a model into vram upon receiving a request (if not already loaded), and unload a model from vram upon request (like keep_alive in Ollama). Having the model always loaded when Koboldcpp server is running is problematic when using ComfyUI workflows with other services, models, generations (image generation), that need vram. This might also allow model swapping/changing via API.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load/Unload model via API #1167

{{title}}

Replies: 0 comments

Select a reply

Load/Unload model via API #1167

Jonseed Oct 14, 2024

Replies: 0 comments

Jonseed
Oct 14, 2024