Custom model orchestration #6145

tingc9 · 2023-08-04T11:44:14Z

tingc9
Aug 4, 2023

Hi,
I'm trying to dynamically load and unload models to better make use of GPU memory.
While I can POST the v2 load endpoint from the host like mentioned here (https://github.com/triton-inference-server/server/blob/main/docs/protocol/extension_model_repository.md), I receive connection refused errors when I attempt to do so from within a python backend model running on tritonserver.
Essentially, can tritonserver api endpoints be called from within a running python model?

Answered by dyastremsky

Aug 7, 2023

You are using a model to load/unload other models? That sounds like poor separation of duties and can run into a lot of trouble. The intent of the model loading/unloading API is to be called by the user. Could you describe your use case further?

Python backend models are created in their own processes, so it's possible they are not communicating on the same network as Triton server.

CC: @Tabrizian

View full answer

dyastremsky · 2023-08-07T20:05:29Z

dyastremsky
Aug 7, 2023
Collaborator

You are using a model to load/unload other models? That sounds like poor separation of duties and can run into a lot of trouble. The intent of the model loading/unloading API is to be called by the user. Could you describe your use case further?

Python backend models are created in their own processes, so it's possible they are not communicating on the same network as Triton server.

CC: @Tabrizian

0 replies

outtanames · 2024-02-12T20:06:03Z

outtanames
Feb 12, 2024

Ray was meant to deal with situations like this. https://github.com/autonomi-ai/nos supports multiple models on the same HW (with loading/unloading) if you want to try it out.

0 replies

Tabrizian · 2024-02-12T21:49:50Z

Tabrizian
Feb 12, 2024
Collaborator

@tingc9 We've added model load/unload support in Python backend starting from 23.07 which might be what you're looking for: https://github.com/triton-inference-server/python_backend?tab=readme-ov-file#model-loading-api

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom model orchestration #6145

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Custom model orchestration #6145

tingc9 Aug 4, 2023

Replies: 3 comments

dyastremsky Aug 7, 2023 Collaborator

outtanames Feb 12, 2024

Tabrizian Feb 12, 2024 Collaborator

tingc9
Aug 4, 2023

dyastremsky
Aug 7, 2023
Collaborator

outtanames
Feb 12, 2024

Tabrizian
Feb 12, 2024
Collaborator