SageMaker AI Inference Endpoint Support #601

JPfeifer21 · 2025-03-05T08:52:37Z

Is your feature request related to a problem? Please describe.

Our team is facing challenges with managing costs associated with AWS SageMaker AI Inference Endpoints. It is particularly frustrating when developers forget to manually scale down endpoints before leaving for the day, resulting in unnecessary operational costs during non-working hours in our test environments.

Describe the feature you'd like

We would like the AWS Instance Scheduler to support automatic scaling down and scaling up of SageMaker AI Inference Endpoints. This would allow endpoints to be managed based on a predefined schedule, similar to how EC2 and RDS instances are managed.

CrypticCabub · 2025-03-06T14:33:51Z

Thanks for submitting this feature request! We'll add it to our backlog. If anybody else would also like to see this feature, please thumbs-up the original post to help with priorititization.

JPfeifer21 added the enhancement label Mar 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SageMaker AI Inference Endpoint Support #601

SageMaker AI Inference Endpoint Support #601

JPfeifer21 commented Mar 5, 2025

CrypticCabub commented Mar 6, 2025

SageMaker AI Inference Endpoint Support #601

SageMaker AI Inference Endpoint Support #601

Comments

JPfeifer21 commented Mar 5, 2025

CrypticCabub commented Mar 6, 2025