Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SageMaker AI Inference Endpoint Support #601

Open
JPfeifer21 opened this issue Mar 5, 2025 · 1 comment
Open

SageMaker AI Inference Endpoint Support #601

JPfeifer21 opened this issue Mar 5, 2025 · 1 comment

Comments

@JPfeifer21
Copy link

Is your feature request related to a problem? Please describe.

Our team is facing challenges with managing costs associated with AWS SageMaker AI Inference Endpoints. It is particularly frustrating when developers forget to manually scale down endpoints before leaving for the day, resulting in unnecessary operational costs during non-working hours in our test environments.

Describe the feature you'd like

We would like the AWS Instance Scheduler to support automatic scaling down and scaling up of SageMaker AI Inference Endpoints. This would allow endpoints to be managed based on a predefined schedule, similar to how EC2 and RDS instances are managed.

@CrypticCabub
Copy link
Member

Thanks for submitting this feature request! We'll add it to our backlog. If anybody else would also like to see this feature, please thumbs-up the original post to help with priorititization.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants