-
Notifications
You must be signed in to change notification settings - Fork 187
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Autoscaling #1319
Comments
Hi @bkmgit - we have a "Cloud Working Group" tackling this right now. The goal is to have a cloud equivalent of the current on-premises recipes, but skipping the un-needed parts (Warewulf/xCAT) and instead dealing with automated scale up/down of compute nodes, as well as other considerations (which instance types make sense, what storage to use, etc). I'll drop another message here when there is something for you to try out. |
@ChrisDowning Thanks. Mailing list may be a helpful thing to have as indicated at openhpc/cloudwg#13 |
@ChrisDowning I'd definitely be interested in hearing what happens too. We have done proof-of-concept work on Slurm autoscaling using OpenHPC before, although it's not ready for production. |
@sjpb Great - will keep you in the loop. I've deployed auto-scaling using Slurm power-saving for customers a few times over the last ~18 months, just never using the OpenHPC build. Deploying the same basic functionality using the OpenHPC Slurm package is pretty trivial, so we need to just get it documented first then move on to the "best practices" and other considerations people might not be aware of if they are new to cloud. |
A friendly reminder that this issue had no activity for 30 days. |
Slurm supports autoscaling which is very helpful for cloud deployments. Might this be something that can be included and made relatively easy to configure?
The text was updated successfully, but these errors were encountered: