Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate best way to do scaling of bedrock pods #1298

Open
duallain opened this issue May 1, 2020 · 3 comments
Open

Investigate best way to do scaling of bedrock pods #1298

duallain opened this issue May 1, 2020 · 3 comments

Comments

@duallain
Copy link
Contributor

duallain commented May 1, 2020

It seems like cpu/memory may not be the best signal to know when to spin up new bedrock instances. If we choose a metric that is better fitted to bedrock's capabilities we can likely deploy fewer pods, and need less k8s infrastructure resources. We should investigate a few items:

  1. How do we expose additional metrics to HPA objects. (There are quite a few choices here, what's best?)
  2. What metric is most reflective of load on bedrock (this implies some perf testing)
  3. (generate stories for) deploying new metrics for HPA
  4. (generate stories for) deploying bedrock prod/stg/dev with the new HPA metrics
  5. Test the hpa works, ensure latency between origin/cdn doesn't change.
@bookshelfdave
Copy link
Contributor

see also: https://github.com/kubernetes/autoscaler/tree/master/vertical-pod-autoscaler

@duallain
Copy link
Contributor Author

duallain commented May 7, 2020

https://github.com/FairwindsOps/goldilocks can maybe help with tuning the VPA. It seems like you basically install goldilocks + vpa and then get changes recommended. Which is rad as all get out.

@bookshelfdave
Copy link
Contributor

let's give Goldilocks a shot!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants