-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RUN-20245 Support NodePool GPU Topology #86
RUN-20245 Support NodePool GPU Topology #86
Conversation
gshaibi
commented
Jul 15, 2024
- private
- BaseTopology -> ClusterTopology
deploy/fake-gpu-operator/values.yaml
Outdated
gpuMemory: 11441 | ||
nodePoolLabelKey: run.ai/simulated-gpu-node-pool | ||
mig-strategy: mixed |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why is it mig-strategy
vs migStrategy
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh thanks, my mistake
// labelNode labels the node with required labels for the fake-gpu-operator to function. | ||
func (p *NodeHandler) labelNode(node *v1.Node) error { | ||
err := p.patchNodeLabels(node, map[string]interface{}{ | ||
"nvidia.com/gpu.deploy.dcgm-exporter": "true", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would it be helpful to make the labels consts?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, absolutely