Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

创建pod时出现UnexpectedAdmissionError #596

Open
jfboy233 opened this issue Nov 6, 2024 · 1 comment
Open

创建pod时出现UnexpectedAdmissionError #596

jfboy233 opened this issue Nov 6, 2024 · 1 comment

Comments

@jfboy233
Copy link

jfboy233 commented Nov 6, 2024

Please provide an in-depth description of the question you have:
创建一个pod时出现UnexpectedAdmissionError,具体信息是Allocate failed due to requested number of devices unavailable for nvidia.com/gpu. Requested: 1, Available: 0, which is unexpected。但当我在yaml中删除nvidia.com/gpu:1这一行时,能够成功调度,但31993/metrics端口无法检测到该pod
What do you think about this question?:
请问排查问题该从哪里入手
Environment:

  • HAMi version:
  • Kubernetes version:
  • Others:

image

@Nimbus318
Copy link
Contributor

  1. GPU 节点的 Annotation 需要提供一下
  2. 这个节点上是否还有其他的 Running 的 GPU pod
  3. hami 的版本可能有点老,可以尝试更新到 2.4.0,期间修复了部分问题

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants