-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug: operator-inventory
fails to detect GPU/CPU, causing unlabeled nodes and null
in GRPC status
#240
Labels
repo/provider
Akash provider-services repo issues
Comments
Have observed the same on
Fixed by bouncing the
|
I've noticed that operator inventory is consuming 100% cpu (out of 2 CPU's it is allocated via the helm chart) Maybe that's something that could contribute to the issue. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The
operator-inventory
occasionally fails to detect the GPU/CPU, resulting in worker nodes remaining unlabeled. Consequently, the GRPC status endpoint returnsnull
forcpu_info
and/orgpu_info
, which in turn affects the Cloudmos / Console API statistics.SW versions
Logs
https://gist.github.com/andy108369/49bcc40a15b85de75cb3f1808a32c1f9
The text was updated successfully, but these errors were encountered: