Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Storage discovery: handle ResourceNotReady #167

Open
defo89 opened this issue Nov 6, 2024 · 3 comments · May be fixed by #174
Open

Storage discovery: handle ResourceNotReady #167

defo89 opened this issue Nov 6, 2024 · 3 comments · May be fixed by #174
Labels
bug Something isn't working

Comments

@defo89
Copy link
Contributor

defo89 commented Nov 6, 2024

Describe the bug
While server is in Discovery phase and is restarting or booting, Storage resource may not be ready to respond to request. Maybe this can be handled somehow?

2024-11-06T10:48:35.544570402Z 2024-11-06T10:48:35Z	ERROR	Reconciler error	{"controller": "server", "controllerGroup": "metal.ironcore.dev", "controllerKind": "Server", "Server": {"name":"node009-pod001-system-0"}, "namespace": "", "name": "node009-pod001-system-0", "reconcileID": "a313bec7-2675-419c-83ae-46b9bbac1bfa", "error": "failed to ensure server state transition: failed to get storages for Server: failed to retrieve some items: [{\"link\":\"/redfish/v1/Systems/1/Storage/\",\"error\":\"400: {\\\"error\\\":{\\\"code\\\":\\\"iLO.0.10.ExtendedInfo\\\",\\\"message\\\":\\\"See @Message.ExtendedInfo for more information.\\\",\\\"@Message.ExtendedInfo\\\":[{\\\"MessageArgs\\\":[\\\"5, (82,00,00)\\\"],\\\"MessageId\\\":\\\"iLO.2.27.ResourceNotReadyRetry\\\"}]}}\"}]"}
2024-11-06T10:48:35.544614080Z sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).reconcileHandler
2024-11-06T10:48:35.544621554Z 	/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:316
2024-11-06T10:48:35.544627533Z sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).processNextWorkItem
2024-11-06T10:48:35.544631934Z 	/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:263
2024-11-06T10:48:35.544657010Z sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).Start.func2.2
2024-11-06T10:48:35.544661277Z 	/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:224
@defo89 defo89 added the bug Something isn't working label Nov 6, 2024
@defo89
Copy link
Contributor Author

defo89 commented Nov 6, 2024

This particular error happened with

  manufacturer: HPE
  model: ProLiant DL380 Gen11

Will observe how it look on other servers.

@afritzler
Copy link
Member

Seems like a similar issue we are seeing here: #170

@stefanhipfel
Copy link
Contributor

Seems like a similar issue we are seeing here: #170

I use polling now to wait for redfish resources

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: Backlog
Development

Successfully merging a pull request may close this issue.

3 participants