Panic on startup if apiserver is unreachable #616

ryanfaircloth · 2022-09-02T15:41:48Z

Line 428 in da5e41a

func initMode() {

In some cluster deployments a condition can occur where an init container connecting to the api server very quickly can receive a connection error. "Connection refused" and "500" errors should be retry-able for a configurable amount of time, default to 60s with 2s sleep

SaaldjorMike · 2022-10-25T10:47:50Z

I guess the main benefit here would be to retry things based on our own retry logic vs having to rely on container restarts which would also retry those requests, but overall I'm not sure this is super critical. It is obviously "nicer" to retry and not panic, but if you have a k8s setup with a ton of connection issues or so when talking to the api server, then its probably more important to fix those issues.

ryanfaircloth · 2022-10-27T19:14:40Z

this actually came up in EKS where aws handles scale of the control plane. Sometimes its just transient network issues other times it could be a delay in the cni

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Panic on startup if apiserver is unreachable #616

Panic on startup if apiserver is unreachable #616

ryanfaircloth commented Sep 2, 2022

SaaldjorMike commented Oct 25, 2022

ryanfaircloth commented Oct 27, 2022

Panic on startup if apiserver is unreachable #616

Panic on startup if apiserver is unreachable #616

Comments

ryanfaircloth commented Sep 2, 2022

SaaldjorMike commented Oct 25, 2022

ryanfaircloth commented Oct 27, 2022