When any one or more than one worker node is down, checkmk does not provide metrics from other nodes which are in ready state and gives below error, here worker-2 instance is stopped

CMK version:
2.1.0
OS version:
Kubernetes 1.23.1
Error message:
[special_kube] Agent exited with code 1: Failed to establish a connection to 10.160.0.12:6443 at URL /api/v1/nodes/worker-3/proxy/healthzCRIT , Got no information from hostCRIT , execution time 49.6 sec

Not able to get metrics for any worker nodes which are up and ready, when any one of worker is down…However able to get metrics from all worker nodes when all nodes are in ready state…
Below is screenshot of kubernetes cluster Dashboard

Output of “cmk --debug -vvn hostname”: (If it is a problem with checks or plugins)

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed. Contact an admin if you think this should be re-opened.