Kubernetes / OKD restarting containers

r.sander · March 19, 2020, 8:56am

Hi,

how do you deal with containers as hosts in checkmk and k8s / okd restarting the pod creating new container instances but not instantly deleting the old ones that are not running any more?

Checkmk’s dcd creates new hosts for the new container instances but will not remove the not running containers resulting in down hosts alerts.

Do we need to adjust the aggressiveness of k8s’ “garbage collection” to remove old non-running containers earlier?

rprengel · March 19, 2020, 12:39pm

We have similar problems but how should check-mk decide that a deleted container is not a problem?
Missing systems with a defined tag or all system in a seperate site are ok?
Ralf

r.sander · March 19, 2020, 1:44pm

That’s exactly the issue. What is the reason for k8s to have the old containers still around?

rprengel · March 19, 2020, 2:20pm

I m working with rancher and don t know k8s.
Perhaps a tag or marker that blocks deleting.
My strategy is that all hosts in a special site can be deleted if there is a problem.
Not the perfect way but ok for us.
Ralf

andreas-doehler · March 19, 2020, 2:21pm

kubelet Garbage Collection
Is not the “MinAge” a good option to configure how long a dead container is still existing inside the k8s.
And for retrieving log information from a dead container it is not so bad to have it accessible for some time.

r.sander · March 19, 2020, 2:55pm

Why should there be log information inside a container?

rprengel · March 19, 2020, 4:33pm

Perhaps as an option to solve your problem ,-)
Container technology needs some more years to „grow up“ allthough containers are a really big thing.
Ralf

andreas-doehler · March 20, 2020, 7:57am

Inside a raw k8s the last information about a dead container is only inside the container if i remember it correctly. But if you don’t need these information for troubleshooting then the “MinAge” setting with a value of 0 should remove every dead container.

system · April 24, 2020, 8:55pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.