Host checks become stale

Hi,

We have one of our slaves in distributed monitoring that cannot seem to run every host check appropriately. All of it’s hosts checks go stale eventually and I cannot seem to get smartping to work on all of them at once.

I can get a portion of the hosts to work with smartping, but if I set all of them at once on smartping, everything goes down. When I try switching to regular ping instead, everything goes stale and I simply never updates.

We use 1.6.0p6, the server has quite a lot of snmp going on and about 1600 hosts. The machine has 24gb ram and the load is at about 4-5, on 8 cores.

I tried to remove as many rules as possible to keep the host checks “vanilla”, and it doesn’t seem to work.

Any suggestions? thanks.

First suggestion is update to 1.6.0p11 there was a bug until p9 or p10 where active checks (if you don’t use smart ping your host check is also an active check) where scheduled in the past. This leads to these checks are not executed.

I had this problem on some of my older updated systems and the problem is now gone after the upgrade to p11.

Updating to 1.6.0p11 fixed our problem. Thanks!

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.