How to Remove the Critical/Warning alarm on the following services Check_MK/Check_MK Discovery

Hi folks, I’m not really good at this, can you help me please remove the alarms below?
Thank you in advance.

Hi @myeric23

the clue is in the service description:
The Checkm_MK service is CRIT because there was a timeout (the host is flapping as well, as indicated by the red and green dot icon)

So there seems to be some kind of connectivity issue with the host. You seem to be using SNMP as a datasource, did you run a connection test?
image

The reasons for this can be manifold, good starting points in troubleshooting are:

and

The WARN on the discovery is because the last time the discovery ran, a new host label was discovered.
image

To get rid of that, you can update the host labels (Service Discovery → Actions → Update Host Labels)

But I guess you should try to fix the connectivity issue first…

1 Like

Thank you @elias, the update host labels worked!
For the Check_MK in CRIT I used SNMP as a datasource and did run a connection test and the results are okay.
However, I am still getting the same alarm, does that mean the IP of the host that I enrolled is intermittent or flapping?

Well, I am glad that we figured out one of the two. :slight_smile:

As for flapping: That one’s tricky, as it can have many different reasons.

It might be that the device is just slow: SNMP Service Alert: Check_MK flapping
—> Increase check interval to e.g. 5 minutes, see if the problem persists (read more here: Monitoring via SNMP - Monitoring of SNMP devices with Checkmk)

You might be reaching the limits of what the RAW can do (unlikely): CheckMK Raw Host & Service Check Timeouts - #4 by rprengel

or about 17 other reasons…

I would start with the check interval and retries and see if that fixes things.

1 Like

@elias.jan-niklas appreciate so much of your help, will try your recommendations later, I will update you. Thank you! :smile:

Wrong Elias, but happy to help :smiley:

1 Like

Apologies for the wrong mention, I think the CRIT alarm is also resolved now.
I think it’s just flapping earlier, the alarm cleared on its own and I did not change nor modify anything. I will bookmark the links you shared for my future reference as well.
Appreciate your help @elias.voelker . :smiley:

1 Like

Great. Sometimes problems just take care of themselves.
Happy monitoring!

1 Like

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed. Contact an admin if you think this should be re-opened.