Lots of false "Check_MK: host UP->Down" and vice versa

neilyoung · December 19, 2025, 6:55am

CMK version: 2.2.0p22
OS version: Server Ubuntu 20.04 ARM, Host Ubuntu 22.04 AMD64

Error message: Host Up→Down / Down→ Up, false alerts

Output of “cmk --debug -vvn hostname”: (If it is a problem with checks or plugins)

OMD[site]:~$ cmk --debug -vvn

Checkmk version 2.2.0p22

FETCHING DATA
Source: SourceInfo(hostname=‘’, ipaddress=‘’, ident=‘agent’, fetcher_type=TCP)
Read from cache: AgentFileCache(, path=/omd/sites//tmp/check_mk/cache/)
Connecting via TCP to :6556
Detected transport protocol: PLAIN
Reading data from agent
Write data to cache file /omd/sites//tmp/check_mk/cache/

PARSE FETCHER RESULTS
<<<check_mk>>>
<<<df_v2>>>
<<<systemd_units>>>
<<<ps_lnx>>>
<<>>
<<>>
<<>>
<<<lnx_if>>>
<<>>
<<>>
<<>>
<<<lnx_thermal>>>
<<>>
<<>>

Host ‘’ → sections parsed successfully
No piggyback data received

SERVICE RESULTS

HTTP-Service-1        HTTP check OK
HTTP-Service-2        HTTP check OK
Check_MK Agent        Version: 2.2.0p22, OS: linux, TLS not activated
CPU load              15 min load: 0.00
CPU utilization       Total CPU: 0.36%
Disk IO SUMMARY       Read: 0.00 B/s, Write: 12.6 kB/s
Filesystem /          Used: 8.57% of total
Network Interface 1   Speed: 1 GBit/s (expected: 100 MBit/s)(!)
Network Interface 2   Speed: unknown
Systemd Services      Total: 218, Failed: 0
Memory                Used: 8.34% of total
TCP Connections       Established: 8
Temperature Zone 0    27.8 °C
Temperature Zone 1    32.0 °C
Temperature Zone 2    33.0 °C
Uptime                183 days
Custom Job            last run 75 hours ago

No piggyback files found

[agent] Success, [piggyback] Success (no data)
Execution time: ~2.0 sec

Both node and host using a tailscale tunnel only to communicate, firewall closed for anything else on the host (node is in a DMZ). It generally works fine, but maybe due to tunnelling problems I’m constantly getting message like these for 30 seconds or a minute duration:

followed by

Those are just temporary issues and I would like to get rid of these, tried TCP already and a higher timeout, but all this wasn’t helpful.

bernhard.dolezal · December 19, 2025, 7:13am

Hi,

I had the same problem with the smart ping, this is because, the smart ping accepts the tcp packets as ok.

I run a ping command from the shell to check if this is the problem, sometimes the ping is disabled.

If you want a bit more infos: Special characteristics of the CMC

BR

Berni

neilyoung · December 19, 2025, 7:43am

Hmm. A shell ping from my server to the host works fine, not so the other way…It clearly addresses the proper tunnel endpoint, but no way…. Strange.

But what is the problem with the so called Smart Ping? If it accepts TCP as OK, that would be fine. For me it seems, the tunnel just looses some of those packets…

neilyoung · December 19, 2025, 7:46am

Oh, the mystery of the failing PING from node to server is solved. It was a problem with the tailscale ACL.

neilyoung · December 19, 2025, 7:37pm

Can the smart ping be disabled somehow?

andreas-doehler · December 20, 2025, 6:18am

In your screenshots there it is not using smart ping but a normal ping check.

If you use smart ping (enterprise/cloud only) then you have no metrics available for the host check. But if you have enterprise edition in use you can decide for the host check what type of check it should use (ping / smart ping / service state and so on).

neilyoung · December 20, 2025, 11:17am

Ok, so no way in CE I guess

andreas-doehler · December 20, 2025, 11:39am

In Raw / Community edtion you can also change the host check command to something else than ping. Most times where i cannot ping a device i use the status of the Check_MK service as host status.

neilyoung · December 20, 2025, 2:45pm

Oh that would help. Maybe it is documented how to achieve that?

Thanks

EDIT: Found it, activated it. Let’s see if it works

rdot84 · December 20, 2025, 6:32pm

So how did you change the host check command to something else than ping?

Jay2k1 · December 20, 2025, 7:25pm

Click Setup and search for host check command, then create a rule there for your host and choose an alternative.

neilyoung · December 20, 2025, 7:30pm

Exactly like so. Thanks

openmindz · December 20, 2025, 7:39pm

Unrelated to topic in and on itself, just a “language” note (please forgive me):

No, it doesn’t. It seems that it loses packets, with one “o”

Helpful reference: Lose vs. Loose: How to Use Each Correctly | Merriam-Webster

Regards,
Thomas