CMK version:
Open Monitoring Distribution Version 2.1.0p24.cfe
OS version:
Ubuntu 20.04.6 LTS
Error message:
I am a real newbie, I just got to know checkmk only yesterday and I have installed it today. The purpose is that we do have some dedicated hosts on serverloft and some workloads on AWS. In AWS we are only interested in 2 EC2 instances. They are used as encryption domain for some VPNs we will need the agents on this EC2 boxes to monitor upstream services that we depend on. After installing the agent on the EC2 boxes (Ubuntu 18.04.3 LTS) they both reported down with:
CRIT Check_MK [agent] Communication failed: timed outCRIT, execution time 5.0 sec
CRIT Check_MK Discovery no unmonitored services found, no vanished services found, no new host labels, [agent] Communication failed: timed out
Then I added 2 serverloft hosts to it to find out whether I am the one making mistakes. That as well turns out to be interesting. Both are 2 but not without issues.
the first one is up but with warning because I didn’t choose to monitor all the services. Wondering how to clear that warning.
The second one though saying it’s up, the Check_MK and Check_MK Discovery are both saying the same message as the EC2 boxes that are shown as down. So I am confused.
I would like to use this opportunities to understand the tool and build some experience on how to troubleshoot , what to check and how to check it.
So community please where do I start. Thanks in advance