OMD Site Status Stopped & incorrect Check_MK Agent version reported

CMK version: 2.1.0p20 CRE
OS version: Ubuntu 22.04 VM

Error message:

Output of “cmk --debug -vvn hostname”:

Checkmk version 2.1.0p20
Try license usage history update.
Trying to acquire lock on /omd/sites/metro/var/check_mk/license_usage/next_run
Got lock on /omd/sites/metro/var/check_mk/license_usage/next_run
Trying to acquire lock on /omd/sites/metro/var/check_mk/license_usage/history.json
Got lock on /omd/sites/metro/var/check_mk/license_usage/history.json
Next run time has not been reached yet. Abort.
Releasing lock on /omd/sites/metro/var/check_mk/license_usage/history.json
Released lock on /omd/sites/metro/var/check_mk/license_usage/history.json
Releasing lock on /omd/sites/metro/var/check_mk/license_usage/next_run
Released lock on /omd/sites/metro/var/check_mk/license_usage/next_run
+ FETCHING DATA
  Source: SourceType.HOST/FetcherType.PIGGYBACK
[cpu_tracking] Start [7facabd334f0]
[PiggybackFetcher] Fetch with cache settings: NoCache(checkmk-metro, base_path=/omd/sites/metro/tmp/check_mk/data_source_cache/piggyback, max_age=MaxAge(checking=0, discovery=120, inventory=120), disabled=True, use_outdated=False, simulation=False)
Not using cache (Cache usage disabled)
[PiggybackFetcher] Execute data source
No piggyback files for 'checkmk-metro'. Skip processing.
No piggyback files for '127.0.1.1'. Skip processing.
Not using cache (Cache usage disabled)
[cpu_tracking] Stop [7facabd334f0 - Snapshot(process=posix.times_result(user=0.0, system=0.0, children_user=0.0, children_system=0.0, elapsed=0.0))]
+ PARSE FETCHER RESULTS
  Source: SourceType.HOST/FetcherType.PIGGYBACK
No persisted sections
  -> Add sections: []
Received no piggyback data
[cpu_tracking] Start [7facabd33a30]
value store: synchronizing
Trying to acquire lock on /omd/sites/metro/tmp/check_mk/counters/checkmk-metro
Got lock on /omd/sites/metro/tmp/check_mk/counters/checkmk-metro
value store: loading from disk
Releasing lock on /omd/sites/metro/tmp/check_mk/counters/checkmk-metro
Released lock on /omd/sites/metro/tmp/check_mk/counters/checkmk-metro
No piggyback files for 'checkmk-metro'. Skip processing.
No piggyback files for '127.0.1.1'. Skip processing.
[cpu_tracking] Stop [7facabd33a30 - Snapshot(process=posix.times_result(user=0.050000000000000044, system=0.0, children_user=0.0, children_system=0.0, elapsed=0.05000000074505806))]
execution time 0.1 sec | execution_time=0.050 user_time=0.050 system_time=0.000 children_user_time=0.000 children_system_time=0.000 cmk_time_agent=0.000

A little history first - the Site was initially installed at v 2.1.0p13 & upgrade to p18 - then finally today to p20. I also changed some settings in the gui & nagios.conf because of high cpu usage as per this post: https://forum.checkmk.com/t/checkmk-raw-host-service-check-timeouts/29378/7 which did significantly reduce my CPU usage, I currently have 1300+ services monitored.

I then noticed that the OMD site status had stopped for my CheckMK server. On further checking - I can also see that the Check_MK Agent is reported as 2.1.0p13 & the Livestatus version: 2.1.0p18. I reinstalled the p20 version of the agent - restarted the server & it’s still the same. I also deleted the host & it’s also still the same.

For completeness:

$ omd status
agent-receiver: running
mkeventd:       running
rrdcached:      running
npcd:           running
nagios:         running
apache:         running
redis:          running
crontab:        running
-----------------------
Overall state:  running
$ omd status --bare --auto
agent-receiver 0
mkeventd 0
rrdcached 0
npcd 0
nagios 0
apache 0
redis 0
crontab 0
OVERALL 0```

This sounds like a wild mixture of issues.
First: The agent and the site itself are not connected version-wise, meaning the agent can have a different version than the site.
What is your current status?

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed. Contact an admin if you think this should be re-opened.