"Check_MK" service CRIT state on windows

Hi, Some days ago i was testing this software, do a few tests and lately installed on a VM runing linux mint 20.3, no problem on installing, and registering the site and the localhost, but with windows keeps me throwing a crit state with the Check_MK Service.
I’ve installed the agent and succesfully register on OS windows 7x64 and windows Server 2012,
but despite the connection tests complete ok, the error persist.
Readed some posts, realized the suggested tests, uninstalled , reinstalled and registered again.
Gratefull in advance with some guidance that could help me.

CMK version: Raw Edition 2.1.0p2
OS version: Linux Mint 20.3

**Check_MK Service on CRIT state.

Output of “cmk --debug -vvn hostname”:*

"Checkmk version 2.1.0p2
Try license usage history update.
Trying to acquire lock on /omd/sites/monitoring/var/check_mk/license_usage/next_run
Got lock on /omd/sites/monitoring/var/check_mk/license_usage/next_run
Trying to acquire lock on /omd/sites/monitoring/var/check_mk/license_usage/history.json
Got lock on /omd/sites/monitoring/var/check_mk/license_usage/history.json
Next run time has not been reached yet. Abort.
Releasing lock on /omd/sites/monitoring/var/check_mk/license_usage/history.json
Released lock on /omd/sites/monitoring/var/check_mk/license_usage/history.json
Releasing lock on /omd/sites/monitoring/var/check_mk/license_usage/next_run
Released lock on /omd/sites/monitoring/var/check_mk/license_usage/next_run
+ FETCHING DATA
  Source: SourceType.HOST/FetcherType.TCP
[cpu_tracking] Start [7f0433f28820]
[TCPFetcher] Fetch with cache settings: DefaultAgentFileCache(pc_amti01, base_path=/omd/sites/monitoring/tmp/check_mk/cache, max_age=MaxAge(checking=0, discovery=120, inventory=120), disabled=False, use_outdated=False, simulation=False)
Not using cache (Too old. Age is 350 sec, allowed is 0 sec)
[TCPFetcher] Execute data source
Connecting via TCP to 10.93.8.3:6556 (5.0s timeout)
Detected transport protocol: TransportProtocol.TLS (b'16')
Reading data from agent via TLS socket
Reading data from agent
Detected transport protocol: TransportProtocol.PLAIN (b'<<')
Write data to cache file /omd/sites/monitoring/tmp/check_mk/cache/pc_amti01
Trying to acquire lock on /omd/sites/monitoring/tmp/check_mk/cache/pc_amti01
Got lock on /omd/sites/monitoring/tmp/check_mk/cache/pc_amti01
Releasing lock on /omd/sites/monitoring/tmp/check_mk/cache/pc_amti01
Released lock on /omd/sites/monitoring/tmp/check_mk/cache/pc_amti01
Closing TCP connection to 10.93.8.3:6556
[cpu_tracking] Stop [7f0433f28820 - Snapshot(process=posix.times_result(user=0.010000000000000231, system=0.0, children_user=0.0, children_system=0.0, elapsed=3.9200000017881393))]
  Source: SourceType.HOST/FetcherType.PIGGYBACK
[cpu_tracking] Start [7f0433f28f10]
[PiggybackFetcher] Fetch with cache settings: NoCache(pc_amti01, base_path=/omd/sites/monitoring/tmp/check_mk/data_source_cache/piggyback, max_age=MaxAge(checking=0, discovery=120, inventory=120), disabled=True, use_outdated=False, simulation=False)
Not using cache (Cache usage disabled)
[PiggybackFetcher] Execute data source
No piggyback files for 'pc_amti01'. Skip processing.
No piggyback files for '10.93.8.3'. Skip processing.
Not using cache (Cache usage disabled)
[cpu_tracking] Stop [7f0433f28f10 - Snapshot(process=posix.times_result(user=0.0, system=0.010000000000000009, children_user=0.0, children_system=0.0, elapsed=0.009999997913837433))]
+ PARSE FETCHER RESULTS
  Source: SourceType.HOST/FetcherType.TCP
<<<check_mk>>> / Transition NOOPParser -> HostSectionParser
<<<cmk_agent_ctl_status:sep(0)>>> / Transition HostSectionParser -> HostSectionParser
<<<wmi_cpuload:sep(124)>>> / Transition HostSectionParser -> HostSectionParser
<<<uptime>>> / Transition HostSectionParser -> HostSectionParser
<<<mem>>> / Transition HostSectionParser -> HostSectionParser
<<< fileinfo:sep(124)>>> / Transition HostSectionParser -> HostSectionParser
<<< df:sep(9)>>> / Transition HostSectionParser -> HostSectionParser
<<<checkmk_agent_plugins_win:sep(0)>>> / Transition HostSectionParser -> HostSectionParser
<<<services>>> / Transition HostSectionParser -> HostSectionParser
<<<logwatch>>> / Transition HostSectionParser -> HostSectionParser
<<< ps:sep(9)>>> / Transition HostSectionParser -> HostSectionParser
<<<dotnet_clrmemory:sep(124)>>> / Transition HostSectionParser -> HostSectionParser
<<<winperf_phydisk>>> / Transition HostSectionParser -> HostSectionParser
<<<winperf_if>>> / Transition HostSectionParser -> HostSectionParser
<<<winperf_processor>>> / Transition HostSectionParser -> HostSectionParser
Transition HostSectionParser -> NOOPParser
Transition NOOPParser -> NOOPParser
<<<systemtime>>> / Transition NOOPParser -> HostSectionParser
No persisted sections
  -> Add sections: ['check_mk', 'checkmk_agent_plugins_win', 'cmk_agent_ctl_status', 'df', 'dotnet_clrmemory', 'fileinfo', 'logwatch', 'mem', 'ps', 'services', 'systemtime', 'uptime', 'winperf_if', 'winperf_phydisk', 'winperf_processor', 'wmi_cpuload']
  Source: SourceType.HOST/FetcherType.PIGGYBACK
No persisted sections
  -> Add sections: []
Received no piggyback data
Received no piggyback data
[cpu_tracking] Start [7f0433f287c0]
value store: synchronizing
Trying to acquire lock on /omd/sites/monitoring/tmp/check_mk/counters/pc_amti01
Got lock on /omd/sites/monitoring/tmp/check_mk/counters/pc_amti01
value store: loading from disk
Releasing lock on /omd/sites/monitoring/tmp/check_mk/counters/pc_amti01
Released lock on /omd/sites/monitoring/tmp/check_mk/counters/pc_amti01
No piggyback files for 'pc_amti01'. Skip processing.
No piggyback files for '10.93.8.3'. Skip processing.
[cpu_tracking] Stop [7f0433f287c0 - Snapshot(process=posix.times_result(user=0.0, system=0.0, children_user=0.0, children_system=0.0, elapsed=0.0))]
[agent] Success, execution time 3.9 sec | execution_time=3.930 user_time=0.010 system_time=0.010 children_user_time=0.000 children_system_time=0.000 cmk_time_agent=3.910"


*The cmk-agent-ctl.exe status on windows server host result:*

Version: 2.1.0p2
Agent socket: operational
IP allowlist: any


Connection: 172.32.0.70:8000/monitoring
        UUID: 6ffc3200-5df9-4240-9b95-61943792d46c
        Local:
                Connection type: pull-agent
                Certificate issuer: Site 'monitoring' local CA
                Certificate validity: Fri, 10 Jun 2022 23:36:16 +0000 - Wed, 11
Oct 3020 23:36:16 +0000
        Remote:
                Connection type: pull-agent
                Registration state: operational
                Host name: srvant-01

On your command line output i see no error message. The server gets all the information from the agent as you see the sections it finds.

What says a “cmk --debug -vvII hostname” on the command line?
And if this runs without error and you get at the end a list of found services you can do a “cmk -R” to activate these new services.
Then you should see no error message in the web frontend.

I can only think you forgot to activate the discovered services in the web interface or?

Hi, thanks for the reply.
This is what obtain executing the debug command:

Discovering services and host labels on: srvant-01
srvant-01:
+ FETCHING DATA
  Source: SourceType.HOST/FetcherType.TCP
[cpu_tracking] Start [7f48048d0040]
[TCPFetcher] Fetch with cache settings: DefaultAgentFileCache(srvant-01, base_path=/omd/sites/monitoring/tmp/check_mk/cache, max_age=MaxAge(checking=0, discovery=120, inventory=120), disabled=False, use_outdated=False, simulation=False)
Not using cache (Too old. Age is 224823 sec, allowed is 120 sec)
[TCPFetcher] Execute data source
Connecting via TCP to 172.32.0.92:6556 (5.0s timeout)
Detected transport protocol: TransportProtocol.TLS (b'16')
Reading data from agent via TLS socket
Reading data from agent
Detected transport protocol: TransportProtocol.PLAIN (b'<<')
Write data to cache file /omd/sites/monitoring/tmp/check_mk/cache/srvant-01
Trying to acquire lock on /omd/sites/monitoring/tmp/check_mk/cache/srvant-01
Got lock on /omd/sites/monitoring/tmp/check_mk/cache/srvant-01
Releasing lock on /omd/sites/monitoring/tmp/check_mk/cache/srvant-01
Released lock on /omd/sites/monitoring/tmp/check_mk/cache/srvant-01
Closing TCP connection to 172.32.0.92:6556
[cpu_tracking] Stop [7f48048d0040 - Snapshot(process=posix.times_result(user=0.009999999999999787, system=0.0, children_user=0.0, children_system=0.0, elapsed=6.8599999994039536))]
  Source: SourceType.HOST/FetcherType.PIGGYBACK
[cpu_tracking] Start [7f48048d0910]
[PiggybackFetcher] Fetch with cache settings: NoCache(srvant-01, base_path=/omd/sites/monitoring/tmp/check_mk/data_source_cache/piggyback, max_age=MaxAge(checking=0, discovery=120, inventory=120), disabled=True, use_outdated=False, simulation=False)
Not using cache (Cache usage disabled)
[PiggybackFetcher] Execute data source
No piggyback files for 'srvant-01'. Skip processing.
No piggyback files for '172.32.0.92'. Skip processing.
Not using cache (Cache usage disabled)
[cpu_tracking] Stop [7f48048d0910 - Snapshot(process=posix.times_result(user=0.0, system=0.0, children_user=0.0, children_system=0.0, elapsed=0.0))]
+ PARSE FETCHER RESULTS
  Source: SourceType.HOST/FetcherType.TCP
<<<check_mk>>> / Transition NOOPParser -> HostSectionParser
<<<cmk_agent_ctl_status:sep(0)>>> / Transition HostSectionParser -> HostSectionParser
<<<wmi_cpuload:sep(124)>>> / Transition HostSectionParser -> HostSectionParser
<<<uptime>>> / Transition HostSectionParser -> HostSectionParser
<<<mem>>> / Transition HostSectionParser -> HostSectionParser
<<< fileinfo:sep(124)>>> / Transition HostSectionParser -> HostSectionParser
<<<checkmk_agent_plugins_win:sep(0)>>> / Transition HostSectionParser -> HostSectionParser
<<<services>>> / Transition HostSectionParser -> HostSectionParser
<<< df:sep(9)>>> / Transition HostSectionParser -> HostSectionParser
<<<winperf_phydisk>>> / Transition HostSectionParser -> HostSectionParser
<<<winperf_if>>> / Transition HostSectionParser -> HostSectionParser
<<<winperf_processor>>> / Transition HostSectionParser -> HostSectionParser
<<< ps:sep(9)>>> / Transition HostSectionParser -> HostSectionParser
<<<logwatch>>> / Transition HostSectionParser -> HostSectionParser
<<<dotnet_clrmemory:sep(124)>>> / Transition HostSectionParser -> HostSectionParser
Transition HostSectionParser -> NOOPParser
Transition NOOPParser -> NOOPParser
<<<systemtime>>> / Transition NOOPParser -> HostSectionParser
No persisted sections
  -> Add sections: ['check_mk', 'checkmk_agent_plugins_win', 'cmk_agent_ctl_status', 'df', 'dotnet_clrmemory', 'fileinfo', 'logwatch', 'mem', 'ps', 'services', 'systemtime', 'uptime', 'winperf_if', 'winperf_phydisk', 'winperf_processor', 'wmi_cpuload']
  Source: SourceType.HOST/FetcherType.PIGGYBACK
No persisted sections
  -> Add sections: []
Received no piggyback data
Received no piggyback data
+ ANALYSE DISCOVERED HOST LABELS
Trying host label discovery with: check_mk, checkmk_agent_plugins_win, cmk_agent_ctl_status, df, dotnet_clrmemory, fileinfo, logwatch, mem, ps, services, systemtime, uptime, winperf_if, winperf_phydisk, winperf_processor, wmi_cpuload
  cmk/os_family: windows (check_mk)
Trying host label discovery with: 
Trying to acquire lock on /omd/sites/monitoring/var/check_mk/discovered_host_labels/srvant-01.mk
Got lock on /omd/sites/monitoring/var/check_mk/discovered_host_labels/srvant-01.mk
Releasing lock on /omd/sites/monitoring/var/check_mk/discovered_host_labels/srvant-01.mk
Released lock on /omd/sites/monitoring/var/check_mk/discovered_host_labels/srvant-01.mk
SUCCESS - Found 1 host labels
+ ANALYSE DISCOVERED SERVICES
+ EXECUTING DISCOVERY PLUGINS (27)
  Trying discovery with: systemtime, fileinfo, uptime, domino_tasks, wmi_cpuload, fileinfo_groups, winperf_if, mem_vmalloc, services, esx_vsphere_hostsystem_cpu_usage, mem_win, mssql_datafiles, logwatch_groups, dotnet_clrmemory, df, mem_linux, winperf_phydisk, logwatch_ec, logwatch_ec_single, docker_container_status_uptime, services_summary, logwatch, mssql_transactionlogs, checkmk_agent, winperf_processor_util, check_mk_only_from, ps
  1 checkmk_agent
  1 df
  1 dotnet_clrmemory
  9 logwatch
  1 mem_win
  1 services_summary
  1 systemtime
  1 uptime
  3 winperf_if
  1 winperf_phydisk
  1 winperf_processor_util
  1 wmi_cpuload
SUCCESS - Found 22 services

The cmk-R result:
Generating configuration for core (type nagios)…
Precompiling host checks…OK
Validating Nagios configuration…OK
Restarting monitoring core…OK

Still marked as CRIT on the windows hosts.
And yes, i’ve activated the services in web interface, the top right yellow advice, right?
This is what looks my web interface.
CMK Interface on 13062022.

And again, thanks for any guidance :slight_smile:

Can you please provide a screenshot whats crit.
On the linked screen i only see the two CheckMK services with output “null”.
That’s a problem with CMK2.0 and RAW edition. I don’t saw this in 2.1 until now.
But it should be gone after 1 check interval.

Thanks for your time and replies. I’ve realized today that after the “cmk-R” command the error with te agent dissapeared.

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed. Contact an admin if you think this should be re-opened.