Hello, we have some monitored nodes that sometimes alert because there was no information from the host via snmp. The next check, it’s ok again. This happens throughout the day for this and a few other devices. If there’s any advice or suggestions, I’d appreciate it.
The snmp timing is 5 minute intervals. All devices are on inlinesnmp unless there is an issue. I tried disabling inline snmp for this device and it did not fix the issue. I’ve tried extending the snmp timeout to no avail.
Service Metrics execution_time=0.040 user_time=0.020 system_time=0.010 children_user_time=0.000 children_system_time=0.000 cmk_time_snmp=0.000
CMK version: 2.1.0p12
OS version: Oracle Linux Server 8.6 x86_64
Error message: [snmp] Success, Got no information from host CRIT , execution time 0.0 sec
Output of “cmk --debug -vvn hostname”: (If it is a problem with checks or plugins)
Checkmk version 2.1.0p12
Try license usage history update.
Trying to acquire lock on /omd/sites/default_site/var/check_mk/license_usage/next_run
Got lock on /omd/sites/default_site/var/check_mk/license_usage/next_run
Trying to acquire lock on /omd/sites/default_site/var/check_mk/license_usage/history.json
Got lock on /omd/sites/default_site/var/check_mk/license_usage/history.json
Next run time has not been reached yet. Abort.
Releasing lock on /omd/sites/default_site/var/check_mk/license_usage/history.json
Released lock on /omd/sites/default_site/var/check_mk/license_usage/history.json
Releasing lock on /omd/sites/default_site/var/check_mk/license_usage/next_run
Released lock on /omd/sites/default_site/var/check_mk/license_usage/next_run
+ FETCHING DATA
Source: SourceType.HOST/FetcherType.SNMP
[cpu_tracking] Start [7f2db980b520]
[SNMPFetcher] Fetch with cache settings: SNMPFileCache(server-1-iLO, base_path=/omd/sites/default_site/tmp/check_mk/data_source_cache/snmp, max_age=MaxAge(checking=0, discovery=120, inventory=120), disabled=False, use_outdated=False, simulation=False)
Not using cache (Too old. Age is 40 sec, allowed is 0 sec)
[SNMPFetcher] Execute data source
SNMP scan:
Getting OID .1.3.6.1.2.1.1.1.0: Executing SNMP GET of .1.3.6.1.2.1.1.1.0 on server-1-iLO
=> [b'Integrated Lights-Out 5 2.33 Dec 09 2020'] OCTETSTR
b'Integrated Lights-Out 5 2.33 Dec 09 2020'
Getting OID .1.3.6.1.2.1.1.2.0: Executing SNMP GET of .1.3.6.1.2.1.1.2.0 on server-1-iLO
=> [b'.1.3.6.1.4.1.232.9.4.11'] OBJECTID
b'.1.3.6.1.4.1.232.9.4.11'
Using cached OID .1.3.6.1.2.1.1.1.0: 'Integrated Lights-Out 5 2.33 Dec 09 2020'
SNMP scan found snmp_uptime
Trying to acquire lock on /omd/sites/default_site/tmp/check_mk/snmp_scan_cache/server-1-iLO.10.30.22.11
Got lock on /omd/sites/default_site/tmp/check_mk/snmp_scan_cache/server-1-iLO.10.30.22.11
Releasing lock on /omd/sites/default_site/tmp/check_mk/snmp_scan_cache/server-1-iLO.10.30.22.11
Released lock on /omd/sites/default_site/tmp/check_mk/snmp_scan_cache/server-1-iLO.10.30.22.11
Write data to cache file /omd/sites/default_site/tmp/check_mk/data_source_cache/snmp/checking/server-1-iLO
Trying to acquire lock on /omd/sites/default_site/tmp/check_mk/data_source_cache/snmp/checking/server-1-iLO
Got lock on /omd/sites/default_site/tmp/check_mk/data_source_cache/snmp/checking/server-1-iLO
Releasing lock on /omd/sites/default_site/tmp/check_mk/data_source_cache/snmp/checking/server-1-iLO
Released lock on /omd/sites/default_site/tmp/check_mk/data_source_cache/snmp/checking/server-1-iLO
[cpu_tracking] Stop [7f2db980b520 - Snapshot(process=posix.times_result(user=0.04999999999999982, system=0.0, children_user=0.0, children_system=0.0, elapsed=0.05999999959021807))]
+ PARSE FETCHER RESULTS
Source: SourceType.HOST/FetcherType.SNMP
Trying to acquire lock on /omd/sites/default_site/var/check_mk/persisted_sections/snmp/server-1-iLO
Got lock on /omd/sites/default_site/var/check_mk/persisted_sections/snmp/server-1-iLO
Releasing lock on /omd/sites/default_site/var/check_mk/persisted_sections/snmp/server-1-iLO
Released lock on /omd/sites/default_site/var/check_mk/persisted_sections/snmp/server-1-iLO
Stored persisted sections: hp_proliant_cpu, hp_proliant_fans, hp_proliant_mem, hp_proliant_power, hp_proliant_psu, hp_proliant_temp, snmp_info, snmp_uptime
Using persisted section SectionName('hp_proliant_cpu')
Using persisted section SectionName('hp_proliant_fans')
Using persisted section SectionName('hp_proliant_mem')
Using persisted section SectionName('hp_proliant_power')
Using persisted section SectionName('hp_proliant_psu')
Using persisted section SectionName('hp_proliant_temp')
Using persisted section SectionName('snmp_info')
Using persisted section SectionName('snmp_uptime')
-> Add sections: ['hp_proliant_cpu', 'hp_proliant_fans', 'hp_proliant_mem', 'hp_proliant_power', 'hp_proliant_psu', 'hp_proliant_temp', 'snmp_info', 'snmp_uptime']
Received no piggyback data
[cpu_tracking] Start [7f2db980bb20]
value store: synchronizing
Trying to acquire lock on /omd/sites/default_site/tmp/check_mk/counters/server-1-iLO
Got lock on /omd/sites/default_site/tmp/check_mk/counters/server-1-iLO
value store: loading from disk
Releasing lock on /omd/sites/default_site/tmp/check_mk/counters/server-1-iLO
Released lock on /omd/sites/default_site/tmp/check_mk/counters/server-1-iLO
HW CPU 0 CPU0 "Intel(R) Xeon(R) Gold 5118 CPU @ 2.30GHz" in slot 0 is in state "ok"
HW CPU 1 CPU1 "Intel(R) Xeon(R) Gold 5118 CPU @ 2.30GHz" in slot 0 is in state "ok"
HW FAN1 (system) FAN Sensor 1 "system", Speed is normal, State is ok
HW FAN2 (system) FAN Sensor 2 "system", Speed is normal, State is ok
HW FAN3 (system) FAN Sensor 3 "system", Speed is normal, State is ok
HW FAN4 (system) FAN Sensor 4 "system", Speed is normal, State is ok
HW FAN5 (system) FAN Sensor 5 "system", Speed is normal, State is ok
HW FAN6 (system) FAN Sensor 6 "system", Speed is normal, State is ok
HW FAN7 (system) FAN Sensor 7 "system", Speed is normal, State is ok
HW Mem 14 Board: 0, Number: 14, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 15 Board: 0, Number: 15, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 16 Board: 0, Number: 16, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 17 Board: 0, Number: 17, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 18 Board: 0, Number: 18, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 19 Board: 0, Number: 19, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 2 Board: 0, Number: 2, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 20 Board: 0, Number: 20, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 21 Board: 0, Number: 21, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 3 Board: 0, Number: 3, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 4 Board: 0, Number: 4, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 5 Board: 0, Number: 5, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 6 Board: 0, Number: 6, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 7 Board: 0, Number: 7, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 8 Board: 0, Number: 8, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW Mem 9 Board: 0, Number: 9, Type: unknown (19), Size: 16.0 GiB, Status: good, Condition: ok
HW PSU 0/1 Chassis 0/Bay 1, State: "ok", Usage: 47 Watts
HW PSU 0/2 Chassis 0/Bay 2, State: "ok", Usage: 40 Watts
HW PSU Total Usage: 87 Watts
HW Power Meter Current reading: 87.00 Watts
SNMP Info Integrated Lights-Out 5 2.33 Dec 09 2020, ilo-au-ev-esxprod01., L2 AHS, NEP IT
Temperature 1 (ambient) 23.0 °C
Temperature 10 (memory) 28.0 °C
Temperature 15 (ambient) 25.0 °C
Temperature 16 (system) 31.0 °C
Temperature 17 (system) 30.0 °C
Temperature 18 (system) 30.0 °C
Temperature 19 (system) 30.0 °C
Temperature 2 (cpu) 40.0 °C
Temperature 20 (system) 30.0 °C
Temperature 21 (system) 31.0 °C
Temperature 22 (system) 39.0 °C
Temperature 23 (system) 67.0 °C
Temperature 24 (system) 37.0 °C
Temperature 29 (system) 31.0 °C
Temperature 3 (cpu) 40.0 °C
Temperature 31 (ioBoard) 31.0 °C
Temperature 33 (ioBoard) 30.0 °C
Temperature 37 (system) 31.0 °C
Temperature 38 (powerSupply) 29.0 °C
Temperature 39 (powerSupply) 31.0 °C
Temperature 4 (memory) 27.0 °C
Temperature 40 (powerSupply) 40.0 °C
Temperature 41 (powerSupply) 40.0 °C
Temperature 42 (powerSupply) 25.0 °C
Temperature 43 (powerSupply) 30.0 °C
Temperature 6 (memory) 28.0 °C
Temperature 8 (memory) 28.0 °C
Uptime Up since Sep 20 2021 07:44:26, Uptime: 1 year 44 days
[cpu_tracking] Stop [7f2db980bb20 - Snapshot(process=posix.times_result(user=0.009999999999999787, system=0.0, children_user=0.0, children_system=0.0, elapsed=0.009999999776482582))]
[snmp] Success, execution time 0.1 sec | execution_time=0.070 user_time=0.060 system_time=0.000 children_user_time=0.000 children_system_time=0.000 cmk_time_snmp=0.010