CMK 2.3p39, redfish 2.3.77 , Dell iDRAC fw 7.20.60.50 services keep vanishing /reappearing

CMK version: 2.3p39
OS version: virt1

there seems to be an odd behaviour with Dell iDRAC fw 7.20.60.50, as certain services keep vanishing (→ UNKN) and after some check attempts they are available again undtil a few checks later … and so on

Most affected services are Temp, Fan, Memory and Harddrives

Has anyone seen this too?

BR
Thomas

The problem here is i think the iDRAC version. iDRAC 7 only supports some Redfish functions and not all. With iDRAC 7 it is the same problem as with iLO4 some are working without any problem and some shown such things like your UNKN state.

If i have such a problematic interface inside my monitoring - i only select the system state to be fetched. If there is a problem the rollup state changes and i need to have a look at the management interface directly.

I think there is no real solution for problems with older management interfaces.

Hi andi

do you mean iDRAC version or iDRAC fw version?

Best

Thomas

Sorry - your system is a iDRAC 9 or?

If it is iDRAC 9 and you have from time to time the UNKN for some services and later than again the OK pls check the following things.

Firmware Inventory needs to be done with a high cache time or if not needed disable it.

And for the CheckMK service of this devices it can be good to set the maximum timeout to 120 seconds. I had some Dell systems with quick response if you only fetch the normal data and with Firmware inventory it took over 1 minute.

Hi andi

yeah iDRAC9 with fw 7.xxx.xxx but it seems not all of those are affected.
Atm it seems that it makes a difference how many hops the iDRACs are away from the Checkmk.

We wil try out and report back ^^

Best
Tom