Raid Information per IPMI

Wie in dem Thread hier geschrieben würde es mich interessieren wie per Free oder Open-IPMI Informationen von Raids ordentlich ausgelesen werden können.
Das wäre für Systeme relevant welche nicht richtig (weil zu alt oder sonstiges) per Redfish ausgelesen werden können.

Wenn das ohne große Probleme ginge wäre für ältere Hardware kein SNMP notwendig.
Der User welcher ja für IPMI verwendet wird sollte ja wie im Redfish ein User sein welcher sich im BMC anmelden kann.

Gruß
Andreas

Hi Andreas, wie versprochen hier die Details:

So rufen wir den special agenten auf:

/omd/sites/AUG/share/check_mk/agents/special/agent_ipmi_sensors '10.20.30.40' 'user' 'password' 'freeipmi' 'user' '--driver_type' 'LAN_2_0' '--sdr_cache_recreate' '--interpret_oem_data' '--output_sensor_state' '--output_sensor_thresholds'

Ältere Hardware unterstützt kein ‘–driver_type’ ‘LAN_2_0’. Wir haben deshalb zwei Hosttypen. Die NG Variante unterstütz kein LAN_2_0.

Hier die Matrix für die Modelle die wir derzeit getestet haben:

Hardware Host Type
M3 IPMI Server Hardware
M4 IPMI Server Hardware NG
M5 IPMI Server Hardware NG
SR650 IPMI Server Hardware NG
DELL DRAC IPMI Server Hardware

Hier der Agent Output von einer Lenovo SR650. Der kann direkt für die Simulation in checkmk verwendet werden.

<<<ipmi_sensors:sep(124)>>>
2   | CPUs             | Critical Interrupt          | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
3   | All CPUs         | Processor                   | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
4   | One of CPUs      | Processor                   | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
5   | CPU 1 Status     | Processor                   | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Processor Presence detected'
6   | CPU 2 Status     | Processor                   | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
11  | PSU Mismatch     | Power Supply                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
12  | PS Heavy Load    | Power Unit                  | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'State Deasserted'
13  | Power Resource   | Power Unit                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
14  | Power Supply 1   | Power Supply                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Presence detected'
15  | PSU1 Failure     | Power Supply                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
16  | PSU1 PF Failure  | Power Supply                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
17  | PSU1 IN Failure  | Power Supply                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
18  | PSU1 AC In Pwr   | Current                     | Nominal  | 60.00      | W           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
19  | PSU1 DC Out Pwr  | Current                     | Nominal  | 50.00      | W           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
20  | Power Supply 2   | Power Supply                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Presence detected'
21  | PSU2 Failure     | Power Supply                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
22  | PSU2 PF Failure  | Power Supply                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
23  | PSU2 IN Failure  | Power Supply                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
24  | PSU2 AC In Pwr   | Current                     | Nominal  | 55.00      | W           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
25  | PSU2 DC Out Pwr  | Current                     | Nominal  | 45.00      | W           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
26  | IPMI Watchdog    | Watchdog 2                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
27  | SEL Fullness     | Event Logging Disabled      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
28  | Host Power       | Power Unit                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
29  | All DIMMs        | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
30  | DIMMs            | Critical Interrupt          | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
31  | One of DIMMs     | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
32  | Backup Memory    | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
33  | Lane Failover    | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
34  | Memory Resized   | POST Memory Resize          | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
35  | Non-Auth DIMMs   | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
36  | SMI Lane Failed  | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
37  | Aux Log          | System Event                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
38  | Firmware Error   | System Firmware Progress    | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
39  | GPT Status       | System Firmware Progress    | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
40  | No Boot Device   | Boot Error                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
41  | No Op ROM Space  | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
42  | Progress         | System Firmware Progress    | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
43  | Sec Rollback Err | System Firmware Progress    | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
44  | Secure Boot Err  | System Firmware Progress    | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
45  | Sys Boot Status  | System Firmware Progress    | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
46  | TPM_TCM Init Err | System Firmware Progress    | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
47  | TPM_TCM Lock     | Button/Switch               | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
48  | Phy Presence Set | Button/Switch               | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
49  | TXT ACM Module   | System Firmware Progress    | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
50  | SMM Mode         | System Event                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
51  | SMM Monitor      | System Event                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
52  | XCC FW Corrupted | System Firmware Progress    | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
53  | XCC Switchover   | Version Change              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
54  | Low Security Jmp | Management Subsystem Health | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Device Removed/Device Absent'
55  | Phy Presence Jmp | System Firmware Progress    | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Device Removed/Device Absent'
56  | XCC DB Status    | Management Subsystem Health | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
57  | NMI State        | Critical Interrupt          | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
58  | PCI 1 Fault      | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
59  | PCI 2 Fault      | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
60  | PCI 3 Fault      | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
61  | PCI 4 Fault      | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
62  | PCI 5 Fault      | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
63  | PCI 6 Fault      | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
64  | PCI 7 Fault      | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Slot/Connector Device installed/attached'
65  | PCIs             | Critical Interrupt          | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
66  | All PCI Error    | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
67  | One of PCIs      | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
68  | No I/O Resources | System Firmware Progress    | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
69  | Front Panel      | Entity Presence             | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Entity Present'
70  | COMport Card     | Cable/Interconnect          | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
71  | BP 1 Status      | Entity Presence             | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Entity Present'
72  | BP 2 Status      | Entity Presence             | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
73  | BP 3 Status      | Entity Presence             | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
74  | Rear BP Status   | Entity Presence             | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
75  | RAID Vol State   | Drive Slot                  | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
76  | Ext Drive        | Drive Slot                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
77  | Ext Enclosure PS | Power Supply                | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
78  | RAID Log         | System Event                | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
79  | Front Video      | Cable/Interconnect          | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
80  | PHY Card Fault   | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Slot/Connector Device installed/attached'
81  | M2 Card Fault    | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
82  | M2 Drive 0       | Drive Slot                  | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
83  | M2 Drive 1       | Drive Slot                  | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
84  | PCI Riser 1      | Entity Presence             | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
85  | PCI Riser 2      | Entity Presence             | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
86  | ML2 Card Fault   | Slot/Connector              | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
87  | Pwr Rail ABC Err | Voltage                     | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
88  | Pwr Rail DEF Err | Voltage                     | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
89  | Pwr Rail GHJ Err | Voltage                     | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
90  | Pwr Rail KLM Err | Voltage                     | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
91  | CPU1 VR          | Voltage                     | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
92  | CPU2 VR          | Voltage                     | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
93  | SysBrd Vol Fault | Voltage                     | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
94  | CMOS Battery     | Voltage                     | Nominal  | 3.15       | V           | N/A        | 2.25       | 2.39       | N/A        | N/A        | N/A        | 'OK'
95  | SysBrd 3.3V      | Voltage                     | Nominal  | 3.33       | V           | N/A        | 2.98       | N/A        | N/A        | 3.63       | N/A        | 'OK'
96  | SysBrd 5V        | Voltage                     | Nominal  | 5.03       | V           | N/A        | 4.49       | N/A        | N/A        | 5.50       | N/A        | 'OK'
97  | SysBrd 12V       | Voltage                     | Nominal  | 12.10      | V           | N/A        | 10.81      | N/A        | N/A        | 13.22      | N/A        | 'OK'
98  | Sys Utilization  | Other Units Based Sensor    | Nominal  | 17.00      | %           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
99  | CPU Utilization  | Other Units Based Sensor    | Nominal  | 23.00      | %           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
100 | Mem Utilization  | Other Units Based Sensor    | Nominal  | 3.00       | %           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
101 | IO Utilization   | Other Units Based Sensor    | Nominal  | 0.00       | %           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
102 | ME Status        | Management Subsystem Health | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
103 | Sys Power        | Current                     | Nominal  | 125.00     | W           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
104 | CPU Power        | Current                     | Nominal  | 36.00      | W           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
105 | Mem Power        | Current                     | Nominal  | 6.00       | W           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
106 | CPU 1 PECI       | Critical Interrupt          | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
107 | CPU 2 PECI       | Critical Interrupt          | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
108 | CPU 1 QPILinkErr | Cable/Interconnect          | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
109 | CPU 2 QPILinkErr | Cable/Interconnect          | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
110 | CPU1 OverTemp    | Temperature                 | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
111 | CPU1 Temp        | Temperature                 | Nominal  | 31.00      | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
112 | CPU1 DTS         | Temperature                 | Nominal  | -51.00     | unspecified | N/A        | N/A        | N/A        | N/A        | -0.20      | 0.00       | 'OK'
113 | CPU2 OverTemp    | Temperature                 | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
114 | CPU2 Temp        | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
115 | CPU2 DTS         | Temperature                 | N/A      | N/A        | unspecified | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
116 | RAID Card 1 Ctrl | Add In Card                 | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
117 | RAID Card 2 Ctrl | Add In Card                 | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
118 | RAID Card 3 Ctrl | Add In Card                 | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
119 | RAID Card 4 Ctrl | Add In Card                 | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
120 | RAID Card 5 Ctrl | Add In Card                 | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
121 | RAID Card 6 Ctrl | Add In Card                 | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
122 | Lom link state   | LAN                         | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
123 | DIMMs Disabled   | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
158 | Drive 0          | Drive Slot                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Drive Presence'
159 | Drive 1          | Drive Slot                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Drive Presence'
160 | Drive 2          | Drive Slot                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Drive Presence'
161 | Drive 3          | Drive Slot                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Drive Presence'
162 | Drive 4          | Drive Slot                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Drive Presence' 'Hot Spare'
163 | Drive 5          | Drive Slot                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
164 | Drive 6          | Drive Slot                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
165 | Drive 7          | Drive Slot                  | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
166 | Drive Mismatch   | Drive Slot                  | N/A      | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
167 | DIMM 1           | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Presence detected'
168 | DIMM 1 Temp      | Temperature                 | Nominal  | 27.00      | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
171 | DIMM 2           | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
172 | DIMM 2 Temp      | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
180 | DIMM 3           | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Presence detected'
189 | DIMM 3 Temp      | Temperature                 | Nominal  | 27.00      | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
191 | DIMM 4           | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
192 | DIMM 4 Temp      | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
194 | DIMM 5           | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Presence detected'
195 | DIMM 5 Temp      | Temperature                 | Nominal  | 27.00      | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
197 | DIMM 6           | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
198 | DIMM 6 Temp      | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
200 | DIMM 7           | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
201 | DIMM 7 Temp      | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
203 | DIMM 8           | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'Presence detected'
204 | DIMM 8 Temp      | Temperature                 | Nominal  | 25.00      | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
206 | DIMM 9           | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
207 | DIMM 9 Temp      | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
209 | DIMM 10          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
210 | DIMM 10 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
212 | DIMM 11          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
213 | DIMM 11 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
215 | DIMM 12          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
216 | DIMM 12 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
218 | DIMM 13          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
219 | DIMM 13 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
221 | DIMM 14          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
222 | DIMM 14 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
224 | DIMM 15          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
225 | DIMM 15 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
227 | DIMM 16          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
228 | DIMM 16 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
232 | DIMM 17          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
233 | DIMM 17 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
236 | DIMM 18          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
237 | DIMM 18 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
239 | DIMM 19          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
240 | DIMM 19 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
242 | DIMM 20          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
243 | DIMM 20 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
245 | DIMM 21          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
246 | DIMM 21 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
248 | DIMM 22          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
249 | DIMM 22 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
251 | DIMM 23          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
252 | DIMM 23 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
254 | DIMM 24          | Memory                      | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
255 | DIMM 24 Temp     | Temperature                 | N/A      | N/A        | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
257 | PCH OverTemp     | Temperature                 | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
258 | PCH Temp         | Temperature                 | Nominal  | 49.00      | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
259 | Ambient Temp     | Temperature                 | Nominal  | 23.00      | C           | N/A        | N/A        | N/A        | 43.00      | 47.00      | 50.00      | 'OK'
260 | PCI 1 OverTemp   | Temperature                 | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
261 | PCI 2 OverTemp   | Temperature                 | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
262 | PCI 3 OverTemp   | Temperature                 | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
263 | PCI 4 OverTemp   | Temperature                 | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
264 | PCI 5 OverTemp   | Temperature                 | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
265 | PCI 6 OverTemp   | Temperature                 | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
266 | PCI 7 OverTemp   | Temperature                 | Nominal  | N/A        | N/A         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'transition to OK'
267 | Fan 1 Tach       | Fan                         | Nominal  | 3339.00    | RPM         | N/A        | 630.00     | N/A        | N/A        | N/A        | N/A        | 'OK'
268 | Fan 2 Tach       | Fan                         | Nominal  | 3339.00    | RPM         | N/A        | 630.00     | N/A        | N/A        | N/A        | N/A        | 'OK'
269 | Fan 3 Tach       | Fan                         | Nominal  | 3339.00    | RPM         | N/A        | 630.00     | N/A        | N/A        | N/A        | N/A        | 'OK'
270 | Fan 4 Tach       | Fan                         | Nominal  | 3339.00    | RPM         | N/A        | 630.00     | N/A        | N/A        | N/A        | N/A        | 'OK'
271 | Fan 5 Tach       | Fan                         | Nominal  | 3339.00    | RPM         | N/A        | 630.00     | N/A        | N/A        | N/A        | N/A        | 'OK'
272 | Fan 6 Tach       | Fan                         | N/A      | N/A        | RPM         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | N/A
273 | System Air Flow  | Cooling Device              | Nominal  | 35.00      | CFM         | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
274 | Exhaust Temp     | Temperature                 | Nominal  | 28.00      | C           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'
275 | Sys Fan Pwr      | Current                     | Nominal  | 4.00       | W           | N/A        | N/A        | N/A        | N/A        | N/A        | N/A        | 'OK'

Hallo Andreas,

ich habe mir mal bei alten HP-Servern mit cpqacuxe oder ssacli die Informationen aus dem installierten Betriebssystem besorgt. Allerdings nicht für Checkmk sondern um Konfigurationen zu übertragen. Wenn der BMC eine einigermaßen brauchbare “Shell” zur Verfügung stellt, kann man das bestimmt auch über Telnet/SSH herauskriegen. Die Frage ist: Wie alt ist “alt” und wieviel Energie möchte man aufwenden?

Gruß
Stefan

Die Frage war hier eher - krieg ich per IPMI gleichwertige Daten wie über Redfish oder SNMP. Oder vielleicht besser gesagt wie ist die Datenqualität der verschiedenen verfügbaren Management Protokolle.
Hier will ich explizit OS Tools ausschließen.

Ich denke mal bei @mike1098 ist es auch so, dass hier das OS selbst möglichst nicht angetastet werden soll für das Hardware Monitoring. So halte ich es halt.

Der Output passt mit dem zusammen was ich so kenne. Für die meisten meiner Anwendungsfälle ist mir der Inhalt zu wenig oder zu unspezifisch.
Bei den meisten Einträgen beschränkt sich die Info einfach auf “OK” oder nicht. Wenn das reicht dann ist das bestimmt auch per IMPI alles abbildbar.
Am besten finde ich immer den Status “Transition to ok”.

Ich hänge mal als Beispiel die Daten von einem einzelnen Memory Modul an welches ich per Redfish abfage.
Ob halt Daten so ausführlich relevant sind oder nicht muss jeder für sich oder seine Umgebung entscheiden.
Hab auch mal die einzelnen Hersteller ran geschrieben um zu sehen wie unterschiedlich da auch die Daten sein können.

# HPE Memory
{
    "@odata.context": "/redfish/v1/$metadata#Memory.Memory",
    "@odata.etag": 'W/"E6EC3A2C"',
    "@odata.id": "/redfish/v1/Systems/1/Memory/proc1dimm12",
    "@odata.type": "#Memory.v1_7_1.Memory",
    "Id": "proc1dimm12",
    "BaseModuleType": "RDIMM",
    "BusWidthBits": 72,
    "CacheSizeMiB": 0,
    "CapacityMiB": 32768,
    "DataWidthBits": 64,
    "DeviceLocator": "PROC 1 DIMM 12",
    "ErrorCorrection": "MultiBitECC",
    "LogicalSizeMiB": 0,
    "Manufacturer": "HPE",
    "MemoryDeviceType": "DDR4",
    "MemoryLocation": {"Channel": 3, "MemoryController": 1, "Slot": 12, "Socket": 1},
    "MemoryMedia": ["DRAM"],
    "MemoryType": "DRAM",
    "Name": "proc1dimm12",
    "NonVolatileSizeMiB": 0,
    "Oem": {
        "Hpe": {
            "@odata.context": "/redfish/v1/$metadata#HpeMemoryExt.HpeMemoryExt",
            "@odata.type": "#HpeMemoryExt.v2_5_0.HpeMemoryExt",
            "Attributes": ["HpeSmartMemory"],
            "BaseModuleType": "RDIMM",
            "DIMMManufacturingDate": "1746",
            "DIMMStatus": "GoodInUse",
            "MaxOperatingSpeedMTs": 2666,
            "MinimumVoltageVoltsX10": 12,
            "VendorName": "Micron",
        }
    },
    "OperatingMemoryModes": ["Volatile"],
    "OperatingSpeedMhz": 2666,
    "PartNumber": "36ASF4G72PZ-2G6D1  ",
    "PersistentRegionSizeLimitMiB": 0,
    "RankCount": 2,
    "SecurityCapabilities": {},
    "SerialNumber": "19A45CE9",
    "Status": {"Health": "OK", "State": "Enabled"},
    "VendorID": "11264",
    "VolatileRegionSizeLimitMiB": 32768,
    "VolatileSizeMiB": 32768,
}
# Dell Memory
{
    "@odata.context": "/redfish/v1/$metadata#Memory.Memory",
    "@odata.type": "#Memory.v1_9_2.Memory",
    "@odata.id": "/redfish/v1/Systems/System.Embedded.1/Memory/DIMM.Socket.A5",
    "AllowedSpeedsMHz": [3200],
    "AllowedSpeedsMHz@odata.count": 1,
    "Assembly": {
        "@odata.id": "/redfish/v1/Systems/System.Embedded.1/Memory/DIMM.Socket.A5/Assembly"
    },
    "BaseModuleType": None,
    "BusWidthBits": 72,
    "CacheSizeMiB": 0,
    "CapacityMiB": 65536,
    "DataWidthBits": 64,
    "Description": "DIMM A5",
    "DeviceLocator": "DIMM A5",
    "ErrorCorrection": "MultiBitECC",
    "FirmwareRevision": None,
    "Id": "DIMM.Socket.A5",
    "Links": {
        "Chassis": {"@odata.id": "/redfish/v1/Chassis/System.Embedded.1"},
        "Oem": {
            "Dell": {
                "@odata.type": "#DellOem.v1_2_0.DellOemLinks",
                "CPUAffinity": [
                    {
                        "@odata.id": "/redfish/v1/Systems/System.Embedded.1/Processors/CPU.Socket.1"
                    }
                ],
                "CPUAffinity@odata.count": 1,
            }
        },
    },
    "LogicalSizeMiB": 0,
    "Manufacturer": "Hynix Semiconductor",
    "MaxTDPMilliWatts": [],
    "MaxTDPMilliWatts@odata.count": 0,
    "MemoryDeviceType": "DDR4",
    "MemorySubsystemControllerManufacturerID": None,
    "MemorySubsystemControllerProductID": None,
    "MemoryType": "DRAM",
    "Metrics": {
        "@odata.id": "/redfish/v1/Systems/System.Embedded.1/Memory/DIMM.Socket.A5/MemoryMetrics"
    },
    "ModuleManufacturerID": None,
    "ModuleProductID": None,
    "Name": "DIMM A5",
    "NonVolatileSizeMiB": 0,
    "Oem": {
        "Dell": {
            "@odata.type": "#DellOem.v1_2_0.DellOemResources",
            "DellMemory": {
                "BankLabel": "A",
                "Id": "DIMM.Socket.A5",
                "LastSystemInventoryTime": "2023-08-03T05:36:12+00:00",
                "LastUpdateTime": "2021-01-04T23:35:24+00:00",
                "ManufactureDate": "Mon Nov 02 13:00:00 2020 UTC",
                "MemoryTechnology": "DRAM",
                "Model": "DDR4 DIMM",
                "Name": "DellMemory",
                "RemainingRatedWriteEndurancePercent": None,
                "SystemEraseCapability": "NotSupported",
                "@odata.context": "/redfish/v1/$metadata#DellMemory.DellMemory",
                "@odata.type": "#DellMemory.v1_1_0.DellMemory",
                "@odata.id": "/redfish/v1/Systems/System.Embedded.1/Memory/DIMM.Socket.A5/Oem/Dell/DellMemory/DIMM.Socket.A5",
            },
        }
    },
    "OperatingMemoryModes": ["Volatile"],
    "OperatingMemoryModes@odata.count": 1,
    "OperatingSpeedMhz": 3200,
    "PartNumber": "HMAA8GR7AJR4N-XN",
    "RankCount": 2,
    "SerialNumber": "2265D090",
    "Status": {"Health": "OK", "State": "Enabled"},
    "VolatileSizeMiB": 65536,
}
# Supermicro
{
    "@odata.context": "/redfish/v1/$metadata#Memory.Memory",
    "@odata.etag": '"1678366478"',
    "@odata.id": "/redfish/v1/Systems/Self/Memory/DevType2_DIMM13",
    "@odata.type": "#Memory.v1_8_0.Memory",
    "Actions": {
        "Oem": {
            "#AmiBios.ChangeState": {
                "@Redfish.ActionInfo": "/redfish/v1/Systems/Self/Memory/DevType2_DIMM13/ChangeStateActionInfo",
                "target": "/redfish/v1/Systems/Self/Memory/DevType2_DIMM13/Actions/AmiBios.ChangeState",
            }
        }
    },
    "AllowedSpeedsMHz": [3200],
    "BaseModuleType": "RDIMM",
    "BusWidthBits": 72,
    "CacheSizeMiB": 0,
    "CapacityMiB": 8192,
    "DataWidthBits": 64,
    "DeviceLocator": "DIMM_G2",
    "FirmwareRevision": "Unknown",
    "Id": "DevType2_DIMM13",
    "Links": {"Chassis": {"@odata.id": "/redfish/v1/Chassis/Self"}},
    "LogicalSizeMiB": 0,
    "Manufacturer": "ATP Electronics Inc",
    "MemoryDeviceType": "DDR4",
    "MemoryLocation": {"Channel": 0, "Slot": 1, "Socket": 1},
    "MemorySubsystemControllerManufacturerID": "0x0000",
    "MemorySubsystemControllerProductID": "0x0000",
    "MemoryType": "DRAM",
    "ModuleManufacturerID": "0xE386",
    "ModuleProductID": "0x0000",
    "Name": "DevType2_DIMM13",
    "NonVolatileSizeMiB": 0,
    "OperatingMemoryModes": ["Volatile"],
    "OperatingSpeedMhz": 3200,
    "PartNumber": "X4B08QD8BNWESO-7-TN1",
    "RankCount": 1,
    "SecurityCapabilities": {
        "ConfigurationLockCapable": False,
        "DataLockCapable": False,
        "PassphraseCapable": False,
    },
    "SerialNumber": "20D4C5FD",
    "Status": {"Health": "OK", "State": "Enabled"},
    "VolatileSizeMiB": 8192,
}
# Cisco
{
    "@odata.context": "/redfish/v1/$metadata#Memory.Memory",
    "@odata.id": "/redfish/v1/Systems/WZP26490C67/Memory/DIMM_P1_G1",
    "@odata.type": "#Memory.v1_7_0.Memory",
    "CapacityMiB": 16384,
    "DataWidthBits": 64,
    "Description": "Computer Memory",
    "DeviceLocator": "DIMM_P1_G1",
    "Id": "13",
    "Links": {"Chassis": {"@odata.id": "/redfish/v1/Chassis/1"}},
    "Manufacturer": "0x2C00",
    "MemoryDeviceType": "DDR4",
    "MemoryLocation": {"Channel": 6, "Slot": 0, "Socket": 0},
    "MemoryMedia": ["DRAM"],
    "MemoryType": "DRAM",
    "Name": "DIMM_P1_G1",
    "OperatingMemoryModes": ["Volatile"],
    "OperatingSpeedMhz": 3200,
    "PartNumber": "18ASF2G72PZ-3G2R1   ",
    "SecurityCapabilities": {"PassphraseCapable": False},
    "SerialNumber": "2249-3CF55BE0",
    "Status": {"State": "Enabled", "Health": "OK"},
}

Generell ist es richtig dass IPMI natürlich nur die Daten liefern kann für die Sensoren die der Hersteller im BMC implementiert hat. Das wird aber bei RedFish etc. auch nicht anders sein.

Wie freeipmi die vom Sensor gelieferten messages interpretiert kann man unter /etc/freeipmi/freeipmi_interpret_sel.conf einsehen.
So wie ich IPMI verstanden habe liefert es ‘nur’ events und keine ‘states’ der Sensoren so wie wir es von Nagios her kennen. Wenn sich ein Sensor Status geändert hat, dann wird eine neue Nachricht rausgehauen. Daher ‘Transition to’. Ich hatte da auch längere Disussionen mit den Entwicklern des special agents.

Die Spezifikation zu IPMI v2 kann man sich hier ansehen:

https://www.intel.com/content/dam/www/public/us/en/documents/product-briefs/ipmi-second-gen-interface-spec-v2-rev1-1.pdf?mwg_rnd=8863218

Nicht ganz, was du im Output siehst sind schon die aktuellen Statusinformationen. Hast ja auch extra beim Aufruf “output_sensor_state” dabei.

Laut Spec hat das nix mit einer Nachricht zu tun. Besonders “lustig” finde ich hier das Kleingedruckte am Ende.

This is used to know if an entity is in trouble or other state information.
Valid values are:
00h - transition to ok
01h - transition to non-critical from ok.
02h - transition to critical from less critical.
03h - transition to non-recoverable from less critical.
04h - transition to non-critical from more critical.
05h - transition to critical from non-recoverable.
06h - transition to non-recoverable.1
07h - monitor
08h - informational

The actual meaning of these is not defined by the spec

Das bestätigt wieder meine Meinung zu IPMI Ausgabe - meist ist es ein lustiges Ratespiel was wirklich gerade los ist.

Alles was ich dazu noch sagen möchte ist dass wir IPMI nun seit über einer Dekade bei uns im großen Stil erfolgreich einsetzen. Wir hatten unter Nagios auch schon ein angepasstes plugin von Tomas Krenn im Einsatz.
Z.t. haben wir 1124 Server mit LAN_2.0 und 331 ältere Server in checkmk weltweit.
Das checkmk plugin hatte von Version zu Version einige Probleme mit der Interpretation der freeipmi Ergebnisse. Wir hatten das jeweils von einem externen Entwickler beheben lassen und seit der 2.0.0p36 sollte das endlich auch im offiziellen Code behoben sein.

Ich glaube ich habe damit alle Informationen geliefert und bin damit auch raus.

Viel Erfolg

Michael