After installing the update, I see some problem regarding service alive checks on Linux hosts (check_mk-systemd_units_ servicessummary and check_mk-systemd_unitssockets _summary).
The output is no longer “readable” as only the count of failed services is shown, but not the actually names of failed services:
“Total: 186, Disabled: 9, Failed: 1 CRIT”
I have an enterprise version, so the agents should update automatically. But even when doing a manual update of the agent, the problem persists.
I think even the number of failed services is “false positive”.
I think i face a similar situation on my Check-MK instance.
I upgraded it from “Checkmk Raw Edition 2.3.0p23” to “Checkmk Raw Edition 2.3.0p27”.
After the upgrade to version 2.3.0p27 some of the monitored Linux servers shows the following information:
Systemd Service Summary Total: 169, Disabled: 9, Failed: 2CRIT
What i found out is the following:
All affected server have a “Exclude units matching provided regex patterns” rule to ignore failed services. Now it seems that the rule is not working anymore. It recognizes the excluded service (that’s why the name of the failed service isn’t shown), but it states the service as critical and trigger an alert instead of ignoring it.
I hope this information will help to fix this bug.
I used the rules several month before upgrading to 2.3.0p27 and it always worked fine.
If you want a I can send you some screenshots from my system (as a new user I’m not allowed to upload files directly to this post)
If one of the answers helped you solve your question, please mark it as the solution. This way, you thank the person who helped you and also indicate that the question has been resolved. This, in turn, helps others who come across the same question.