Monitor CPU over time without thresholds or alerts

We have a High Performance Computing environment and nodes running high is not an issue. 100% CPU utilization is common and currently we disable CPU monitoring because we don’t want the alerts. Is there a way to set CPU monitoring up so that I can get graphs of utilization over time? Please let me know if I’m not explaining this very well.

You can use the rule “Enable/disable notifications for services”.
For the Notification configuration the Notifications Documentation at checkmk is very good.

2 Likes

If you are using any of the commercial Checkmk editions, you can also use the Service State translation to always show the service as OK.

image

Usually you can set threshold limits to 101% to avoid WARN and CRIT states.

1 Like

I would rather use of the other mentioned approaches, as this one has the biggest potential for confusion.

IMHO, @r.sander has the best approach, because it does not only avoid notifications, but also state changes, which might be puzzling, if you expect high(est) utilization.

2 Likes

Thanks everyone for all the helpful comments! This community is the best!

2 Likes

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed. Contact an admin if you think this should be re-opened.