I’m now trying a combination of rules for this server.
Notified Events for Service
- any -> CRIT
- any -> OK
This should only send CRIT and OK notifications.
Enable / Disable Flapping Detection for Service
- Disable flapping detection
The state was regularly going into flapping state because of sudden temperature changes.
Service Period for Services
- With this rule I setup a service / backup / cleanup time period in which there will be no notifications sent
Many of the notifications I recieve are from the service period at night, where backup and cleanup jobs are running.
Maximum Number of Check Attempts for Service
- This rule is set to 10 attempts for all “Temperature Zone” Services
With all of these rules combined, I should have the following setup
- Only Recieve CRIT or OK messages
- Do not recieve any messages at night (cleanup and backup timeperiod)
- Only recieve messages after 10 check retries (10 minutes)
- No Flapping detection and notification
If this is not enough, I will probably disable WARN (if possible) values for this Server.
This should lead to only OK->CRIT and CRIT->OK events.
I will let this run for a few days and will report back afterwards.
Thank you everyone for your help.