Hi all,
Not necessarily looking for an exact answer, but just wanting to get a few responses of how you handle outages of monitoring when you are doing maintenance that requires an outage?
Do you just not have monitoring data during operating system or Checkmk version upgrades?
Do you have a pair of Checkmk appliances in a failover cluster or perhaps have HA implemented at the virtualisation level?
Do you do distributed monitoring with multiple sites that limit the outage time across your whole environment?
Just looking for ideas for how to increase the total availability of monitoring and happy to hear how you’ve chosen to do implement it.