Distributed monitoring - Activating changes for multiple remotes stuck indefinitely

CMK version:
Checkmk Enterprise Edition 2.3.0p11

OS version:
Ubuntu 22.04.3 LTS

Error message:
The activiation process mostly gets stuck during the “synchronising” step to the remotes without any error (see screenshot). Sometimes it gets stuck a step later during the
“activating” step of the remotes. It does not have to affect all remotes on the same time, mostly one remote is working (without any pattern) and the other remotes are getting stuck.

If I apply the changes to the local site central first and afterwards to one remote after the other it works without any problems. So I think there is no general problem in the configuration/synchronisation, but a problem when activating multiple remotes at once (maybe a race condition?). I think it started after I added the third remote.

I already waited more than 30 minutes without any visible change in the ui. So once stuck I have to delete the folders /opt/omd/sites/central/tmp/check_mk/wato/activation/* and to start the activiation process again.

My setup is as followed:
location 1: local site central and remote site st001 (both on same server)
location 2: remote site st002 (accessed via vpn)
location 3: remote site st003 (accessed via vpn)

The vpn connection to location 2 and 3 is fast and stable so there is no issue with that. I tried with and without “Persistent Connection” and with increasing timeout limits, but without luck.

1 Like

Any updates on this? Can someone take a look into that?

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed. Contact an admin if you think this should be re-opened.