**CMK version:2.1.0p38.cee Operating system version: CentOS7
Our Check_Mk is slow. Any change takes a lot of time. And sometimes the server also crashes and is displayed as dead.
If I use omd status, I see that some services restart randomly
Where should I look, to debug performance in checkmk ?
Regarding the above i would recommend starting with updates/upgrades, as newer version solve a lot of issues.
Next to that you have not given any detail regarding the sizing of the server CMK is running on, making it very hard for us on the forum to even make a wild guess.
Hi, we are running about 300 instances with 2.1.0 on CentOS 7 without any issues. I would recommend to start looking at the log files in the ~/var/log of the site user. It depends on the service which is crashing but I recommend starting with web.log and cmc.log. In general settings you can even increase the log level to debug for certain services.
Beside that you need a minimum of 4 cores, 8 GByte of RAM and free diskspace.
We have 4 cmk server with a total of 634 Host.
On the server which are slow are:
346 host
20 CPU’s
32 RAM
i check the logs, but i not sure what is costing the Performance Problems.
1
on ~/var/log/cmc.log, we get multiple errors regarding
[rrdcached] [rrdcached at "/omd/sites/$site/tmp/run/rrdcached.sock"] [log] -1 No such file: /data/omd/sites/$site/var/pnp4nagios/perfdata/$hostname/Filesystem__dev_shm_fs_size.rrd
starting up
checking for journal files
replaying from journal: /data/omd/sites/$site/var/rrdcached/rrd.journal.1726565091.678467
Replayed 25358 entries (1328731 failures)
replaying from journal: /data/omd/sites/$site/var/rrdcached/rrd.journal.1726572291.678848
Replayed 307534 entries (975047 failures)
replaying from journal: /data/omd/sites/$site/var/rrdcached/rrd.journal.1726579417.913437
Replayed 515367 entries (308101 failures)
replaying from journal: /data/omd/sites/$site/var/rrdcached/rrd.journal.1726584088.666084
Replayed 0 entries (0 failures)
replaying from journal: /data/omd/sites/$site/var/rrdcached/rrd.journal.1726584214.798012
Replayed 22249 entries (0 failures)
started new journal /data/omd/sites/$site/var/rrdcached/rrd.journal.1726584404.961194
journal processing complete
listening for connections
in file /data/omd/sites/$site/var/rrdcached/rrd.journal.1726565091.678467 are a loot off erros like this update /data/omd/sites/$site/var/check_mk/rrd/$hostname/NTP.rrd 1726565295:-0.000899:2:2:0:0
Check in file ~/etc/init.d/rrdcached following lines:
# Please do not touch the code below
CACHE_DIR="/omd/sites/test/tmp/rrdcached"
JOURNAL_DIR="/omd/sites/test/var/rrdcached"
SOCKET="/omd/sites/test/tmp/run/rrdcached.sock"
PIDFILE="/omd/sites/test/tmp/rrdcached.pid"
LOGFILE="/omd/sites/test/var/log/rrdcached.log"
USER="test"
GROUP="test"
My site name is ‘test’, please replace with your site name.
This should solve your RRD issue at least. “mysite” is a placeholder for the name of your site.
In general, when looking at the official user guide or the knowledge base, strings starting with “my” always indicate a placeholder.