Hi everyone,
using checkmk raw since a couple of weeks and it fits our needs perfectly.
Checkmk Raw Edition 2.2.0p5is running on ubuntu 22.04 server (virtual machine running on vmware esxi, 16gb of RAM are assigned).
This night the ram usage of the server itself exceeded 80%. Interesting is that ram usage constantly increased since 4 days, before the ram usage was pretty constant with about 1.5 GB.
Any ideas how to check whats going on? There is nothing else installed (beside from common ubuntu live server installation) on the server so i think it has something to do with checkmk.
Running 31 host with 901 services.
In my situation I get higher ram usage when the amount of livestatus requests increase, but ram usage drops back to ‘normal’ afterwards.
Not sure what is happening on your server, but what I like about the graphs in Checkmk is that you can mark a position in time in a graph and that mark becomes visible on all graphs everywhere on all hosts. So you can make assumptions based on different graphs and investigate in those areas.
this looks like a Memory Leak to me.
A Checkmk Server with 31 hosts and 901 services does not need that much RAM.
With this command you should be able to monitor the top 15 processes that make up the memory usage:
watch "ps --sort -rss -eo pid,pmem,rss,vsz,comm | head -16"
To fix you issue for now a simple reboot of the server should do the job but this won’t fix the memory leak itself and that’s why it’s important to find the process/program that responsible for the issue.
This topic was automatically closed 365 days after the last reply. New replies are no longer allowed. Contact an admin if you think this should be re-opened.