Hope someone can help or point me in the right direction?
Have set the following:
Global settings Enable regular service discovery checks (deprecated) = unset
Service Rule:
Periodic service discovery = every 2 hours
Problems are:
CPU load goes to 100% and server becomes unresponsive (Number of vCPU is 2)?
Adding vCPU’s make no difference it still goes to 100% even with 8 vCPU’s
After discovery is finished, Check_MK Inventory Fails, reschedule check fixes it.
How do I set the discovery to run the checks on each host after discovery is finished?
Have you tried to split the “Periodic service discovery” rule into multiple rules for different groups like one for all windows and one for all linux Server?
Please keep in mind that only one “Periodic service discovery” rule per server can be active at the same time (first matching rule defines the parameter).
The runtime of a single discovery is important in your case.
Your have around 200 real hosts and some AWS instances.
If you start every two hours the discovery and all start at the same time (what should only happen after the first activation of discovery) the system triggers these 200 and something active checks to run. This is a significant load.
What is the normal check interval used on your system, the default 1 minute interval?
What i don’t know, as i have no AWS systems, is the normal runtime of the AWS special agent.
If a complete discovery takes between 5 and 10 seconds per host you will need for one discovery run around 1000 and 2000 CPU seconds. The time per service discovery you can measure on the command line with a “time cmk --check-discovery hostname”
2 cores → nearly 10 minutes with 100% usage and also a high load
Only if all checks trigger at the same time.
The average run time for the AWS systems is 3sec
I have the Periodic service discovery in rules, but did not set times on the rule,
I set the normal check interval to 5 mins, to see if that was the high CPU problem,
but would like to get it back to 1 min.
I will try setting a different time for each rule.
Is there a way to set a time to run on a rule,
I can only find “Never do discovery or activate changes in the following time ranges”
to run the rule every 2 hours at a set time I would have to add a lot of time ranges?
So far so good, have got the CPU load down, but still have something that is running once a day
that is taking the server down?
Do not have any rules set for once a day, cannot see anything in the global settings.
It is something that starts when the site is started, is there a daily something that runs?
This topic was automatically closed 365 days after the last reply. New replies are no longer allowed. Contact an admin if you think this should be re-opened.