Issue with band graphs

Hello all,
we have a virtual server with 8-core and 32gn RAM where we have setup more then 1000 services to monitor, related to 300 routers/switches that in future they will grow up.
Actually agents don’t read the graphs related to the band and we need to fix it because we can’t have period of time without the band graphs.

Here an image about a router and this issue:

Can you help us to solve this issue? Are there some paramethers to configure for multiple asset to be read in the same time?
Thank you.

This looks like a reachability or timing issue to me.

Are there error messages in the “Check_MK” service of that host? Can you post a screenshot of the availability timeline?

All our hosts are reachable as you can see in the screenshot.

The availability of the router, in this case, is OK and also we can see the utilisation of the band in numerical form, not in graphical form.

As you can see from the setup page of the host, everything is ok, except the vanished services of Interface 1 and 2, these that monitor the band.

For the vanished service problem you only need to refresh the service discovery.
As the name is the same as before it should have no impact on the other things.

For your graph problem you should inspect the rrdcached and pnp4nagios log files if there is a problem to write the performance data to disk.

we tried to find these 2 log files but rrdcached is empty as you can see from the image

image

the other one, pnp4nagios, there isn’t.

Can you tell me the exact path to find the exact files?

The pnp4nagios log files are under ~/var/pnp4nagios

Also pnp4nagios folder is empty

image

Is possible to arrange a remote session to help us to fix this issue?

Are you aware that this is a community forum? not a commercial support channel. just asking.

It is strange that the same check (e.g. “Interface 1”) appears both in Undecided services and Vanished services. Normally, this should not happen. Is there a minor spelling difference, like different number of blanks between “Interface” and “1”?

Do you have a rule of type “Network interface and switch port discovery” for that host? If none, I would consider adding a rule, and select “Use alias” or “Use description” for the discovery (see screenshot).

Then it is an enterprise edition what was not mentioned in the first posts.
If you have problems with your rrd files and use the enterprise edition then you need to look at the “cmc.log” inside “~/var/log”.

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed. Contact an admin if you think this should be re-opened.