Hi Marcel,
thanks for the suggestions.
I stood up a brand new test server today. We don't use OMD but install
CMK via "setup.sh" from the sources. Nagios 3.5.0 packages are from EPEL
repositories, otherwise a "naked" Centos 6.3 server.
The only changes I made after the fresh install is add a file
"all_hosts.mk" to /opt/check_mk/etc/conf.d that has two host definitions
in it:
all_hosts += [
"testhost1",
"testhost2",
]
Ran an inventory with -II and started via -O.
Same problem - all my file system reports are way off:
testhost1:
testhost1 conf.d $ df -h
Filesystem Size Used Avail Use% Mounted on
/dev/sda2 244G 2.4G 229G 2% /
tmpfs 16G 0 16G 0% /dev/shm
/dev/sda1 485M 62M 398M 14% /boot
WebGui reports:
fs_/ OK 6.0% used (14.70 of 243.13 GB)
fs_/boot OK 17.9% used (86.69 of 484.22 MB)
For testhost2:
testhost2 ~ $ df -h
Filesystem Size Used Avail Use% Mounted on
/dev/sda2 244G 3.9G 227G 2% /
tmpfs 16G 0 16G 0% /dev/shm
/dev/sda1 485M 62M 398M 14% /boot
and on the GUI:
fs_/ OK 6.6% used (16.17 of 243.13 GB)
fs_/boot OK 17.9% used (86.67 of 484.22 MB)
I also checked both GUIs, Nagios and CMK's web page, both are reporting
the same values for both hosts, both are wrong.
The Check_Mk agent on the hosts is reporting correct numbers to the
server, though.
Any other ideas as to what might be going wrong here? A fresh install
using 1.2.2p2 works without issues and reports proper file system
numbers. I'm puzzled.
Thanks!
On 03/08/2016 01:19 AM, Marcel Schulte wrote:
Hi,
no problems like that here.
I'd install a fresh OMD testsite without any additional configuration,
add one of the hosts in question and check if the wrong filesystem sizes
show up. If they don't the problem lies somewheer in your configuration
(customized df check or df.include file, custom rules, whatever).
BTW: did you try to reinventorize ("TabulaRasa" in WATO or "cmk -II" in
terminal) one of the hosts, rstart and check again?
Regards,
Marcel
Brainslug <brainslug@freakmail.de <mailto:brainslug@freakmail.de>>
schrieb am Mo., 7. März 2016 um 20:51 Uhr:
Hi all,
after upgrading CheckMK from 1.2.2p2 to 1.2.6p15 we've
noticed that
disk usage reporting seems to be way off.
Example:
This is what "df (-h)" reports:
testhost2 ~ $ df
Filesystem 1K-blocks Used Available Use% Mounted on
/dev/sda2 254941748 3985516 237999204 2% /
tmpfs 16414592 0 16414592 0% /dev/shm
/dev/sda1 495844 63149 407095 14% /boot
testhost2 ~ $ df -h
Filesystem Size Used Avail Use% Mounted on
/dev/sda2 244G 3.9G 227G 2% /
tmpfs 16G 0 16G 0% /dev/shm
/dev/sda1 485M 62M 398M 14% /boot
Here's what the check_mk_agent reports to the server:
<<<df>>>
/dev/sda2 ext4 254941748 3985200 237999520 2% /
tmpfs tmpfs 16414592 0 16414592 0% /dev/shm
/dev/sda1 ext3 495844 63149 407095 14% /boot
<<<df>>>
And this is what the Nagios server reports on the web site and in the
pnp4nagios graphs:
fs_/ OK 6.6% used (*16.16* of 243.13 GB)
fs_/boot OK 17.9% used (*86.67* of 484.22 MB)
This is more than 400%(!) off for fs_/ and a good 25% off for fs_/boot.
The offset seems to be random but significant on every single host we
have in our cluster.
Is anybody else seeing this? Is there anything we need to change in our
configurations to have CheckMK report the right values on the web site
and in the rrd graphs?
Thanks!
_______________________________________________
checkmk-en mailing list
checkmk-en@lists.mathias-kettner.de
<mailto:checkmk-en@lists.mathias-kettner.de>
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
_______________________________________________
checkmk-en mailing list
checkmk-en@lists.mathias-kettner.de
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en