I’m new here.
I need to monitor a GPU server (multiple GPUs per host)
I was trying to find a service for nvidia-smi monitoring.
found nvidia-gpu-2.0.mkp and installed it thru the command line.
The plugin you mentioned consists of two components: The Checkmk server side and the agent side. You have to take the file local/share/check_mk/agents/plugins/nvidia_smi from your Checkmk site and deploy it to the directory /usr/lib/check_mk_agent/plugins/ on the host to be monitored. To test, just run it from there.
In the next step you can run the service discovery in Checkmk.
This topic was automatically closed 365 days after the last reply. New replies are no longer allowed. Contact an admin if you think this should be re-opened.