5.10.3. Ganglia collector [gmond] processes down alert for workers, NameNode, Job­Tracker, HBaseMaster

These alerts check if the Ganglia collector daemons (gmond) on the Ganglia server are running and listening on the network port. Ganglia uses collector daemons (gmond) on the Ganglia server: one for the Hadoop master daemon and one for aggregated metrics from the group of Hadoop slaves.This alert uses the Nagios check_tcp plugin.

 5.10.3.1. Potential causes
  • A gmond process is down

  • A gmond process is hanging

  • The network connection is down between the Nagois and Ganglia servers

 5.10.3.2. Possible remedies
  • Check the gmond related log in /var/log/messages for any errors

  • Check if ping works between Nagios and Ganglia servers.


loading table of contents...