5.3.6. NodeManager process

This host-level alert is triggered if the NodeManager process cannot be established to be up and listening on the network for the configured critical threshold, given in seconds. It uses the Nagios check_tcp plugin.

 5.3.6.1. Potential causes
  • NodeManager process is down or not responding.

  • NodeManager is not down but is not listening to the correct network port/address.

  • Nagios Server cannot connect to the NodeManager

 5.3.6.2. Possible remedies
  • Check if the NodeManager is running.

  • Check for any errors in the NodeManager logs (/var/log/hadoop/yarn) and restart the NodeManager, if necessary

  • Use ping to check the network connection between the Nagios Server and the NodeManager host.


loading table of contents...