3. Validate the Installation

Use these steps to validate your installation.

 3.1. Validate the Nagios Installation

Validate the installation.

nagios -v /etc/nagios/nagios.cfg

 3.2. Start Nagios and httpd

Start the Nagios server and httpd.

/etc/init.d/nagios start
/etc/init.d/httpd start

 3.3. Confirm Nagios is Running

Confirm the server is running.

/etc/init.d/nagios status

This should return:

nagios (pid #) is running...

 3.4. Test Nagios Services

Run the following command:

/usr/lib64/nagios/plugins/check_hdfs_capacity.php -h namenode_hostname -p 50070 -w 80% -c 90%

This should return:

OK: DFSUsedGB:<some#>, DFSTotalGB:<some#>

 3.5. Test Nagios Access

  1. Browse to the Nagios server:

    http://<nagios.server>/nagios
  2. Login using the Nagios admin username (nagiosadmin) and password (see Set the Nagios Admin Password).

  3. Click on hosts to validate that all the hosts in the cluster are listed.

  4. Click on services to validate all the Hadoop services are listed for each host.

 3.6. Test Nagios Alerts

  1. Login to one of your cluster DataNodes.

  2. Stop the TaskTracker service.

    su -l mapred -c "/usr/lib/hadoop/bin/hadoop-daemon.sh --config /etc/hadoop/conf stop tasktracker"
  3. Validate that you received an alert at the admin email address and that you have critical state showing on the console.

  4. Start the TaskTracker service.

    su -l mapred -c "/usr/lib/hadoop/bin/hadoop-daemon.sh --config /etc/hadoop/conf start tasktracker"
  5. Validate that you received an alert at the admin email address and that critical state is cleared on the console.


loading table of contents...