3. Validate the Installation

Use the following instructions to validate your installation:

  1. Start the Nagios server. Execute the following command to start the Nagios server:

    /etc/init.d/nagios start
  2. Confirm the server is running

    /etc/init.d/nagios status

    This should return:

    nagios (pid #) is running...
  3. Test Nagios Services. On the Nagios host machine, execute the following command:

    /usr/lib64/nagios/plugins/check_hdfs_capacity.php -h namenode_hostname -p 50070 -w 80% -c 90%

    This should return:

    OK: DFSUsedGB:<some#>, DFSTotalGB:<some#>
  4. Test Nagios Access.

    1. Browse to the Nagios server:

      http://$nagios.server/nagios
    2. Login using the Nagios admin username (nagiosadmin) and password.

    3. Click on hosts to validate that all the hosts in the cluster are listed.

    4. Click on services to validate all the Hadoop services are listed for each host.

  5. Test Nagios Alerts.

    1. Login to one of your cluster DataNodes.

    2. Stop the TaskTracker service.

      su -l mapred -c "/usr/lib/hadoop/bin/hadoop-daemon.sh --config /etc/hadoop/conf stop tasktracker"
    3. Validate that you received an alert at the admin email address and that you have critical state showing on the console.

    4. Start the TaskTracker service.

      su -l mapred -c "/usr/lib/hadoop/bin/hadoop-daemon.sh --config /etc/hadoop/conf start tasktracker"
    5. Validate that you received an alert at the admin email address and that critical state is cleared on the console.


loading table of contents...