2. Set Directories and Permissions

Create directories and configure ownership + permissions on the appropriate hosts as described below.

If any of these directories already exist, we recommend deleting and recreating them. Use the following instructions to set up Pig configuration files :

  1. We strongly suggest that you edit and source the files included in scripts.zip file (downloaded in  Download Companion Files).

    Alternatively, you can also copy the contents to your ~/.bash_profile) to set up these environment variables in your environment.

  2. Execute these commands on your WebHCat server machine to create log and pid directories.

    mkdir -p $WEBHCAT_LOG_DIR
    chown -R $WEBHCAT_USER:$HADOOP_GROUP $WEBHCAT_LOG_DIR
    chmod -R 755 $WEBHCAT_LOG_DIR

    mkdir -p $WEBHCAT_PID_DIR
    chown -R $WEBHCAT_USER:$HADOOP_GROUP $WEBHCAT_PID_DIR
    chmod -R 755 $WEBHCAT_PID_DIR

    where:

    • $WEBHCAT_LOG_DIR is the directory to store the WebHCat logs. For example, var/log/webhcat.

    • $WEBHCAT_PID_DIR is the directory to store the WebHCat process ID. For example, /var/run/webhcat.

    • $WEBHCAT_USER is the user owning the WebHCat services. For example, hcat.

    • $HADOOP_GROUP is a common group shared by services. For example, hadoop.


loading table of contents...