2. Modify the Configuration Files

In the temporary directory, locate the following files and modify the properties based on your environment. Search for TODO in the files for the properties to replace. See Define Environment Parameters for more information.

 

Table 3.1. core-site.xml

PropertyExampleDescription

fs.default.name

hdfs://$namenode.full.hostname:8020

Enter your NameNode hostname

fs.checkpoint.dir

/grid/hadoop/hdfs/snn

A comma separated list of paths. Use the list of directories from $FS_CHECKPOINT_DIR.


 

Table 3.2. hdfs-site.xml

PropertyExampleDescription

dfs.name.dir

/grid/hadoop/hdfs/nn,/grid1/hadoop/hdfs/nn

Comma separated list of paths. Use the list of directories from $DFS_NAME_DIR

dfs.data.dir

/grid/hadoop/hdfs/dn,grid1/hadoop/hdfs/dn

Comma separated list of paths. Use the list of directories from $DFS_DATA_DIR

dfs.http.address

$namenode.full.hostname:50070

Enter your NameNode hostname for http access

dfs.secondary.http.address

$secondary.namenode.full.hostname:50090

Enter your SecondaryNameNode hostname

dfs.https.address

$namenode.full.hostname:50470

Enter your NameNode hostname for https access.


[Note]Note

The value of NameNode new generation size should be 1/8 of maximum heap size (-Xmx). Please check this value, as the default setting may not be accurate. To change the default value, edit the /etc/hadoop/conf/hadoop-env.sh file and change the value of the -XX:MaxnewSize parameter to 1/8th the value of the maximum heap size (-Xmx) parameter. Also ensure that the NameNode and Secondary NameNode have identical memory settings.

 

Table 3.3. mapred-site.xml

PropertyExampleDescription

mapred.job.tracker

$jobtracker.full.hostname:50300

Enter your JobTracker hostname

mapred.job.tracker.http.address

$jobtracker.full.hostname:50030

Enter your JobTracker hostname

mapred.local.dir

/grid/hadoop/mapred,/grid1/hadoop/mapred

Comma separated list of paths. Use the list of directories from $MAPREDUCE_LOCAL_DIR

mapreduce.task­tracker.group

hadoop

Enter your group. Use the value of $HADOOP_GROUP

mapreduce.history.server.http.address

$jobtracker.full.hostname:51111

Enter your JobTracker hostname


 

Table 3.4. taskcontroller.cfg

PropertyExampleDescription

mapred.local.dir

/grid/hadoop/mapred,/grid1/hadoop/mapred

Comma separated list of paths. Use the list of directories from $MAPREDUCE_LOCAL_DIR



loading table of contents...