The following table describes the directories for install, configuration, data, process IDs, and logs based on the Hadoop Services you plan to install. Use this table to define what you are going to use in setting up your environment.
Note | |
---|---|
The |
Table 1.3. Define Directories for Core Hadoop
Hadoop Service | Parameter | Definition |
---|---|---|
HDFS |
|
Space separated list of directories where NameNode should store the file system image. For example,
|
HDFS |
|
Space separated list of directories where DataNodes should store the blocks. For example,
|
HDFS |
|
Space separated list of directories where SecondaryNameNode should store the checkpoint image. For example,
|
HDFS |
|
Directory for storing the HDFS logs. This directory name is a combination of a directory and the $HDFS_USER. For example, where |
HDFS |
|
Directory for storing the HDFS process ID. This directory name is a combination of a directory and the $HDFS_USER. For example, where |
HDFS |
|
Directory for storing the Hadoop configuration files. For example, |
MapReduce |
|
Space separated list of directories where MapReduce should store temporary data. For example,
|
MapReduce |
|
Directory for storing the HDFS logs. For example,
This directory name is a combination of a directory and the
|
MapReduce |
|
Directory for storing the MapReduce process ID. For example,
This directory name is a combination of a directory and the
|
Table 1.4. Define Directories for Ecosystem Components
Hadoop Service | Parameter | Definition |
---|---|---|
Pig |
|
Directory to store the Pig configuration files. For example,
|
Oozie |
|
Directory to store the Oozie configuration files. For example,
|
Oozie |
|
Directory to store the Oozie data. For example,
|
Oozie |
|
Directory to store the Oozie logs. For example,
|
Oozie |
|
Directory to store the Oozie process ID. For example,
|
Oozie |
|
Directory to store the Oozie temporary files. For example,
|
Hive |
|
Directory to store the Hive configuration files. For example,
|
Hive |
|
Directory to store the Hive logs. For example,
|
Hive |
|
Directory to store the Hive process ID. For example,
|
WebHCat |
|
Directory to store the WebHCat configuration files. For
example, |
WebHCat |
|
Directory to store the WebHCat logs. For example,
|
WebHCat |
|
Directory to store the WebHCat process ID. For example,
|
HBase |
|
Directory to store the HBase configuration files. For example,
|
HBase |
|
Directory to store the HBase logs. For example,
|
HBase |
|
Directory to store the HBase process ID. For example,
|
ZooKeeper |
|
Directory where ZooKeeper will store data. For example,
|
ZooKeeper |
|
Directory to store the ZooKeeper configuration files. For
example, |
ZooKeeper |
|
Directory to store the ZooKeeper logs. For example,
|
ZooKeeper |
|
Directory to store the ZooKeeper process ID. For example,
|
Sqoop |
|
Directory to store the Sqoop configuration files. For example,
|