Before You Begin Installing CDH 5 Manually
- This section contains instructions for new installations. If you need to upgrade from an earlier release, see Upgrading from CDH 4 to CDH 5.
- For a list of supported operating systems, see CDH 5 Requirements and Supported Versions.
- These instructions assume that the sudo command is configured on the hosts where you will be doing the installation. If this is not the case, you will need the root user (superuser) to configure it.
If you are migrating from MapReduce v1 (MRv1) to MapReduce v2 (MRv2, YARN), see Migrating from MapReduce v1 (MRv1) to MapReduce v2 (MRv2, YARN) for important information and instructions.
When starting, stopping and restarting CDH components, always use the service (8) command rather than running scripts in /etc/init.d directly. This is important because service sets the current working directory to / and removes most environment variables (passing only LANG and TERM) so as to create a predictable environment in which to administer the service. If you run the scripts in /etc/init.d, any environment variables you have set remain in force, and could produce unpredictable results. (If you install CDH from packages, service will be installed as part of the Linux Standard Base (LSB).)
- Java Development Kit: if you have not already done so, install the Oracle Java Development Kit (JDK); see Java Development Kit Installation.
- Scheduler defaults: note the
following differences between MRv1 and MRv2 (YARN).
- MRv1:
- Cloudera Manager sets the default to FIFO.
- CDH 5 sets the default to FIFO, with FIFO, Fair Scheduler, and Capacity Scheduler on the classpath by default.
- MRv2 (YARN):
- Cloudera Manager sets the default to Fair Scheduler.
- CDH 5 sets the default to Fair Scheduler, with FIFO and Fair Scheduler on the classpath by default.
- YARN does not support Capacity Scheduler.
- MRv1:
High Availability
- For more information and instructions on setting up a new HA configuration, see the
CDH 5 High Availability Guide.Important
: If you decide to configure HA for the NameNode, do not install hadoop-hdfs-secondarynamenode. After completing the HDFS HA software configuration, follow the installation instructions under Deploying HDFS High Availability.
- To upgrade an existing configuration, follow the instructions under Upgrading to CDH 5.
<< Ways To Install CDH 5 | Installing CDH 5 >> | |