CDH 5 Installation Guide
This CDH 5 Installation Guide is for Apache Hadoop developers and system administrators interested in Hadoop installation. It describes how to install and configure version 5 of Cloudera's Distribution Including Apache Hadoop (CDH 5), and how to deploy it on a cluster.
The guide covers the following major topics.
Before You Start:
Installation tasks:
- Install CDH 5. Start here for a new installation on a cluster.
- Deploy CDH 5. Do these tasks after installing core Hadoop.
- Install components. Install additional components after installing and deploying HDFS and MapReduce. (Components are listed below.)
Note:
To install a release earlier than the current CDH 5 release (for example if you want to add new nodes to a cluster without upgrading the cluster to the latest release), follow these instructions.
Upgrade tasks:
- Upgrade from CDH 4 to CDH 5. Use these instructions if you are currently running a CDH 4 release; or
- Upgrade from an earlier CDH 5 release. Use these instructions if you are currently running a CDH 5 release.
- Upgrade components. Upgrade all installed components after upgrading core Hadoop. (Components are listed below.)
Note: Use these instructions to migrate data from a CDH 4 cluster to a CDH 5 cluster.
CDH 5 Components
Use the following sections to install or upgrade CDH 5 components:
- Crunch Installation
- Flume Installation
- HBase Installation
- Installing and Using HCatalog
- Hive Installation
- HttpFS Installation
- Hue Installation
- Impala Installation
- Llama Installation
- Mahout Installation
- Oozie Installation
- Pig Installation
- Search Installation
- Sentry Installation
- Snappy Installation
- Spark Installation
- Sqoop 1 Installation
- Sqoop 2 Installation
- Whirr Installation
- ZooKeeper
Other Topics in this Guide
This guide also contains information on the following:
- Avro Usage
- Using the Parquet File Format with Impala, Hive, Pig, and MapReduce
- Configuring Ports for CDH 5
- Maintenance Tasks and Notes
- Java Development Kit (JDK) Installation
- Mountable HDFS
- Creating a local Yum Repository
- Using the CDH 5 Maven Repository
- Building RPMs from Source RPMs
- Getting Support
- Apache and Third-Party Licenses
<< Next Steps | What's New in CDH 5 >> | |