Replacing the non-HA JobTracker with the HA JobTracker
This section provides instructions for removing the non-HA JobTracker and installing the HA JobTracker.
The HA JobTracker cannot be installed on a node on which the non-HA JobTracker is installed, and vice versa. If the JobTracker is installed, uninstall it following the instructions below before installing the HA JobTracker. Uninstall the non-HA JobTracker whether or not you intend to install the HA JobTracker on the same node.
Removing the non-HA JobTracker
You must remove the original (non-HA) JobTracker before you install and run the HA JobTracker. First, you need to stop the JobTracker and TaskTrackers.
To stop the JobTracker and TaskTrackers:
-
Stop the TaskTrackers:
On each TaskTracker system:
$ sudo service hadoop-0.20-mapreduce-tasktracker stop
-
Stop the JobTracker:
On the JobTracker system:
$ sudo service hadoop-0.20-mapreduce-jobtracker stop
- Verify that the JobTracker and TaskTrackers have stopped:
$ ps -eaf | grep -i job $ ps -eaf | grep -i task
To remove the JobTracker:
- On Red Hat-compatible systems:
$ sudo yum remove hadoop-0.20-mapreduce-jobtracker
- On SLES systems:
$ sudo zypper remove hadoop-0.20-mapreduce-jobtracker
- On Ubuntu systems:
sudo apt-get remove hadoop-0.20-mapreduce-jobtracker
Installing the HA JobTracker
Use the following steps to install the HA JobTracker package, and optionally the ZooKeeper failover controller package (needed for automatic failover).
Step 1: Install the HA JobTracker package on two separate nodes
On each JobTracker node:
- On Red Hat-compatible systems:
$ sudo yum install hadoop-0.20-mapreduce-jobtrackerha
- On SLES systems:
$ sudo zypper install hadoop-0.20-mapreduce-jobtrackerha
- On Ubuntu systems:
sudo apt-get install hadoop-0.20-mapreduce-jobtrackerha
Step 2: (Optionally) install the failover controller package
If you intend to enable automatic failover, you need to install the failover controller package.
The instructions for automatic failover assume that you have set up a ZooKeeper cluster running on three or more nodes, and have verified its correct operation by connecting using the ZooKeeper command-line interface (CLI). See the ZooKeeper documentation for instructions on how to set up a ZooKeeper ensemble.
Install the failover controller package as follows:
On each JobTracker node:
- On Red Hat-compatible systems:
$ sudo yum install hadoop-0.20-mapreduce-zkfc
- On SLES systems:
$ sudo zypper install hadoop-0.20-mapreduce-zkfc
- On Ubuntu systems:
sudo apt-get install hadoop-0.20-mapreduce-zkfc
<< About JobTracker High Availability (HA) | Configuring JobTracker High Availability >> | |