When you configure HDP HA solution using RHEL, your Hadoop cluster comprises of the following two components:
RHEL HA cluster
Overall Hadoop cluster
Important | |
---|---|
The RHEL HA cluster is a subset of the overall Hadoop cluster. |
Typically, the overall Hadoop cluster must include the following types of machines:
The RHEL HA cluster machines. These machines must host only those master services that require HA (in this case, the NameNode and the JobTracker).
Master machines that run other master services such as Hive Server 2, HBase master, etc.
Slave machines that run slave daemons like Datanodes, TaskTracker, RegionServers etc.
Client machines.
A network that connects all the machines.
The following illustrates the topology of a sample Hadoop cluster when configured for HDP HA using RHEL: