Default cluster topology
Data Hub uses a specific cluster topology including the following host groups: master, worker, and compute.
Data Hub uses the following host groups. These host groups are defined in cluster templates and
cluster definitions used by Data Hub:
Host group | Description | Number of nodes |
---|---|---|
Master | The master host group runs the components for managing the cluster resources (including Cloudera Manager), storing intermediate data (e.g. HDFS), processing tasks, as well as other master components. | 1 |
Worker | The worker host group runs the components that are used for executing processing tasks (such as NodeManager) and handling storing data in HDFS such as DataNode). | 1+ |
Compute | The compute host group can optionally be used for running data processing tasks (such as NodeManager). | 0+ |