Port requirements for Replication Manager on CDP Private Cloud Base
Before you create replication policies in Replication Manager, ensure that the following ports are open and accessible on the source hosts and CDP Private Cloud Base hosts to allow communication between the source and destination Cloudera Manager servers and the HDFS, Hive, MapReduce, and YARN hosts, as required.
Service | Default Port | On-premises source hosts | Description |
---|---|---|---|
Cloudera Manager HTTP (Web UI) | 7180 | All Management Nodes (CM*) | Used for control flow. Open on specific source and destination IP address and not on all source IP addresses to communicate to the peer (source) Cloudera Manager. After you configure the source and destination clusters, the destination Cloudera Manager connects to source Cloudera Manager on port 7180/7183 during peering. |
HDFS NameNode | 8020 | All Primary Nodes | Used for data flow by HDFS and Hive/Impala replication to communicate from destination HDFS and MapReduce hosts to source HDFS NameNode(s). |
HDFS DataNode | 50010 / 9866 is used for DataNode HTTP server port. | All Secondary Nodes | Used for data flow by HDFS and Hive/Impala replication to communicate from destination HDFS and MapReduce hosts to source HDFS DataNode(s). |
NameNode WebHDFS | 9870 | Used for data flow for Apache Hadoop HttpFS service to provide HTTP access to HDFS. HttpFS has a REST HTTP API supporting all HDFS filesystem operations (both read and write). For more information, see Using HttpFS. | |
YARN Resource Manager | 8032 | All Primary Nodes | Used for data flow to access the YARN ResourceManager. For more information, see YARN Configuration Properties. |
Hive Metastore | 9083 | All Management Nodes (CM*) | Used for data flow for Hive/Impala replication to query or access Hive Metastore. For more information, see Configure metastore location and HTTP mode. |
Impala Catalog Server | 26000 | All Management Nodes (CM*) | Internal use only for data flow during Hive/Impala replication. The catalog service uses this port to communicate with the Impala daemons. |
Ranger KMS | 9292 | All Primary Nodes | Used for data flow during replication of encrypted data. For more information, see Migrating Keys. |
Kerberos KDC Server and KRB5 services | 88 | All | Used for authentication flow by Replication Manager when Kerberos authentication
is enabled on the clusters. Open the port on all the hosts on the destination cluster. |
*Cloudera Manager |