Replication Manager for
CDP Private Cloud Base
Replication Manager in CDP Private Cloud Base
Support matrix for Replication Manager on CDP Private Cloud Base
Port requirements for Replication Manager on CDP Private Cloud Base
Data replication
Cloudera license requirements for Replication Manager
Replicating directories with thousands of files and subdirectories
Replication Manager log retention
Replicating from unsecure to secure clusters
Designating a replication source
Configuring a peer
Modifying peers
Configuring peers with SAML authentication
HDFS Replication
Source data
Network latency and replication
Performance and scalability limitations
HDFS replication from Sentry-enabled clusters
Guidelines for using snapshot diff-based replication
Configuring replication of HDFS data
Limiting replication hosts
Viewing replication policies
Viewing replication history
Monitoring the performance of HDFS replication policies
Hive/Impala replication
Host selection for Hive/Impala replication
Hive tables and DDL commands
Replication of parameters
Hive replication in dynamic environments
Creating a Hive/Impala replication policy
Sentry to Ranger replication for Hive replication policies
Replication of Impala and Hive User Defined Functions (UDFs)
Monitoring the performance of Hive/Impala replication policies
Enabling, disabling, or deleting a replication policy
Replicating data to Impala clusters
Enabling replication between clusters with Kerberos Authentication
Ports
Considerations for realm names
HDFS, Hive, and Impala replication
Kerberos connectivity test
Copying data between a secure and an insecure cluster using DistCp and WebHDFS
Kerberos setup guidelines for Distcp between secure clusters
Replication of encrypted data
Encrypting data in transit between clusters
Security considerations
Snapshots
Cloudera Manager snapshot policies
Managing snapshot policies
Snapshots history
Orphaned snapshots
Managing HDFS snapshots
Browsing HDFS directories
Enabling and disabling HDFS snapshots
Taking and deleting HDFS snapshots
Restoring Snapshots
Using snapshots with replication
Hive/Impala replication using snapshots
Use DistCp to migrate HDFS data from HDP to CDP
Using DistCp to migrate data from secure HDP to unsecure CDP
Step 1: Enabling hdfs user to run YARN jobs
Step 2: Configuration changes on the CDP cluster
Step 3: Running the DistCp job on the HDP cluster
Using DistCp to migrate data from secure HDP to secure CDP using DistCp
Step 1: Configuration changes on HDP and CDP clusters
Step 2: Configuring user to run YARN jobs on both the clusters
Step 3: Running DistCp job on CDP cluster