Replication Manager for
CDP Private Cloud Base
Replication Manager Overview
Support matrix for Replication Manager on CDP Private Cloud Base
Data Replication
Cloudera License Requirements for Replication Manager
Replicating Directories with Thousands of Files and Subdirectories
Replication Manager Log Retention
Replicating from Unsecure to Secure Clusters
Designating a Replication Source
Configuring a Peer Relationship
Modifying Peers
Configuring Peers with SAML Authentication
HDFS Replication
Source Data
Network Latency and Replication
Performance and Scalability Limitations
Replication with Sentry Enabled
Guidelines for using snapshot diff-based replication
Configuring Replication of HDFS Data
Limiting Replication Hosts
Viewing Replication Policies
Viewing Replication History
Monitoring the Performance of HDFS Replications
Hive/Impala Replication
Host Selection for Hive/Impala Replication
Hive Tables and DDL Commands
Replication of Parameters
Hive Replication in Dynamic Environments
Configuring Replication of Hive/Impala Data
Sentry to Ranger Replication
Replication of Impala and Hive User Defined Functions (UDFs)
Monitoring the Performance of Hive or Impala Replications
Enabling, Disabling, or Deleting A Replication Policy
Replicating Data to Impala Clusters
Using Snapshots with Replication
Hive/Impala Replication with Snapshots
Enabling Replication Between Clusters with Kerberos Authentication
Ports
Considerations for Realm Names
HDFS, Hive, and Impala Replication
Kerberos Connectivity Test
Kerberos setup guidelines for Distcp between secure clusters (without cross-realm authentication)
Replication of Encrypted Data
Encrypting Data in Transit Between Clusters
Security Considerations
Snapshots
Cloudera Manager Snapshot Policies
Managing Snapshot Policies
Snapshots History
Orphaned Snapshots
Managing HDFS Snapshots
Browsing HDFS Directories
Enabling and Disabling HDFS Snapshots
Taking and Deleting HDFS Snapshots
Restoring Snapshots