Replication Manager

Replication Manager is a service to copy and migrate data from CDH clusters to CDP Public Cloud. It enables you to replicate data across data centers for disaster recovery scenarios. You can migrate Hive, Impala, and HDFS workloads to CDP Public Cloud. Replications can include data stored in HDFS, data stored in Hive tables, Hive metastore data, and Impala metadata (catalog server metadata) associated with Impala tables registered in the Hive metastore.

You can use the Replication Manager service to replicate data from CDH clusters to CDP Public Cloud clusters that are running on Amazon S3 or Microsoft Azure ADLS Gen2 (ABFS). To migrate data, you need Cloudera Manager 6.3.0 and above and CDH version 5.13+ and above. For information about the support matrix, see Support matrix for Replication Manager on CDP Public Cloud.

Replication Manager supports the following features:
  • HDFS replication
  • Hive metadata replication
  • Hive external table replication
  • Table-level replication
  • Sentry to Ranger replication. To perform the Sentry policy replication, you must be running the Sentry service on CDH 5.12 or higher, or any CDH 6.x version. The Ranger version running on your cloud cluster must be 3.1.