Requirements while using CDH on-premise clusters

While using Replication Manager with CDH on-premise clusters, you must be aware about certain requirements.

Note the following:
  • The CDH on-premise cluster on Cloudera Manager instance must be registered on the Management Console. For more information, see Add a CDH Cluster.
  • You must upgrade to Cloudera Manager version 6.3 and above to use Replication Manager service.
  • You must plan to use CDH clusters version 5.13x and above.
  • While performing HDFS replication, you must ensure that the Replication Manager service interacts with classic cluster registered Cloudera Manager instance.
  • While performing Hive replication, you must ensure that the Replication Manager service interacts with the Data Lake Cloudera Manager instance and vice-versa.
  • For HDFS replication, you must ensure that you add an external account in the Cloudera Manager instance. You must also verify if the account has access to the bucket, where the HDFS data gets copied. For more information, see How to Configure AWS Credentials in the Cloudera Manager documentation.
  • For Hive replication, in additional to adding an external account in the Cloudera Manager instance, you must add an IAM external account with bdr as the username in the Data Lake cluster Cloudera Manager instance. For more information, see IAM Role-based Authentication in the Cloudera Manager documentation.
  • Additionally, for Hive replication, you must add classic cluster Cloudera Manager as a source in the Data Lake Cloudera Manager instance. For more information, see Designating a Replication Source in the BDR documentation.