Preparing to create Iceberg replication policies

Before you create an Iceberg replication policy, you must complete the prerequisites. Iceberg replication policies can replicate Iceberg V2 tables, created using Spark (read-only with Impala), between Cloudera Base on premises 7.1.9 or higher clusters using Cloudera Manager 7.11.3 or higher versions. Starting from Cloudera Base on premises 7.3.1, Replication Manager can also replicate V1 and V2 Iceberg tables created using Hive.

You can use one or more Iceberg replication policies to replicate a database from the source cluster to the target cluster. And, you must ensure that you replicate the database only from the source to the target to maintain a single source of truth for the database.
  • Ensure that the source cluster and target cluster versions are Cloudera Base on premises 7.1.9 or higher using Cloudera Manager version 7.11.3 or higher versions.
  • Ensure that the source and target clusters have the same Cloudera Manager major version.
  • Activate the Iceberg Replication parcel. The parcel might be included in your Cloudera Runtime distribution or in a separate distribution. For more information, contact your Cloudera account team.
  • Add the Iceberg Replication service on both the clusters.
    To add a service, go to the Cloudera Manager > Clusters > [***CLUSTER NAME***] page and click Actions > Add Service. For more information, see Adding a Service.
  • Ensure that you have the Atlas user credentials in addition to the Replication Administrator or Full Administrator roles to replicate Atlas metadata. The atlas user must also have relevant read and write permissions to the staging locations.