Before you create an Iceberg replication policy, you must complete the prerequisites.
Iceberg replication policies can replicate Iceberg V2 tables, created using Spark (read-only
with Impala), between Cloudera Base on premises 7.1.9 or higher
clusters using Cloudera Manager 7.11.3 or higher versions. Starting from
Cloudera Base on premises 7.3.1, Replication Manager can
also replicate V1 and V2 Iceberg tables created using Hive.
You can use one or more Iceberg replication policies to
replicate a database from the source cluster to the target cluster. And, you must ensure
that you replicate the database only from the source to the target to maintain a single
source of truth for the database.
Ensure that the source cluster and target cluster versions are Cloudera Base on premises 7.1.9 or higher using Cloudera Manager version 7.11.3 or higher versions.
Ensure that the source and target clusters have the same Cloudera Manager major
version.
Activate the Iceberg Replication parcel. The parcel might be included in your
Cloudera Runtime distribution or in a separate distribution. For more
information, contact your Cloudera account team.
Add the Iceberg Replication service on both the
clusters.
To add a service, go to the Cloudera Manager > Clusters > [***CLUSTER NAME***] page and click Actions > Add Service. For more information, see Adding a Service.
Ensure that you have the Atlas user credentials in addition to the
Replication Administrator or Full Administrator
roles to replicate Atlas metadata. The atlas user must also
have relevant read and write permissions to the staging locations.