Replicating Iceberg tables stored in FSO buckets
Learn how to replicate Iceberg tables stored in FSO buckets.
-
If you are using a secure source cluster, authenticate using the Ozone Manager (OM)
keytab.
kinit -kt /cdep/keytabs/om.keytab om -
Using the Ozone shell on the source cluster, create a volume and bucket on the target
cluster.
ozone sh volume create [ ***VOLUME NAME*** ] ozone sh bucket create [ ***VOLUME NAME*** ]/[ ***BUCKET NAME*** ] -
Add the mandatory advanced configuration snippet, depending on the Cloudera Base on premises source cluster version, for the Cloudera Manager > [ ***CORE_SETTINGS ***] > Configuration > Cluster-wide Advanced Configuration Snippet (Safety Valve) for
core-site.xml property:
For 7.1.9, add
fs.ofs.impl = org.apache.hadoop.fs.ozone.BasicRootedOzoneFileSystemFor 7.3.2, add
fs.ofs.impl = org.apache.hadoop.fs.ozone.RootedOzoneFileSystem - Save and refresh the stale configuration.
-
Add the mandatory advanced configuration snippet, depending on the Cloudera Base on premises target cluster version, for the Cloudera Manager > [***CORE_SETTINGS***] > Configuration > Cluster-wide Advanced Configuration Snippet (Safety Valve) for
core-site.xml property:
For 7.1.9, add
fs.ofs.impl = org.apache.hadoop.fs.ozone.BasicRootedOzoneFileSystemFor 7.3.2, add
fs.ofs.impl = org.apache.hadoop.fs.ozone.RootedOzoneFileSystem - Save and refresh the stale configuration.
-
Create an Iceberg table on the source cluster.
Example:
create table tb1(id int, val int) stored by iceberg location 'ofs://[*** OM SERVICE ID ***]/[*** VOLUME ***]/[*** BUCKET ***]/[*** KEY ***]; - Enable the ‘Iceberg on Ozone replication’ feature flag.
- Add the source cluster as a peer before creating the Iceberg replication policy.
-
Create the Iceberg replication policy by providing the following mandatory details in
the Create Iceberg replication policy wizard:
On the General tab, set the Source Storage Filter field to OZONE and configure the rest of the fields as required.
On the Advanced tab, set the Location Mapping field to:
ofs://[** SOURCE OM SERVICE ID ***]/]*** SOURCE VOLUME ***]/[*** SOURCE_BUCKET ***] ---> ofs://[*** TARGET OM SERVICE ID ***]/[*** TARGET VOLUME ***]/[*** TARGET_BUCKET ***]
