Replicating Hive nested tables

CDP Public Cloud Replication Manager does not support Hive nested tables. What do I do if there are Hive nested tables in the source cluster?

CDP Public Cloud Replication Manager does not support Hive nested tables for replication. Therefore, it is recommended that you move the nested tables to a different location in HDFS and then replicate Hive external tables. However, if this is not possible, you can perform the following steps in the given order as a workaround.

Solution

  1. Create a Hive replication policy on the target cluster. Ensure that the Additional Settings > Replication Option > Metadata only option is selected to replicate the metadata of required files and directories.
  2. Create a HDFS replication policy on the source cluster to replicate the table data.