Hive and Impala data

Hive and Impala data generally reside on HDFS. Hive and Impala replication enables you to copy your Hive metastore and data from one cluster to another. You can synchronize the Hive metastore and data on the destination cluster with the source, based on a specified replication policy.

There are two mechanisms to replicate Hive data on HDFS. The mechanism you choose depends on whether your tables are defined as managed or external.

The following table lists the table type and the replication method you can use to replicate data:

Table Type Replication Style
Hive 3 Managed Transactional Hive built-in Replication
Hive 3 External Replication Manager Hive replication policies