Hive/Impala Replication with Snapshots

If you are using Hive replication, Cloudera recommends that you make the Hive Warehouse Directory snapshottable.

The Hive warehouse directory is located in the HDFS file system in the location specified by the hive.metastore.warehouse.dir property. The default location is /user/hive/warehouse.

To access the hive.metastore.warehouse.dir property, perform the following steps:
  1. Open Cloudera Manager and browse to the Hive service.
  2. Click the Configuration tab.
  3. In the Search box, type hive.metastore.warehouse.dir.

    The Hive Warehouse Directory property appears.

If you are using external tables in Hive, also make the directories hosting any external tables not stored in the Hive warehouse directory snapshottable.

Similarly, if you are using Impala and are replicating any Impala tables using Hive/Impala replication, ensure that the storage locations for the tables and associated databases are also snapshottable.