HBase Migration through Replication Manager

Apache HBase is a scalable, distributed, column-oriented datastore that provides real-time read/write random access to very large datasets hosted on HDFS. In CDP Operational Database or COD you use Apache HBase as a datastore with HDFS and/or S3/ABFS providing the storage infrastructure.

You can use one of the following methods to replicate HBase data based on your requirements:

Table 1. Replication methods
Methods Description When to use
Cloudera Replication Plugin for cluster versions that Replication Manager does not support. You can prepare your data for migration, then set up the replication plugin and use a snapshot to migrate your data. The following list consolidates all the minimum supported versions of source and target cluster combinations for which you can use the replication plugin to replicate HBase data. The replication plugin is compatible with all the CDP Public Cloud releases.

For information about use cases that are not supported by Replication Manager, see support matrix.

Replication Manager You can use Replication Manager to migrate HBase data that uses HBase replication through HBase replication policy. When the source cluster and target cluster meet the requirements of supported use cases. See caveats.

The following list consolidates all the minimum supported versions of source and target cluster combinations for which you can use HBase replication policies to replicate HBase data.

See support matrix for more information.