Configuring Fault Tolerance
HBase Cluster Replication for Geographic Data Distribution

HBase provides a cluster replication mechanism which allows you to keep one cluster’s state synchronized with that of another cluster, using the write-ahead log (WAL) of the source cluster to propagate the changes.

The use cases for cluster replication include the following scenarios:

  • Backup and disaster recovery

  • Data aggregation

  • Geographic data distribution, such as data centers

  • Online data ingestion combined with offline data analytics


Replication is enabled at the granularity of the column family. Before enabling replication for a column family, create the table and all column families to be replicated on the destination cluster.