HBase Cluster Replication for Geographic Data Distribution
HBase provides a cluster replication mechanism which allows you to keep one cluster’s state synchronized with that of another cluster, using the write-ahead log (WAL) of the source cluster to propagate the changes.
The use cases for cluster replication include the following scenarios:
Backup and disaster recovery
Geographic data distribution, such as data centers
Online data ingestion combined with offline data analytics
Replication is enabled at the granularity of the column family. Before enabling replication for a column family, create the table and all column families to be replicated on the destination cluster.