Synchronize table data using HashTable/SyncTable tool
The HashTable/SyncTable tool can be used for partial or entire table data synchronization, under the same or remote cluster.
- Ensure that all RegionServers/DataNodes on the source cluster is accessible by the NodeManagers on the target cluster where SyncTable job tasks will be running.
- In the case of secured clusters, the user on the target cluster who executes
the SyncTable job must be able to do the following on the HDFS and HBase
services of the source cluster:
- Authenticate: for example, using centralized authentication or cross-realm setup.
- Be authorized: having at least read permission.