Data Migration Tools and Methods
Data Migration Tools and Methods Overview
Accelerate Your Migration to CDP with Workload Manager or Workload XM
Step 1 Identify Current and Potential Issues
Identifying Workload Problems and Health Issues
Identifying Resource Contention
Identifying Rogue Users from a Workload View
Identifying Resource-Hungry Workloads
Step 2 Create an Optimization Plan
Identifying and Correcting Inefficient SQL Code
Step 3 Capture Your Existing Baselines
Identifying Performance Trends
Use Replication Manager to migrate to CDP Public Cloud
About Replication Manager
Access Replication Manager service in CDP Public Cloud
Overview page
Classic Clusters page
Cloud Credentials page
Replication Policies page
How replication policies work
HDFS replication policy
HDFS snapshots
Requirements and benefits of HDFS snapshots
Enabling and taking snapshots in Cloudera Manager
Hive replication policy
Hive replication
Hive tables
Hive cloud replication
Table-level replication
Migrate Sentry authorization policies into Ranger
Sentry to Ranger permissions
HBase replication policy
Supported clusters for HBase replication policies
How HBase replication policies work
Methods to replicate HBase data
Using HDFS replication policies
Preparing to create an HDFS replication policy
Creating HDFS replication policy
Manage and monitor HDFS replication policies
Using Hive replication policies
Preparing to create a Hive replication policy
Creating Hive replication policy
Manage and monitor Hive replication policies
Using HBase replication policies
Preparing to create an HBase replication policy
Creating HBase replication policy
Manage and monitor HBase replication policies
Monitor HBase replication policy job details
Monitor HBase RegionServer replication peer metrics in Replication Manager
Viewing HBase RegionServer replication peer metrics
Troubleshooting replication policies in CDP Public Cloud
Different methods to identify errors related to failed replication policy
HDFS replication policy fails due to export HTTPS_PROXY environment variable
Cannot find destination clusters for HBase replication policies
HBase replication policy fails when Perform Initial Snapshot is chosen
Optimize HBase replication policy performance when replicating HBase tables with several TB data
Partition metadata replication takes a long time to complete
Replicating Hive nested tables
Target HBase folder is deleted when HBase replication policy fails
Replicate HBase data in existing and future tables
Appendix
Support matrix for CDP Public Cloud Replication Manager
Replicate data from CDP Private Cloud Base and CDP Public Cloud source clusters
Replicate data from CDH and HDP source clusters
Register cloud credentials to use in CDP Public Cloud Replication Manager
Registering Amazon S3 cloud account in Replication Manager
Register Azure cloud credentials in Replication Manager
Registering ABFS cloud account in Replication Manager
Updating Azure cloud credentials in Cloudera Manager
Ports for Replication Manager on CDP Public Cloud
Ports required for HBase replication policies
Configuring SSL/TLS certificate exchange between two Cloudera Manager instances