Data Migration Tools and Methods
Data Migration Tools and Methods Overview
Accelerate Your Migration to Cloudera with Workload Manager or Workload XM
Step 1 Identify Current and Potential Issues
Identifying Workload Problems and Health Issues
Identifying Resource Contention
Identifying Rogue Users from a Workload View
Identifying Resource-Hungry Workloads
Step 2 Create an Optimization Plan
Identifying and Correcting Inefficient SQL Code
Step 3 Capture Your Existing Baselines
Identifying Performance Trends
Use Cloudera Replication Manager to migrate to Cloudera Public Cloud
About Replication Manager
Fine-grained permission to access Cloudera Replication Manager
Providing role-based access control (RBAC) to Replication Manager users
Accessing Replication Manager UI
Access Replication Manager in Cloudera Public Cloud
Classic Clusters page
Cloud Credentials page
Replication Policies page
How replication policies work
HDFS replication policy
HDFS snapshots
Requirements and benefits of HDFS snapshots
Enabling and taking snapshots in Cloudera Manager
Hive replication policy
Hive replication
Hive tables
Hive cloud replication
Table-level replication
Migrate Sentry authorization policies into Ranger
Sentry to Ranger permissions
HBase replication policy
Supported clusters for HBase replication policies
How HBase replication policies work
Methods to replicate HBase data
Replicate HBase data simultaneously between multiple clusters
Using HDFS replication policies
Preparing to create an HDFS replication policy
Creating HDFS replication policy
Manage and monitor HDFS replication policies
Using Hive replication policies
Preparing to create a Hive replication policy
Creating Hive replication policy
Manage and monitor Hive replication policies
Using HBase replication policies
Preparing to create an HBase replication policy
Creating HBase replication policy
Manage and monitor HBase replication policies
Monitor HBase replication policy job details
Creating triggers and monitoring replication-related metrics in Cloudera Manager
Monitor HBase RegionServer replication peer metrics in Replication Manager
Viewing HBase RegionServer replication peer metrics
Troubleshooting replication policies in Cloudera Replication Manager
Different methods to identify errors related to failed replication policy
Replication Policies page does not display all the replication policies
HDFS replication policy fails due to export HTTPS_PROXY environment variable
Cannot find destination clusters for HBase replication policies
HBase replication policy fails when Perform Initial Snapshot is chosen
Optimize HBase replication policy performance when replicating HBase tables with several TB data
Partition metadata replication takes a long time to complete
Replicating Hive nested tables
Target HBase folder is deleted when HBase replication policy fails
Replicate HBase data in existing and future tables
Appendix
Support matrix for Cloudera Replication Manager
List of features supported by Cloudera Replication Manager
Replicate data from Cloudera Private Cloud Base and Cloudera Public Cloud source clusters
Replicate data from CDH and HDP source clusters
Cloud credentials to use in Cloudera Replication Manager
Registering Amazon S3 cloud account in Replication Manager
Register Azure cloud credentials in Replication Manager
Registering ABFS cloud account in Replication Manager
Updating Azure Cloud Credentials in Cloudera Manager
Registering GCP credentials to use in Replication Manager
Add IDBroker to use temporary AWS session credentials
How temporary AWS credentials for replication policies works
Authentication methods to use AWS credentials in replication policies
Adding a role instance to IDBroker in Cloudera Manager
Configuring IDBroker to use in replication policies
Adding IDBroker credentials in Cloudera Private Cloud Base
Adding and managing an IDBroker-based external account in Cloudera Manager
Ports for Cloudera Replication Manager
Ports required for HBase replication policies