Migrating Data to
CDP One
Overview
Migrating data from CDH to CDP One
Migrating HDFS and Hive data from CDH to CDP One
Migration prerequisites
Ports for Replication Manager on CDP Public Cloud
Setting up an external account
Setting up SSL/TLS certificate exchange
Cloudera license requirements for Replication Manager
Introduction to Replication Manager
Accessing the Replication Manager service
How replication policies work
Replication policy considerations
Working with cloud credentials
Adding cloud credentials
Update cloud credentials
Delete cloud credentials
HDFS data migration from CDH to CDP One
Creating a HDFS replication policy
Verifying HDFS data migration
Hive migration from CDH to CDP One
Creating a Hive replication policy
Verifying Hive data migration
Migrating Oozie workflows from CDH to CDP One
About Migrating Oozie workloads
Migration prerequisites
Setting up an external account
Migrating Hue databases from CDH to CDP One
Performing post-migration tasks
Migrating HDFS native permissions to CDP One
Extracting HDFS native permissions
Converting HDFS native permissions into Ranger HDFS policies
Transforming Ranger HDFS policies into Ranger S3 policies
Importing Ranger AWS S3 policies
Migrating workflows directly created in Oozie to CDP One
Migrating Sentry policies from CDH to CDP One
About Migrating Sentry policies
Migration prerequisites
Setting up an external account
Exporting Sentry permissions
Importing Sentry permissions into Ranger
Migrating data from HDP to CDP One
Migrating HDFS data from HDP to CDP One
Migration prerequisites
About DistCp tool
Using the DistCp tool
Unbanning hdfs user in HDP cluster
Before migrating
HDFS data migration from HDP to CDP One
Migrating HDFS native permissions to CDP One
Extracting HDFS native permissions
Converting HDFS native permissions into Ranger HDFS policies
Transforming Ranger HDFS policies into Ranger S3 policies
Importing Ranger AWS S3 policies
Migrating Ranger policies from HDP to CDP One
About Migrating Ranger policies
Migration prerequisites
Copying Policy Migration utility to the source cluster
Performing Export and Transform operations
About the export operation
Running the export operation
About the transform operation
Running the transform operation
Performing Import operation
Supported Input parameters for Export operation
Supported Input parameters for Transform operation
Migrating Hive data from HDP 2.x or HDP 3.x to CDP One
Migration prerequisites
Setting up Hive JDBC standalone JARS
Saving Hive metastore on HDP by dumping
Taking a mandatory snapshot of HDP tables
Setting up security
Installing and configuring HMS Mirror
Sample YAML configuration file
Testing the YAML and the cluster connection
HMS Mirror command summary
Migrating Hive metadata
HMS Mirror generated files
Verifying metadata migration
Migrating actual Hive data
Adjust AVRO table schema URLs
Verifying actual Hive data migration
Table locations
Fixing statistics
Changes to HDP Hive tables