Migrating from source cluster to target cluster
After registering the source and target cluster, and labeling the scanned datasets and workloads on the source cluster, you can start the migration process.
- S3 Bucket Access, Secret Key, and credential name
For more information about how to generate access and private key, see the Managing access keys documentation.
- Click Migrations on the left navigation pane.
- Click Start Your First Migration.
Select Cloudera Distributed Hadoop 5 or
Cloudera Distributed Hadoop 6 as Source
The registered source cluster is selected by default. You can select any other cluster using the drop-down menu . In case you have not registered a source cluster at this point, click New Source and complete the steps in Registering the source cluster.
CDP Public Cloud and the registered target cluster are selected by default. You can select any other cluster using the drop-down menu. In case you have not registered a source cluster at this point, click New Target and complete the steps in Registering the target cluster.
- Click Next.
- Click Next to confirm the migration path.
Provide the S3 Bucket Access Key and S3
Bucket Secret Key.
The remaining settings on the Configurations page are automatically filled out, but can be changed based on your requirements.
- Click Next.
Select one or more labels to migrate the datasets that were labelled to the
You can select if the migration should Run Now or be completed in a Scheduled Run. Run Now means that all of the datasets and workloads that were selected with the labels are going to be migrated as soon as the process starts. When choosing the Scheduled Run, you can select the start date of the migration, and set a frequency in which the migration process should proceed.
- Click Next.
Review the information on the Overview page and ensure
that the information is correct.
At this point, you can go back and change any configuration if the information is not correct.
- Click Create to save the migration plan.
- Click on the created migration plan on the Migrations page.
Click Run First Step to start the migration.
You can see the status and steps of the migration process.
The Master Table shows a read-only version of the label and the related datasets.
The Data & Metadata Migration executes the data migration of the labeled datasets with Replication Manager.The Hive SQL Migration replicates the Hive SQL queries that were fixed to be Hive complied during the Hive Workload migraton steps.
The Finalization waits until all the Replication Manager policies complete their jobs. If the label is created as a frequently scheduled migration, the Replication Manager waits only for the first jobs.