Prerequisites for inbuilt CDSW migration tool

Before migrating from Cloudera Data Science Workbench (CDSW) to Cloudera AI in Cloudera on premises, you must meet a number of prerequisites to succeed. A prerequisite for migration is the installation of Cloudera AI on your Cloudera Data Science Workbench base cluster.

The following table presents the supported migration version combinations for Cloudera Data Science Workbench and Cloudera AI:

Table 1. Supported migration versions for Cloudera Data Science Workbench and Cloudera AI
CDSW versions supported for migration Target Cloudera AI on premises version

CDSW 1.10.0- 1.10.4

Upgrade to CDSW 1.10.5 before migrating to Cloudera AI.

1.5.5 SP1 (recommended version)

1.5.5 CHF1

1.5.4 SP2 CHF1

CDSW 1.10.5

1.5.5 SP1 (recommended version)

1.5.5 CHF1

1.5.4 SP2 CHF1

Migration from CDSW, configured with LDAP, SAML, or LOCAL authentication, to Cloudera AI, is supported, the automatic migration is supported only if CDSW is running with LDAP or SAML. The migration process does not automatically migrate your authentication configurations. Therefore, setting up LDAP or SAML in CDSW before the migration is part of the migration procedure.

The migration does not migrate your CDSW endpoint connections. Therefore, post-migration instructions include setting up LDAP, SAML endpoint connections, and DNS on Cloudera AI, so you can upload them after migration to Cloudera AI.

  1. Ensure you have a CDSW 1.10.0 or higher version cluster in Cloudera. Otherwise, choose one of the following options:
    • If you have a CDSW installation in either CDH or HDP, migrate to on premises 1.5.1 or higher versions, and then migrate CDSW to Cloudera AI.
    • If you have CDSW installation earlier than 1.10.0, upgrade to CDSW 1.10.0 or higher versions.
  2. Ensure that LDAP or SAML is configured in your CDSW cluster on Cloudera. If LDAP or SAML is not yet configured, set up LDAP or SAML before the pre-migration tasks. For guidelines on setting up LDAP and SAML, see Configuring External Authentication with LDAP and SAML.
    The migration process cannot succeed without authentication.
  3. Meet the Cloudera AI software requirements for on premises, including storage, for installing Cloudera AI on Cloudera on premises 1.5.1 or higher versions. For Cloudera AI on premises software requirements, see Cloudera AI software requirements for Cloudera.
  4. Backup CDSW data. For instructions on backing up CDSW data, see Backup and Disaster Recovery for Cloudera Data Science Workbench.
  5. In CDSW, export your Grafana dashboards. For instructions on exporting Grafana dashboards, see Export and import | Grafana documentation.
  6. Take notes of the connections of endpoints in your CDSW cluster and consider your custom settings.
    You must use this information after migration to set up endpoints in your on premises cluster.
  7. Take notes of your custom settings, if you have customized your DNS configuration, to be able to customize your DNS configuration after migration.
    If you did not customize your DNS configuration, the migration tool configures DNS in your on premises cluster.
  8. Gather information about your LDAP or SAML configurations on CDSW.
    After migration, you must set up LDAP or SAML again on the Cloudera AI cluster as the LDAP or SAML configuration is not migrated.
  9. In CDSW, manually back up the custom DNS configuration for Kube-DNS, and then migrate your custom configuration to Cloudera AI.
    Cloudera AI uses the core-DNS, which is incompatible with the CDSW Kube-DNS.
  10. In Cloudera Manager, select Install and Upgrade to Cloudera on premises 1.5.4 or higher versions using the Cloudera Embedded Container Service on your CDSW cluster.
  11. During the installation of Cloudera Data Services on premises using the Cloudera Embedded Container Service, if you select Airgap, set up a network connection between CDSW and the Cloudera on premises cluster.
  12. Enable the Cloudera AI features during installation that you were using in CDSW.
    For example, enable model metrics and monitoring.
    If you do not enable the same, or similar, Cloudera AI features during installation that you were using in CDSW, you will not be able to use the Cloudera AI features.