Preparing for an upgrade

Upgrading CDP Public Cloud consists of two major steps: upgrading the Data Lake within an environment and then upgrading the attached Data Hubs. Currently the Data Hubs must run the same version of Cloudera Runtime as the Data Lake.

You should also periodically upgrade the environment (FreeIPA cluster) to ensure that you are running the latest security patches, but this is not required at the same time as upgrading the Data Lake and Data Hubs.

Data Lake upgrade workflow

Pre-upgrade tasks

  1. Review the Data Lake upgrade requirements and limitations.
  2. Carefully review the differences between the types of upgrades and what they entail.
  3. Check the Data Lake support matrix for upgrade to verify which Runtime versions you can upgrade to and from.
  4. If you have not configured your Data Lake for backup and restore, you will need to do so. The backup and restore process is integrated into the upgrade flow automatically, but successful upgrade requires that the correct IAM policies exist on the DATALAKE_ADMIN_ROLE and RANGER_AUDIT_ROLE (for backup), and the DATALAKE_ADMIN_ROLE, RANGER_AUDIT_ROLE, and LOG_ROLE (for restore).

    If your roles are not configured correctly, CDP will not be able to write the backup to the BACKUP_LOCATION_BASE path of your cloud storage.

  5. If you are performing the backup manually (as opposed to the integrated backup available during the upgrade process), you can launch the Data Lake backup from the UI or CLI. When using the CLI, you can specify to skip certain backup actions (skip HMS, Atlas metadata or Ranger audit log index backup). You can monitor the backup process using the CLI.
Data Lake upgrade tasks
  1. Perform the upgrade, either through the UI or CLI. The type of upgrade that you perform will depend on whether a new version of Runtime or only a hotfix is available. A new OS image may also be available to upgrade to.
  2. If the upgrade fails, check the logs for manual troubleshooting info. You can also recover from failed upgrades.
  3. When the upgrade succeeds, proceed to upgrading the attached Data Hubs.

Data Hub upgrade workflow

Pre-upgrade tasks

  1. Carefully review the differences between the types of upgrades and what they entail.
  2. Check that the cluster that you want to upgrade is supported.

Major/minor Runtime version upgrade tasks

  1. Backup the cluster data and CM configurations.
  2. Perform the upgrade, either through the UI or CLI. (Operational database clusters have a different process.)
  3. Complete any post-upgrade tasks for the type of cluster that you upgraded.
  4. For DE clusters, use the CM UI to add any configs that were not added during the upgrade.
  5. If the upgrade fails, check the Event log and the troubleshooting section.
  6. Complete any post-upgrade tasks for the type of cluster that you upgraded.
  7. For DE clusters, use the CM UI to add any configs that were not added during the upgrade.
  8. If the upgrade fails, check the Event log and the troubleshooting section.

Maintenance (hotfix) upgrade tasks

  1. Backup the cluster data and CM configurations.
  2. Perform the upgrade, either through the UI or CLI.
  3. If the upgrade fails, check the Event log and the troubleshooting section.

OS upgrade tasks

  1. Review the Before you begin section to verify that there is no data belonging to NiFi or NiFi Registry on the root disk of the VM. Note that during an OS upgrade, any data on the root volume (parcels, service logs, custom software) will be lost.
  2. Unlike the Data Lake upgrade, OS upgrades are not integrated in the larger upgrade flow and must be performed separately, either through the UI or CLI.

FreeIPA upgrades

When a FreeIPA upgrade is available, upgrade the FreeIPA cluster to ensure that the nodes are running the latest OS-level security patches.