Handling upgrade failures for Cloudera Data Engineering
If your upgrade of Cloudera Data Engineering fails, you have the option to clone the service with the latest version of Cloudera Data Engineering. Learn how to handle an upgrade failure.
During a Cloudera Data Engineering upgrade, a backup is created as part of the upgrade preparation process. This procedure uses that backup to be restored in a new cluster.
The list of service backups is available in the Backup Library. To locate the Backup Library, in the left navigation menu of Cloudera Data Engineering select Administration, select Service Details, select the Maintenance tab, and select Backup Library.
To obtain the list of all available backups, in the CDP CLI, run:cdp de list-backups
To obtain the list of service backups associated with a specific Cloudera environment, run
"cdp de list-backups --filter "environment(eq)[***CDP
ENVIRONMENT NAME***]
"
The Cloudera Data Engineering backup includes the following:
- Cloudera Data Engineering Service configurations
- Virtual cluster names
- Virtual cluster configurations
- Virtual cluster file-based resources
- Spark job definitions
- Airflow job definitions
- Spark Python-env resources
- Non file-based resources, for example, Python-venv resources and custom runtimes
- Airflow custom operators & libraries
- Logs
- Job run history
- Endpoints
- Ensure that the
catchup
option is not enabled for any user's Airflow jobs.Before the backup starts, if the Airflow DAG
catchup
options are enabled, disable them manually. - By default, the restored Cloudera Data Engineering service is assigned
the name of the original backed-up service, and a new service ID is generated.
To prevent backup failure due to naming conflicts, choose one of the following
options:
- Delete the original service that failed to upgrade during the Cloudera Data Engineering upgrade.
- Rename the service using the
--service-name
option.