Backing up and restoring CDW

The backup and restore procedures for AWS and Azure replace the in-place upgrade Cloudera offered for AWS environments. To get the supported Kubernetes version, you back up your old AWS or Azure environment and start up a new environment using the restoration process.

Backup and restore does not replace upgrading your Virtual Warehouse. To pick up the latest engine changes in your Virtual Warehouse, simply upgrade your Database Catalog and Virtual Warehouse in CDW. The backup and restore process saves your environment parameters, making it possible to recreate your environment with the same settings, URL, and connection strings you used in your previous environment.

You can back up and restore CDW one of the following methods:
  • automatic
  • semi-automatic
  • manual

Choosing the right method

You choose the automatic, semi-automatic, or manual method based on the conditions described below.

Automatic

Use the automatic backup and restore under one of the following conditions:
  • Your Cloudera Data Warehouse (CDW) runs in an AWS environment using the AWS restricted policy.
  • Your CDW runs in an Azure environment using CDW 1.6.3-b319 (released May 5, 2023) or later.
  • Your private CDW environment in Azure Kubernetes Service is using CDW 1.9.1-b233 (released July 26, 2024) or later.

Semi-automatic backup and restore

Use the semi-automatic procedure under any one of the following conditions:

Manual backup and restore

Use the manual backup and restore under any one of the following conditions:
  • You cannot meet conditions for using either the automatic backup and restore or the semi-automatic backup and restore.
  • You have enabled the MULTI_DEFAULT_DBC entitlement.
  • You have one, or more, custom (non-default) Database Catalogs.
  • You experienced problems using the automatic or semi-automatic backup and restore.

Ensuring a successful restoration

You must use the same Cloudera Data Warehouse version to restore files that you used to back up those files. For example, using a backup file from 1.6.2-b197 (released Feb 13, 2023) and using 1.8.4-b33 to restore files will not work. Check and note the version of your Cloudera Data Warehouse (CDW) application version before performing these tasks. The CDW application version is not the same as your cluster, Database Catalog, or Virtual Warehouse versions.

Finding the version of your CDW environment

In Cloudera Data Warehouse, select your environment, click Edit. The Environment Details includes the version.

Importance of bringing down the cluster

Backing up and restoring CDW requires bringing down the cluster to ensure successful cluster restoration. During downtime, CDW, you must prevent end-users from accessing the cluster. If downtime is not feasible due to your operational model, you can use a workaround that disables end user access instead of bringing down the cluster.

You lose any manual modification of the Kubernetes objects or configurations when you bring down the cluster. Modifications applied using the CDW UI and settings defined during creation are preserved.