Backing up and restoring CDW

The backup and restore procedures for AWS and Azure replace the in-place upgrade Cloudera offered for AWS environments. To get the supported Kubernetes version, you back up your old AWS or Azure environment and start up a new environment using the restoration process.

Backup and restore does not replace upgrading your Virtual Warehouse. To pick up the latest engine changes in your Virtual Warehouse, simply upgrade your Database Catalog and Virtual Warehouse in CDW. The backup and restore process saves your environment parameters, making it possible to recreate your environment with the same settings, URL, and connection strings you used in your previous environment.

You can back up and restore CDW one of the following methods:
  • automatic
  • semi-automatic
  • manual

Choosing the right method

You choose the automatic, semi-automatic, or manual method based on the following conditions:

Automatic

Use the automatic backup and restore under one of the following conditions:
  • Your Cloudera Data Warehouse (CDW) runs in an AWS environment using the AWS restricted policy.
  • Your CDW runs in an Azure environment using CDW 1.6.3-b319 (released May 5, 2023) or later.
  • Your private CDW environment in Azure Kubernetes Service is using CDW 1.9.1-b233 (released July 26, 2024) or later.

Semi-automatic backup and restore

Use the semi-automatic procedure under any one of the following conditions:

Manual backup and restore

Use the manual backup and restore under any one of the following conditions:
  • You cannot meet conditions for using either the automatic backup and restore or the semi-automatic backup and restore.
  • You have enabled the MULTI_DEFAULT_DBC entitlement.
  • You have one, or more, custom (non-default) Database Catalogs.
  • You experienced problems using the automatic or semi-automatic backup and restore.

Ensuring a successful restoration

To ensure a successful restoration:
  • Create a fresh backup when you plan your upgrade.
  • Ensure that the backup is of the same CDW Control Plane version with which you plan to restore the files.

For example, if your CDW environment version is 1.8.4-b33, and you wish to upgrade to 1.9.1-b233 (i.e. the CDW Control Plane version is now 1.9.1-b233), then you must create a fresh backup with the CDW Control Plane version of 1.9.1-b233 and use this backup to restore the files after reactivating the environment in CDW. Cloudera recommends that you do not use a backup created with any other/older CDW Control Plane version (1.6.2-b197, for example) during the restoration process.

Check and note the version of your CDW Control Plane (on the bottom left side of the page) before performing these tasks. The CDW Control Plane version is not the same as your environment, Database Catalog, or Virtual Warehouse versions.

Finding the version of your CDW environment

In Cloudera Data Warehouse, select your environment, click Edit. The Environment Details include the version.

Importance of bringing down the cluster

Backing up and restoring CDW requires bringing down the cluster to ensure successful cluster restoration. During downtime, CDW, you must prevent end-users from accessing the cluster. If downtime is not feasible due to your operational model, you can use a workaround that disables end-user access instead of bringing down the cluster.

You lose any manual modification of the Kubernetes objects or configurations when you bring down the cluster. Modifications applied using the CDW UI and settings defined during creation are preserved.