Automatically restoring the environment

You can automatically reactivate the entire environment using the CLI, which includes your cluster. Automatic restoration enables all settings of the environment that you backed up.

An environment-level automated restoration CLI option restores the environment, the deployed Database Catalog, Virtual Warehouse, and Data Visualization entities. If a Virtual Warehouse or a Data Visualization object is not present on the cluster, but the backup file contains it, the Virtual Warehouse or Data Visualization object will be restored to the cluster. If such an entity is already deployed, no changes or configuration updates will take place.

The CLI dw restore-cluster command can be used in the following ways:
  • Passing the environment’s Cloudera resource name (crn) will activate the cluster from the backup file and restore all the entities and database contents.
  • Passing an activated environment identifier will restore all the entities and database contents to the running environment. This is useful when you need to change activation parameters, but requires manual reactivation.
Automatic restoration consists of the following operations in the order shown here:
  • Activates the environment and waits for infrastructure creation
  • Applies the cluster services and sets up the environment
  • Creates the default Database Catalog
  • Updates the Database Catalog configuration to apply customer configuration customizations
  • Starts the Hue database restore job in the database catalog namespace asynchronously
  • Deploys the Virtual Warehouse instances
  • Deploys the Data Visualization instances
  • Starts the Data Visualization restore job in the individual namespaces asynchronously

Before you begin the restoration of Hue, if you cannot bring down the cluster, use the recommended workaround to disable end user access to the cluster endpoints. The automatic restoration process does not wait for the database operations to be finished. You must monitor the status of the jobs using the operation id to make sure the process finishes. For more information, see Monitoring Hue and Data Visualization restoration and Monitoring environment restoration.

Details about the restore process

The restore process is designed to be an idempotent process, it can be restarted as many times as you want. If the environment is activated and healthy, you can run the restore operation multiple times to restore the Virtual Warehouse and Data Visualization objects. For every restore operation, the Hue database restore will run. This operation will overwrite the Hue database contents. If a Virtual Warehouse or a Data Visualization object is not present on the cluster, but the backup file contains it, it will be restored to the cluster. In case such an entity is already deployed, no changes or configuration updates will take place.