Restoring Cloudera Data Engineering jobs from backup
You can restore Cloudera Data Engineering (CDE) jobs and associated resources from a backup ZIP file.
Before you begin
- You must have a valid backup file to restore from. For instructions on backing up CDE jobs, see Backing up Cloudera Data Engineering jobs.
- Download and configure the CDE CLI.
Steps
- Run the
cde backup restore
command to restore a backup file to the virtual cluster your CDE CLI is configured to interact with. Use the--duplicate-handling
flag to select the policy for handling duplicate jobs. The possible values are as follows:error
- Return an error if there are duplicate job names, and abort the restore operation. This is the default behavior.
rename
- If a job name in the backup conflicts with an existing job, keep the existing job and rename the restored job by appending a numerical identifier to the job name.
keep-original
- If a job name in the backup conflicts with an existing job, keep the existing job and do not restore the backed up job.
For example:
cde backup restore --local-path archive-2021-11-10T01:24:06.zip --duplicate-handling rename
--remote-path
- If you're using a remote storage, use this flag for relative remote storage path of the backup to be restored. This will restore the archive in the remote object store. The path should include a filename and be relative to dir /dex/backup/.
Example restore for remote storage:
./cde backup restore --remote-path test/archive.zip # the output is similar to list-archive
Result
Validate that the jobs and resources were restored by running
cde job list
and cde resource
list
.