Enabling File Storage Replication

File storage replication is not enabled by default. You can enable it either during the initial provisioning of a new workbench or by modifying the settings of an existing workbench or during a workbench restore.

During provisioning a Cloudera AI Workbench

You can select the Enable File Storage Replication option when provisioning a workbench to enable file storage replication.

  1. In the Cloudera console, click the Cloudera AI tile.

    The Cloudera AI Workbenches page is displayed.

  2. Click AI Workbench Backups in the left navigation pane.
  3. Provision a new Cloudera AI Workbench as explained in the Provisioning a new Workbench section.
  4. Ensure that you select the Enable File Storage Replication option. This action automatically creates a replica of the file system in the same region, thereby enhancing data durability and availability.

When using CDP CLI to provision a workbench, set the enableFileSystemReplica value to true within the JSON configuration.

{
  "environmentName": "eng-ml-dev-env-aws",
  "workspaceName": "createReplicass",
  "disableTLS": false,
  "usePublicLoadBalancer": false,
  "privateCluster": false,
  ...
  },
  "subnetsForLoadBalancers": [],
  "skipValidation": true,
  "disableSSO": false,
  "enableFileSystemReplica": true,
  "xEntitlements": [
    "ML_FILESYSTEM_REPLICA"
  ]
}

Enabling file storage replication in an existing workbench

You can modify the settings of an existing workbench to enable file storage replication.

  1. In the Cloudera console, click the Cloudera AI tile.

    he Cloudera AI Workbenches page is displayed.

  2. Click AI Workbench Backups in the left navigation pane.
  3. From the Actions menu of the workbench, select View Workbench Details.
  4. Select the Enable File Storage Replication option. This action automatically creates a replica of the file system in the same region, thereby enhancing data durability and availability.

The progress of the data replication can be monitored in the Replication Status field. Upon completion of the replication, the replication file system ID, last synchronization time, and replication status are displayed.

Use the createFileReplica method:

cdp ml create-file-replica --workspace-crn <value> --profile int
{
  "workspaceCrn":"sample:CRN",
  "xEntitlements": [
    "ML_FILESYSTEM_REPLICA"
  ]
}

During Workbench restore with replication

When restoring a workbench from a backup or snapshot, you can choose to enable file storage replication. This creates a replica of the restored file system, maintaining high availability and resilience.

  1. In the Cloudera console, click the Cloudera AI tile.

    The Cloudera AI Workbenches page is displayed.

  2. Click AI Workbench Backups in the left navigation pane.
  3. In the Workbench Backups UI, locate the workbench for which you want to enable file storage replication. You can search by workbench name or CRN. A given workbench can have multiple backups.
  4. Click Restore. The Provision Workbench from Backup window is displayed.
  5. In the Provision Workbench from Backup window, provide a name for your workbench.
  6. In Select Environment, choose your AWS environment. The Advanced Options toggle button is displayed.
  7. Toggle the Advanced Options button.
  8. Select the Enable File Storage Replication checkbox.
  9. Click Provision Workbench.

Include the "enableFileSystemReplica": true line within the newWorkspaceParameters object when restoring a workbench.

{   
   "newWorkspaceParameters":{         
   "environmentName":"eng-ml-dev-env-aws",
   "workspaceName":"cus_ws_2_restore",               
   "disableTLS":false,
   "usePublicLoadBalancer":false,
   "privateCluster":false,
   "enableMonitoring":true,
   "enableGovernance":false,
   "enableModelMetrics":true,
   "enableFileSystemReplica": true,
   "loadBalancerIPWhitelists":[],
   "whitelistAuthorizedIPRanges":false,
   "authorizedIPRanges":[

   ],
   "provisionK8sRequest":{

    ... 
}