Configuring File Storage Replication on AWS

File Storage Replication is designed to enhance business continuity, disaster recovery, and operational efficiency for your Cloudera AI workbench. It ensures that an up-to-date copy of your critical project files is maintained. You can manage file storage replication features using the UI or CDP CLI.

Key Features:

  • Persistent Storage: Your Cloudera AI workbench utilizes AWS EFS for persistent storage of project files.
  • Availability Zone Redundancy: Replication within the same region across different Availability Zones helps reduce single points of failure. For AWS EFS, continuous replication can be implemented across Availability Zones within the same region.
  • One-Way Continuous Replication: AWS EFS provides one-way continuous replication from a source file system to a destination file system, automatically synchronizing all data and metadata changes.
  • Read-Only Replica: During active replication, the replicated file system is available in read-only mode to prevent modifications that could conflict with the replication process.
  • Minimal Performance Impact: The replication process is designed to have minimal impact on the performance of your production file systems.
  • Simple Management: Replication can be easily configured and monitored through the UI or CDP CLI.