NFS

Cloudera Machine Learning (CML) requires NFS for storing project files and folders.

Cloudera Machine Learning (CML) requires NFS for storing project files and folders. An internal user-space NFS server can be deployed into the cluster which serves a block storage device (persistent volume) managed by the cluster’s software defined storage (SDS) system, such as Ceph, Portworx, etc. This is the recommended option for CML in Private Cloud. Alternatively, the NFS server can be external to the cluster, such as a NetApp filer which is accessible from the Private Cloud cluster nodes. Note that NFS storage should be used only for storing project files and folders, and not for any other CML data, such as PostgreSQL database, and livelog.

Note that CML does not support shared volumes, such as Portworx shared volumes, for storing project files. A read-write-once (RWO) persistent volume must be allocated to the internal NFS server (e.g., NFS server provisioner) as the persistence layer. The NFS server uses the volume to dynamically provision read-write-many (RWX) NFS volumes for the CML clients.