Network Planning for Cloudera Machine Learning on Azure

Before attempting to deploy your Azure virtual network and set up your Azure Environment and ML Workspaces, you should plan the network.

As an example, a minimum architecture to support two ML Workspaces would comprise the following:
  • An Azure virtual network. Cloudera Machine Learning can use an existing virtual network if available.
  • One subnet dedicated to the Azure NetApp Files service.
  • One subnet for each ML Workspace.
Keep the following considerations in mind when planning your network:
  • Each ML workspace requires one subnet in the virtual network.
  • Plan the CIDR addresses for each subnet so that the ranges do not overlap.
  • Each subnet should use a /26 CIDR. This should accommodate a maximum of 30 worker nodes as well as 4 infrastructure nodes for Cloudera Machine Learning.
  • When GPU support is added, you will need to recreate the virtual network. You will then need to use a /25 CIDR to accommodate a maximum of 30 GPU nodes.
  • The recommended NFS for use with Cloudera Machine Learning on Azure is Azure NetApp Files v3.
  • Subnets may not use the following reserved CIDR blocks: 10.0.0.0/16 or 10.244.0.0/16