Provisioning ML Workspaces

This topic describes how to provision ML workspaces.

The first user to access the ML workspace after it is created must have the MLAdmin role assigned. See Configuring User Access to CML for information about this resource role.
  1. Log in to the CDP web interface.
    • On Public Cloud, log in to https://console.us-west-1.cdp.cloudera.com using your corporate credentials or any other credentials that you received from your CDP administrator.
    • On Private Cloud, log in according to instructions from your CDP administrator.
  2. Click ML Workspaces.
  3. Click Provision Workspace.
  4. Fill out the following fields.
    • Workspace Name - Give the ML workspace a name. For example, user1_dev.
    • Environment - From the dropdown, select the environment where the ML workspaces must be provisioned. If you do not have any environments available to you in the dropdown, contact your CDP administrator to gain access.
  5. Switch the toggle to display Advanced Settings.
    1. CPU Settings - From the dropdown, select the CPU instance type (Small, Medium, or Large) and autoscale range to be assigned to the ML workspace.
    2. GPU Settings- From the dropdown, select the GPU instance type (Small, Medium, or Large) and autoscale range to be assigned to the ML workspace.
    3. Existing NFS - (Azure only) Enter the Mount path from the Environment creation procedure.
    4. Enable TLS - Select this checkbox if you want the workspace to use HTTPS for web communication.
    5. Enable public Internet access - When enabled, the CML workspace will be available on the public Internet. When disabled, it is assumed that connectivity is achieved through a corporate VPC.
    6. Enable Monitoring - Administrators (users with the MLAdmin role) can use a Grafana dashboard to monitor resource usage in the provisioned workspace.
    7. Enable Governance - Must be enabled to capture and view information about your ML projects, models, and builds from Apache Atlas for a given environment. If you do not select this option, then integration with Atlas won't work.
    8. Enable Model Metrics - When enabled, stores metrics in a scalable metrics store, enables you to track individual model predictions, and also track and analyze metrics using custom code.
    9. Tags - Tags added to cloud infrastructure, compute, and storage resources associated with this CML workspace.
      Note that these tags are propagated to your cloud service provider account. See Related information for links to AWS and Azure tagging strategies.
    10. Load Balancer Source Ranges - Enter a CIDR range of IP addresses allowed to access the cluster.
      • If the CML workspace is provisioned with public access, enter the allowed public IP address range.
      • If the CML workspace is provisioned with private access, enter the allowed private IP address range.
  6. Click Provision Service.
It takes approximately an hour for an ML workspace to be provisioned and installed. Once the status changes to show that the workspace has been successfully provisioned, click on the Workspace Name to go to the web application.

Note that the domain name for the provisioned workspace is randomly generated and cannot be changed.

Grant users access to this ML workspace using the instructions at Granting SSO Access to Provisioned Workspaces.