Configuring Quotas

This topic describes how to configure CPU, GPU, and memory quotas for users of an Cloudera AI Workbench.

Required Role: MLAdmin

Make sure you are assigned the MLAdmin role in Cloudera Data Platform. Only users with the MLAdmin role will be logged into Cloudera AI Workbenches with Site Administrator privileges.

There are two types of quota: Default and Custom. Default quotas apply to all users of the workbench. Custom quotas apply to individual users in the workbench, and take precedence over the default quota.

  1. Log in to the web interface.
  2. Click Cloudera AI Workbenches, then open the workbench for which you want to set quotas.
  3. Click Admin > Quotas.
  4. Switch the Default Quotas toggle to ON.
    This applies a default quota of 2 vCPU and 8 GB memory to each user in the workbench.
    If your workbench was provisioned with GPUs, a default quota of 0 GPU per user applies. If you want users to have access to GPUs, you must modify the default quotas as described in the next step.
  5. If you want to change the default quotas, click on Default (per user) .
    Cloudera AI displays the Edit default quota dialog box.
  6. Enter the CPU, Memory, and GPU quota values that should apply to all users of the workbench.
  7. Click Update.
  8. To add a custom quota for a specific user, click Add User.
  9. Enter the user name, and enter the quotas for CPU, Memory, and GPU.
  10. Click Add.
Enabling and modifying quotas will only affect new workloads. If users have already scheduled workloads that exceed the new quota limits, those will continue to run uninterrupted. If a user is over their limit, they will not be able to schedule any more workloads.
To specify the maximum number of replicas in a model deployment, go to Site Administration > Settings > Model Deployment Settings. The default is 9 replicas, and up to 199 can be set.