Monitor ML workspace and workload performance using Cloudera Observability

With Cloudera Observability, you can collect metrics from Cloudera AI and obtain detailed information about the resources used in the Cloudera AI service.

From the ML Workspace Summary dashboard, you can monitor multiple ML workspaces at the Cloudera AI service level and manage the individual workspaces. From the ML Workspace dashboard, you can monitor, optimize, and troubleshoot ML workloads such as sessions, jobs, models, and applications, categorized by the user, team, and project.

How to enable Cloudera AI feature in Cloudera Observability

To enable the Cloudera AI feature in the Cloudera Observability user interface, complete the following tasks:
  • Confirm with Cloudera Support that your account is enabled for the feature from the Cloudera side with appropriate entitlements.
  • Enable the outbound traffic. For information, see AWS outbound network access destinations.
  • Installation of Cloudera Observability components on the Cloudera AI Workbench with Cloudera AI 2.0.46 version and higher:
    • For existing workspaces:
      • If the existing workspaces are upgraded from an older version to the Cloudera AI 2.0.46 version or higher, the Cloudera Observability components are installed automatically during the upgrade.
      • If the existing workspace is already on the latest version of Cloudera AI, you must suspend the workspace, and then resume the workspace. The Cloudera Observability components are enabled automatically within 12 hours.

        For information on suspending and resuming the workspace, see Suspend and resume Cloudera AI Workbenches in Cloudera AI documentation.

    • For new workspaces created with the Cloudera AI version 2.0.46 or higher, the Cloudera Observability components are installed automatically.

      For information on creating a new workspace, see Provisioning Cloudera AI Workbenches in Cloudera AI documentation.