Using the Cloudera Observability user interface

Describes a few frequently used interface elements of Cloudera Observability that help you identify and troubleshoot your workload issues.

The following examples describe a few interface elements of Cloudera Observability that enable you to quickly identify workload problems, health issues, resource contentions, and abnormal or degraded performance problems.

These examples assume that you have logged in to the Cloudera Data Platform and accessed the Cloudera Observability web UI from the Your Enterprise Data Cloud landing page.

Identify workload problems and health issues

You can easily locate what engines are running on your clusters and what jobs and queries are failing the health tests with the Usage Analysis chart widget.

The Usage Analysis chart widget displays what engines are running on the cluster, how many jobs or queries are processed by the engine, and how many jobs or queries have failed and missed their SLAs (thresholds).

Location: This chart widget is found on the Cluster Summary page, which is accessed by selecting the cluster for analysis in the Clusters page

Identify and address resource contentions

Cloudera Observability provides the following chart widgets that help you analyze and identify resource consumption and contention problems:
  • The Resource Consumption By Services chart widget displays the CPU and memory consumption for each service across the time range you selected. Hover your mouse over the timeline, to display the percentage of CPU or memory consumed by each of the cluster's services.

    Location: The Resource Consumption By Services chart widget is found on the Cluster Summary page of a Private Cloud Base or a Data Hub cluster.



  • The Memory Utilization chart widget, displays the aggregated maximum amount of memory that is used by the queries on any node performing the processing. It helps you identify inefficient queries that are consuming the most amount of memory and decide if you need to allocate more memory to continue running your query jobs.

    Location: The Memory Utilization chart widget is found on the Impala Engine page. From the Clusters page select the workload cluster for analysis and then in the Cluster Summary Navigation Panel select Impala

  • The Resource Consumption chart widget, displays the concurrent use of CPU and memory consumption for a workload across the timeline you selected.

    Location: The Resource Consumption chart widget is found on the Impala Engine page. From the Clusters page select the workload cluster for analysis and then in the Cluster Summary Navigation Panel select Impala



Identify and address abnormal or degraded performance problems

Cloudera Observability enables you to identify and address abnormal or degraded performance problems by establishing baselines from health issues that also enable a performance comparison of your workloads. The Cloudera Observability baseline metrics measure the current performance of a job against the average performance of previous runs. They use performance data from 30 of the most recent runs of a job and require a minimum of three runs. The baseline comparisons start with the fourth run of a job.

Location: The baseline for a job or a query is found on the Baseline page, which is accessed by selecting the workload cluster for analysis and then in the Trend chart widget, select the engine for analysis, and then depending on the engine, click Total Jobs or Total Queries. From the Jobs or Queries page, select the job or query of interest and then select the Baseline tab.
The following images show some of the baseline and comparison metrics that are provided:
  • This image shows the comparison between the baseline performance metrics and the current job run:


  • To display only those metrics with performance issues, select Show only abnormal metrics:


Identify performance trends

You can identify trends as well as baselines by analyzing your engine’s or cluster’s performance trends from the Trends chart widget and the Trend tab. Where:
  • The Trends time-series chart widget, displays more detailed metrics about the processed jobs and queries and enables you to view historical trends for analysis when you select a predefined or custom time period from the Time-Range filter list.
    Location: This chart widget is found on the engine's page, which is accessed by selecting the workload cluster for analysis and then selecting the engine of interest. from the navigation panel.


  • The Trends tab, displays the job or query's instances executed during the selected time period. Depending on the engine, the Trends page displays a job's historical trend from Duration, Data Input, and Data Output histogram charts or lists the runs of the query to show how its performance changes overtime.

    Location: The Trends tab is found on the Jobs or Query's page, which is accessed by selecting the workload cluster for analysis and then selecting the engine of interest from the navigation panel. Depending on the engine chosen, from the Trends chart widget, click Total Jobs or Total Queries and then from the Jobs or Queries page, select the job or query of interest and then select the Trends tab.