When a Cloudera Observability cost center is created, detailed summary reports of the
costs and resource usage for the environment are also generated, which enable you to analyze
the costs and the cost breakdown associated with the cost center. You can view the current
and historical costs and the resource usage associated with your cost centers.
Steps on how to view the detailed summary reports associated with a cost center.
The reports visually display the tracked resource consumption and usage costs
associated with the cost center for a specific time-period that you select from the
time-range list.
Within each report you can to drill down further:
- To view by users, resource pools, jobs, queries, projects, teams, and
application types with the highest costs. Note that projects and teams are
only available in the Cloudera AI environment, and resource pools are not
applicable for Cloudera AI.
- To view the health of a job or query of interest. To optimize costs by using
the Cloudera Observability prescriptive guidance and recommendations that
enable you to improve performance and resource usage.
- Enabling the Show Resource Usage checkbox allows you
to track resource usage, including CPU, GPU, memory, and data read/write.
For example, CPU and GPU usage can be viewed in core-hours.
-
Verify that you are logged in to the Cloudera Observability on premises web UI.
-
In the URL field of a supported web browser, enter the Cloudera Observability on premises URL that
you were given by your system administrator and press
Enter.
-
When the Cloudera Observability on premises Log in page opens, enter your Cloudera Observability on premises user name and password access credentials.
-
Click Log in.
The Cloudera Observability on premises landing page opens.
-
From the Cloudera Observability Main navigation panel, select .
The
Cloudera Observability
Cost Centers page opens, which displays:
-
To display a cost center's detailed report that includes costs for each chosen
environment, do the following:
- From the Date Range and
Granularity list, select a time-period that
meets your requirements.
- From the Cost Centers page, click inside the cost
center row that requires analysis.
The cost center's report page
opens, which displays the following:
- The provisioned costs, CPU, GPU, Memory usage, and Data Read
and Data Written cost per GB for the cost center.
- Lists the environments that are associated with the cost
center and displays their provisioned costs, CPU, GPU, and
Memory usage, and data read and written cost.
- To view more details about an environment, click on the environment
name.
-
To display the users, pools, jobs, queries, projects, teams, and application
types that created the highest costs on a specific cluster of interest, do the
following:
- Click inside the cluster row that requires more analysis.
The cluster
report overview page opens, which displays the following:
- The top 500 users whose jobs created the highest costs.
- The top 500 pools whose jobs created the highest costs.
- The top 500 jobs or queries that created the highest costs.
- To gain more insights on the health of a job or query, click the name of
the job or query listed in the Top 500 Jobs panel that requires more
investigation.
The job or query's summary page opens.
- Select the Health Checks and Execution
Details tabs for more insights, and if available, read
the optimization recommendations.
- Enabling the Show Resource Usage checkbox allows
you to track resource usage, including CPU, GPU, memory, and data
read/write. For example, CPU and GPU usage can be viewed in
core-hours.
-
To view a full list of users, including their job costs, CPU, GPU, and Memory
usage, select the Users tab.
The
Users report opens, which displays the following:
- The name of the user.
- The total cost that the user incurred.
- The number of jobs that the user ran.
- The CPU, GPU, and Memory usage.
- The cost per GB for the data read and written by your
application.
- The total job costs that the Cloudera Observability
engines incurred: AI Jobs, Impala, Hive, Spark, MapReduce, and
Oozie.
-
To view a full list of pools, including their job costs, CPU, GPU, and Memory
usage, select the Pools tab.
The
Pools report opens, which displays the following:
- The name of the pool.
- The environment that the pool is associated with.
- The total cost that the pool incurred.
- The number of jobs that the pool ran.
- The CPU, GPU, and Memory usage.
- The cost per GB for the data read and written by your
application.
- The total job costs that the Cloudera Observability
incurred: AI Jobs, Impala, Hive, Spark, MapReduce, and Oozie.
-
To view the list of Jobs & Queries, their CPU, GPU, and Memory usage,
select the Jobs tab.
The
Top 500 Jobs report opens, which displays the
following:
- The name of the job or query.
- The total cost that the job or query incurred.
- The CPU, GPU, and Memory cost and resource usage.
- The cost per GB and resource usage for the data read and written by
your job or query.
- The total job costs that the Cloudera Observability
engines incurred: AI Jobs, Impala, Hive, Spark, MapReduce, and
Oozie.
-
To view the list of Projects, their CPU, GPU, and Memory usage, select the
Projects tab.
The
Projects report opens, which displays the
following:
- The name of the Project.
- The number of jobs for that project.
- The total cost that the project incurred.
- The CPU, GPU, and Memory usage.
- The cost per GB for the data read and written by your application.
-
To view the list of Teams, their CPU, GPU, and Memory usage, select the
Teams tab.
The
Teams report opens, which displays the following:
- The name of the Team.
- The number of jobs for that team.
- The total cost that the project incurred.
- The CPU, GPU, and Memory usage.
- The cost per GB for the data read and written by your application.
-
To view the list of Application types, their CPU, GPU, and Memory usage, select
the Application Types tab.
The
Application Types open, displaying the
Top Application Types, which include the engines
Impala, Hive, Spark, MapReduce, and Oozie. However, if the environment is
Cloudera AI, the
Top Application
Types will display Application, Session, Job, and Model.
- The name of the Application Type.
- The number of jobs for the application type.
- The total cost that the user incurred.
- The CPU, GPU, and Memory cost and resource usage.
- The cost per GB and resource usage for the data read and written by
your application.
- The total job costs that the Application Type incurred.