You must configure several parameters to enable the Cluster Utilization
Report.
Minimum Required Role:
Configurator (also provided by
Cluster Administrator,
Limited Cluster Administrator , and
Full Administrator)
By default, the Cluster Utilization Report displays aggregated CPU and memory
utilization for an entire cluster and for YARN and Impala utilization. You can also view
this utilization by tenants, which include Linux users and Dynamic Resource Pools. To
see utilization for a tenant, you must configure the tenant and define resource limits for
it.
Enable YARN utilization metrics collection:
-
In Cloudera Manager, select the YARN service.
-
Click the Configuration tab.
-
Use the Search function to locate the configuration properties
mentioned below.
-
In the Container Usage MapReduce Job User
property, enter a username for the MapReduce job that collects the
metrics.
The username you enter must be a Linux user on all the cluster
hosts. If you are using an Active Directory KDC, the username must
also exist in Active Directory. For secure clusters, the user must not
be banned or below the minimum user ID. You can view the list of
banned users (
banned.users
) and the minimum user ID
(
min.user.id
) by under YARN Configuration in
Cloudera Manager.
The user that is configured with the
Container Usage MapReduce Job User property
in the YARN service requires permissions to read the subdirectories
of the HDFS directory specified with the Cloudera Manager
Container Usage Metrics Directory property. The
default umask of 022 allows any user to read from that directory.
However, if a more strict umask (for example, 027) is used, then
those directories are not readable by any user. In that case the
user specified with the Container Usage MapReduce Job User property
should be added to the same group that owns the subdirectories.
For example, if the
/tmp/cmYarnContainerMetrics/20161010
subdirectory
is owned by user and group yarn:hadoop
, the user
specified in Container Usage MapReduce Job User
should be added to the hadoop group.
The
directories you specify with the Cloudera Manager Container Usage
Metrics Directory and Container Usage Output Directory
properties should not be located in encryption zones.
-
Optionally, enter the resource pool in which the container usage collection MapReduce
job runs in the Container Usage MapReduce Job Pool property.
Cloudera recommends that you dedicate a resource pool for running this MapReduce
job.
If you specify a custom resource pool, ensure that the placement rules for the cluster
allow for it. The first rule must be for resource pools to be specified at run time with
the Create pool if it does not exist option selected. Alternatively,
ensure that the pool you specify already exists. If the placement rule is not properly
configured or the resource pool does not already exist, the job may run in a different
pool.
-
You must select Enable Container Usage Metric Collection.
-
Click Save Changes.
-
Click the Actions button.
-
Select Create CM Container Usage Metrics
Dir.
-
Restart the YARN service.
Enable Impala utilization collection:
-
In Cloudera Manager, select the Impala service.
-
Click the Configuration tab.
-
Search for admission control.
-
Find the Enable Impala Admission Control and the
Enable Dynamic Resource Pools properties and enable both of
them.
-
Click Save Changes.
-
Restart the Impala service.