Metrics ReferencePDF version

JobTracker Metrics

Reference information for JobTracker Metrics

In addition to these base metrics, many aggregate metrics are available. If an entity type has parents defined, you can formulate all possible aggregate metrics using the formula base_metric_across_parents.

In addition, metrics for aggregate totals can be formed by adding the prefix total_ to the front of the metric name.

Use the type-ahead feature in the Cloudera Manager chart browser to find the exact aggregate metric name, in case the plural form does not end in "s".

For example, the following metric names may be valid for JobTracker:

  • alerts_rate_across_clusters
  • total_alerts_rate_across_clusters

Some metrics, such as alerts_rate, apply to nearly every metric context. Others only apply to a certain service or role.

Description
The number of alerts.
Unit
events per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Blacklisted Maps
Unit
tasks per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Blacklisted Reduces
Unit
tasks per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
CPU usage of the role's cgroup
Unit
seconds per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
User Space CPU usage of the role's cgroup
Unit
seconds per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Page cache usage of the role's cgroup
Unit
bytes
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Resident memory of the role's cgroup
Unit
bytes
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Swap usage of the role's cgroup
Unit
bytes
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Bytes read from all disks by the role's cgroup
Unit
bytes per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Number of read I/O operations from all disks by the role's cgroup
Unit
ios per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Bytes written to all disks by the role's cgroup
Unit
bytes per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Number of write I/O operations to all disks by the role's cgroup
Unit
ios per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Total System CPU
Unit
seconds per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Total CPU user time
Unit
seconds per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
The number of critical events.
Unit
events per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
The number of important events.
Unit
events per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
The number of informational events.
Unit
events per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Maximum number of file descriptors
Unit
file descriptors
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Open file descriptors.
Unit
file descriptors
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Percentage of Time with Bad Health
Unit
seconds per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Percentage of Time with Concerning Health
Unit
seconds per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Percentage of Time with Disabled Health
Unit
seconds per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Percentage of Time with Good Health
Unit
seconds per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Percentage of Time with Unknown Health
Unit
seconds per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Heartbeats
Unit
operations per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Jobs Completed
Unit
jobs per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Jobs Failed
Unit
jobs per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Jobs Killed
Unit
jobs per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Jobs Preparing
Unit
jobs
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Jobs Running
Unit
jobs
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Jobs Submitted
Unit
jobs per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Blocked threads
Unit
threads
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Number of garbage collections
Unit
garbage collections per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Total time spent garbage collecting.
Unit
ms per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Total amount of committed heap memory.
Unit
MB
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Total amount of used heap memory.
Unit
MB
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Maximum allowed memory.
Unit
MB
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
New threads
Unit
threads
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Total amount of committed non-heap memory.
Unit
MB
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Total amount of used non-heap memory.
Unit
MB
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Runnable threads
Unit
threads
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Terminated threads
Unit
threads
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Timed waiting threads
Unit
threads
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Total threads
Unit
threads
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Waiting threads
Unit
threads
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Logged Errors
Unit
messages per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Logged Fatals
Unit
messages per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Logged Infos
Unit
messages per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Logged Warnings
Unit
messages per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Map Slots
Unit
slots
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Maps Completed
Unit
tasks per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Maps Failed
Unit
tasks per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Maps Killed
Unit
tasks per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Maps Launched
Unit
tasks per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Maps Running
Unit
tasks
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Resident memory used
Unit
bytes
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Amount of swap memory used by this role's process.
Unit
bytes
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Virtual memory used
Unit
bytes
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Occupied Map Slots
Unit
slots
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Occupied Reduce Slots
Unit
slots
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
The number of times the role's backing process was killed due to an OutOfMemory error. This counter is only incremented if the Cloudera Manager "Kill When Out of Memory" option is enabled.
Unit
exits per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
The number of bytes read from the device
Unit
bytes per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Reduce slots
Unit
slots
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Reduces Completed
Unit
tasks per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Reduces failed
Unit
tasks per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Reduces Killed
Unit
tasks per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Reduces Launched
Unit
tasks per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Reduces Running
Unit
tasks
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Reserved Map Slots
Unit
slots
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Reserved Reduce Slots
Unit
slots
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
TaskTrackers
Unit
TaskTrackers
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
TaskTrackers Blacklisted
Unit
TaskTrackers
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
TaskTrackers Decommissioned
Unit
TaskTrackers
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
The number of times the role's backing process exited unexpectedly.
Unit
exits per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
For a host, the amount of time since the host was booted. For a role, the uptime of the backing process.
Unit
seconds
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
Description
Waiting Maps
Unit
tasks
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Waiting Reduces
Unit
tasks
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
Web Server Responsiveness
Unit
ms
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0)
Description
The number of bytes written to the device
Unit
bytes per second
Parents
cluster, mapreduce, rack
CDH Version
[CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]