JobTracker Metrics

Metric Name Description Unit CDH Version
alerts_rate The number of alerts. events per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
blacklisted_maps_rate Blacklisted Maps tasks per second [CDH 5.0.0..CDH 6.0.0)
blacklisted_reduces_rate Blacklisted Reduces tasks per second [CDH 5.0.0..CDH 6.0.0)
cgroup_cpu_system_rate CPU usage of the role's cgroup seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_cpu_user_rate User Space CPU usage of the role's cgroup seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_mem_page_cache Page cache usage of the role's cgroup bytes [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_mem_rss Resident memory of the role's cgroup bytes [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_mem_swap Swap usage of the role's cgroup bytes [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_read_bytes_rate Bytes read from all disks by the role's cgroup bytes per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_read_ios_rate Number of read I/O operations from all disks by the role's cgroup ios per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_write_bytes_rate Bytes written to all disks by the role's cgroup bytes per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_write_ios_rate Number of write I/O operations to all disks by the role's cgroup ios per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cpu_system_rate Total System CPU seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cpu_user_rate Total CPU user time seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
events_critical_rate The number of critical events. events per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
events_important_rate The number of important events. events per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
events_informational_rate The number of informational events. events per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
fd_max Maximum number of file descriptors file descriptors [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
fd_open Open file descriptors. file descriptors [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
health_bad_rate Percentage of Time with Bad Health seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
health_concerning_rate Percentage of Time with Concerning Health seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
health_disabled_rate Percentage of Time with Disabled Health seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
health_good_rate Percentage of Time with Good Health seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
health_unknown_rate Percentage of Time with Unknown Health seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
heartbeats_rate Heartbeats operations per second [CDH 5.0.0..CDH 6.0.0)
jobs_completed_rate Jobs Completed jobs per second [CDH 5.0.0..CDH 6.0.0)
jobs_failed_rate Jobs Failed jobs per second [CDH 5.0.0..CDH 6.0.0)
jobs_killed_rate Jobs Killed jobs per second [CDH 5.0.0..CDH 6.0.0)
jobs_preparing Jobs Preparing jobs [CDH 5.0.0..CDH 6.0.0)
jobs_running Jobs Running jobs [CDH 5.0.0..CDH 6.0.0)
jobs_submitted_rate Jobs Submitted jobs per second [CDH 5.0.0..CDH 6.0.0)
jvm_blocked_threads Blocked threads threads [CDH 5.0.0..CDH 6.0.0)
jvm_gc_rate Number of garbage collections garbage collections per second [CDH 5.0.0..CDH 6.0.0)
jvm_gc_time_ms_rate Total time spent garbage collecting. ms per second [CDH 5.0.0..CDH 6.0.0)
jvm_heap_committed_mb Total amount of committed heap memory. MB [CDH 5.0.0..CDH 6.0.0)
jvm_heap_used_mb Total amount of used heap memory. MB [CDH 5.0.0..CDH 6.0.0)
jvm_max_memory_mb Maximum allowed memory. MB [CDH 5.0.0..CDH 6.0.0)
jvm_new_threads New threads threads [CDH 5.0.0..CDH 6.0.0)
jvm_non_heap_committed_mb Total amount of committed non-heap memory. MB [CDH 5.0.0..CDH 6.0.0)
jvm_non_heap_used_mb Total amount of used non-heap memory. MB [CDH 5.0.0..CDH 6.0.0)
jvm_runnable_threads Runnable threads threads [CDH 5.0.0..CDH 6.0.0)
jvm_terminated_threads Terminated threads threads [CDH 5.0.0..CDH 6.0.0)
jvm_timed_waiting_threads Timed waiting threads threads [CDH 5.0.0..CDH 6.0.0)
jvm_total_threads Total threads threads [CDH 5.0.0..CDH 6.0.0)
jvm_waiting_threads Waiting threads threads [CDH 5.0.0..CDH 6.0.0)
log_error_rate Logged Errors messages per second [CDH 5.0.0..CDH 6.0.0)
log_fatal_rate Logged Fatals messages per second [CDH 5.0.0..CDH 6.0.0)
log_info_rate Logged Infos messages per second [CDH 5.0.0..CDH 6.0.0)
log_warn_rate Logged Warnings messages per second [CDH 5.0.0..CDH 6.0.0)
map_slots Map Slots slots [CDH 5.0.0..CDH 6.0.0)
maps_completed_rate Maps Completed tasks per second [CDH 5.0.0..CDH 6.0.0)
maps_failed_rate Maps Failed tasks per second [CDH 5.0.0..CDH 6.0.0)
maps_killed_rate Maps Killed tasks per second [CDH 5.0.0..CDH 6.0.0)
maps_launched_rate Maps Launched tasks per second [CDH 5.0.0..CDH 6.0.0)
maps_running Maps Running tasks [CDH 5.0.0..CDH 6.0.0)
mem_rss Resident memory used bytes [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
mem_swap Amount of swap memory used by this role's process. bytes [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
mem_virtual Virtual memory used bytes [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
occupied_map_slots Occupied Map Slots slots [CDH 5.0.0..CDH 6.0.0)
occupied_reduce_slots Occupied Reduce Slots slots [CDH 5.0.0..CDH 6.0.0)
oom_exits_rate The number of times the role's backing process was killed due to an OutOfMemory error. This counter is only incremented if the Cloudera Manager "Kill When Out of Memory" option is enabled. exits per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
read_bytes_rate The number of bytes read from the device bytes per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
reduce_slots Reduce slots slots [CDH 5.0.0..CDH 6.0.0)
reduces_completed_rate Reduces Completed tasks per second [CDH 5.0.0..CDH 6.0.0)
reduces_failed_rate Reduces failed tasks per second [CDH 5.0.0..CDH 6.0.0)
reduces_killed_rate Reduces Killed tasks per second [CDH 5.0.0..CDH 6.0.0)
reduces_launched_rate Reduces Launched tasks per second [CDH 5.0.0..CDH 6.0.0)
reduces_running Reduces Running tasks [CDH 5.0.0..CDH 6.0.0)
reserved_map_slots Reserved Map Slots slots [CDH 5.0.0..CDH 6.0.0)
reserved_reduce_slots Reserved Reduce Slots slots [CDH 5.0.0..CDH 6.0.0)
trackers TaskTrackers TaskTrackers [CDH 5.0.0..CDH 6.0.0)
trackers_blacklisted TaskTrackers Blacklisted TaskTrackers [CDH 5.0.0..CDH 6.0.0)
trackers_decommissioned TaskTrackers Decommissioned TaskTrackers [CDH 5.0.0..CDH 6.0.0)
unexpected_exits_rate The number of times the role's backing process exited unexpectedly. exits per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
uptime For a host, the amount of time since the host was booted. For a role, the uptime of the backing process. seconds [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
waiting_maps Waiting Maps tasks [CDH 5.0.0..CDH 6.0.0)
waiting_reduces Waiting Reduces tasks [CDH 5.0.0..CDH 6.0.0)
web_metrics_collection_duration Web Server Responsiveness ms [CDH 5.0.0..CDH 6.0.0)
write_bytes_rate The number of bytes written to the device bytes per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]