NodeManager Metrics

Reference information for NodeManager Metrics

In addition to these base metrics, many aggregate metrics are available. If an entity type has parents defined, you can formulate all possible aggregate metrics using the formula base_metric_across_parents.

In addition, metrics for aggregate totals can be formed by adding the prefix total_ to the front of the metric name.

Use the type-ahead feature in the Cloudera Manager chart browser to find the exact aggregate metric name, in case the plural form does not end in "s".

For example, the following metric names may be valid for NodeManager:

  • alerts_rate_across_clusters
  • total_alerts_rate_across_clusters

Some metrics, such as alerts_rate, apply to nearly every metric context. Others only apply to a certain service or role.

alerts_rate

Description
The number of alerts.
Unit
events per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

allocated_containers

Description
Number of containers allocated in this pool.
Unit
containers
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

allocated_memory_gb

Description
Allocated Memory
Unit
gigabytes
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

available_memory_gb

Description
Available Memory
Unit
gigabytes
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cgroup_cpu_system_rate

Description
CPU usage of the role's cgroup
Unit
seconds per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cgroup_cpu_user_rate

Description
User Space CPU usage of the role's cgroup
Unit
seconds per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cgroup_mem_page_cache

Description
Page cache usage of the role's cgroup
Unit
bytes
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cgroup_mem_rss

Description
Resident memory of the role's cgroup
Unit
bytes
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cgroup_mem_swap

Description
Swap usage of the role's cgroup
Unit
bytes
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cgroup_read_bytes_rate

Description
Bytes read from all disks by the role's cgroup
Unit
bytes per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cgroup_read_ios_rate

Description
Number of read I/O operations from all disks by the role's cgroup
Unit
ios per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cgroup_write_bytes_rate

Description
Bytes written to all disks by the role's cgroup
Unit
bytes per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cgroup_write_ios_rate

Description
Number of write I/O operations to all disks by the role's cgroup
Unit
ios per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

containers_completed_rate

Description
Containers Completed
Unit
containers per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

containers_failed_rate

Description
Containers Failed
Unit
containers per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

containers_initing

Description
Containers Initializing
Unit
containers
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

containers_killed_rate

Description
Containers Killed
Unit
containers per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

containers_launched_rate

Description
Containers Launched
Unit
containers per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

containers_running

Description
Containers Running
Unit
containers
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cpu_system_rate

Description
Total System CPU
Unit
seconds per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cpu_system_with_descendants_rate

Description
The total system CPU time for this process and all its descendant processes
Unit
seconds per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cpu_user_rate

Description
Total CPU user time
Unit
seconds per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

cpu_user_with_descendants_rate

Description
The total user CPU time for this process and all its descendant processes
Unit
seconds per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

events_critical_rate

Description
The number of critical events.
Unit
events per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

events_important_rate

Description
The number of important events.
Unit
events per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

events_informational_rate

Description
The number of informational events.
Unit
events per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

fd_max

Description
Maximum number of file descriptors
Unit
file descriptors
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

fd_open

Description
Open file descriptors.
Unit
file descriptors
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

gc_count_concurrent_mark_sweep_rate

Description
The number of garbage collections by the Concurrent Mark Sweep Collector.
Unit
garbage collections per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

gc_count_par_new_rate

Description
The number of garbage collections by the Parallel Collector.
Unit
garbage collections per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

gc_time_ms_concurrent_mark_sweep_rate

Description
The total time spent in garbage collections by the Concurrent Mark Sweep Collector.
Unit
ms per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

gc_time_ms_par_new_rate

Description
The total time spent in garbage collections by the Parallel Collector.
Unit
ms per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

get_container_statuses_avg_time

Description
Get Container Statuses Average Time
Unit
ms
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

get_container_statuses_rate

Description
Get Container Statuses Operations
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

get_hadoop_groups_avg_time

Description
Average Time to get Hadoop group for the user
Unit
ms
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

get_hadoop_groups_rate

Description
Get Hadoop User Operations
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

health_bad_rate

Description
Percentage of Time with Bad Health
Unit
seconds per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

health_concerning_rate

Description
Percentage of Time with Concerning Health
Unit
seconds per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

health_disabled_rate

Description
Percentage of Time with Disabled Health
Unit
seconds per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

health_good_rate

Description
Percentage of Time with Good Health
Unit
seconds per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

health_unknown_rate

Description
Percentage of Time with Unknown Health
Unit
seconds per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_blocked_threads

Description
Blocked threads
Unit
threads
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_gc_rate

Description
Number of garbage collections
Unit
garbage collections per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_gc_time_ms_rate

Description
Total time spent garbage collecting.
Unit
ms per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_heap_committed_mb

Description
Total amount of committed heap memory.
Unit
MB
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_heap_used_mb

Description
Total amount of used heap memory.
Unit
MB
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_max_memory_mb

Description
Maximum allowed memory.
Unit
MB
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_new_threads

Description
New threads
Unit
threads
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_non_heap_committed_mb

Description
Total amount of committed non-heap memory.
Unit
MB
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_non_heap_used_mb

Description
Total amount of used non-heap memory.
Unit
MB
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_runnable_threads

Description
Runnable threads
Unit
threads
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_terminated_threads

Description
Terminated threads
Unit
threads
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_timed_waiting_threads

Description
Timed waiting threads
Unit
threads
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

jvm_waiting_threads

Description
Waiting threads
Unit
threads
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

log_error_rate

Description
Logged Errors
Unit
messages per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

log_fatal_rate

Description
Logged Fatals
Unit
messages per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

log_info_rate

Description
Logged Infos
Unit
messages per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

log_warn_rate

Description
Logged Warnings
Unit
messages per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

login_failure_avg_time

Description
Average Failed Login Time
Unit
ms
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

login_failure_rate

Description
Login Failures
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

login_success_avg_time

Description
Average Successful Login Time
Unit
ms
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

login_success_rate

Description
Login Successes
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

mem_rss

Description
Resident memory used
Unit
bytes
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

mem_rss_with_descendants

Description
The total resident memory for this process and all its descendant processes
Unit
bytes
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

mem_swap

Description
Amount of swap memory used by this role's process.
Unit
bytes
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

mem_virtual

Description
Virtual memory used
Unit
bytes
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

mem_virtual_with_descendants

Description
The total virtual memory for this process and all its descendant processes
Unit
bytes
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

metrics_dropped_pub_all

Description
Dropped Metrics Updates By All Sinks
Unit
updates
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

metrics_num_active_sinks

Description
Active Metrics Sinks Count
Unit
sinks
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

metrics_num_active_sources

Description
Active Metrics Sources Count
Unit
sources
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

metrics_num_all_sinks

Description
All Metrics Sinks Count
Unit
sinks
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

metrics_num_all_sources

Description
All Metrics Sources Count
Unit
sources
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

metrics_publish_avg_time

Description
Metrics Publish Average Time
Unit
ms
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

metrics_publish_rate

Description
Metrics Publish Operations
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

metrics_snapshot_avg_time

Description
Metrics Snapshot Average Time
Unit
ms
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

metrics_snapshot_rate

Description
Metrics Snapshot Average Operations
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

oom_exits_rate

Description
The number of times the role's backing process was killed due to an OutOfMemory error. This counter is only incremented if the Cloudera Manager "Kill When Out of Memory" option is enabled.
Unit
exits per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

read_bytes_rate

Description
The number of bytes read from the device
Unit
bytes per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

rpc_authentication_failures_rate

Description
RPC Authentication Failures
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

rpc_authentication_successes_rate

Description
RPC Authentication Successes
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

rpc_authorization_failures_rate

Description
RPC Authorization Failures
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

rpc_authorization_successes_rate

Description
RPC Authorization Successes
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

rpc_call_queue_length

Description
RPC Call Queue Length
Unit
items
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

rpc_num_open_connections

Description
Open RPC Connections
Unit
connections
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

rpc_processing_time_avg_time

Description
Average RPC Processing Time
Unit
ms
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

rpc_processing_time_rate

Description
RPCs Processed
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

rpc_queue_time_avg_time

Description
Average RPC Queue Time
Unit
ms
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

rpc_queue_time_rate

Description
RPCs Queued
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

rpc_received_bytes_rate

Description
RPC Received Bytes
Unit
bytes per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

rpc_sent_bytes_rate

Description
RPC Sent Bytes
Unit
bytes per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

start_containers_avg_time

Description
Start containers average time.
Unit
ms
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

start_containers_rate

Description
Start containers operations.
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

stop_containers_avg_time

Description
Stop containers average time.
Unit
ms
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

stop_containers_rate

Description
Stop containers operations.
Unit
operations per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

unexpected_exits_rate

Description
The number of times the role's backing process exited unexpectedly.
Unit
exits per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

uptime

Description
For a host, the amount of time since the host was booted. For a role, the uptime of the backing process.
Unit
seconds
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7

write_bytes_rate

Description
The number of bytes written to the device
Unit
bytes per second
Parents
cluster, rack, yarn
CDH Version
CDH 5, CDH 6, CDH 7