NodeManager Metrics
Reference information for NodeManager Metrics
In addition to these base metrics, many aggregate metrics are available.
If an entity type has parents defined, you can formulate all possible
aggregate metrics using the formula
base_metric_across_parents
.
In addition, metrics for aggregate totals can be formed by adding the prefix
total_
to the front of the metric name.
Use the type-ahead feature in the Cloudera Manager chart browser to find the exact aggregate metric name, in case the plural form does not end in "s".
For example, the following metric names may be valid for NodeManager:
-
alerts_rate_across_clusters
-
total_alerts_rate_across_clusters
Some metrics, such as alerts_rate
, apply to nearly every metric context. Others only apply to a
certain service or role.
alerts_rate
- Description
- The number of alerts.
- Unit
- events per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
allocated_containers
- Description
- Number of containers allocated in this pool.
- Unit
- containers
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
allocated_memory_gb
- Description
- Allocated Memory
- Unit
- gigabytes
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
available_memory_gb
- Description
- Available Memory
- Unit
- gigabytes
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cgroup_cpu_system_rate
- Description
- CPU usage of the role's cgroup
- Unit
- seconds per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cgroup_cpu_user_rate
- Description
- User Space CPU usage of the role's cgroup
- Unit
- seconds per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cgroup_mem_page_cache
- Description
- Page cache usage of the role's cgroup
- Unit
- bytes
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cgroup_mem_rss
- Description
- Resident memory of the role's cgroup
- Unit
- bytes
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cgroup_mem_swap
- Description
- Swap usage of the role's cgroup
- Unit
- bytes
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cgroup_read_bytes_rate
- Description
- Bytes read from all disks by the role's cgroup
- Unit
- bytes per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cgroup_read_ios_rate
- Description
- Number of read I/O operations from all disks by the role's cgroup
- Unit
- ios per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cgroup_write_bytes_rate
- Description
- Bytes written to all disks by the role's cgroup
- Unit
- bytes per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cgroup_write_ios_rate
- Description
- Number of write I/O operations to all disks by the role's cgroup
- Unit
- ios per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
containers_completed_rate
- Description
- Containers Completed
- Unit
- containers per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
containers_failed_rate
- Description
- Containers Failed
- Unit
- containers per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
containers_initing
- Description
- Containers Initializing
- Unit
- containers
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
containers_killed_rate
- Description
- Containers Killed
- Unit
- containers per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
containers_launched_rate
- Description
- Containers Launched
- Unit
- containers per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
containers_running
- Description
- Containers Running
- Unit
- containers
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cpu_system_rate
- Description
- Total System CPU
- Unit
- seconds per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cpu_system_with_descendants_rate
- Description
- The total system CPU time for this process and all its descendant processes
- Unit
- seconds per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cpu_user_rate
- Description
- Total CPU user time
- Unit
- seconds per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
cpu_user_with_descendants_rate
- Description
- The total user CPU time for this process and all its descendant processes
- Unit
- seconds per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
events_critical_rate
- Description
- The number of critical events.
- Unit
- events per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
events_important_rate
- Description
- The number of important events.
- Unit
- events per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
events_informational_rate
- Description
- The number of informational events.
- Unit
- events per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
fd_max
- Description
- Maximum number of file descriptors
- Unit
- file descriptors
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
fd_open
- Description
- Open file descriptors.
- Unit
- file descriptors
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
gc_count_concurrent_mark_sweep_rate
- Description
- The number of garbage collections by the Concurrent Mark Sweep Collector.
- Unit
- garbage collections per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
gc_count_par_new_rate
- Description
- The number of garbage collections by the Parallel Collector.
- Unit
- garbage collections per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
gc_time_ms_concurrent_mark_sweep_rate
- Description
- The total time spent in garbage collections by the Concurrent Mark Sweep Collector.
- Unit
- ms per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
gc_time_ms_par_new_rate
- Description
- The total time spent in garbage collections by the Parallel Collector.
- Unit
- ms per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
get_container_statuses_avg_time
- Description
- Get Container Statuses Average Time
- Unit
- ms
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
get_container_statuses_rate
- Description
- Get Container Statuses Operations
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
get_hadoop_groups_avg_time
- Description
- Average Time to get Hadoop group for the user
- Unit
- ms
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
get_hadoop_groups_rate
- Description
- Get Hadoop User Operations
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
health_bad_rate
- Description
- Percentage of Time with Bad Health
- Unit
- seconds per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
health_concerning_rate
- Description
- Percentage of Time with Concerning Health
- Unit
- seconds per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
health_disabled_rate
- Description
- Percentage of Time with Disabled Health
- Unit
- seconds per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
health_good_rate
- Description
- Percentage of Time with Good Health
- Unit
- seconds per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
health_unknown_rate
- Description
- Percentage of Time with Unknown Health
- Unit
- seconds per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_blocked_threads
- Description
- Blocked threads
- Unit
- threads
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_gc_rate
- Description
- Number of garbage collections
- Unit
- garbage collections per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_gc_time_ms_rate
- Description
- Total time spent garbage collecting.
- Unit
- ms per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_heap_committed_mb
- Description
- Total amount of committed heap memory.
- Unit
- MB
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_heap_used_mb
- Description
- Total amount of used heap memory.
- Unit
- MB
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_max_memory_mb
- Description
- Maximum allowed memory.
- Unit
- MB
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_new_threads
- Description
- New threads
- Unit
- threads
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_non_heap_committed_mb
- Description
- Total amount of committed non-heap memory.
- Unit
- MB
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_non_heap_used_mb
- Description
- Total amount of used non-heap memory.
- Unit
- MB
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_runnable_threads
- Description
- Runnable threads
- Unit
- threads
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_terminated_threads
- Description
- Terminated threads
- Unit
- threads
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_timed_waiting_threads
- Description
- Timed waiting threads
- Unit
- threads
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
jvm_waiting_threads
- Description
- Waiting threads
- Unit
- threads
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
log_error_rate
- Description
- Logged Errors
- Unit
- messages per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
log_fatal_rate
- Description
- Logged Fatals
- Unit
- messages per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
log_info_rate
- Description
- Logged Infos
- Unit
- messages per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
log_warn_rate
- Description
- Logged Warnings
- Unit
- messages per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
login_failure_avg_time
- Description
- Average Failed Login Time
- Unit
- ms
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
login_failure_rate
- Description
- Login Failures
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
login_success_avg_time
- Description
- Average Successful Login Time
- Unit
- ms
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
login_success_rate
- Description
- Login Successes
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
mem_rss
- Description
- Resident memory used
- Unit
- bytes
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
mem_rss_with_descendants
- Description
- The total resident memory for this process and all its descendant processes
- Unit
- bytes
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
mem_swap
- Description
- Amount of swap memory used by this role's process.
- Unit
- bytes
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
mem_virtual
- Description
- Virtual memory used
- Unit
- bytes
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
mem_virtual_with_descendants
- Description
- The total virtual memory for this process and all its descendant processes
- Unit
- bytes
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
metrics_dropped_pub_all
- Description
- Dropped Metrics Updates By All Sinks
- Unit
- updates
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
metrics_num_active_sinks
- Description
- Active Metrics Sinks Count
- Unit
- sinks
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
metrics_num_active_sources
- Description
- Active Metrics Sources Count
- Unit
- sources
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
metrics_num_all_sinks
- Description
- All Metrics Sinks Count
- Unit
- sinks
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
metrics_num_all_sources
- Description
- All Metrics Sources Count
- Unit
- sources
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
metrics_publish_avg_time
- Description
- Metrics Publish Average Time
- Unit
- ms
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
metrics_publish_rate
- Description
- Metrics Publish Operations
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
metrics_snapshot_avg_time
- Description
- Metrics Snapshot Average Time
- Unit
- ms
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
metrics_snapshot_rate
- Description
- Metrics Snapshot Average Operations
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
oom_exits_rate
- Description
- The number of times the role's backing process was killed due to an OutOfMemory error. This counter is only incremented if the Cloudera Manager "Kill When Out of Memory" option is enabled.
- Unit
- exits per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
read_bytes_rate
- Description
- The number of bytes read from the device
- Unit
- bytes per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
rpc_authentication_failures_rate
- Description
- RPC Authentication Failures
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
rpc_authentication_successes_rate
- Description
- RPC Authentication Successes
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
rpc_authorization_failures_rate
- Description
- RPC Authorization Failures
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
rpc_authorization_successes_rate
- Description
- RPC Authorization Successes
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
rpc_call_queue_length
- Description
- RPC Call Queue Length
- Unit
- items
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
rpc_num_open_connections
- Description
- Open RPC Connections
- Unit
- connections
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
rpc_processing_time_avg_time
- Description
- Average RPC Processing Time
- Unit
- ms
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
rpc_processing_time_rate
- Description
- RPCs Processed
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
rpc_queue_time_avg_time
- Description
- Average RPC Queue Time
- Unit
- ms
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
rpc_queue_time_rate
- Description
- RPCs Queued
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
rpc_received_bytes_rate
- Description
- RPC Received Bytes
- Unit
- bytes per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
rpc_sent_bytes_rate
- Description
- RPC Sent Bytes
- Unit
- bytes per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
start_containers_avg_time
- Description
- Start containers average time.
- Unit
- ms
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
start_containers_rate
- Description
- Start containers operations.
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
stop_containers_avg_time
- Description
- Stop containers average time.
- Unit
- ms
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
stop_containers_rate
- Description
- Stop containers operations.
- Unit
- operations per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
unexpected_exits_rate
- Description
- The number of times the role's backing process exited unexpectedly.
- Unit
- exits per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
uptime
- Description
- For a host, the amount of time since the host was booted. For a role, the uptime of the backing process.
- Unit
- seconds
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7
write_bytes_rate
- Description
- The number of bytes written to the device
- Unit
- bytes per second
- Parents
- cluster, rack, yarn
- CDH Version
- CDH 5, CDH 6, CDH 7