NodeManager Metrics
In addition to these base metrics, many aggregate metrics are available.
If an entity type has parents defined, you can formulate all possible
aggregate metrics using the formula
base_metric_across_parents
.
In addition, metrics for aggregate totals can be formed by adding the prefix
total_
to the front of the metric name.
Use the type-ahead feature in the Cloudera Manager chart browser to find the exact aggregate metric name, in case the plural form does not end in "s".
For example, the following metric names may be valid for NodeManager:
-
alerts_rate_across_clusters
-
total_alerts_rate_across_clusters
Some metrics, such as alerts_rate
, apply to nearly every metric context. Others only apply to a
certain service or role.
Metric Name | Description | Unit | Parents | Version |
---|---|---|---|---|
alerts_rate | The number of alerts. | events per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
allocated_containers | Number of containers allocated in this pool. | containers | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
allocated_memory_gb | Allocated Memory | gigabytes | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
available_memory_gb | Available Memory | gigabytes | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cgroup_cpu_system_rate | CPU usage of the role's cgroup | seconds per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cgroup_cpu_user_rate | User Space CPU usage of the role's cgroup | seconds per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cgroup_mem_page_cache | Page cache usage of the role's cgroup | bytes | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cgroup_mem_rss | Resident memory of the role's cgroup | bytes | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cgroup_mem_swap | Swap usage of the role's cgroup | bytes | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cgroup_read_bytes_rate | Bytes read from all disks by the role's cgroup | bytes per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cgroup_read_ios_rate | Number of read I/O operations from all disks by the role's cgroup | ios per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cgroup_write_bytes_rate | Bytes written to all disks by the role's cgroup | bytes per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cgroup_write_ios_rate | Number of write I/O operations to all disks by the role's cgroup | ios per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
containers_completed_rate | Containers Completed | containers per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
containers_failed_rate | Containers Failed | containers per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
containers_initing | Containers Initializing | containers | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
containers_killed_rate | Containers Killed | containers per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
containers_launched_rate | Containers Launched | containers per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
containers_running | Containers Running | containers | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cpu_system_rate | Total System CPU | seconds per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cpu_system_with_descendants_rate | The total system CPU time for this process and all its descendant processes | seconds per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cpu_user_rate | Total CPU user time | seconds per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
cpu_user_with_descendants_rate | The total user CPU time for this process and all its descendant processes | seconds per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
events_critical_rate | The number of critical events. | events per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
events_important_rate | The number of important events. | events per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
events_informational_rate | The number of informational events. | events per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
fd_max | Maximum number of file descriptors | file descriptors | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
fd_open | Open file descriptors. | file descriptors | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
gc_count_concurrent_mark_sweep_rate | The number of garbage collections by the Concurrent Mark Sweep Collector. | garbage collections per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
gc_count_par_new_rate | The number of garbage collections by the Parallel Collector. | garbage collections per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
gc_time_ms_concurrent_mark_sweep_rate | The total time spent in garbage collections by the Concurrent Mark Sweep Collector. | ms per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
gc_time_ms_par_new_rate | The total time spent in garbage collections by the Parallel Collector. | ms per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
get_container_statuses_avg_time | Get Container Statuses Average Time | ms | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
get_container_statuses_rate | Get Container Statuses Operations | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
get_hadoop_groups_avg_time | Average Time to get Hadoop group for the user | ms | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
get_hadoop_groups_rate | Get Hadoop User Operations | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
health_bad_rate | Percentage of Time with Bad Health | seconds per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
health_concerning_rate | Percentage of Time with Concerning Health | seconds per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
health_disabled_rate | Percentage of Time with Disabled Health | seconds per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
health_good_rate | Percentage of Time with Good Health | seconds per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
health_unknown_rate | Percentage of Time with Unknown Health | seconds per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_blocked_threads | Blocked threads | threads | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_gc_rate | Number of garbage collections | garbage collections per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_gc_time_ms_rate | Total time spent garbage collecting. | ms per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_heap_committed_mb | Total amount of committed heap memory. | MB | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_heap_used_mb | Total amount of used heap memory. | MB | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_max_memory_mb | Maximum allowed memory. | MB | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_new_threads | New threads | threads | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_non_heap_committed_mb | Total amount of committed non-heap memory. | MB | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_non_heap_used_mb | Total amount of used non-heap memory. | MB | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_runnable_threads | Runnable threads | threads | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_terminated_threads | Terminated threads | threads | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_timed_waiting_threads | Timed waiting threads | threads | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
jvm_waiting_threads | Waiting threads | threads | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
log_error_rate | Logged Errors | messages per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
log_fatal_rate | Logged Fatals | messages per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
log_info_rate | Logged Infos | messages per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
log_warn_rate | Logged Warnings | messages per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
login_failure_avg_time | Average Failed Login Time | ms | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
login_failure_rate | Login Failures | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
login_success_avg_time | Average Successful Login Time | ms | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
login_success_rate | Login Successes | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
mem_rss | Resident memory used | bytes | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
mem_rss_with_descendants | The total resident memory for this process and all its descendant processes | bytes | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
mem_swap | Amount of swap memory used by this role's process. | bytes | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
mem_virtual | Virtual memory used | bytes | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
mem_virtual_with_descendants | The total virtual memory for this process and all its descendant processes | bytes | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
metrics_dropped_pub_all | Dropped Metrics Updates By All Sinks | updates | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
metrics_num_active_sinks | Active Metrics Sinks Count | sinks | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
metrics_num_active_sources | Active Metrics Sources Count | sources | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
metrics_num_all_sinks | All Metrics Sinks Count | sinks | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
metrics_num_all_sources | All Metrics Sources Count | sources | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
metrics_publish_avg_time | Metrics Publish Average Time | ms | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
metrics_publish_rate | Metrics Publish Operations | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
metrics_snapshot_avg_time | Metrics Snapshot Average Time | ms | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
metrics_snapshot_rate | Metrics Snapshot Average Operations | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
oom_exits_rate | The number of times the role's backing process was killed due to an OutOfMemory error. This counter is only incremented if the Cloudera Manager "Kill When Out of Memory" option is enabled. | exits per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
read_bytes_rate | The number of bytes read from the device | bytes per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
rpc_authentication_failures_rate | RPC Authentication Failures | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
rpc_authentication_successes_rate | RPC Authentication Successes | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
rpc_authorization_failures_rate | RPC Authorization Failures | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
rpc_authorization_successes_rate | RPC Authorization Successes | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
rpc_call_queue_length | RPC Call Queue Length | items | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
rpc_num_open_connections | Open RPC Connections | connections | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
rpc_processing_time_avg_time | Average RPC Processing Time | ms | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
rpc_processing_time_rate | RPCs Processed | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
rpc_queue_time_avg_time | Average RPC Queue Time | ms | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
rpc_queue_time_rate | RPCs Queued | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
rpc_received_bytes_rate | RPC Received Bytes | bytes per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
rpc_sent_bytes_rate | RPC Sent Bytes | bytes per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
start_containers_avg_time | Start containers average time. | ms | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
start_containers_rate | Start containers operations. | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
stop_containers_avg_time | Stop containers average time. | ms | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
stop_containers_rate | Stop containers operations. | operations per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
unexpected_exits_rate | The number of times the role's backing process exited unexpectedly. | exits per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
uptime | For a host, the amount of time since the host was booted. For a role, the uptime of the backing process. | seconds | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |
write_bytes_rate | The number of bytes written to the device | bytes per second | cluster, rack, yarn | CDH 5, CDH 6, Cloudera Runtime 7 |