NodeManager Metrics

Metric Name Description Unit CDH Version
alerts_rate The number of alerts. events per second CDH 4, CDH 5
allocated_containers Number of containers allocated in this pool. containers CDH 4, CDH 5
allocated_memory_gb Allocated Memory gigabytes CDH 4, CDH 5
available_memory_gb Available Memory gigabytes CDH 4, CDH 5
cgroup_cpu_system_rate CPU usage of the role's cgroup seconds per second CDH 4, CDH 5
cgroup_cpu_user_rate User Space CPU usage of the role's cgroup seconds per second CDH 4, CDH 5
cgroup_mem_page_cache Page cache usage of the role's cgroup bytes CDH 4, CDH 5
cgroup_mem_rss Resident memory of the role's cgroup bytes CDH 4, CDH 5
cgroup_mem_swap Swap usage of the role's cgroup bytes CDH 4, CDH 5
cgroup_read_bytes_rate Bytes read from all disks by the role's cgroup bytes per second CDH 4, CDH 5
cgroup_read_ios_rate Number of read I/O operations from all disks by the role's cgroup ios per second CDH 4, CDH 5
cgroup_write_bytes_rate Bytes written to all disks by the role's cgroup bytes per second CDH 4, CDH 5
cgroup_write_ios_rate Number of write I/O operations to all disks by the role's cgroup ios per second CDH 4, CDH 5
containers_completed_rate Containers Completed containers per second CDH 4, CDH 5
containers_failed_rate Containers Failed containers per second CDH 4, CDH 5
containers_initing Containers Initializing containers CDH 4, CDH 5
containers_killed_rate Containers Killed containers per second CDH 4, CDH 5
containers_launched_rate Containers Launched containers per second CDH 4, CDH 5
containers_running Containers Running containers CDH 4, CDH 5
cpu_system_rate Total System CPU seconds per second CDH 4, CDH 5
cpu_system_with_descendants_rate The total system CPU time for this process and all its descendant processes seconds per second CDH 4, CDH 5
cpu_user_rate Total CPU user time seconds per second CDH 4, CDH 5
cpu_user_with_descendants_rate The total user CPU time for this process and all its descendant processes seconds per second CDH 4, CDH 5
events_critical_rate The number of critical events. events per second CDH 4, CDH 5
events_important_rate The number of important events. events per second CDH 4, CDH 5
events_informational_rate The number of informational events. events per second CDH 4, CDH 5
fd_max Maximum number of file descriptors file descriptors CDH 4, CDH 5
fd_open Open file descriptors. file descriptors CDH 4, CDH 5
gc_count_concurrent_mark_sweep_rate The number of garbage collections by the Concurrent Mark Sweep Collector. garbage collections per second CDH 4, CDH 5
gc_count_par_new_rate The number of garbage collections by the Parallel Collector. garbage collections per second CDH 4, CDH 5
gc_time_ms_concurrent_mark_sweep_rate The total time spent in garbage collections by the Concurrent Mark Sweep Collector. ms per second CDH 4, CDH 5
gc_time_ms_par_new_rate The total time spent in garbage collections by the Parallel Collector. ms per second CDH 4, CDH 5
get_container_status_avg_time Get Container Status Average Time ms CDH 4
get_container_status_rate Get Container Status Operations operations per second CDH 4
health_bad_rate Percentage of Time with Bad Health seconds per second CDH 4, CDH 5
health_concerning_rate Percentage of Time with Concerning Health seconds per second CDH 4, CDH 5
health_disabled_rate Percentage of Time with Disabled Health seconds per second CDH 4, CDH 5
health_good_rate Percentage of Time with Good Health seconds per second CDH 4, CDH 5
health_unknown_rate Percentage of Time with Unknown Health seconds per second CDH 4, CDH 5
jvm_blocked_threads Blocked threads threads CDH 4, CDH 5
jvm_gc_rate Number of garbage collections garbage collections per second CDH 4, CDH 5
jvm_gc_time_ms_rate Total time spent garbage collecting (ms) ms per second CDH 4, CDH 5
jvm_heap_committed_mb Total amount of committed heap memory (MB) MB CDH 4, CDH 5
jvm_heap_used_mb Total amount of used heap memory (MB) MB CDH 4, CDH 5
jvm_max_memory_mb Maximum allowed memory (MB) MB CDH 4, CDH 5
jvm_new_threads New threads threads CDH 4, CDH 5
jvm_non_heap_committed_mb Total amount of committed non-heap memory (MB) MB CDH 4, CDH 5
jvm_non_heap_used_mb Total amount of used non-heap memory (MB) MB CDH 4, CDH 5
jvm_runnable_threads Runnable threads threads CDH 4, CDH 5
jvm_terminated_threads Terminated threads threads CDH 4, CDH 5
jvm_timed_waiting_threads Timed waiting threads threads CDH 4, CDH 5
jvm_waiting_threads Waiting threads threads CDH 4, CDH 5
log_error_rate Logged Errors messages per second CDH 4, CDH 5
log_fatal_rate Logged Fatals messages per second CDH 4, CDH 5
log_info_rate Logged Infos messages per second CDH 4, CDH 5
log_warn_rate Logged Warnings messages per second CDH 4, CDH 5
login_failure_avg_time Average Failed Login Time ms CDH 4, CDH 5
login_failure_rate Login Failures operations per second CDH 4, CDH 5
login_success_avg_time Average Successful Login Time ms CDH 4, CDH 5
login_success_rate Login Successes operations per second CDH 4, CDH 5
mem_rss Resident memory used bytes CDH 4, CDH 5
mem_rss_with_descendants The total resident memory for this process and all its descendant processes bytes CDH 4, CDH 5
mem_virtual Virtual memory used bytes CDH 4, CDH 5
mem_virtual_with_descendants The total virtual memory for this process and all its descendant processes bytes CDH 4, CDH 5
metrics_dropped_pub_all Dropped Metrics Updates By All Sinks updates CDH 4, CDH 5
metrics_num_active_sinks Active Metrics Sinks Count sinks CDH 4, CDH 5
metrics_num_active_sources Active Metrics Sources Count sources CDH 4, CDH 5
metrics_num_all_sinks All Metrics Sinks Count sinks CDH 4, CDH 5
metrics_num_all_sources All Metrics Sources Count sources CDH 4, CDH 5
metrics_publish_avg_time Metrics Publish Average Time ms CDH 4, CDH 5
metrics_publish_rate Metrics Publish Operations operations per second CDH 4, CDH 5
metrics_snapshot_avg_time Metrics Snapshot Average Time ms CDH 4, CDH 5
metrics_snapshot_rate Metrics Snapshot Average Operations operations per second CDH 4, CDH 5
read_bytes_rate The number of bytes read from the device bytes per second CDH 4, CDH 5
rpc_authentication_failures_rate RPC Authentication Failures operations per second CDH 4, CDH 5
rpc_authentication_successes_rate RPC Authentication Successes operations per second CDH 4, CDH 5
rpc_authorization_failures_rate RPC Authorization Failures operations per second CDH 4, CDH 5
rpc_authorization_successes_rate RPC Authorization Successes operations per second CDH 4, CDH 5
rpc_call_queue_length RPC Call Queue Length items CDH 4, CDH 5
rpc_num_open_connections Open RPC Connections connections CDH 4, CDH 5
rpc_processing_time_avg_time Average RPC Processing Time ms CDH 4, CDH 5
rpc_processing_time_rate RPCs Processed operations per second CDH 4, CDH 5
rpc_queue_time_avg_time Average RPC Queue Time ms CDH 4, CDH 5
rpc_queue_time_rate RPCs Queued operations per second CDH 4, CDH 5
rpc_received_bytes_rate RPC Received Bytes bytes per second CDH 4, CDH 5
rpc_sent_bytes_rate RPC Sent Bytes bytes per second CDH 4, CDH 5
start_container_avg_time Start Container Average Time ms CDH 4
start_container_rate Start Container Operations operations per second CDH 4
stop_container_avg_time Stop Container Average Time ms CDH 4
stop_container_rate Stop Container Operations operations per second CDH 4
unexpected_exits The number of times the role's backing process exited unexpectedly. unexpected exits CDH 4, CDH 5
write_bytes_rate The number of bytes written to the device bytes per second CDH 4, CDH 5
get_container_statuses_avg_time Get Container Statuses Average Time ms CDH 5
get_container_statuses_rate Get Container Statuses Operations operations per second CDH 5
start_containers_avg_time Start containers average time. ms CDH 5
start_containers_rate Start containers operations. operations per second CDH 5
stop_containers_avg_time Stop containers average time. ms CDH 5
stop_containers_rate Stop containers operations. operations per second CDH 5
capacity_across_directories Statistics, including the average, minimum, and maximum, of the Capacity metric computed across all this entity's directories. bytes n/a
capacity_free_across_directories Statistics, including the average, minimum, and maximum, of the Capacity Free metric computed across all this entity's directories. bytes n/a
capacity_used_across_directories Statistics, including the average, minimum, and maximum, of the Capacity Used metric computed across all this entity's directories. bytes n/a
total_capacity_across_directories The sum of the Capacity metric computed across all this entity's directories. bytes n/a
total_capacity_free_across_directories The sum of the Capacity Free metric computed across all this entity's directories. bytes n/a
total_capacity_used_across_directories The sum of the Capacity Used metric computed across all this entity's directories. bytes n/a