JournalNode Metrics
In addition to these base metrics, many aggregate metrics are available.
If an entity type has parents defined, you can formulate all possible
aggregate metrics using the formula
base_metric_across_parents
.
In addition, metrics for aggregate totals can be formed by adding the prefix
total_
to the front of the metric name.
Use the type-ahead feature in the Cloudera Manager chart browser to find the exact aggregate metric name, in case the plural form does not end in "s".
For example, the following metric names may be valid for JournalNode:
-
accept_recovery_avg_time_across_clusters
-
total_accept_recovery_avg_time_across_clusters
Some metrics, such as alerts_rate
, apply to nearly every metric context. Others only apply to a
certain service or role.
Metric Name | Description | Unit | Parents | CDH Version |
---|---|---|---|---|
accept_recovery_avg_time | Accept Recovery Average Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
accept_recovery_rate | Accept Recovery Operations | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
alerts_rate | The number of alerts. | events per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
batches_written_rate | Batches Written | batches per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
batches_written_while_lagging_rate | Batches Written While Lagging | batches per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
bytes_written_rate | Bytes Written | bytes per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
cgroup_cpu_system_rate | CPU usage of the role's cgroup | seconds per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
cgroup_cpu_user_rate | User Space CPU usage of the role's cgroup | seconds per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
cgroup_mem_page_cache | Page cache usage of the role's cgroup | bytes | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
cgroup_mem_rss | Resident memory of the role's cgroup | bytes | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
cgroup_mem_swap | Swap usage of the role's cgroup | bytes | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
cgroup_read_bytes_rate | Bytes read from all disks by the role's cgroup | bytes per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
cgroup_read_ios_rate | Number of read I/O operations from all disks by the role's cgroup | ios per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
cgroup_write_bytes_rate | Bytes written to all disks by the role's cgroup | bytes per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
cgroup_write_ios_rate | Number of write I/O operations to all disks by the role's cgroup | ios per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
cpu_system_rate | Total System CPU | seconds per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
cpu_user_rate | Total CPU user time | seconds per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
current_lag_txns_journalnode | The number of transactions by which this JournalNode's log is lagging behind the quorum as reported by the JournalNode | transactions | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
current_lag_txns_namenode | The number of transactions by which this JournalNode's log is lagging behind the quorum as reported by the NameNode | transactions | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
events_critical_rate | The number of critical events. | events per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
events_important_rate | The number of important events. | events per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
events_informational_rate | The number of informational events. | events per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
fd_max | Maximum number of file descriptors | file descriptors | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
fd_open | Open file descriptors. | file descriptors | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
finalize_log_segment_avg_time | Finalize Log Segment Average Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
finalize_log_segment_rate | Finalize Log Segment Operations | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
format_avg_time | The average time for format operations. | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
format_rate | The total number of format operations. | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
get_edit_log_manifest_avg_time | Get Edit Log Manifest Average Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
get_edit_log_manifest_rate | Get Edit Log Manifest Operations | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
get_hadoop_groups_avg_time | Average Time to get Hadoop group for the user | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
get_hadoop_groups_rate | Get Hadoop User Operations | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
get_journal_state_avg_time | Get Journal State Average Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
get_journal_state_rate | Get Journal State Operations | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
health_bad_rate | Percentage of Time with Bad Health | seconds per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
health_concerning_rate | Percentage of Time with Concerning Health | seconds per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
health_disabled_rate | Percentage of Time with Disabled Health | seconds per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
health_good_rate | Percentage of Time with Good Health | seconds per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
health_unknown_rate | Percentage of Time with Unknown Health | seconds per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
is_formatted_avg_time | The average time for is formatted operations. | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
is_formatted_rate | The total number of is formatted operations. | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
journal_avg_time | Journal Average Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
journal_rate | Journal Operations | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_blocked_threads | Blocked threads | threads | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_gc_rate | Number of garbage collections | garbage collections per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_gc_time_ms_rate | Total time spent garbage collecting. | ms per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_heap_committed_mb | Total amount of committed heap memory. | MB | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_heap_used_mb | Total amount of used heap memory. | MB | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_max_memory_mb | Maximum allowed memory. | MB | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_new_threads | New threads | threads | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_non_heap_committed_mb | Total amount of committed non-heap memory. | MB | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_non_heap_used_mb | Total amount of used non-heap memory. | MB | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_runnable_threads | Runnable threads | threads | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_terminated_threads | Terminated threads | threads | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_timed_waiting_threads | Timed waiting threads | threads | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_total_threads | Total threads | threads | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
jvm_waiting_threads | Waiting threads | threads | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
lag_time_millis | The amount of time by which this JournalNode's log is lagging behind the quorum | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
last_promised_epoch | Last Promised Epoch | epoch | CDH 5, CDH 6, CDH 7 | |
last_writer_epoch | Last Writer Epoch | epoch | CDH 5, CDH 6, CDH 7 | |
log_error_rate | Logged Errors | messages per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
log_fatal_rate | Logged Fatals | messages per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
log_info_rate | Logged Infos | messages per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
log_warn_rate | Logged Warnings | messages per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
login_failure_avg_time | Average Failed Login Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
login_failure_rate | Login Failures | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
login_success_avg_time | Average Successful Login Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
login_success_rate | Login Successes | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
mem_rss | Resident memory used | bytes | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
mem_swap | Amount of swap memory used by this role's process. | bytes | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
mem_virtual | Virtual memory used | bytes | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
metrics_dropped_pub_all | Dropped Metrics Updates By All Sinks | updates | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
metrics_num_active_sinks | Active Metrics Sinks Count | sinks | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
metrics_num_active_sources | Active Metrics Sources Count | sources | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
metrics_num_all_sinks | All Metrics Sinks Count | sinks | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
metrics_num_all_sources | All Metrics Sources Count | sources | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
metrics_publish_avg_time | Metrics Publish Average Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
metrics_publish_rate | Metrics Publish Operations | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
metrics_snapshot_avg_time | Metrics Snapshot Average Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
metrics_snapshot_rate | Metrics Snapshot Average Operations | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
new_epoch_avg_time | New Epoch Average Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
new_epoch_rate | New Epoch Average Operations | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
oom_exits_rate | The number of times the role's backing process was killed due to an OutOfMemory error. This counter is only incremented if the Cloudera Manager "Kill When Out of Memory" option is enabled. | exits per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
prepare_recovery_avg_time | Prepare Recovery Average Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
prepare_recovery_rate | Prepare Recovery Operations | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
purge_logs_avg_time | Purge Logs Average Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
purge_logs_rate | Purge Logs Operations | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
queued_edits_size | The size in bytes of the edits currently queued by the active NameNode to this JournalNode | bytes | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
read_bytes_rate | The number of bytes read from the device | bytes per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
rpc_authentication_failures_rate | RPC Authentication Failures | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
rpc_authentication_successes_rate | RPC Authentication Successes | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
rpc_authorization_failures_rate | RPC Authorization Failures | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
rpc_authorization_successes_rate | RPC Authorization Successes | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
rpc_call_queue_length | RPC Call Queue Length | items | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
rpc_num_open_connections | Open RPC Connections | connections | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
rpc_processing_time_avg_time | Average RPC Processing Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
rpc_processing_time_rate | RPCs Processed | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
rpc_queue_time_avg_time | Average RPC Queue Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
rpc_queue_time_rate | RPCs Queued | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
rpc_received_bytes_rate | RPC Received Bytes | bytes per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
rpc_sent_bytes_rate | RPC Sent Bytes | bytes per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
start_log_segment_avg_time | Start Log Segment Average Time | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
start_log_segment_rate | Start Log Segment Average Operations | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs300s50th_percentile_latency_micros | Sync Latency 5 Minutes 50% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs300s75th_percentile_latency_micros | Sync Latency 5 Minutes 75% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs300s90th_percentile_latency_micros | Sync Latency 5 Minutes 90% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs300s95th_percentile_latency_micros | Sync Latency 5 Minutes 95% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs300s99th_percentile_latency_micros | Sync Latency 5 Minutes 99% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs300s_rate | Sync Operations: Five Minute Granularity | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs3600s50th_percentile_latency_micros | Sync Latency Hour 50% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs3600s75th_percentile_latency_micros | Sync Latency Hour 75% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs3600s90th_percentile_latency_micros | Sync Latency Hour 90% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs3600s95th_percentile_latency_micros | Sync Latency Hour 95% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs3600s99th_percentile_latency_micros | Sync Latency Hour 99% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs3600s_rate | Sync Operations: One Hour Granularity | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs60s50th_percentile_latency_micros | Sync Latency Minute 50% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs60s75th_percentile_latency_micros | Sync Latency Minute 75% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs60s90th_percentile_latency_micros | Sync Latency Minute 90% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs60s95th_percentile_latency_micros | Sync Latency Minute 95% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs60s99th_percentile_latency_micros | Sync Latency Minute 99% | micros | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
syncs60s_rate | Sync Operations: One Minute Granularity | operations per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
txns_written_rate | Transactions Written | transactions per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
unexpected_exits_rate | The number of times the role's backing process exited unexpectedly. | exits per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
uptime | For a host, the amount of time since the host was booted. For a role, the uptime of the backing process. | seconds | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
web_metrics_collection_duration | Web Server Responsiveness | ms | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |
write_bytes_rate | The number of bytes written to the device | bytes per second | cluster, hdfs, rack | CDH 5, CDH 6, CDH 7 |