HDFS Metrics

Many aggregate metrics are available in addition to base metrics. If an entity type has parents defined, you can formulate all possible aggregate metrics using the formula base_metric_across_parents.

In addition, metrics for aggregate totals can be formed by adding the prefix total_ to the front of the metric name.

Use the type-ahead feature in the Cloudera Manager chart browser to find the exact aggregate metric name, in case the plural form does not end in "s".

For example, the following metric names may be valid for HDFS:

  • alerts_rate_across_clusters
  • total_alerts_rate_across_clusters

Some metrics, such as alerts_rate, apply to nearly every metric context. Others only apply to a certain service or role.

Metric Name Description Unit Parents
alerts_rate The number of alerts. events per second cluster
block_capacity The block capacity of the NameNode blocks cluster
blocks_total Blocks total blocks cluster
blocks_with_corrupt_replicas Blocks with corrupt replicas blocks cluster
canary_duration Duration of the last or currently running canary job ms cluster
cm_time_since_last_fsimage_fetch Time since last FsImage was fetched by Cloudera Reports Manager seconds cluster
cm_time_since_last_fsimage_index Time since last FsImage was indexed By Cloudera Reports Manager seconds cluster
dfs_capacity Total configured HDFS storage capacity bytes cluster
dfs_capacity_used Storage space used by HDFS files bytes cluster
dfs_capacity_used_non_hdfs Storage space used by non-HDFS files bytes cluster
events_critical_rate The number of critical events. events per second cluster
events_important_rate The number of important events. events per second cluster
events_informational_rate The number of informational events. events per second cluster
excess_blocks The total number of excess blocks blocks cluster
expired_heartbeats The number of expired heartbeats heartbeats cluster
files_total The number of files and directories in the HDFS files cluster
health_bad_rate Percentage of Time with Bad Health seconds per second cluster
health_concerning_rate Percentage of Time with Concerning Health seconds per second cluster
health_disabled_rate Percentage of Time with Disabled Health seconds per second cluster
health_good_rate Percentage of Time with Good Health seconds per second cluster
health_unknown_rate Percentage of Time with Unknown Health seconds per second cluster
missing_blocks Missing blocks blocks cluster
pending_deletion_blocks The number of replicas pending deletion. replicas cluster
pending_replication_blocks The number of blocks with replication pending blocks cluster
scheduled_replication_blocks The number of blocks with replication currently scheduled blocks cluster
under_replicated_blocks Under-replicated blocks blocks cluster
xceivers Transceivers transceivers cluster