HDFS Metrics
Many aggregate metrics are available in addition to base metrics. If an entity type has
parents defined, you can formulate all possible aggregate metrics using the formula
base_metric_across_parents
.
In addition, metrics for aggregate totals can be formed by adding the prefix
total_
to the front of the metric name.
Use the type-ahead feature in the Cloudera Manager chart browser to find the exact aggregate metric name, in case the plural form does not end in "s".
For example, the following metric names may be valid for HDFS:
-
alerts_rate_across_clusters
-
total_alerts_rate_across_clusters
Some metrics, such as alerts_rate
, apply to nearly every metric context. Others only apply to a
certain service or role.
Metric Name | Description | Unit | Parents |
---|---|---|---|
alerts_rate | The number of alerts. | events per second | cluster |
block_capacity | The block capacity of the NameNode | blocks | cluster |
blocks_total | Blocks total | blocks | cluster |
blocks_with_corrupt_replicas | Blocks with corrupt replicas | blocks | cluster |
canary_duration | Duration of the last or currently running canary job | ms | cluster |
cm_time_since_last_fsimage_fetch | Time since last FsImage was fetched by Cloudera Reports Manager | seconds | cluster |
cm_time_since_last_fsimage_index | Time since last FsImage was indexed By Cloudera Reports Manager | seconds | cluster |
dfs_capacity | Total configured HDFS storage capacity | bytes | cluster |
dfs_capacity_used | Storage space used by HDFS files | bytes | cluster |
dfs_capacity_used_non_hdfs | Storage space used by non-HDFS files | bytes | cluster |
events_critical_rate | The number of critical events. | events per second | cluster |
events_important_rate | The number of important events. | events per second | cluster |
events_informational_rate | The number of informational events. | events per second | cluster |
excess_blocks | The total number of excess blocks | blocks | cluster |
expired_heartbeats | The number of expired heartbeats | heartbeats | cluster |
files_total | The number of files and directories in the HDFS | files | cluster |
health_bad_rate | Percentage of Time with Bad Health | seconds per second | cluster |
health_concerning_rate | Percentage of Time with Concerning Health | seconds per second | cluster |
health_disabled_rate | Percentage of Time with Disabled Health | seconds per second | cluster |
health_good_rate | Percentage of Time with Good Health | seconds per second | cluster |
health_unknown_rate | Percentage of Time with Unknown Health | seconds per second | cluster |
missing_blocks | Missing blocks | blocks | cluster |
pending_deletion_blocks | The number of replicas pending deletion. | replicas | cluster |
pending_replication_blocks | The number of blocks with replication pending | blocks | cluster |
scheduled_replication_blocks | The number of blocks with replication currently scheduled | blocks | cluster |
under_replicated_blocks | Under-replicated blocks | blocks | cluster |
xceivers | Transceivers | transceivers | cluster |