Kafka Connect Metrics

In addition to these base metrics, many aggregate metrics are available. If an entity type has parents defined, you can formulate all possible aggregate metrics using the formula base_metric_across_parents.

In addition, metrics for aggregate totals can be formed by adding the prefix total_ to the front of the metric name.

Use the type-ahead feature in the Cloudera Manager chart browser to find the exact aggregate metric name, in case the plural form does not end in "s".

For example, the following metric names may be valid for Kafka Connect:

  • alerts_rate_across_clusters
  • total_alerts_rate_across_clusters

Some metrics, such as alerts_rate, apply to nearly every metric context. Others only apply to a certain service or role.

Metric Name Description Unit Parents CDH Version
alerts_rate The number of alerts. events per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
cgroup_cpu_system_rate CPU usage of the role's cgroup seconds per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
cgroup_cpu_user_rate User Space CPU usage of the role's cgroup seconds per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
cgroup_mem_page_cache Page cache usage of the role's cgroup bytes cluster, kafka, rack CDH 5, CDH 6, CDH 7
cgroup_mem_rss Resident memory of the role's cgroup bytes cluster, kafka, rack CDH 5, CDH 6, CDH 7
cgroup_mem_swap Swap usage of the role's cgroup bytes cluster, kafka, rack CDH 5, CDH 6, CDH 7
cgroup_read_bytes_rate Bytes read from all disks by the role's cgroup bytes per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
cgroup_read_ios_rate Number of read I/O operations from all disks by the role's cgroup ios per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
cgroup_write_bytes_rate Bytes written to all disks by the role's cgroup bytes per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
cgroup_write_ios_rate Number of write I/O operations to all disks by the role's cgroup ios per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
cpu_system_rate Total System CPU seconds per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
cpu_user_rate Total CPU user time seconds per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
events_critical_rate The number of critical events. events per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
events_important_rate The number of important events. events per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
events_informational_rate The number of informational events. events per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
fd_max Maximum number of file descriptors file descriptors cluster, kafka, rack CDH 5, CDH 6, CDH 7
fd_open Open file descriptors. file descriptors cluster, kafka, rack CDH 5, CDH 6, CDH 7
health_bad_rate Percentage of Time with Bad Health seconds per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
health_concerning_rate Percentage of Time with Concerning Health seconds per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
health_disabled_rate Percentage of Time with Disabled Health seconds per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
health_good_rate Percentage of Time with Good Health seconds per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
health_unknown_rate Percentage of Time with Unknown Health seconds per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_completed_rebalances_total The total number of rebalances completed by this worker. message.units.rebalances cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_connector_count The number of connectors run in this worker. message.units.connectors cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_connector_startup_attempts_total The total number of connector startups that this worker has attempted. message.units.connectors cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_connector_startup_failure_total The total number of connector starts that failed. message.units.connectors cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_epoch The epoch or generation number of this worker. epoch cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_rebalance_avg_time_ms The average time in milliseconds spent by this worker to rebalance. message.units.milliseconds cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_rebalance_max_time_ms The maximum time in milliseconds spent by this worker to rebalance. message.units.milliseconds cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_rebalancing Whether this worker is currently rebalancing. message.units.rebalancing cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_task_count The number of tasks run in this worker. message.units.connectors cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_task_startup_attempts_total The total number of task startups that this worker has attempted. message.units.connectors cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_task_startup_failure_total The total number of task starts that failed. message.units.connectors cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_task_startup_success_total The total number of task starts that succeeded. message.units.connectors cluster, kafka, rack CDH 5, CDH 6, CDH 7
kafka_connect_time_since_last_rebalance_ms The time in milliseconds since the most recent rebalance in this worker. message.units.milliseconds cluster, kafka, rack CDH 5, CDH 6, CDH 7
mem_rss Resident memory used bytes cluster, kafka, rack CDH 5, CDH 6, CDH 7
mem_swap Amount of swap memory used by this role's process. bytes cluster, kafka, rack CDH 5, CDH 6, CDH 7
mem_virtual Virtual memory used bytes cluster, kafka, rack CDH 5, CDH 6, CDH 7
oom_exits_rate The number of times the role's backing process was killed due to an OutOfMemory error. This counter is only incremented if the Cloudera Manager "Kill When Out of Memory" option is enabled. exits per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
read_bytes_rate The number of bytes read from the device bytes per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
unexpected_exits_rate The number of times the role's backing process exited unexpectedly. exits per second cluster, kafka, rack CDH 5, CDH 6, CDH 7
uptime For a host, the amount of time since the host was booted. For a role, the uptime of the backing process. seconds cluster, kafka, rack CDH 5, CDH 6, CDH 7
write_bytes_rate The number of bytes written to the device bytes per second cluster, kafka, rack CDH 5, CDH 6, CDH 7