Telemetry Publisher Metrics

Metric Name Description Unit CDH Version
alerts_rate The number of alerts. events per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_cpu_system_rate CPU usage of the role's cgroup seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_cpu_user_rate User Space CPU usage of the role's cgroup seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_mem_page_cache Page cache usage of the role's cgroup bytes [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_mem_rss Resident memory of the role's cgroup bytes [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_mem_swap Swap usage of the role's cgroup bytes [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_read_bytes_rate Bytes read from all disks by the role's cgroup bytes per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_read_ios_rate Number of read I/O operations from all disks by the role's cgroup ios per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_write_bytes_rate Bytes written to all disks by the role's cgroup bytes per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cgroup_write_ios_rate Number of write I/O operations to all disks by the role's cgroup ios per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cpu_system_rate Total System CPU seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cpu_user_rate Total CPU user time seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
events_critical_rate The number of critical events. events per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
events_important_rate The number of important events. events per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
events_informational_rate The number of informational events. events per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
fd_max Maximum number of file descriptors file descriptors [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
fd_open Open file descriptors. file descriptors [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
health_bad_rate Percentage of Time with Bad Health seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
health_concerning_rate Percentage of Time with Concerning Health seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
health_disabled_rate Percentage of Time with Disabled Health seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
health_good_rate Percentage of Time with Good Health seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
health_unknown_rate Percentage of Time with Unknown Health seconds per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
mem_rss Resident memory used bytes [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
mem_swap Amount of swap memory used by this role's process. bytes [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
mem_virtual Virtual memory used bytes [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
oom_exits_rate The number of times the role's backing process was killed due to an OutOfMemory error. This counter is only incremented if the Cloudera Manager "Kill When Out of Memory" option is enabled. exits per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
read_bytes_rate The number of bytes read from the device bytes per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
unexpected_exits_rate The number of times the role's backing process exited unexpectedly. exits per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
uptime For a host, the amount of time since the host was booted. For a role, the uptime of the backing process. seconds [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
write_bytes_rate The number of bytes written to the device bytes per second [CDH 5.0.0..CDH 6.0.0), [CDH 6.0.0..CDH 7.0.0), [CDH 7.0.0..CDH 8.0.0), [CM -1.0.0..CM -1.0.0]
cm_snapshot_data_export_fail_counts_rate Data export fail count for cm-snapshot counts per second [CM -1.0.0..CM -1.0.0]
cm_snapshot_data_export_success_counts_rate Data export success count for cm-snapshot counts per second [CM -1.0.0..CM -1.0.0]
cm_snapshot_data_ingest_fail_counts_rate Data ingest fail count for cm-snapshot counts per second [CM -1.0.0..CM -1.0.0]
cm_snapshot_data_ingest_success_counts_rate Data ingest success count for cm-snapshot counts per second [CM -1.0.0..CM -1.0.0]
hive_app_data_export_fail_counts_rate Data export fail count for HIVE-app counts per second [CM -1.0.0..CM -1.0.0]
hive_app_data_export_success_counts_rate Data export success count for HIVE-app counts per second [CM -1.0.0..CM -1.0.0]
hive_app_data_ingest_fail_counts_rate Data ingest fail count for HIVE-app counts per second [CM -1.0.0..CM -1.0.0]
hive_app_data_ingest_success_counts_rate Data ingest success count for HIVE-app counts per second [CM -1.0.0..CM -1.0.0]
hive_query_audits_data_export_fail_counts_rate Data export fail count for HIVE-query-lineage counts per second [CM -1.0.0..CM -1.0.0]
hive_query_audits_data_export_success_counts_rate Data export success count for HIVE-query-lineage counts per second [CM -1.0.0..CM -1.0.0]
hive_query_audits_data_ingest_fail_counts_rate Data ingest fail count for HIVE-query-audits counts per second [CM -1.0.0..CM -1.0.0]
hive_query_audits_data_ingest_success_counts_rate Data ingest fail count for HIVE-query-audits counts per second [CM -1.0.0..CM -1.0.0]
hive_query_lineage_data_export_fail_counts_rate Data export fail count for HIVE-query-lineage counts per second [CM -1.0.0..CM -1.0.0]
hive_query_lineage_data_export_success_counts_rate Data export success count for HIVE-query-lineage counts per second [CM -1.0.0..CM -1.0.0]
hive_query_lineage_data_ingest_fail_counts_rate Data ingest fail count for HIVE-query-lineage counts per second [CM -1.0.0..CM -1.0.0]
hive_query_lineage_data_ingest_success_counts_rate Data ingest success count for HIVE-query-lineage counts per second [CM -1.0.0..CM -1.0.0]
hive_tez_app_data_export_fail_counts_rate Data export fail count for HIVE-tez-app counts per second [CM -1.0.0..CM -1.0.0]
hive_tez_app_data_export_success_counts_rate Data export success count for HIVE-tez-app counts per second [CM -1.0.0..CM -1.0.0]
hive_tez_app_data_ingest_fail_counts_rate Data ingest fail count for HIVE-tez-app counts per second [CM -1.0.0..CM -1.0.0]
hive_tez_app_data_ingest_success_counts_rate Data ingest success count for HIVE-tez-app counts per second [CM -1.0.0..CM -1.0.0]
hms_metastore_data_export_fail_counts_rate Data export fail count for HMS-metastore counts per second [CM -1.0.0..CM -1.0.0]
hms_metastore_data_export_success_counts_rate Data export success count for HMS-metastore counts per second [CM -1.0.0..CM -1.0.0]
hms_metastore_data_ingest_fail_counts_rate Data ingest fail count for HMS-metastore counts per second [CM -1.0.0..CM -1.0.0]
hms_metastore_data_ingest_success_counts_rate Data ingest success count for HMS-metastore counts per second [CM -1.0.0..CM -1.0.0]
impala_query_lineage_data_export_fail_counts_rate Data export fail count for IMPALA-query-lineage counts per second [CM -1.0.0..CM -1.0.0]
impala_query_lineage_data_export_success_counts_rate Data export success count for IMPALA-query-lineage counts per second [CM -1.0.0..CM -1.0.0]
impala_query_lineage_data_ingest_fail_counts_rate Data ingest fail count for IMPALA-query-lineage counts per second [CM -1.0.0..CM -1.0.0]
impala_query_lineage_data_ingest_success_counts_rate Data ingest success count for IMPALA-query-lineage counts per second [CM -1.0.0..CM -1.0.0]
impala_query_profile_data_export_fail_counts_rate Data export fail count for IMPALA-query-profile counts per second [CM -1.0.0..CM -1.0.0]
impala_query_profile_data_export_success_counts_rate Data export success count for IMPALA-query-profile counts per second [CM -1.0.0..CM -1.0.0]
impala_query_profile_data_ingest_fail_counts_rate Data ingest fail count for IMPALA-query-profile counts per second [CM -1.0.0..CM -1.0.0]
impala_query_profile_data_ingest_success_counts_rate Data ingest success count for IMPALA-query-profile counts per second [CM -1.0.0..CM -1.0.0]
jvm_gc_rate Number of garbage collections garbage collections per second [CM -1.0.0..CM -1.0.0]
jvm_gc_time_ms_rate Total time spent garbage collecting. ms per second [CM -1.0.0..CM -1.0.0]
jvm_heap_committed_mb Total amount of committed heap memory. MB [CM -1.0.0..CM -1.0.0]
jvm_heap_used_mb Total amount of used heap memory. MB [CM -1.0.0..CM -1.0.0]
jvm_max_memory_mb Maximum allowed memory. MB [CM -1.0.0..CM -1.0.0]
jvm_non_heap_committed_mb Total amount of committed non-heap memory. MB [CM -1.0.0..CM -1.0.0]
jvm_non_heap_used_mb Total amount of used non-heap memory. MB [CM -1.0.0..CM -1.0.0]
oozie_workflows_data_export_fail_counts_rate Data export fail count for OOZIE-workflows counts per second [CM -1.0.0..CM -1.0.0]
oozie_workflows_data_export_success_counts_rate Data export success count for OOZIE-workflows counts per second [CM -1.0.0..CM -1.0.0]
oozie_workflows_data_ingest_fail_counts_rate Data ingest fail count for OOZIE-workflows counts per second [CM -1.0.0..CM -1.0.0]
oozie_workflows_data_ingest_success_counts_rate Data ingest success count for OOZIE-workflows counts per second [CM -1.0.0..CM -1.0.0]
spark2_on_yarn_event_log_data_export_fail_counts_rate Data export fail count for SPARK2_ON_YARN-event-log counts per second [CM -1.0.0..CM -1.0.0]
spark2_on_yarn_event_log_data_export_success_counts_rate Data export success count for SPARK2_ON_YARN-event-log counts per second [CM -1.0.0..CM -1.0.0]
spark2_on_yarn_event_log_data_ingest_fail_counts_rate Data ingest fail count for SPARK2_ON_YARN-event-log counts per second [CM -1.0.0..CM -1.0.0]
spark2_on_yarn_event_log_data_ingest_success_counts_rate Data ingest success count for SPARK2_ON_YARN-event-log counts per second [CM -1.0.0..CM -1.0.0]
spark_on_yarn_event_log_data_export_fail_counts_rate Data export fail count for SPARK_ON_YARN-event-log counts per second [CM -1.0.0..CM -1.0.0]
spark_on_yarn_event_log_data_export_success_counts_rate Data export success count for SPARK_ON_YARN-event-log counts per second [CM -1.0.0..CM -1.0.0]
spark_on_yarn_event_log_data_ingest_fail_counts_rate Data ingest fail count for SPARK_ON_YARN-event-log counts per second [CM -1.0.0..CM -1.0.0]
spark_on_yarn_event_log_data_ingest_success_counts_rate Data ingest success count for SPARK_ON_YARN-event-log counts per second [CM -1.0.0..CM -1.0.0]
spark_on_yarn_lineage_data_export_fail_counts_rate Data export fail count for SPARK_ON_YARN-lineage counts per second [CM -1.0.0..CM -1.0.0]
spark_on_yarn_lineage_data_export_success_counts_rate Data export success count for SPARK_ON_YARN-lineage counts per second [CM -1.0.0..CM -1.0.0]
spark_on_yarn_lineage_data_ingest_fail_counts_rate Data ingest fail count for SPARK_ON_YARN-lineage counts per second [CM -1.0.0..CM -1.0.0]
spark_on_yarn_lineage_data_ingest_success_counts_rate Data ingest success count for SPARK_ON_YARN-lineage counts per second [CM -1.0.0..CM -1.0.0]
telemetry_publisher_exported_data_size Total Data Exported by Telemetry Publisher since it started. Data size in bytes [CM -1.0.0..CM -1.0.0]
web_metrics_collection_duration Web Server Responsiveness ms [CM -1.0.0..CM -1.0.0]
yarn_apps_data_export_fail_counts_rate Data export fail count for YARN-apps counts per second [CM -1.0.0..CM -1.0.0]
yarn_apps_data_export_success_counts_rate Data eport success count for YARN-apps counts per second [CM -1.0.0..CM -1.0.0]
yarn_apps_data_ingest_fail_counts_rate Data ingest fail count for YARN-apps counts per second [CM -1.0.0..CM -1.0.0]
yarn_apps_data_ingest_success_counts_rate Data ingest success count for YARN-apps counts per second [CM -1.0.0..CM -1.0.0]
yarn_jhist_data_export_fail_counts_rate Data export fail count for YARN-jhist counts per second [CM -1.0.0..CM -1.0.0]
yarn_jhist_data_export_success_counts_rate Data export success count for YARN-jhist counts per second [CM -1.0.0..CM -1.0.0]
yarn_jhist_data_ingest_fail_counts_rate Data ingest fail count for YARN-jhist counts per second [CM -1.0.0..CM -1.0.0]
yarn_jhist_data_ingest_success_counts_rate Data ingest success count for YARN-jhist counts per second [CM -1.0.0..CM -1.0.0]
yarn_jobs_data_export_fail_counts_rate Data export fail count for YARN-jobs counts per second [CM -1.0.0..CM -1.0.0]
yarn_jobs_data_export_success_counts_rate Data export success count for YARN-jobs counts per second [CM -1.0.0..CM -1.0.0]
yarn_jobs_data_ingest_fail_counts_rate Data ingest fail count for YARN-jobs counts per second [CM -1.0.0..CM -1.0.0]
yarn_jobs_data_ingest_success_counts_rate Data ingest success count for YARN-jobs counts per second [CM -1.0.0..CM -1.0.0]