Cloudera Observability On-Premises cluster services health checks
Lists the ZooKeeper health check tests that are performed on your Cloudera Observability On-Premises cluster services. They provide processing performance insights, such as messaging queue bottlenecks and delays that can cause workload scheduling issues. You can find the ZooKeeper Queue and Processing Timers metric charts in the Cloudera Observability On-Premises Charts Library tab and the following Health checks on the Cloudera Observability On-Premises related cluster service's page in the Health Tests section.
Analytic Database server
Health Check | Description |
---|---|
Impala Query Processing Time | This health test raises an alert when more than 25% of the Impala Queries do
not finish processing within the threshold's run time value. Where the defined
Concerning and Bad runtime threshold
limits are 30 and 60 seconds, respectively. It uses the
|
Pipeline server
Health Check | Description |
---|---|
Spark Event Log Processing Time | This health test raises an alert when more than 25% of the Spark Event Logs do
not finish processing within the threshold's run time value. Where the defined
Concerning and Bad runtime threshold
limits are 30 and 60 seconds, respectively. It uses the
|
MR Jhist Processing Time | This health test raises an alert when more than 25% of the MR Jhist payloads do
not finish processing within the threshold's run time value. Where the defined
Concerning and Bad runtime threshold
limits are 30 and 60 seconds, respectively. It uses the
|
Hive Audit Processing Time | This health test raises an alert when more than 25% of the Hive Audit payloads
do not finish processing within the threshold's run time value. Where the defined
Concerning and Bad runtime threshold
limits are 30 and 60 seconds, respectively. It uses the
|
Oozie Workflow Processing Time | This health test raises an alert when more than 25% of the Oozie Workflows do
not finish processing within the threshold's run time value. Where the defined
Concerning and Bad runtime threshold
limits are 30 and 60 seconds, respectively. It uses the
|
Yarn App Processing Time | This health test raises an alert when more than 25% of the Yarn Apps do not
finish processing within the threshold's run time value. Where the defined
Concerning and Bad runtime threshold
limits are 30 and 60 seconds, respectively. It uses the
|
Tez Dag Event Processing Time | This health test raises an alert when more than 25% of the Tez Dag Events do
not finish processing within the threshold's run time value. Where the defined
Concerning and Bad runtime threshold
limits are 30 and 60 seconds, respectively. It uses the
|
Hive Tez Processing Time | This health test raises an alert when more than 25% of the Hive Tez
Applications do not finish processing within the threshold's run time value. Where
the defined Concerning and Bad runtime
threshold limits are 30 and 60 seconds, respectively. It uses the
|
MR Task Log Processing Time | This health test raises an alert when more than 25% of the MR task logs do not
finish processing within the threshold's run time value. Where the defined
Concerning and Bad runtime threshold
limits are 30 and 60 seconds, respectively. It uses the
|
Spark Task Log Processing Time | This health test raises an alert when more than 25% of the Spark Task Logs do
not finish processing within the threshold's run time value. Where the defined
Concerning and Bad runtime threshold
limits are 30 and 60 seconds, respectively. It uses the
|
Yarn Container Log Processing Time | This health test raises an alert when more than 25% of the Yarn Container Logs
do not finish processing within the threshold's run time value. Where the defined
Concerning and Bad runtime threshold
limits are 30 and 60 seconds, respectively. It uses the
|
Hive HDP26 Log Processing Time | This health test raises an alert when more than 25% of the Hive HDP26 Logs do
not finish processing within the threshold's run time value. Where the defined
Concerning and Bad runtime threshold
limits are 30 and 60 seconds, respectively. It uses the
|
Admin API server
Health Check Name | Description |
---|---|
Spark Event Log Zookeeper Queue Size | This health test raises an alert when the size of the Spark Event Log ZooKeeper
queue is above the Concerning and Bad
threshold size. Where:
It uses the
|
Spark Task Log Zookeeper Queue Size | This health test raises an alert when the size of the Spark Task Log ZooKeeper
queue is above the Concerning and Bad
threshold size. Where:
It uses the
|
Yarn App Zookeeper Queue Size | This health test raises an alert when the size of the Yarn App ZooKeeper queue
is above the Concerning and Bad
threshold size. Where:
It uses the
|
Hive Audit Zookeeper Queue Size | This health test raises an alert when the size of the Hive Audit ZooKeeper
queue is above the Concerning and Bad
threshold size. Where:
It uses the
|
MR Jhist Zookeeper Queue Size | This health test raises an alert when the size of the MR Jhist ZooKeeper queue
is above the Concerning and Bad
threshold size. Where:
It uses the
|
MR Task Log Zookeeper Queue Size | This health test raises an alert when the size of the MR Task Log ZooKeeper
queue is above the Concerning and Bad
threshold size. Where:
It uses the
|
Oozie Workflow Zookeeper Queue Size | This health test raises an alert when the size of the Oozie Workflow ZooKeeper
queue is above the Concerning and Bad
threshold size. Where:
It uses the
|
Pse Zookeeper Queue Size | This health test raises an alert when the size of the Pse ZooKeeper queue is
above the Concerning and Bad threshold
size. Where:
It uses the |
Sdx Details Zookeeper Queue Size | This health test raises an alert when the size of the Sdx Details ZooKeeper
queue is above the Concerning and Bad
threshold size. Where:
It uses the
|
Impala Query Zookeeper Queue Size | This health test raises an alert when the size of the Impala Query ZooKeeper
queue is above the Concerning and Bad
threshold size. Where:
It uses the
|
Yarn App Metric Zookeeper Queue Size | This health test raises an alert when the size of the Yarn App Metric ZooKeeper
queue is above the Concerning and Bad
threshold size. Where:
It uses the
|
Hive On MR Table Zookeeper Queue Size | This health test raises an alert when the size of the Hive On MR Table
ZooKeeper queue is above the Concerning and
Bad threshold size. Where:
It uses the
|
Tez History Protobuf Zookeeper Queue Size | This health test raises an alert when the size of the Tez History Protobuf
ZooKeeper queue is above the Concerning and
Bad threshold size. Where:
It uses the
|
Hive History Protobuf Zookeeper Queue Size | This health test raises an alert when the size of the Hive History Protobuf
ZooKeeper queue is above the Concerning and
Bad threshold size. Where:
It uses the
|
Llap History Protobuf Zookeeper Queue Size | This health test raises an alert when the size of the Llap History Protobuf
ZooKeeper queue is above the Concerning and
Bad threshold size. Where:
It uses the
|
Hive HDP26 Log Zookeeper Queue Size | This health test raises an alert when the size of the Hive HDP26 Log ZooKeeper
queue is above the Concerning and Bad
threshold size. Where:
It uses the
|
Cluster Metrics Zookeeper Queue Size | This health test raises an alert when the size of the Cluster Metrics ZooKeeper
queue is above the Concerning and Bad
threshold size. Where:
It uses the
|