Oozie Health Tests
Oozie Server Health
This Oozie service-level health test checks for the presence of a running, healthy Oozie Server. The test returns "Bad" health if the service is running and the Oozie Server is not running. In all other cases it returns the health of the Oozie Server. A failure of this health test indicates a stopped or unhealthy Oozie Server. Check the status of the Oozie Server for more information. This test can be enabled or disabled using the Oozie Server Role Health Test Oozie Server service-wide monitoring setting.
Short Name: Oozie Server Health
Property Name | Description | Template Name | Default Value | Unit |
---|---|---|---|---|
Oozie Server Role Health Test | When computing the overall OOZIE health, consider Oozie Server's health | oozie_oozieserver_health_enabled | true | no unit |
Oozie Servers Health
This is a Oozie service-level health test that checks that enough of the Oozie Servers in the cluster are healthy. The test returns "Concerning" health if the number of healthy Oozie Servers falls below a warning threshold, expressed as a percentage of the total number of Oozie Servers. The test returns "Bad" health if the number of healthy and "Concerning" Oozie Servers falls below a critical threshold, expressed as a percentage of the total number of Oozie Servers. For example, if this test is configured with a warning threshold of 95% and a critical threshold of 90% for a cluster of 100 Oozie Servers, this test would return "Good" health if 95 or more Oozie Servers have good health. This test would return "Concerning" health if at least 90 Oozie Servers have either "Good" or "Concerning" health. If more than 10 Oozie Servers have bad health, this test would return "Bad" health. A failure of this health test indicates unhealthy Oozie Servers. Check the status of the individual Oozie Servers for more information. This test can be configured using the Oozie Oozie service-wide monitoring setting.
Short Name: Oozie Servers Health
Property Name | Description | Template Name | Default Value | Unit |
---|---|---|---|---|
Healthy Oozie Server Monitoring Thresholds | The health test thresholds of the overall Oozie Server health. The check returns "Concerning" health if the percentage of "Healthy" Oozie Servers falls below the warning threshold. The check is unhealthy if the total percentage of "Healthy" and "Concerning" Oozie Servers falls below the critical threshold. | oozie_servers_healthy_thresholds | critical:51.0, warning:99.0 | PERCENT |