Livy Health Tests

Livy Server Health

This is a Livy service-level health test that checks that enough of the Livy Servers in the cluster are healthy. The test returns "Concerning" health if the number of healthy Livy Servers falls below a warning threshold, expressed as a percentage of the total number of Livy Servers. The test returns "Bad" health if the number of healthy and "Concerning" Livy Servers falls below a critical threshold, expressed as a percentage of the total number of Livy Servers. For example, if this test is configured with a warning threshold of 95% and a critical threshold of 90% for a cluster of 100 Livy Servers, this test would return "Good" health if 95 or more Livy Servers have good health. This test would return "Concerning" health if at least 90 Livy Servers have either "Good" or "Concerning" health. If more than 10 Livy Servers have bad health, this test would return "Bad" health. A failure of this health test indicates unhealthy Livy Servers. Check the status of the individual Livy Servers for more information. This test can be configured using the Livy Livy service-wide monitoring setting.

Short Name: Livy Server Health

Healthy Livy Server Monitoring Thresholds

Description
The health test thresholds of the overall Livy Server health. The check returns "Concerning" health if the percentage of "Healthy" Livy Servers falls below the warning threshold. The check is unhealthy if the total percentage of "Healthy" and "Concerning" Livy Servers falls below the critical threshold.
Template Name
LIVY_LIVY_SERVER_healthy_thresholds
Default Value
critical:90.0, warning:99.0
Unit(s)
PERCENT