Hive on LLAP Health Tests

Hive on LLAP HiveServer2 Health

This is a Hive on LLAP service-level health test that checks that enough of the HiveServer2s in the cluster are healthy. The test returns "Concerning" health if the number of healthy HiveServer2s falls below a warning threshold, expressed as a percentage of the total number of HiveServer2s. The test returns "Bad" health if the number of healthy and "Concerning" HiveServer2s falls below a critical threshold, expressed as a percentage of the total number of HiveServer2s. For example, if this test is configured with a warning threshold of 95% and a critical threshold of 90% for a cluster of 100 HiveServer2s, this test would return "Good" health if 95 or more HiveServer2s have good health. This test would return "Concerning" health if at least 90 HiveServer2s have either "Good" or "Concerning" health. If more than 10 HiveServer2s have bad health, this test would return "Bad" health. A failure of this health test indicates unhealthy HiveServer2s. Check the status of the individual HiveServer2s for more information. This test can be configured using the Hive on LLAP Hive on LLAP service-wide monitoring setting.

Short Name: HiveServer2 Health

Property Name Description Template Name Default Value Unit
Healthy HiveServer2 Monitoring Thresholds The health test thresholds of the overall HiveServer2 health. The check returns "Concerning" health if the percentage of "Healthy" HiveServer2s falls below the warning threshold. The check is unhealthy if the total percentage of "Healthy" and "Concerning" HiveServer2s falls below the critical threshold. hive_llap_hiveserver2s_healthy_thresholds critical:51.0, warning:99.0 PERCENT

Hive on LLAP LLAP Proxy Health

This is a Hive on LLAP service-level health test that checks that enough of the LLAP Proxys in the cluster are healthy. The test returns "Concerning" health if the number of healthy LLAP Proxys falls below a warning threshold, expressed as a percentage of the total number of LLAP Proxys. The test returns "Bad" health if the number of healthy and "Concerning" LLAP Proxys falls below a critical threshold, expressed as a percentage of the total number of LLAP Proxys. For example, if this test is configured with a warning threshold of 95% and a critical threshold of 90% for a cluster of 100 LLAP Proxys, this test would return "Good" health if 95 or more LLAP Proxys have good health. This test would return "Concerning" health if at least 90 LLAP Proxys have either "Good" or "Concerning" health. If more than 10 LLAP Proxys have bad health, this test would return "Bad" health. A failure of this health test indicates unhealthy LLAP Proxys. Check the status of the individual LLAP Proxys for more information. This test can be configured using the Hive on LLAP Hive on LLAP service-wide monitoring setting.

Short Name: LLAP Proxy Health

Property Name Description Template Name Default Value Unit
Healthy LLAP Proxy Monitoring Thresholds The health test thresholds of the overall LLAP Proxy health. The check returns "Concerning" health if the percentage of "Healthy" LLAP Proxys falls below the warning threshold. The check is unhealthy if the total percentage of "Healthy" and "Concerning" LLAP Proxys falls below the critical threshold. hive_llap_llapproxys_healthy_thresholds critical:51.0, warning:99.0 PERCENT