This alert is configured for all nodes in the Hadoop cluster (Hadoop master and slave
nodes) as well as the Nagios and Ganglia monitoring servers. By default, it uses the Nagios
plugin check_ping
to find the average round trip response (RTT) time and the
packet loss percentage by pinging each cluster node.
This alert helps Ambari Web determine the number of cluster nodes that are up and down at a given time. A network outage may also result in a host down alert.
Note | |
---|---|
The |
The host is actually down
There is a network outage and the Nagios server cannot access the host