5.4.1. JobTracker process down alert

This alert is triggered if the JobTracker process cannot be confirmed to be up and listening on the network for the configured critical threshold, given in seconds. It uses the Nagios check_tcp plugin. Potential causes
  • The JobTracker daemon is down for reasons such as OutOfMemory errors or the misconfiguration of JobTracker, etc. Using CapacityScheduler should usually prevent this issue from occurring.

  • There are hardware problems on the JobTracker host Possible remedies
  • Login to the JobTracker machine and verify that the JobTracker daemon is not running

  • Check the logs for errors

  • Check the status of the host itself

  • Restart JobTracker

loading table of contents...