5.3.4. ResourceManager RPC latency

This host-level alert is triggered if the ResourceManager operations RPC latency exceeds the configured critical threshold. Typically an increase in the RPC processing time increases the RPC queue length, causing the average queue wait time to increase for ResourceManager operations. It uses the Nagios check_rpcq_latency plugin.

 5.3.4.1. Potential causes
  • A job or an application is performing too many ResourceManager operations.

 5.3.4.2. Possible remedies
  • Review the job or the application for potential bugs causing it to perform too many ResourceManager operations.


loading table of contents...