User Guide
Also available as:
PDF

MapReduce & Tez Dashboard

The MapReduce & Tez Dashboard was created to provide key information for workloads that use MapReduce or Tez for execution.

This dashboard includes the following paragraphs:

• Top N Longest Running Jobs

• Top N Resource Intensive Jobs

• Top N Resource Wasting Jobs

• Job Distribution By Type

• Top N Data IO Users

• CPU Usage By Queue

• Job Submission Trend By Day.Hour

Most of these paragraphs have titles that are self-explanatory. A few of them are described below to provide more context:

ParagraphDescription
Top N Resource Wasting Jobs

Resource wasting is calculated by calculating the difference between the memory asked for and the memory that was actually used.

For example, if a job asks for 100 8GB containers but only uses 5GB per container, 3GB per container is considered wasted. This is calculated per job, and the top 10 are listed.

Job Submission Trend By Day.Hour

This paragraph shows the number of jobs submitted by day and hour with the notation being <day>.<hour>. For example:

• Monday.1 - 1am on Monday

• Monday.20 - 8pm on Monday

The goal of this dashboard is to identify specific job submission hotspots during the week and day. You can use this information to identify the best time to schedule resource intensive jobs to execute.