Box plot

Cloudera Data Visualization enables you to create Box Plot visuals.

A Box Plot visual displays the distribution of data values. The box represents the bulk of the data (25th percentile through 75th percentile), and whisker lines represent statistical minimum and maximum values.

The following example uses the 2012 and 2013 API Scores for California Schools. The School Performance dataset is used, which is built on a datafile that contains the California School APIs from 2012 and 2013. The range of possible scores varies from a low of 200 to a high of 1000. The statewide API performance target for all schools is 800.

For an overview of shelves that specify this visual, see Shelves for box plots.

  1. Start a new visual based on the NYC Taxi Data dataset.
    For instructions, see Creating a visual.
  2. In the VISUALS menu, find and click Box Plot.
    The shelves of the visual changed. They are now Dimension, Measure, X Trellis, Y Trellis, and Filters.

    The mandatory shelves are:

    • Dimension: you can specify one dimension for grouping the box plot
    • Measure: you can specify the measurement for box plot calculations
  3. Populate the shelves:
    1. Add the Dname (district name) field to the Dimension shelf.
    2. Add the Api13 (API obtained in 2013)field to the Measure shelf.
    3. Add the Cname (county name) field to the Filters shelf, and select a single value. In this example, Santa Clara is used.
    4. Add the Stype (school type) field to the Filters shelf, and select the value H, for Highschool.
  4. Optional: Alias the shelves.
  5. Click REFRESH VISUAL.

    The new Box Plot visual appears.

  6. Hovering over a particular box plot distribution rectangle opens a Tooltip modal that contains the statistical measures of that grouping, including the minimum, the first quartile, the median/second quartile, the third quartile, and the maximum.
  7. Click the pencil/edit icon next to the title of the visualization to enter a name for the visual.

    In this example, the title is changed to 'High School Performance - Box Plot'. You can also add a brief description of the visual as a subtitle below the title of the visualization.

  8. At the top left corner of the Dashboard Designer, click SAVE.

If you wish to generate a horizontal Box Plot visual, see the instructions in Making box plots horizontal.

To adjust the Box Plot display, check all the available settings for this visual.