Sign in to save

Bookmark this page so you can find it later.

Sign in to save

Bookmark this page so you can find it later.

A five-number summary describes a data set using the minimum, first quartile, median, third quartile, and maximum. This cheat sheet helps students organize data, find quartiles, and build box plots accurately. These skills are useful for comparing groups, spotting spread, and identifying unusual values in real-world data. The most important ideas are position, center, spread, and outliers. The median divides ordered data into two halves, while quartiles divide data into four parts. The interquartile range measures the spread of the middle half of the data using IQR=Q3Q1IQR = Q_3 - Q_1. A box plot shows these values visually using a box, a median line, whiskers, and sometimes separate outlier points.

Key Facts

  • A five-number summary is written as min,Q1,Q2,Q3,max\min, Q_1, Q_2, Q_3, \max, where Q2Q_2 is the median.
  • The range of a data set is range=maxmin\text{range} = \max - \min.
  • The interquartile range is IQR=Q3Q1IQR = Q_3 - Q_1, which measures the spread of the middle 50%50\% of the data.
  • The lower outlier fence is Q11.5(IQR)Q_1 - 1.5(IQR), and values below it are possible outliers.
  • The upper outlier fence is Q3+1.5(IQR)Q_3 + 1.5(IQR), and values above it are possible outliers.
  • In a box plot, the box runs from Q1Q_1 to Q3Q_3, and the line inside the box marks the median Q2Q_2.
  • For a modified box plot, whiskers extend to the smallest and largest non-outlier values, not necessarily to the minimum and maximum.
  • A longer box or whisker means the data are more spread out over that part of the distribution.

Vocabulary

Five-number summary
A summary of a data set using the minimum, first quartile, median, third quartile, and maximum.
Median
The middle value of an ordered data set, also called Q2Q_2.
Quartile
A value that divides ordered data into four parts with about 25%25\% of the data in each part.
Interquartile range
The spread of the middle half of the data, found by IQR=Q3Q1IQR = Q_3 - Q_1.
Box plot
A graph that displays the five-number summary with a box from Q1Q_1 to Q3Q_3, a median line, and whiskers.
Outlier
A data value that is unusually far from the rest of the data, often checked using the 1.5(IQR)1.5(IQR) rule.

Common Mistakes to Avoid

  • Not ordering the data first, which is wrong because the median and quartiles must be found from values arranged from least to greatest.
  • Including the median in both halves when the method says to exclude it, which can change Q1Q_1 and Q3Q_3 for data sets with an odd number of values.
  • Using maxmin\max - \min when asked for IQRIQR, which is wrong because IQRIQR measures only the middle 50%50\% of the data.
  • Drawing whiskers to outliers on a modified box plot, which is wrong because outliers should be plotted as separate points.
  • Thinking a box plot shows every data value, which is wrong because it summarizes position and spread rather than listing each value.

Practice Questions

  1. 1 Find the five-number summary for the data set 4,7,9,10,12,15,18,20,214, 7, 9, 10, 12, 15, 18, 20, 21.
  2. 2 For a data set with Q1=12Q_1 = 12 and Q3=28Q_3 = 28, find IQRIQR, the lower fence, and the upper fence.
  3. 3 A modified box plot has Q1=35Q_1 = 35, median =42= 42, Q3=50Q_3 = 50, minimum non-outlier =30= 30, and maximum non-outlier =63= 63. Describe the box, median line, and whiskers.
  4. 4 Two box plots have the same median, but one has a much larger IQRIQR. Explain what this means about the two data sets.