Skip to content
Mathematics · Year 11 · Data Interpretation and Statistics · Spring Term

Box Plots and Interquartile Range

Students will construct and interpret box plots to compare distributions and identify outliers using the interquartile range.

National Curriculum Attainment TargetsGCSE: Mathematics - Statistics

About This Topic

Box plots provide a visual summary of data distributions using the five-number summary: minimum, lower quartile (Q1), median, upper quartile (Q3), and maximum. Year 11 students construct box plots from raw datasets, calculate the interquartile range (IQR = Q3 - Q1) to measure spread, and identify outliers as values beyond 1.5 times the IQR from the quartiles. They compare box plots of two distributions, such as exam scores from different classes, to discuss central tendency, spread, and skewness.

This topic aligns with GCSE Statistics requirements for data interpretation. Students justify using the median over the mean for skewed data, as it resists extreme values, and distinguish IQR from range, which ignores data clustering. These skills support bivariate data analysis and hypothesis testing later in the course.

Active learning suits box plots because students manipulate real-world datasets, like sports scores or heights, to build plots collaboratively. Physical plotting on large charts or digital tools reveals patterns intuitively, while group comparisons foster discussions that clarify interpretations and build confidence in drawing statistical conclusions.

Key Questions

  1. Justify why the median is often a better measure of central tendency than the mean for skewed data.
  2. Differentiate between the range and the interquartile range as measures of spread.
  3. Compare two datasets using box plots, drawing conclusions about their distributions.

Learning Objectives

  • Calculate the interquartile range (IQR) for a given dataset.
  • Construct a box plot accurately from a five-number summary.
  • Compare and contrast the central tendency and spread of two datasets using their box plots.
  • Identify potential outliers in a dataset using the 1.5 * IQR rule.
  • Explain why the median is a more appropriate measure of central tendency than the mean for skewed data.

Before You Start

Calculating Mean, Median, and Mode

Why: Students need to be familiar with calculating the median and understanding measures of central tendency before comparing it to the mean for skewed data.

Ordering Data and Finding the Range

Why: Students must be able to order data and find the range to understand how the IQR is a more refined measure of spread.

Understanding Percentiles and Quartiles

Why: A foundational understanding of how data is divided into parts is necessary for calculating and interpreting quartiles.

Key Vocabulary

Interquartile Range (IQR)The difference between the upper quartile (Q3) and the lower quartile (Q1) of a dataset, representing the spread of the middle 50% of the data.
Box PlotA graphical representation of data that displays the five-number summary: minimum, Q1, median, Q3, and maximum.
MedianThe middle value in a dataset when the data is ordered from least to greatest. It divides the data into two equal halves.
QuartilesValues that divide a dataset into four equal parts. Q1 is the median of the lower half, and Q3 is the median of the upper half.
OutlierA data point that is significantly different from other observations in the dataset, often identified using the IQR rule.

Watch Out for These Misconceptions

Common MisconceptionThe box plot shows the shape of the full distribution like a histogram.

What to Teach Instead

Box plots summarise key values but hide frequency details within quartiles. Hands-on construction from ordered data helps students see it as a compact overview, while comparing to histograms in pairs clarifies limitations.

Common MisconceptionOutliers are always data errors to ignore.

What to Teach Instead

Outliers can be valid extremes; the IQR rule flags them for investigation. Group debates on real datasets, like exam scores, teach students to consider context before dismissal, building critical thinking.

Common MisconceptionIQR equals the range.

What to Teach Instead

IQR measures middle 50% spread, ignoring extremes, unlike full range. Plotting both on charts collaboratively shows how IQR better represents typical variation, especially in skewed data.

Active Learning Ideas

See all activities

Real-World Connections

  • Sports analysts use box plots to compare player statistics, such as batting averages or points scored per game, across different seasons or teams to identify trends and performance differences.
  • Financial advisors might use box plots to visualize the distribution of investment returns for different portfolio types, helping clients understand potential risks and rewards.
  • Medical researchers can use box plots to compare the effectiveness of different treatments by visualizing the spread of patient recovery times or symptom severity.

Assessment Ideas

Quick Check

Provide students with a dataset of exam scores. Ask them to calculate the five-number summary and then construct a box plot. Check for accuracy in calculations and plot construction.

Discussion Prompt

Present two box plots comparing the heights of Year 11 boys and girls. Ask students: 'Which group has a greater spread in heights? How do the medians compare? Based on these box plots, what can you conclude about the typical height of a boy versus a girl in this sample?'

Exit Ticket

Give students a small dataset. Ask them to calculate the IQR and identify any potential outliers. Then, ask them to write one sentence explaining why the median might be a better measure of central tendency for this specific dataset if it were skewed.

Frequently Asked Questions

Why is the median better than the mean for skewed data in box plots?
Skewed data has outliers that pull the mean toward extremes, distorting central tendency. The median splits data evenly, providing a robust measure. Students grasp this by plotting skewed sets like incomes, seeing the mean fall outside the box while median anchors it centrally. This visual contrast reinforces GCSE justification skills.
How do you calculate and use the interquartile range?
Order the data, find Q1 (median of lower half) and Q3 (median of upper half), then IQR = Q3 - Q1. Use it to assess spread of middle data and flag outliers (below Q1 - 1.5 IQR or above Q3 + 1.5 IQR). Comparing IQRs across box plots reveals distribution differences clearly for GCSE comparisons.
How can active learning help students understand box plots?
Active tasks like pair-plotting real datasets make quartiles tangible as students handle ordered lists. Group comparisons spark discussions on skewness and outliers, while whole-class builds on boards visualise IQR robustness. These approaches turn abstract stats into collaborative problem-solving, boosting retention and application for exams.
How to compare two datasets using box plots?
Align box plots for overlap in medians, boxes (IQR), and whiskers. Greater box length means wider spread; higher median shows greater central value. For GCSE, students draw conclusions like 'Dataset A is more consistent but lower achieving,' supported by evidence from plots and calculations.

Planning templates for Mathematics