Box Plots and Interquartile Range
Students will construct and interpret box plots to compare distributions and identify outliers using the interquartile range.
About This Topic
Box plots provide a visual summary of data distributions using the five-number summary: minimum, lower quartile (Q1), median, upper quartile (Q3), and maximum. Year 11 students construct box plots from raw datasets, calculate the interquartile range (IQR = Q3 - Q1) to measure spread, and identify outliers as values beyond 1.5 times the IQR from the quartiles. They compare box plots of two distributions, such as exam scores from different classes, to discuss central tendency, spread, and skewness.
This topic aligns with GCSE Statistics requirements for data interpretation. Students justify using the median over the mean for skewed data, as it resists extreme values, and distinguish IQR from range, which ignores data clustering. These skills support bivariate data analysis and hypothesis testing later in the course.
Active learning suits box plots because students manipulate real-world datasets, like sports scores or heights, to build plots collaboratively. Physical plotting on large charts or digital tools reveals patterns intuitively, while group comparisons foster discussions that clarify interpretations and build confidence in drawing statistical conclusions.
Key Questions
- Justify why the median is often a better measure of central tendency than the mean for skewed data.
- Differentiate between the range and the interquartile range as measures of spread.
- Compare two datasets using box plots, drawing conclusions about their distributions.
Learning Objectives
- Calculate the interquartile range (IQR) for a given dataset.
- Construct a box plot accurately from a five-number summary.
- Compare and contrast the central tendency and spread of two datasets using their box plots.
- Identify potential outliers in a dataset using the 1.5 * IQR rule.
- Explain why the median is a more appropriate measure of central tendency than the mean for skewed data.
Before You Start
Why: Students need to be familiar with calculating the median and understanding measures of central tendency before comparing it to the mean for skewed data.
Why: Students must be able to order data and find the range to understand how the IQR is a more refined measure of spread.
Why: A foundational understanding of how data is divided into parts is necessary for calculating and interpreting quartiles.
Key Vocabulary
| Interquartile Range (IQR) | The difference between the upper quartile (Q3) and the lower quartile (Q1) of a dataset, representing the spread of the middle 50% of the data. |
| Box Plot | A graphical representation of data that displays the five-number summary: minimum, Q1, median, Q3, and maximum. |
| Median | The middle value in a dataset when the data is ordered from least to greatest. It divides the data into two equal halves. |
| Quartiles | Values that divide a dataset into four equal parts. Q1 is the median of the lower half, and Q3 is the median of the upper half. |
| Outlier | A data point that is significantly different from other observations in the dataset, often identified using the IQR rule. |
Watch Out for These Misconceptions
Common MisconceptionThe box plot shows the shape of the full distribution like a histogram.
What to Teach Instead
Box plots summarise key values but hide frequency details within quartiles. Hands-on construction from ordered data helps students see it as a compact overview, while comparing to histograms in pairs clarifies limitations.
Common MisconceptionOutliers are always data errors to ignore.
What to Teach Instead
Outliers can be valid extremes; the IQR rule flags them for investigation. Group debates on real datasets, like exam scores, teach students to consider context before dismissal, building critical thinking.
Common MisconceptionIQR equals the range.
What to Teach Instead
IQR measures middle 50% spread, ignoring extremes, unlike full range. Plotting both on charts collaboratively shows how IQR better represents typical variation, especially in skewed data.
Active Learning Ideas
See all activitiesPair Plotting: Class Test Scores
Provide pairs with two lists of 20 test scores each. They order data, find quartiles and median, then sketch box plots side-by-side. Pairs note differences in spread and outliers before sharing with the class.
Small Group Comparison: Athlete Performances
Give small groups datasets on 100m sprint times for two teams. Groups construct box plots, calculate IQRs, and discuss which team has greater consistency. They present findings using key questions on a whiteboard.
Whole Class Outlier Hunt: Real Data Challenge
Display a large dataset on the board, such as household incomes. Class votes on potential outliers, then constructs a box plot together to verify using IQR rule. Discuss impacts on mean versus median.
Individual Interpretation: Skewed Distributions
Students receive printed box plots of skewed data like incomes or ages. Individually, they justify median use over mean and compare spreads. Follow with pair shares to refine explanations.
Real-World Connections
- Sports analysts use box plots to compare player statistics, such as batting averages or points scored per game, across different seasons or teams to identify trends and performance differences.
- Financial advisors might use box plots to visualize the distribution of investment returns for different portfolio types, helping clients understand potential risks and rewards.
- Medical researchers can use box plots to compare the effectiveness of different treatments by visualizing the spread of patient recovery times or symptom severity.
Assessment Ideas
Provide students with a dataset of exam scores. Ask them to calculate the five-number summary and then construct a box plot. Check for accuracy in calculations and plot construction.
Present two box plots comparing the heights of Year 11 boys and girls. Ask students: 'Which group has a greater spread in heights? How do the medians compare? Based on these box plots, what can you conclude about the typical height of a boy versus a girl in this sample?'
Give students a small dataset. Ask them to calculate the IQR and identify any potential outliers. Then, ask them to write one sentence explaining why the median might be a better measure of central tendency for this specific dataset if it were skewed.
Frequently Asked Questions
Why is the median better than the mean for skewed data in box plots?
How do you calculate and use the interquartile range?
How can active learning help students understand box plots?
How to compare two datasets using box plots?
Planning templates for Mathematics
5E Model
The 5E Model structures lessons through five phases (Engage, Explore, Explain, Elaborate, and Evaluate), guiding students from curiosity to deep understanding through inquiry-based learning.
Unit PlannerMath Unit
Plan a multi-week math unit with conceptual coherence: from building number sense and procedural fluency to applying skills in context and developing mathematical reasoning across a connected sequence of lessons.
RubricMath Rubric
Build a math rubric that assesses problem-solving, mathematical reasoning, and communication alongside procedural accuracy, giving students feedback on how they think, not just whether they got the right answer.
More in Data Interpretation and Statistics
Cumulative Frequency Graphs
Students will construct and interpret cumulative frequency graphs to estimate medians and quartiles.
2 methodologies
Histograms with Equal Class Widths
Students will construct and interpret histograms for continuous data with equal class intervals.
2 methodologies
Histograms with Unequal Class Widths
Students will construct and interpret histograms where frequency density is used to represent data with unequal class intervals.
2 methodologies
Scatter Graphs and Correlation
Students will plot and interpret scatter graphs, identifying types of correlation and drawing lines of best fit.
2 methodologies