Box Plots for Comparing Data
Students will construct and interpret box plots to visually compare the distribution and spread of two or more data sets.
About This Topic
Box plots offer a clear method to summarise data distributions and compare multiple sets visually. Year 9 students calculate the median, lower quartile, upper quartile, and range from ordered data lists, then plot these on scales to show spread, centre, and potential outliers. They compare box plots side-by-side, for example, to analyse reaction times across age groups or exam scores between classes, addressing key questions on spread, quartiles, and skewness detection.
This fits KS3 Statistics in the Data Interpretation and Probability unit, extending prior mean and range work to robust summary statistics. Students build skills in precise calculation, graphical representation, and critique, preparing for GCSE data analysis tasks like interpreting real-world datasets from surveys or experiments.
Active learning suits box plots well. When students collect their own class data via quick surveys, compute quartiles collaboratively, and plot then debate comparisons in small groups, they grasp summaries' power firsthand. Peer review of plots catches errors early and deepens understanding of limitations, such as masking multimodal data.
Key Questions
- How can we use box plots to visually compare the spread of two different groups?
- Analyze what the median and quartiles represent on a box plot.
- Critique the effectiveness of box plots for identifying skewness in data.
Learning Objectives
- Calculate the median, lower quartile, and upper quartile for two or more data sets.
- Construct comparative box plots accurately on a numerical scale.
- Compare the spread and central tendency of two or more data sets using their box plots.
- Analyze the visual representation of quartiles and the interquartile range on a box plot.
- Critique the suitability of box plots for representing skewed data distributions.
Before You Start
Why: Students need to be able to find the median of a data set before they can calculate the median and quartiles for a box plot.
Why: Box plots require data to be ordered from least to greatest, a fundamental skill for calculating quartiles.
Why: The concept of range as the difference between the maximum and minimum values is a precursor to understanding spread in box plots.
Key Vocabulary
| Median | The middle value in an ordered data set. If there is an even number of data points, it is the average of the two middle values. |
| Quartiles | Values that divide an ordered data set into four equal parts. The lower quartile (Q1) is the median of the lower half, and the upper quartile (Q3) is the median of the upper half. |
| Interquartile Range (IQR) | The difference between the upper quartile (Q3) and the lower quartile (Q1). It represents the spread of the middle 50% of the data. |
| Box Plot | A graphical representation of data that shows the median, quartiles, and range of a data set using a box and whiskers. |
Watch Out for These Misconceptions
Common MisconceptionThe box plot shows every data point in the set.
What to Teach Instead
Box plots summarise quartiles and extremes, not individual values; the box captures 50% of data from Q1 to Q3. Hands-on plotting from raw data in pairs helps students see how many points cluster inside the box, clarifying the summary nature through visual construction.
Common MisconceptionThe median on a box plot is the same as the mean.
What to Teach Instead
Median is the middle value in ordered data, less affected by outliers than the mean. Group critiques of skewed data sets reveal this difference, as students compare calculated mean and median lines overlaid on plots, building robust statistic awareness.
Common MisconceptionWhiskers always extend to the absolute minimum and maximum.
What to Teach Instead
Whiskers reach the smallest and largest values excluding outliers, defined as beyond 1.5 times the interquartile range. Station activities with outlier spotting let groups debate and adjust plots collaboratively, reinforcing rules through trial and error.
Active Learning Ideas
See all activitiesPairs Survey: Heights Box Plots
Pairs survey 20 classmates' heights, order data, calculate median and quartiles, then draw box plots for boys and girls on shared graph paper. They note differences in spread and centre, presenting one key comparison to the class. Extend by identifying any outliers.
Small Groups Stations: Sports Data Comparison
Prepare four data sets on reaction times or jump heights from sports. Groups rotate through stations every 10 minutes: calculate stats, plot box plot, compare to another set, and critique skewness. Record findings on a group sheet.
Whole Class Critique: Exam Scores Challenge
Display three pairs of box plots from mock exam data on the board. Class discusses in a think-pair-share: which shows greater spread, evidence of skewness, and why one set might mislead. Vote and justify best comparison method.
Individual Plotting: Weather Data
Provide rainfall data for two UK cities over a month. Students independently order data, compute quartiles, plot box plots, and write two sentences comparing variability. Share digitally for class gallery walk.
Real-World Connections
- Sports analysts use box plots to compare the performance statistics of different players or teams, such as comparing the number of goals scored by strikers in the Premier League versus La Liga.
- Market researchers might use box plots to compare customer satisfaction scores for two different product versions, identifying which product has a more consistent positive reception.
- In education, teachers can use box plots to compare exam results between different classes or teaching methods, visualizing differences in score distribution and spread.
Assessment Ideas
Provide students with two small, ordered data sets (e.g., heights of students in two different Year 9 classes). Ask them to calculate the median, Q1, and Q3 for each set and write down the IQR for both. Check their calculations for accuracy.
Present students with two side-by-side box plots representing, for example, the daily temperatures in two cities over a month. Ask: 'Which city had a higher average temperature? Which city had more variable temperatures? Explain your reasoning using the box plot features.'
Give students a box plot that is clearly skewed. Ask them to write one sentence describing the skewness and one sentence explaining a limitation of using box plots for this type of data.
Frequently Asked Questions
How do you teach constructing box plots in Year 9?
What do box plots reveal about data skewness?
How can active learning help teach box plots?
Real-world examples of comparing data with box plots?
Planning templates for Mathematics
5E Model
The 5E Model structures lessons through five phases (Engage, Explore, Explain, Elaborate, and Evaluate), guiding students from curiosity to deep understanding through inquiry-based learning.
Unit PlannerMath Unit
Plan a multi-week math unit with conceptual coherence: from building number sense and procedural fluency to applying skills in context and developing mathematical reasoning across a connected sequence of lessons.
RubricMath Rubric
Build a math rubric that assesses problem-solving, mathematical reasoning, and communication alongside procedural accuracy, giving students feedback on how they think, not just whether they got the right answer.
More in Data Interpretation and Probability
Scatter Graphs and Correlation
Students will construct and interpret scatter graphs, identifying types of correlation and drawing lines of best fit.
2 methodologies
Interpolation and Extrapolation
Students will use lines of best fit to make predictions, distinguishing between interpolation and extrapolation and understanding their reliability.
2 methodologies
Probability Basics: Mutually Exclusive Events
Students will calculate probabilities of single events and understand the concept of mutually exclusive events.
2 methodologies
Tree Diagrams for Independent Events
Students will use tree diagrams to represent and calculate probabilities of combined independent events.
2 methodologies
Tree Diagrams for Dependent Events
Students will use tree diagrams to represent and calculate probabilities of combined dependent events (without replacement).
2 methodologies
Venn Diagrams for Probability
Students will use Venn diagrams to represent sets and calculate probabilities of events, including 'and' and 'or' conditions.
2 methodologies