Skip to content
Mathematics · Year 9 · Statistics and Probability · Term 4

Comparing Data Distributions

Students will compare the distributions of two or more data sets using measures of central tendency, spread, and appropriate graphical representations (e.g., back-to-back stem-and-leaf plots, parallel box plots).

ACARA Content DescriptionsAC9M9ST02

About This Topic

Comparing data distributions equips students to analyze two or more data sets using measures of central tendency like mean, median, and mode, alongside spread measures such as range and interquartile range. They create and interpret graphical representations including back-to-back stem-and-leaf plots and parallel box plots. For instance, students compare pulse rates after different exercises or exam scores across classes, addressing key questions on visual comparisons and representation effectiveness.

This topic aligns with AC9M9ST02 in the Statistics and Probability unit. It strengthens skills in data interpretation, variability assessment, and critiquing displays, preparing students for real-world applications like sports analytics or market research. Understanding that distributions reveal shape, center, and spread beyond single summary statistics builds statistical reasoning.

Active learning benefits this topic greatly. Students gather their own data through surveys or measurements, then construct plots collaboratively. Such hands-on work makes abstract measures concrete, encourages peer critique of graphs, and highlights how context influences choice of representation, leading to lasting comprehension.

Key Questions

  1. How can we use measures of center and spread to compare two different data sets?
  2. Explain how parallel box plots allow for a visual comparison of data distributions.
  3. Critique the effectiveness of different graphical representations for comparing data.

Learning Objectives

  • Calculate and compare the mean, median, mode, range, and interquartile range for two or more data sets.
  • Create and interpret back-to-back stem-and-leaf plots and parallel box plots to visually compare data distributions.
  • Explain how measures of center and spread contribute to the comparison of different data sets.
  • Critique the effectiveness of different graphical representations for comparing statistical data.
  • Analyze the shape, center, and spread of data distributions to draw conclusions about two or more groups.

Before You Start

Calculating Mean, Median, Mode, and Range

Why: Students need to be proficient in calculating these basic statistical measures before comparing them across data sets.

Constructing Stem-and-Leaf Plots

Why: Understanding how to build a basic stem-and-leaf plot is foundational for creating and interpreting back-to-back versions.

Understanding Data Variability

Why: Students should have a basic grasp of what variability means in a data set before exploring specific measures of spread.

Key Vocabulary

Measures of Central TendencyStatistical measures that identify the center or typical value of a data set, including mean, median, and mode.
Measures of SpreadStatistical measures that describe the variability or dispersion of data points, such as range and interquartile range (IQR).
Back-to-Back Stem-and-Leaf PlotA graphical display that compares two data sets with common stems, where leaves for one data set extend to the left and for the other to the right.
Parallel Box PlotA graphical display that shows multiple box plots side-by-side on the same axis, allowing for direct visual comparison of their distributions.
DistributionThe way data values are spread out or arranged, characterized by its shape, center, and spread.

Watch Out for These Misconceptions

Common MisconceptionThe mean always represents the center best.

What to Teach Instead

Outliers skew the mean, while median resists this; activities with student-generated data sets let pairs add outliers and re-plot, revealing through discussion why median suits skewed distributions like income data.

Common MisconceptionBox plots display every data point.

What to Teach Instead

Box plots summarize with quartiles and extremes; small group plotting tasks from raw data help students see the five-number summary captures shape without all points, clarified in peer reviews.

Common MisconceptionIdentical means mean identical distributions.

What to Teach Instead

Spread and shape differ; whole-class surveys followed by parallel plots demonstrate this, as groups compare overlapping boxes with varying ranges, fostering critique via shared findings.

Active Learning Ideas

See all activities

Real-World Connections

  • Sports analysts compare player statistics, such as batting averages or points per game, between two teams using parallel box plots to identify performance differences.
  • Market researchers analyze customer survey data from different demographics, using measures of central tendency and spread to understand purchasing habits and preferences.
  • Medical professionals compare the effectiveness of two different treatments by analyzing patient recovery times, using graphical displays to visualize differences in outcomes.

Assessment Ideas

Quick Check

Provide students with two small data sets (e.g., test scores from two classes). Ask them to calculate the mean, median, and range for each set and write one sentence comparing the typical performance and variability of the two classes.

Discussion Prompt

Present students with a scenario comparing the heights of Year 9 boys and girls. Ask: 'How would you use a parallel box plot to visually compare these two groups? What specific features of the box plots would you look for to determine which group is generally taller and which has more variation in height?'

Exit Ticket

Give students a back-to-back stem-and-leaf plot showing the number of minutes students spent on homework for two different subjects. Ask them to identify the median time spent on each subject and write one observation about the distribution of homework times.

Frequently Asked Questions

How do parallel box plots help compare data distributions?
Parallel box plots align boxes side by side, showing medians, quartiles, and outliers at a glance. Students quickly spot differences in center, spread, and symmetry. For example, comparing test scores reveals one class has higher variability. Practice with real data builds confidence in visual analysis over tables alone.
What real data sets work well for this topic?
Use pulse rates after jogging versus resting, hand span measurements by gender, or daily steps from fitness trackers. These connect to students' lives, motivate data collection, and produce varied distributions. Provide printed sets or let classes survey peers for ownership, ensuring enough data points for robust plots.
How can active learning improve understanding of data comparisons?
Active approaches like paired data collection and group graphing make measures tangible. Students measure, plot, and debate their own sets, such as reaction times, uncovering how outliers affect means firsthand. Collaborative critiques of peers' box plots reveal representation strengths, turning passive recall into deep insight on distributions.
Why critique graphical representations?
Different graphs highlight aspects like skewness or gaps uniquely; back-to-back stems show individual values well, while box plots emphasize summaries. Student-led evaluations, such as ranking graphs for sports data, teach context matters. This skill prevents misinterpretation in media or science reports.

Planning templates for Mathematics