Skip to content
Mathematics · Grade 9 · Data, Probability, and Decision Making · Term 3

Box Plots and Outliers

Students will construct and interpret box plots, identifying quartiles and potential outliers.

Ontario Curriculum ExpectationsCCSS.MATH.CONTENT.6.SP.B.4CCSS.MATH.CONTENT.HSS.ID.A.1

About This Topic

Box plots offer a compact way to display data distributions through the five-number summary: minimum, first quartile (Q1), median, third quartile (Q3), and maximum. In Grade 9, students construct box plots from real-world datasets, such as student heights or test scores, to identify quartiles and spot potential outliers. They interpret these plots to describe data spread, central tendency, and skewness, which supports informed decision making in data strands of the Ontario curriculum.

Students compare box plots to histograms, noting that box plots emphasize summary statistics while histograms reveal shape and frequency. They justify outlier detection using the interquartile range (IQR): values below Q1 - 1.5 × IQR or above Q3 + 1.5 × IQR. This process strengthens analytical skills for probability and statistics.

Active learning shines here because students manipulate their own data to build plots, revealing how choices affect interpretations. Collaborative comparisons of plots from varied datasets clarify nuances like outlier validity, making concepts stick through peer feedback and iteration.

Key Questions

  1. Interpret the five-number summary represented in a box plot.
  2. Differentiate between a box plot and a histogram in terms of information conveyed.
  3. Justify the method for identifying outliers using the IQR.

Learning Objectives

  • Calculate the five-number summary (minimum, Q1, median, Q3, maximum) for a given dataset.
  • Construct a box plot accurately from a calculated five-number summary.
  • Identify and classify potential outliers in a dataset using the 1.5 × IQR rule.
  • Compare and contrast the information conveyed by a box plot versus a histogram for a given dataset.
  • Justify the method used to identify outliers based on the interquartile range (IQR).

Before You Start

Measures of Central Tendency (Mean, Median, Mode)

Why: Students need to understand how to calculate the median and be familiar with measures of central tendency to grasp the concept of quartiles.

Measures of Spread (Range)

Why: Understanding the range helps students conceptualize data spread, which is further detailed by the IQR in box plots.

Data Organization and Representation (Histograms, Frequency Tables)

Why: Familiarity with representing data graphically, such as with histograms, provides a foundation for understanding different visualization methods like box plots.

Key Vocabulary

Five-Number SummaryA set of five key statistics that describe a dataset: minimum, first quartile (Q1), median, third quartile (Q3), and maximum.
Interquartile Range (IQR)The difference between the third quartile (Q3) and the first quartile (Q1) of a dataset, representing the spread of the middle 50% of the data.
OutlierA data point that is significantly different from other observations in the dataset, often identified using a specific rule based on the IQR.
Box PlotA graphical representation of the five-number summary, displaying the distribution of a dataset through quartiles and indicating potential outliers.

Watch Out for These Misconceptions

Common MisconceptionOutliers are always data errors to ignore.

What to Teach Instead

Outliers may represent valid extremes, like a star athlete's score. Active group debates on real datasets help students apply the IQR rule and consider context, shifting focus from deletion to investigation.

Common MisconceptionA box plot shows data frequencies like a histogram.

What to Teach Instead

Box plots summarize quartiles and extremes, not frequencies. Hands-on construction from raw data in pairs clarifies this, as students see histograms bunch values while box plots highlight spread.

Common MisconceptionThe median in a box plot is the data average.

What to Teach Instead

Median is the middle value, not mean. Comparing calculated medians and means in small groups with skewed data reveals distinctions, building precise vocabulary through shared examples.

Active Learning Ideas

See all activities

Real-World Connections

  • Sports analysts use box plots to compare player statistics across different seasons or teams, identifying performance ranges and potential outliers that might indicate exceptional or poor performance.
  • Financial advisors might use box plots to visualize the distribution of returns for different investment portfolios, helping clients understand risk and potential extreme outcomes.
  • Public health researchers use box plots to display the distribution of patient recovery times or disease prevalence across different regions, allowing for quick identification of unusual patterns or potential outliers.

Assessment Ideas

Exit Ticket

Provide students with a dataset. Ask them to calculate the five-number summary and the IQR. Then, have them identify any potential outliers using the 1.5 × IQR rule and write one sentence explaining their findings.

Discussion Prompt

Present students with two box plots representing different datasets (e.g., test scores from two classes). Ask: 'What can you conclude about the spread and central tendency of each dataset? What are the advantages of using box plots over histograms for this comparison?'

Quick Check

Show students a pre-made box plot with a few data points marked. Ask: 'Is this point (point A) likely an outlier? Justify your answer using the IQR rule and the information shown in the box plot.'

Frequently Asked Questions

How do you identify outliers in a box plot?
Use the IQR method: subtract Q1 from Q3 to find IQR, then flag values below Q1 - 1.5 × IQR or above Q3 + 1.5 × IQR. Students practice with ordered lists, plotting points to visualize fences. This rule balances sensitivity to extremes without over-flagging normal variation in datasets like exam scores.
What is the difference between a box plot and a histogram?
Box plots display the five-number summary to show center, spread, and outliers; histograms show frequency distribution and shape. Teaching both with the same data helps students see box plots for quick summaries and histograms for detailed patterns, essential for Ontario data expectations.
How can active learning help teach box plots?
Activities like pairs building plots from class surveys make quartiles tangible as students sort and measure data themselves. Group comparisons of plots versus histograms spark discussions on strengths, while outlier hunts encourage justifying IQR use. These methods boost retention by connecting abstract stats to familiar contexts, fostering data fluency.
Why use box plots for Grade 9 data analysis?
Box plots efficiently compare multiple datasets, revealing medians and spreads at a glance, which suits decision making in probability units. Students interpret real data like regional test scores, aligning with curriculum goals for visual summaries and outlier reasoning over lengthy lists.

Planning templates for Mathematics