Skip to content
Mathematics · 9th Grade · Statistical Reasoning and Data · Weeks 10-18

Shapes of Distributions

Identifying normal, skewed, and bimodal distributions and their implications.

Common Core State StandardsCCSS.Math.Content.HSS.ID.A.3CCSS.Math.Content.HSS.ID.A.4

About This Topic

Box plots (or box-and-whisker plots) provide a visual summary of a data set's distribution based on a five-number summary: minimum, first quartile (Q1), median, third quartile (Q3), and maximum. In 9th grade, students use these plots to compare different data sets and identify variability. This topic is central to the Common Core standards for summarizing and comparing data distributions.

Students learn to use the Interquartile Range (IQR) to measure the 'spread' of the middle 50% of the data, which is a more stable measure than the full range. This topic comes alive when students can create 'human box plots' where they physically stand in a line and divide themselves into quartiles, making the abstract concept of '25% of the data' a visible reality.

Key Questions

  1. Analyze what real-world phenomena typically follow a normal distribution.
  2. Explain how the tail of a distribution influences the mean.
  3. Justify why a bimodal distribution might suggest the presence of two different groups.

Learning Objectives

  • Classify given data sets as representing normal, skewed, or bimodal distributions based on their graphical representations.
  • Explain how the position of the mean relative to the median indicates the direction and severity of skew in a distribution.
  • Analyze the characteristics of a bimodal distribution to infer the potential presence of two distinct underlying groups within the data.
  • Compare the implications of a normal distribution versus a skewed distribution for making predictions about future data points.

Before You Start

Measures of Center: Mean and Median

Why: Students need to understand how to calculate and interpret the mean and median to analyze their relationship within different distribution shapes.

Data Visualization: Histograms

Why: Students must be able to read and interpret histograms to identify the visual patterns characteristic of normal, skewed, and bimodal distributions.

Key Vocabulary

Normal DistributionA symmetrical, bell-shaped distribution where data clusters around the mean, with most values close to the mean and fewer values farther away.
SkewnessA measure of the asymmetry of a probability distribution. A distribution can be skewed left (negative skew) or skewed right (positive skew).
Bimodal DistributionA distribution with two distinct peaks, suggesting that the data set may be composed of two separate groups or populations.
MeanThe average of a data set, calculated by summing all values and dividing by the number of values. It is sensitive to outliers and extreme values.
MedianThe middle value in a data set when the data is ordered from least to greatest. It is not affected by extreme values.

Watch Out for These Misconceptions

Common MisconceptionStudents often think a longer 'whisker' or a wider 'box' means there are more data points in that section.

What to Teach Instead

Use the 'Human Box Plot.' Peer discussion helps students realize that each of the four sections contains the SAME number of people; a wider section just means those people's values are more spread out.

Common MisconceptionConfusing the median (the line in the box) with the mean.

What to Teach Instead

Have students calculate both for a skewed data set. Collaborative analysis of the box plot shows that the median is a physical 'middle' of the sorted list, which may not be the same as the 'balance point' (mean).

Active Learning Ideas

See all activities

Real-World Connections

  • Height measurements for adult males in a large population typically follow a normal distribution. This allows clothing manufacturers to predict the most common sizes needed for production.
  • Test scores for a challenging exam might show a negatively skewed distribution, with most students scoring high but a few scoring very low. This suggests the test was generally manageable but had some difficult questions.
  • Customer satisfaction survey data could reveal a bimodal distribution if there are two distinct groups of customers: those highly satisfied and those highly dissatisfied, indicating potential issues with different aspects of a product or service.

Assessment Ideas

Quick Check

Provide students with three histograms, each representing a different distribution (normal, right-skewed, left-skewed). Ask them to label each histogram with the correct distribution type and briefly explain their reasoning, referencing the shape and the relative positions of the mean and median.

Exit Ticket

Present students with a scenario: 'A study found that the number of hours students in a school spent on homework per week had two peaks, one around 3 hours and another around 8 hours.' Ask students: 1. What type of distribution does this suggest? 2. What might this tell us about the student population?

Discussion Prompt

Pose the question: 'Imagine you are analyzing the salaries of employees at a company. If the distribution is heavily skewed to the right, what does this imply about the salary structure? How would this differ if the distribution were normal?' Facilitate a class discussion on the implications for understanding typical earnings.

Frequently Asked Questions

What is the 'five-number summary'?
It consists of the Minimum, the First Quartile (Q1), the Median (Q2), the Third Quartile (Q3), and the Maximum. These five values are the foundation for drawing any box plot.
How can active learning help students understand box plots?
Active learning strategies like 'Human Box Plots' clarify the most common confusion: that the size of the box relates to the number of data points. When students see that each 'quarter' of the rope holds the same number of classmates, regardless of how far apart they are standing, the concept of 'density' vs. 'spread' becomes much clearer.
How do you calculate the Interquartile Range (IQR)?
The IQR is calculated by subtracting the first quartile from the third quartile (Q3 - Q1). It represents the range of the middle 50% of the data and is used to identify outliers.
Why are box plots useful for comparing data?
They allow you to see the center, spread, and overall shape of multiple data sets side-by-side without getting bogged down in individual numbers. They make it easy to see which group is 'higher' or 'more consistent' overall.

Planning templates for Mathematics