Box Plots and Outliers
Students will construct and interpret box plots, identifying quartiles and potential outliers.
About This Topic
Box plots offer a compact way to display data distributions through the five-number summary: minimum, first quartile (Q1), median, third quartile (Q3), and maximum. In Grade 9, students construct box plots from real-world datasets, such as student heights or test scores, to identify quartiles and spot potential outliers. They interpret these plots to describe data spread, central tendency, and skewness, which supports informed decision making in data strands of the Ontario curriculum.
Students compare box plots to histograms, noting that box plots emphasize summary statistics while histograms reveal shape and frequency. They justify outlier detection using the interquartile range (IQR): values below Q1 - 1.5 × IQR or above Q3 + 1.5 × IQR. This process strengthens analytical skills for probability and statistics.
Active learning shines here because students manipulate their own data to build plots, revealing how choices affect interpretations. Collaborative comparisons of plots from varied datasets clarify nuances like outlier validity, making concepts stick through peer feedback and iteration.
Key Questions
- Interpret the five-number summary represented in a box plot.
- Differentiate between a box plot and a histogram in terms of information conveyed.
- Justify the method for identifying outliers using the IQR.
Learning Objectives
- Calculate the five-number summary (minimum, Q1, median, Q3, maximum) for a given dataset.
- Construct a box plot accurately from a calculated five-number summary.
- Identify and classify potential outliers in a dataset using the 1.5 × IQR rule.
- Compare and contrast the information conveyed by a box plot versus a histogram for a given dataset.
- Justify the method used to identify outliers based on the interquartile range (IQR).
Before You Start
Why: Students need to understand how to calculate the median and be familiar with measures of central tendency to grasp the concept of quartiles.
Why: Understanding the range helps students conceptualize data spread, which is further detailed by the IQR in box plots.
Why: Familiarity with representing data graphically, such as with histograms, provides a foundation for understanding different visualization methods like box plots.
Key Vocabulary
| Five-Number Summary | A set of five key statistics that describe a dataset: minimum, first quartile (Q1), median, third quartile (Q3), and maximum. |
| Interquartile Range (IQR) | The difference between the third quartile (Q3) and the first quartile (Q1) of a dataset, representing the spread of the middle 50% of the data. |
| Outlier | A data point that is significantly different from other observations in the dataset, often identified using a specific rule based on the IQR. |
| Box Plot | A graphical representation of the five-number summary, displaying the distribution of a dataset through quartiles and indicating potential outliers. |
Watch Out for These Misconceptions
Common MisconceptionOutliers are always data errors to ignore.
What to Teach Instead
Outliers may represent valid extremes, like a star athlete's score. Active group debates on real datasets help students apply the IQR rule and consider context, shifting focus from deletion to investigation.
Common MisconceptionA box plot shows data frequencies like a histogram.
What to Teach Instead
Box plots summarize quartiles and extremes, not frequencies. Hands-on construction from raw data in pairs clarifies this, as students see histograms bunch values while box plots highlight spread.
Common MisconceptionThe median in a box plot is the data average.
What to Teach Instead
Median is the middle value, not mean. Comparing calculated medians and means in small groups with skewed data reveals distinctions, building precise vocabulary through shared examples.
Active Learning Ideas
See all activitiesPairs: Class Data Box Plot Build
Pairs collect heights from 20 classmates, sort data, calculate quartiles and IQR, then sketch box plots on graph paper. They mark potential outliers and discuss impacts on the plot. Switch partners to verify calculations.
Small Groups: Histogram vs Box Plot Match
Provide datasets; groups create histograms and box plots side-by-side. They list three differences in conveyed information and present one example to the class. Use digital tools like Desmos for efficiency.
Whole Class: Outlier Investigation
Display multiple box plots on the board from sports stats. Class votes on outlier status using IQR rule, then debates if outliers are errors or extremes. Tally votes and refine rule application.
Individual: Interpretation Stations
Set up stations with box plots from different contexts. Students rotate, answering prompts on spread, median comparison, and outlier justification in journals. Debrief key insights as a group.
Real-World Connections
- Sports analysts use box plots to compare player statistics across different seasons or teams, identifying performance ranges and potential outliers that might indicate exceptional or poor performance.
- Financial advisors might use box plots to visualize the distribution of returns for different investment portfolios, helping clients understand risk and potential extreme outcomes.
- Public health researchers use box plots to display the distribution of patient recovery times or disease prevalence across different regions, allowing for quick identification of unusual patterns or potential outliers.
Assessment Ideas
Provide students with a dataset. Ask them to calculate the five-number summary and the IQR. Then, have them identify any potential outliers using the 1.5 × IQR rule and write one sentence explaining their findings.
Present students with two box plots representing different datasets (e.g., test scores from two classes). Ask: 'What can you conclude about the spread and central tendency of each dataset? What are the advantages of using box plots over histograms for this comparison?'
Show students a pre-made box plot with a few data points marked. Ask: 'Is this point (point A) likely an outlier? Justify your answer using the IQR rule and the information shown in the box plot.'
Frequently Asked Questions
How do you identify outliers in a box plot?
What is the difference between a box plot and a histogram?
How can active learning help teach box plots?
Why use box plots for Grade 9 data analysis?
Planning templates for Mathematics
5E Model
The 5E Model structures lessons through five phases (Engage, Explore, Explain, Elaborate, and Evaluate), guiding students from curiosity to deep understanding through inquiry-based learning.
Unit PlannerMath Unit
Plan a multi-week math unit with conceptual coherence: from building number sense and procedural fluency to applying skills in context and developing mathematical reasoning across a connected sequence of lessons.
RubricMath Rubric
Build a math rubric that assesses problem-solving, mathematical reasoning, and communication alongside procedural accuracy, giving students feedback on how they think, not just whether they got the right answer.
More in Data, Probability, and Decision Making
Data Collection Methods
Students will explore different methods of data collection, including surveys, observations, and experiments.
2 methodologies
Sampling Techniques and Bias
Students will identify different sampling techniques and recognize potential sources of bias in data collection.
2 methodologies
Measures of Central Tendency
Students will calculate and interpret mean, median, and mode for various data sets.
2 methodologies
Measures of Spread
Students will calculate and interpret range, interquartile range (IQR), and standard deviation (introduction) to describe data variability.
2 methodologies
Frequency Distributions and Histograms
Students will construct and interpret frequency tables and histograms for numerical data.
2 methodologies
Scatter Plots and Correlation
Students will create and interpret scatter plots, identifying positive, negative, and no correlation.
2 methodologies