Interpreting Data Displays and Outliers
Students will interpret various data displays (histograms, box plots, stem-and-leaf plots) to describe data shape, identify outliers, and draw conclusions.
About This Topic
Year 9 students interpret data displays including histograms, box plots, and stem-and-leaf plots. They describe data shapes such as symmetry, skewness, clusters, and gaps, while identifying outliers that may influence statistical measures like mean, median, and range. Drawing conclusions from these displays builds data literacy for real-world applications, from sports statistics to environmental monitoring.
This topic supports AC9M9ST01 and AC9M9ST02 in the Australian Curriculum's Statistics and Probability strand. Students address key questions about histogram shapes revealing distributions, outlier effects on interpretations, and their importance in analysis. Practicing justification strengthens reasoning skills essential for higher mathematics and data-driven decisions.
Active learning suits this content well. Students who generate their own data sets, construct displays by hand or with software, and debate outlier status in pairs connect abstract ideas to concrete experiences. Group critiques of displays encourage precise vocabulary use and reveal how small changes alter conclusions, fostering deeper understanding and retention.
Key Questions
- What does the shape of a histogram tell us about the distribution of data?
- How do outliers affect the interpretation of data and statistical measures?
- Justify the importance of identifying outliers in a data set.
Learning Objectives
- Analyze the shape of histograms (e.g., symmetric, skewed left, skewed right) to describe the distribution of a data set.
- Compare and contrast data presented in histograms, box plots, and stem-and-leaf plots to identify similarities and differences in data spread and central tendency.
- Identify potential outliers in box plots and stem-and-leaf plots using established rules, such as the 1.5 IQR rule.
- Evaluate the impact of identified outliers on measures of central tendency (mean, median) and spread (range, IQR) for a given data set.
- Justify the significance of identifying and handling outliers when drawing conclusions from statistical data.
Before You Start
Why: Students need to be able to calculate these measures of central tendency to understand how outliers affect them.
Why: Knowledge of range and quartiles is essential for interpreting box plots and identifying outliers using the IQR rule.
Why: Familiarity with basic data visualization helps students grasp the concepts behind more complex displays like histograms and stem-and-leaf plots.
Key Vocabulary
| Outlier | A data point that is significantly different from other observations in a data set. Outliers can skew statistical results and require careful consideration. |
| Histogram | A bar graph representing the frequency distribution of numerical data. The bars represent ranges of data, and their height indicates the frequency within that range. |
| Box Plot (Box and Whisker Plot) | A standardized way of displaying the distribution of data based on a five-number summary: minimum, first quartile (Q1), median, third quartile (Q3), and maximum. It visually shows the spread and potential outliers. |
| Stem-and-Leaf Plot | A method of organizing data to show the shape of the distribution. It separates each data point into a 'stem' (the leading digit or digits) and a 'leaf' (the last digit). |
| Skewness | A measure of the asymmetry of a probability distribution of a real-valued random variable about its mean. A distribution can be skewed left, skewed right, or be symmetric. |
Watch Out for These Misconceptions
Common MisconceptionOutliers are always mistakes and should be removed.
What to Teach Instead
Outliers may represent valid extreme values, like a record rainfall. Students investigate context before deciding. Peer reviews in group activities help challenge assumptions and build habits of justification.
Common MisconceptionThe shape of a histogram only shows the average value.
What to Teach Instead
Shape indicates distribution features like spread and modality, not just central tendency. Hands-on sorting of physical data cards into bins clarifies this, as students see clusters form visually.
Common MisconceptionBox plots and histograms show the same information exactly.
What to Teach Instead
Box plots summarise five-number data and outliers, while histograms show frequency distribution. Comparing both on identical data in pairs highlights differences and complementary uses.
Active Learning Ideas
See all activitiesPair Sort: Histogram Shape Identification
Provide printed histograms of familiar data like test scores or heights. Pairs match each to descriptions of shape (symmetric, skewed right, bimodal). They then justify choices and create one histogram from class data using graphing tools.
Small Group: Outlier Investigation Stations
Set up stations with box plots from real Australian datasets (e.g., rainfall, AFL scores). Groups identify outliers, calculate affected measures before and after removal, and discuss validity. Rotate stations and share findings.
Whole Class: Stem-and-Leaf Plot Challenge
Collect class data on a quick survey (e.g., minutes spent on homework). Display as stem-and-leaf plot on board. Class votes on outliers, redraws plot, and compares measures. Discuss shape implications.
Individual: Data Display Redesign
Give students a messy dataset with outliers. They choose and create two displays (e.g., histogram and box plot), annotate shapes and outliers, then write a conclusion paragraph.
Real-World Connections
- Financial analysts examine stock market data, using box plots and histograms to identify unusual trading days (outliers) that might indicate market volatility or significant news events.
- Sports statisticians analyze player performance data, like batting averages or race times. They use stem-and-leaf plots and box plots to spot exceptional performances or injuries that deviate significantly from the norm.
- Medical researchers study patient recovery times after surgery. Histograms help visualize the spread of recovery periods, while identifying outliers can highlight patients with unusually fast or slow recoveries, prompting further investigation.
Assessment Ideas
Provide students with a small data set and a pre-drawn box plot. Ask them to: 1. Identify any potential outliers shown on the box plot. 2. Calculate the Interquartile Range (IQR). 3. Explain in one sentence how an outlier might affect the mean of this data set.
Display a histogram of student test scores. Ask students to write down two observations about the shape of the distribution (e.g., 'It looks symmetric', 'Most scores are clustered between 70 and 80'). Then, ask them to identify a possible score that might be considered an outlier and explain why.
Present two different data sets with similar medians but different ranges and outlier presence. Pose the question: 'If you had to choose one data set to represent typical student performance on a recent test, which would you choose and why? Consider the impact of outliers and the overall spread of the data.'
Frequently Asked Questions
How do outliers affect statistical measures in data displays?
What does the shape of a histogram tell us about data distribution?
How can active learning help students master interpreting data displays?
Why is identifying outliers important in Year 9 statistics?
Planning templates for Mathematics
5E Model
The 5E Model structures lessons through five phases (Engage, Explore, Explain, Elaborate, and Evaluate), guiding students from curiosity to deep understanding through inquiry-based learning.
Unit PlannerMath Unit
Plan a multi-week math unit with conceptual coherence: from building number sense and procedural fluency to applying skills in context and developing mathematical reasoning across a connected sequence of lessons.
RubricMath Rubric
Build a math rubric that assesses problem-solving, mathematical reasoning, and communication alongside procedural accuracy, giving students feedback on how they think, not just whether they got the right answer.
More in Statistics and Probability
Collecting and Representing Data
Students will review methods of data collection and various ways to represent data, including frequency tables and histograms.
2 methodologies
Measures of Central Tendency (Mean, Median, Mode)
Students will calculate and interpret the mean, median, and mode for various data sets, understanding their strengths and weaknesses.
2 methodologies
Measures of Spread (Range, IQR)
Students will calculate and interpret the range and interquartile range (IQR) as measures of data spread.
2 methodologies
Five-Point Summary and Box Plots
Students will construct five-point summaries and draw box-and-whisker plots to visually represent and compare data distributions.
2 methodologies
Comparing Data Distributions
Students will compare the distributions of two or more data sets using measures of central tendency, spread, and appropriate graphical representations (e.g., back-to-back stem-and-leaf plots, parallel box plots).
2 methodologies
Introduction to Probability
Students will review basic probability concepts, including sample space, events, and calculating theoretical probability.
2 methodologies