Making Inferences from Samples
Using data from a random sample to draw inferences about a population with an unknown characteristic of interest.
About This Topic
Comparing data distributions in Grade 7 moves students from looking at single numbers to analyzing whole sets of information. The Ontario curriculum focuses on using measures of center (mean, median) and measures of spread (mean absolute deviation) to compare two different populations. This topic is essential for making evidence-based decisions, such as comparing the performance of two sports teams or the weather patterns in two different Canadian provinces.
Students learn that the 'average' doesn't tell the whole story; the variability or 'spread' of the data is just as important. They investigate how outliers can skew the mean and why the median is often a more reliable measure for things like income or house prices. By engaging in collaborative investigations, students learn to interpret data more holistically. This topic particularly benefits from hands-on, student-centered approaches where students can manipulate data points and see the immediate effect on the distribution.
Key Questions
- Explain how a sample can be used to make predictions about an entire population.
- Evaluate the reliability of an inference based on the sampling method used.
- Construct an argument for or against a claim based on sample data.
Learning Objectives
- Explain how a random sample can represent a larger population for a specific characteristic.
- Evaluate the reliability of inferences based on different sampling methods, such as convenience or random sampling.
- Construct an argument to support or refute a claim using data collected from a sample.
- Calculate the proportion of a characteristic in a sample and use it to predict the proportion in the population.
- Compare inferences made from multiple random samples of the same population.
Before You Start
Why: Students need to be able to gather and structure data before they can analyze it for inferences.
Why: Students should be familiar with various ways to display data (e.g., bar graphs, pictographs) to help visualize sample results.
Why: Understanding how to calculate a mean from a set of data is foundational for making predictions about a population's average characteristic.
Key Vocabulary
| Inference | A conclusion reached on the basis of evidence and reasoning, often about a population based on a sample. |
| Population | The entire group of individuals or objects that you want to know something about. |
| Sample | A subset of individuals or objects selected from a population to make inferences about the whole group. |
| Random Sample | A sample where every member of the population has an equal chance of being selected, which helps reduce bias. |
| Bias | A systematic error introduced into sampling or testing by selecting or encouraging one outcome or answer over others. |
Watch Out for These Misconceptions
Common MisconceptionThe mean is always the best 'average'.
What to Teach Instead
Students often default to the mean. Using a data set with a massive outlier (like 'salaries in a room where one person is a billionaire') helps them see during a group discussion that the median often gives a better 'typical' value.
Common MisconceptionIf two sets have the same mean, they are the same.
What to Teach Instead
Students may ignore the spread. Comparing two sets like {5, 5, 5} and {0, 5, 10}, both with a mean of 5, helps them realize that the 'variability' makes the two sets very different in practice.
Active Learning Ideas
See all activitiesInquiry Circle: Reaction Time Challenge
Students use an online tool to measure their reaction times. They compare the data distribution of the 'left-hand' group vs. the 'right-hand' group, calculating mean and median for both to see if there is a significant difference.
Think-Pair-Share: The Outlier Effect
Give students a set of 'test scores' where one score is a 0. They calculate the mean and median. Then, they 'remove' the 0 and recalculate. They discuss with a partner which measure of center changed more and why.
Stations Rotation: Weather Watchers
At different stations, students analyze temperature data for two Canadian cities (e.g., Victoria and Winnipeg). They must calculate the mean and the spread (range) to explain which city has more 'predictable' weather and why.
Real-World Connections
- Market researchers use random sampling to survey a small group of consumers to understand preferences for a new product, like a new flavour of ice cream, before launching it nationwide.
- Political pollsters select random samples of voters to predict election outcomes, using the sample results to estimate the overall voting intentions across the country.
- Quality control inspectors in manufacturing plants take random samples of products from an assembly line to assess the defect rate and make inferences about the quality of the entire production batch.
Assessment Ideas
Present students with a scenario: 'A school wants to know the favourite sport of its 500 students. They survey 50 students from the Grade 7 classes only.' Ask students to identify the population and the sample, and explain one potential source of bias in this sampling method.
Provide students with a small dataset from a sample (e.g., 20 coloured marbles drawn from a bag). Ask them to calculate the proportion of each colour in their sample and then write one sentence predicting the proportion of each colour in the full bag, explaining why their prediction is reasonable.
Pose the question: 'If you wanted to know the average height of all Grade 7 students in Ontario, would it be better to measure 100 students from your own school or 100 students randomly selected from across the province? Explain your reasoning, considering the concepts of sample and population.'
Frequently Asked Questions
What is the difference between mean and median?
What does 'variability' mean in data?
How can active learning help students compare data?
When should I use the median instead of the mean?
Planning templates for Mathematics
5E Model
The 5E Model structures lessons through five phases (Engage, Explore, Explain, Elaborate, and Evaluate), guiding students from curiosity to deep understanding through inquiry-based learning.
Unit PlannerMath Unit
Plan a multi-week math unit with conceptual coherence: from building number sense and procedural fluency to applying skills in context and developing mathematical reasoning across a connected sequence of lessons.
RubricMath Rubric
Build a math rubric that assesses problem-solving, mathematical reasoning, and communication alongside procedural accuracy, giving students feedback on how they think, not just whether they got the right answer.
From the Blog
How to Write a Lesson Plan: A 7-Step Guide for Teachers
Learn how to write a lesson plan in 7 clear steps, from setting objectives to post-lesson reflection, with practical examples for every grade level.
Active Learning Strategies That Actually Work in Middle School
Evidence-based active learning strategies for middle school classrooms, from think-pair-share to structured debate, with implementation tips from real teachers.
More in Data Analysis and Statistics
Sampling Strategies
Distinguishing between biased and representative samples to ensure valid conclusions.
2 methodologies
Measures of Center: Mean, Median, Mode
Calculating and interpreting mean, median, and mode for various data sets.
2 methodologies
Measures of Variability: Range & IQR
Understanding and calculating range and interquartile range to describe data spread.
2 methodologies
Comparing Data Distributions
Using mean, median, and mean absolute deviation to compare two different populations.
2 methodologies
Visualizing Data: Box Plots
Creating and interpreting box plots to identify trends and patterns, including quartiles and outliers.
2 methodologies
Visualizing Data: Histograms
Creating and interpreting histograms to display the distribution of numerical data.
2 methodologies