Random Sampling and Bias
Understanding that statistics can be used to gain information about a population by examining a sample.
About This Topic
Once students understand the population-sample distinction, they move to the question of how a sample should be selected. Random sampling is the standard method for avoiding bias because it gives every member of the population an equal chance of being included, removing the influence of researcher preference or self-selection. This topic connects directly to the broader CCSS standard of using statistics to gain information about a population.
Students explore different sampling scenarios and evaluate whether a given method produces a random sample. They examine why voluntary response surveys (where participants choose to respond) consistently skew toward people with strong opinions, and why convenience samples (selecting whoever is easiest to reach) often leave out important parts of the population. The concept of bias is examined not as intentional wrongdoing but as a structural feature of how data is collected.
Small sample sizes introduce another layer of complexity: even a well-designed random sample may produce unreliable inferences if too few people are sampled. Active learning activities that simulate multiple small samples and compare their results give students concrete experience with sampling variability before they formalize the idea.
Key Questions
- What makes a sample representative of a population?
- Why is random sampling the best way to avoid bias in data collection?
- How much confidence can we have in an inference made from a small sample size?
Learning Objectives
- Analyze given sampling methods to identify potential sources of bias.
- Compare the results of random samples to voluntary response samples for a given population question.
- Explain why random sampling is essential for making valid inferences about a population.
- Evaluate the impact of sample size on the reliability of statistical inferences.
- Design a simple random sampling plan for a specified scenario.
Before You Start
Why: Students need to be familiar with collecting and organizing data before they can analyze different sampling methods.
Why: Understanding the concept of equal chance is foundational to grasping the principles of random sampling.
Key Vocabulary
| Population | The entire group of individuals or objects that we want to study or draw conclusions about. |
| Sample | A subset of individuals or objects selected from a population to represent the entire group. |
| Random Sampling | A method of selecting a sample where every member of the population has an equal and independent chance of being chosen. |
| Bias | A systematic error introduced into sampling or testing by selecting or encouraging any one outcome or answer over others. |
| Sampling Variability | The natural variation in results that occurs when different samples are taken from the same population. |
Watch Out for These Misconceptions
Common MisconceptionAny sample drawn without purposeful selection is random.
What to Teach Instead
Random sampling requires a systematic process that gives every member an equal chance of selection. Grabbing the first 20 people in a hallway is convenience sampling, not random sampling. Simulation activities make this distinction visceral.
Common MisconceptionIf enough people respond to a voluntary survey, the results are reliable.
What to Teach Instead
High response volume doesn't correct for selection bias. Voluntary responders systematically differ from non-responders, meaning even thousands of responses can produce a biased picture. Historical examples like the Literary Digest 1936 poll help ground this.
Active Learning Ideas
See all activitiesSimulation Game: Bag of Colored Tiles
Each group receives a bag with an unknown proportion of two colors of tiles. They draw 5-tile samples repeatedly, record results, and predict the full bag composition. Groups compare predictions and then reveal the true proportion to discuss sample size effects.
Think-Pair-Share: Is This Sample Random?
Present five sampling methods for a school-wide survey. Students individually classify each as random or biased, then discuss with a partner. Whole-class discussion focuses on the two or three cases where partners disagreed.
Formal Debate: Should We Trust This Survey?
Share a 'news story' reporting survey results based on a clearly described sample. Groups are assigned to argue either that the sample supports the conclusion or that it doesn't. After 5 minutes of prep, groups debate while the class votes on the most convincing argument.
Real-World Connections
- Political pollsters use random sampling to survey likely voters across a district or state to predict election outcomes. They must ensure their sample accurately reflects the demographic makeup of the voting population to avoid biased results.
- Market researchers for companies like Nike or Apple use random sampling to gather feedback on new product ideas or advertising campaigns. This helps them understand consumer preferences without surveying every potential customer, saving time and resources.
- Scientists conducting environmental studies, such as tracking endangered species populations in a national park, employ random sampling techniques to estimate population sizes and health. This allows them to make informed conservation decisions based on representative data.
Assessment Ideas
Present students with three scenarios: a convenience sample (e.g., surveying friends), a voluntary response sample (e.g., an online poll), and a random sample (e.g., using a random number generator to select names from a class list). Ask students to identify which is which and explain one reason why the random sample is most likely to be unbiased.
Pose the question: 'Imagine you want to know the favorite lunch item of students in your entire school. If you only survey students in the cafeteria during 7th-grade lunch, what problems might arise?' Guide students to discuss potential biases related to convenience and self-selection.
Give students a scenario where a sample of 10 students is taken from a class of 30 to represent the class. Ask them to write two sentences explaining why the results from this small sample might not perfectly match the results if all 30 students were surveyed.
Frequently Asked Questions
Why is random sampling important in statistics?
What is the difference between random sampling and convenience sampling?
How does sample size affect the accuracy of inferences?
How can active learning activities help students understand random sampling?
Planning templates for Mathematics
5E Model
The 5E Model structures lessons through five phases (Engage, Explore, Explain, Elaborate, and Evaluate), guiding students from curiosity to deep understanding through inquiry-based learning.
Unit PlannerMath Unit
Plan a multi-week math unit with conceptual coherence: from building number sense and procedural fluency to applying skills in context and developing mathematical reasoning across a connected sequence of lessons.
RubricMath Rubric
Build a math rubric that assesses problem-solving, mathematical reasoning, and communication alongside procedural accuracy, giving students feedback on how they think, not just whether they got the right answer.
More in Probability and Statistics
Understanding Populations and Samples
Students will differentiate between populations and samples and understand the importance of representative samples.
2 methodologies
Drawing Inferences from Samples
Students will use data from a random sample to draw inferences about a population with an unknown characteristic of interest.
2 methodologies
Measures of Center: Mean, Median, Mode
Students will calculate and interpret measures of center for numerical data sets.
2 methodologies
Measures of Variability: Range and IQR
Students will calculate and interpret measures of variability (range, interquartile range) for numerical data sets.
2 methodologies
Comparing Data Sets
Using measures of center and variability to compare two numerical data distributions.
2 methodologies
Understanding Probability
Students will define probability and understand the likelihood of events.
2 methodologies