Normal Distribution and Z-Scores
Understanding the properties of the normal distribution and standardizing data using z-scores.
About This Topic
The normal distribution, recognized by its symmetric bell-shaped curve, is one of the most important models in statistics. Its shape is determined entirely by two parameters: the mean (which sets the center) and the standard deviation (which sets the spread). Many naturally occurring measurements, from human heights to SAT scores to manufacturing tolerances, approximately follow a normal distribution, making it the foundation for a large portion of inferential statistics.
Common Core standard CCSS.Math.Content.HSS.ID.A.4 requires students to use the mean and standard deviation of a data set to fit it to a normal distribution and to estimate population percentages using the Empirical Rule. Z-scores translate raw data values into a standardized scale, measuring distance from the mean in standard deviation units, which allows direct comparison of values from different normal distributions with different means and spreads.
Active learning works particularly well for the normal distribution because real data collected by the class provides immediate context for the mean, standard deviation, and z-score calculations. When students compute their own z-scores on class-collected data, standardization becomes a concrete operation on familiar numbers rather than an abstract formula applied to given values.
Key Questions
- Explain the significance of the empirical rule (68-95-99.7) for normal distributions.
- Analyze how z-scores allow for comparison of data from different normal distributions.
- Construct a normal probability plot to assess normality of a dataset.
Learning Objectives
- Calculate the z-score for a given data point within a normal distribution, interpreting its meaning in terms of standard deviations from the mean.
- Analyze the properties of a normal distribution by applying the empirical rule (68-95-99.7) to estimate probabilities of data falling within specific ranges.
- Compare and contrast data points from different normal distributions by standardizing them using z-scores.
- Construct and interpret a normal probability plot to visually assess whether a dataset approximates a normal distribution.
- Explain the relationship between the mean, standard deviation, and the shape of a normal distribution curve.
Before You Start
Why: Students need to understand how to calculate and interpret the mean as the center of a distribution.
Why: Students must be able to calculate and understand standard deviation as a measure of data dispersion before they can work with z-scores.
Why: Understanding the fundamental ideas of probability is necessary for interpreting the percentages associated with normal distributions.
Key Vocabulary
| Normal Distribution | A continuous probability distribution that is symmetric around its mean, forming a bell-shaped curve. It is defined by its mean and standard deviation. |
| Mean | The average of a dataset, representing the center of the normal distribution. It is denoted by the Greek letter mu (μ). |
| Standard Deviation | A measure of the amount of variation or dispersion in a set of values, representing the typical distance of data points from the mean. It is denoted by the Greek letter sigma (σ). |
| Z-score | A standardized score that indicates how many standard deviations a data point is from the mean of its distribution. It is calculated as z = (x - μ) / σ. |
| Empirical Rule | A rule for normal distributions stating that approximately 68% of data falls within one standard deviation of the mean, 95% within two, and 99.7% within three. |
Watch Out for These Misconceptions
Common MisconceptionAll data is normally distributed.
What to Teach Instead
Normal distributions appear frequently but not universally. Income distributions are typically right-skewed; waiting times often follow exponential distributions. Students who assume normality without checking will draw invalid conclusions. Constructing histograms and probability plots for deliberately skewed data sets during a station activity shows what non-normal data looks like and why checking the assumption matters.
Common MisconceptionA z-score of 2.0 means a raw score of 2.
What to Teach Instead
A z-score of 2.0 means the score is 2 standard deviations above the mean, not that the raw score is 2. The actual raw score depends on the specific distribution's mean and standard deviation. Working backward from z-scores to raw values using x = mu + z * sigma helps students internalize that z-scores are relative measurements, not raw values.
Active Learning Ideas
See all activitiesInquiry Circle: Class Data Fits the Curve
The class records a measurable quantity (reaction times, heights, or number of words recalled in 30 seconds), computes the class mean and standard deviation, and identifies who falls within one, two, and three standard deviations of the mean. Groups compare their class percentages to the 68-95-99.7 Empirical Rule benchmarks and discuss what deviations from the rule suggest about their data.
Think-Pair-Share: Z-Score Comparison Race
Partners are each given a score from a different test (one from a test with mean 70 and SD 8, one from a test with mean 500 and SD 100). Each computes their z-score and determines who performed better relative to their class. They then explain to each other, in plain language, what the z-score represents about their performance.
Gallery Walk: Normal Distribution Applications
Stations feature four normal distribution contexts (blood pressure readings, ACT scores, manufacturing tolerances, and annual rainfall amounts). Students calculate the proportion of the population within a given range using z-scores and a standard normal table, then interpret the answer in the scenario's context.
Stations Rotation: Assessing Normality
At one station, students construct a histogram from a given data set and visually assess normality. At a second station, they create a normal probability plot (Q-Q plot). At a third, they apply the Empirical Rule to check if approximately 68%, 95%, and 99.7% of data falls within one, two, and three standard deviations. Groups discuss which method gave them the most confidence in their assessment.
Real-World Connections
- Quality control engineers in manufacturing use normal distributions and z-scores to monitor product specifications, like the diameter of ball bearings or the fill level of soda bottles. They can quickly identify if a batch of products is outside acceptable limits by calculating z-scores for sample measurements.
- Educational testing services, such as the College Board for the SAT or ACT, use normal distributions to report scores. Z-scores allow students to compare their performance on different sections of the test, or even across different standardized tests that may have different means and standard deviations.
Assessment Ideas
Provide students with a dataset that approximately follows a normal distribution (e.g., heights of students in class). Ask them to calculate the mean and standard deviation, then find the z-score for two specific data points. Finally, ask them to explain what one of the z-scores means in context.
Present students with two scenarios: Scenario A describes a normal distribution with mean 70 and standard deviation 5, and Scenario B describes a normal distribution with mean 80 and standard deviation 10. Give them two raw scores, one from each scenario (e.g., 75 from A, 90 from B). Ask them to calculate the z-scores for both and determine which score is relatively higher within its own distribution.
Pose the question: 'Why is the empirical rule (68-95-99.7) so useful for understanding normal distributions, even though it only provides approximations?' Facilitate a discussion where students explain its role in quickly estimating probabilities and identifying outliers without complex calculations.
Frequently Asked Questions
What is the Empirical Rule (68-95-99.7 rule) and how is it used?
How do you calculate a z-score?
Why are z-scores useful for comparing data from different distributions?
How can active learning strengthen understanding of z-scores and the normal distribution?
Planning templates for Mathematics
5E Model
The 5E Model structures lessons through five phases (Engage, Explore, Explain, Elaborate, and Evaluate), guiding students from curiosity to deep understanding through inquiry-based learning.
Unit PlannerMath Unit
Plan a multi-week math unit with conceptual coherence: from building number sense and procedural fluency to applying skills in context and developing mathematical reasoning across a connected sequence of lessons.
RubricMath Rubric
Build a math rubric that assesses problem-solving, mathematical reasoning, and communication alongside procedural accuracy, giving students feedback on how they think, not just whether they got the right answer.
More in Probability and Inferential Statistics
Review of Basic Probability and Counting Principles
Revisiting permutations, combinations, and fundamental probability rules.
2 methodologies
Conditional Probability and Bayes
Calculating the probability of events based on prior knowledge of related conditions.
2 methodologies
Random Variables and Probability Distributions
Introducing discrete and continuous random variables and their associated probability distributions.
2 methodologies
Expected Value and Standard Deviation of Random Variables
Calculating and interpreting the expected value and standard deviation for discrete random variables.
2 methodologies
Binomial Distribution
Applying the binomial distribution to model scenarios with a fixed number of independent trials.
2 methodologies
Probability Distributions
Analyzing binomial and normal distributions to determine the likelihood of outcomes.
2 methodologies