Hypothesis Testing: T-Tests
Performing t-tests for population means when the population standard deviation is unknown.
About This Topic
Hypothesis testing with t-tests is a cornerstone of AP Statistics and data literacy education at the 12th grade level in the US. When the population standard deviation is unknown, which is the norm in real research, statisticians substitute the sample standard deviation and the resulting test statistic follows a t-distribution rather than the standard normal. The t-distribution has heavier tails than the normal, reflecting the extra uncertainty introduced by estimating from sample data.
Degrees of freedom, calculated as n - 1 for one-sample tests, control how heavy those tails are. With small samples, the distribution is noticeably wider; as sample size grows, it converges toward the standard normal. Students encounter three test variants: the one-sample t-test compares a sample mean against a stated value, the two-sample t-test compares means from two independent groups, and the paired t-test analyzes within-pair differences when observations are matched or repeated measures on the same subject.
Active learning is well suited to t-test instruction because the hardest skill is not computation but scenario judgment. Students who argue about which test applies to ambiguous cases, check conditions together, and interpret p-values in context retain the reasoning process rather than just the steps.
Key Questions
- Explain why the t-distribution is used instead of the normal distribution when sigma is unknown.
- Differentiate between one-sample, two-sample, and paired t-tests.
- Analyze the impact of degrees of freedom on the shape of the t-distribution.
Learning Objectives
- Explain the rationale for using the t-distribution over the normal distribution when the population standard deviation is unknown.
- Differentiate between the hypotheses and conditions for one-sample, two-sample independent, and paired t-tests.
- Calculate the appropriate t-statistic for one-sample, two-sample independent, and paired scenarios.
- Analyze the effect of sample size and degrees of freedom on the critical values and p-values of a t-test.
- Interpret the results of a t-test in the context of a given research question, including stating conclusions in plain language.
Before You Start
Why: Students need a foundational understanding of sampling distributions and the logic of hypothesis testing before applying t-tests.
Why: Calculating sample means and sample standard deviations is fundamental to computing t-statistics.
Why: Familiarity with the normal distribution and z-scores provides a basis for understanding the t-distribution's properties and its relationship to the normal distribution.
Key Vocabulary
| t-distribution | A probability distribution that is bell-shaped and symmetric like the normal distribution, but has heavier tails. It is used for inference when the population standard deviation is unknown. |
| degrees of freedom (df) | A parameter that characterizes the shape of the t-distribution, typically related to the sample size. For a one-sample t-test, df = n - 1. |
| null hypothesis (H0) | A statement of no effect or no difference, which the t-test aims to find evidence against. |
| alternative hypothesis (Ha) | A statement that contradicts the null hypothesis, proposing that there is an effect or difference. |
| p-value | The probability of observing a test statistic as extreme as, or more extreme than, the one calculated from the sample data, assuming the null hypothesis is true. |
Watch Out for These Misconceptions
Common MisconceptionThe t-distribution and the normal distribution are separate, unrelated tools.
What to Teach Instead
As degrees of freedom increase, the t-distribution converges to the standard normal. For large enough samples, t and z critical values are nearly identical. Having students overlay both distributions in Desmos at varying df values makes this relationship concrete rather than abstract.
Common MisconceptionA paired t-test and a two-sample t-test can be used interchangeably depending on preference.
What to Teach Instead
Paired tests use within-pair differences and remove between-subject variability, making them more powerful when observations are genuinely matched or repeated. Using a two-sample test on paired data inflates the standard error and loses statistical power. Scenario card sorts where students must defend their choice help fix this distinction.
Common MisconceptionA statistically significant result means the effect is large or practically important.
What to Teach Instead
Statistical significance only tells you that the result is unlikely under the null hypothesis. A tiny, practically meaningless difference can produce a very small p-value with a large sample. Pairing t-test results with effect size calculations and context-based interpretation helps students separate statistical from practical significance.
Active Learning Ideas
See all activitiesCard Sort: Which T-Test Applies?
Groups of three receive a set of eight scenario cards and sort them into one-sample, two-sample, and paired categories. Each group writes a one-sentence justification for every card, then the class debriefs on the most contested cases. Disagreement between groups is the discussion goal, not just getting the right answer.
Think-Pair-Share: Estimating Sigma
Present a realistic research scenario where sigma is unknown and ask pairs to explain what changes when the sample standard deviation substitutes for the population value, and why that requires a different distribution. Each pair writes one sentence before the teacher formalizes the idea for the class.
Desmos Degrees-of-Freedom Gallery
Students use a Desmos t-distribution slider to observe how the curve changes at df = 3, 10, 30, and 100, then sketch and annotate each shape in their notes. They answer three comparison questions about tail area and critical values before a brief whole-class discussion on the practical significance of sample size.
Paired T-Test Lab: Before and After
Students collect a small paired dataset, such as dominant versus non-dominant hand grip strength or reaction time before and after a short warm-up, calculate the mean difference and its standard error by hand, and run the test. Each group writes a one-paragraph conclusion interpreting the p-value in plain language before comparing conclusions across groups.
Real-World Connections
- Medical researchers use paired t-tests to compare the effectiveness of a new drug by measuring blood pressure in the same patients before and after treatment.
- Quality control engineers in manufacturing might use a two-sample t-test to determine if there is a significant difference in the average length of bolts produced by two different machines.
- Social scientists conduct one-sample t-tests to investigate if the average score on a standardized test for a particular school district differs significantly from the national average.
Assessment Ideas
Provide students with a scenario describing a research question. Ask them to: 1. Identify whether a one-sample, two-sample independent, or paired t-test is most appropriate. 2. State the null and alternative hypotheses in symbols. 3. List the conditions that must be met for the chosen test.
Present students with two scenarios: one where the population standard deviation is known (use z-test) and one where it is unknown (use t-test). Ask: 'Why do we use different distributions in these cases? What is the practical implication of using the t-distribution?'
Give students a small dataset (e.g., 5 pairs of measurements). Ask them to calculate the mean difference and the sample standard deviation of the differences. Then, ask them to determine the degrees of freedom for a paired t-test on this data.
Frequently Asked Questions
Why use a t-distribution instead of the normal distribution when the population standard deviation is unknown?
What is the difference between a one-sample, two-sample, and paired t-test?
How do degrees of freedom affect the t-distribution shape?
How does active learning help students understand t-tests?
Planning templates for Mathematics
5E Model
The 5E Model structures lessons through five phases (Engage, Explore, Explain, Elaborate, and Evaluate), guiding students from curiosity to deep understanding through inquiry-based learning.
Unit PlannerMath Unit
Plan a multi-week math unit with conceptual coherence: from building number sense and procedural fluency to applying skills in context and developing mathematical reasoning across a connected sequence of lessons.
RubricMath Rubric
Build a math rubric that assesses problem-solving, mathematical reasoning, and communication alongside procedural accuracy, giving students feedback on how they think, not just whether they got the right answer.
More in Probability and Inferential Statistics
Review of Basic Probability and Counting Principles
Revisiting permutations, combinations, and fundamental probability rules.
2 methodologies
Conditional Probability and Bayes
Calculating the probability of events based on prior knowledge of related conditions.
2 methodologies
Random Variables and Probability Distributions
Introducing discrete and continuous random variables and their associated probability distributions.
2 methodologies
Expected Value and Standard Deviation of Random Variables
Calculating and interpreting the expected value and standard deviation for discrete random variables.
2 methodologies
Binomial Distribution
Applying the binomial distribution to model scenarios with a fixed number of independent trials.
2 methodologies
Normal Distribution and Z-Scores
Understanding the properties of the normal distribution and standardizing data using z-scores.
2 methodologies