Skip to content
Mathematics · 9th Grade · Statistical Reasoning and Data · Weeks 10-18

Causation vs. Correlation

Distinguishing between situations where correlation implies causation and where it does not.

Common Core State StandardsCCSS.Math.Content.HSS.ID.C.9

About This Topic

The distinction between correlation and causation is one of the most practically important concepts in 9th grade statistics, and one of the most frequently violated in US media, health reporting, and political discourse. Students who can reliably make this distinction have a foundational tool for evaluating claims they will encounter throughout their lives. The CCSS standard HSS.ID.C.9 asks students to apply this distinction using specific data examples, not just as an abstract principle to state.

The mechanism that explains most spurious correlations is the confounding variable: a third variable that causally influences both of the variables being compared, making them appear related when they are not directly connected. Teaching students to ask what else might explain this connection is the key habit of mind this topic builds.

Active learning is particularly effective here because compelling examples exist everywhere and students genuinely enjoy arguing about them. Structured debate or structured academic controversy, where students argue both sides of a causal claim using real data, develops the analytical skills that make the distinction durable rather than a formula recalled on a test.

Key Questions

  1. Differentiate between correlation and causation with real-world examples.
  2. Analyze common pitfalls in assuming causation from correlation.
  3. Construct an argument for or against a causal link based on given data.

Learning Objectives

  • Analyze provided datasets to identify potential correlations between two variables.
  • Evaluate common media claims for causal links, identifying confounding variables or alternative explanations.
  • Construct a written argument, supported by data, for or against a causal relationship between two specific phenomena.
  • Compare and contrast the definitions of correlation and causation using real-world scenarios.

Before You Start

Introduction to Data Analysis and Graphing

Why: Students need to be able to read and interpret basic graphs and data tables to identify patterns and relationships between variables.

Measures of Central Tendency (Mean, Median, Mode)

Why: Understanding basic statistical measures helps students interpret the data they will use to explore correlations.

Key Vocabulary

CorrelationA statistical measure that describes the extent to which two variables change together. A strong correlation means that as one variable changes, the other tends to change in a predictable way.
CausationThe relationship between cause and effect, where one event (the cause) directly produces another event (the effect).
Confounding VariableA variable that influences both the dependent variable and independent variable, causing a spurious association. It is an 'extra' variable that is not accounted for.
Spurious CorrelationA correlation between two variables that appears to be related but is actually due to coincidence or a third, unobserved variable.

Watch Out for These Misconceptions

Common MisconceptionA strong correlation coefficient (r close to 1 or -1) proves that one variable causes the other.

What to Teach Instead

Correlation strength measures how consistently variables move together, not why. Presenting well-known spurious high correlations, such as the correlation between per capita cheese consumption and deaths from bedsheet tangling, makes this viscerally clear. Group laughter and analysis of why the correlation exists without any causal connection is an effective and memorable correction.

Common MisconceptionIf a study is published or reported in the news, its causal claims have been verified.

What to Teach Instead

Observational studies can establish correlation but cannot verify causation. Causal claims require controlled experiments with random assignment. Students who learn to ask whether a study was experimental or observational have a powerful and practical filter for evaluating media claims. Partner analysis of real news summaries builds this habit efficiently.

Common MisconceptionCommon sense reliably tells us when correlation implies causation.

What to Teach Instead

Many spurious correlations feel intuitively plausible until the confounding variable is identified. Students who rely on intuition rather than systematic analysis are vulnerable to motivated reasoning. Presenting counterintuitive examples where the intuitive causal story is demonstrably wrong builds appropriate epistemic humility about gut-level causal judgments.

Active Learning Ideas

See all activities

Real-World Connections

  • Medical researchers often observe correlations between lifestyle factors and disease rates. For example, a correlation between ice cream sales and drowning incidents might be observed, but both are likely caused by a third factor: hot weather.
  • Economists analyze data for correlations between economic indicators like unemployment rates and consumer spending. They must be careful to distinguish correlation from causation when proposing policy changes.
  • Marketers might see a correlation between advertising campaigns and product sales. However, they need to consider other factors like seasonality, competitor actions, or economic conditions before concluding the ads caused the sales increase.

Assessment Ideas

Quick Check

Present students with three scenarios: 1) A clear causal link (e.g., hitting a light switch and the light turning on). 2) A strong correlation with a likely confounding variable (e.g., number of firefighters at a fire and the amount of damage). 3) A spurious correlation (e.g., number of pirates and global warming). Ask students to label each as causation, correlation with confounding variable, or spurious correlation, and briefly explain their reasoning for the latter two.

Discussion Prompt

Pose the question: 'If two things are correlated, does that mean one causes the other?' Facilitate a class discussion using student-generated examples. Prompt students to ask: 'What else could be causing this?' or 'Is there another explanation?'

Exit Ticket

Provide students with a news headline that implies causation from correlation (e.g., 'Study Shows Coffee Drinkers Live Longer'). Ask them to write one sentence explaining why this headline might be misleading and suggest one question they would ask to investigate further.

Frequently Asked Questions

What is the difference between correlation and causation?
Correlation means two variables tend to change together, but says nothing about why. Causation means one variable directly produces a change in the other through an identifiable mechanism. Establishing causation requires controlled experimental evidence with random assignment. Observational data, regardless of how strong the correlation, can only show that two variables move together, not that one forces the other to change.
What is a confounding variable?
A confounding variable is a third variable causally related to both variables being studied, making them appear correlated even though they have no direct influence on each other. Ambient temperature influences both ice cream sales and drowning rates, creating a correlation between ice cream and drowning that is entirely driven by temperature rather than any real connection between the two activities.
How does active learning help students distinguish correlation from causation?
Arguing both sides of a causal claim in a structured format forces students to think more rigorously than simply labeling an example as correlation or causation. When students must defend a position they do not initially hold, they discover weaknesses in their own reasoning. Group analysis of real news headlines makes the skill feel immediately applicable rather than a classroom exercise detached from real life.
How do scientists establish causation rather than just correlation?
Randomized controlled experiments are the gold standard. Researchers randomly assign participants to treatment and control groups, which eliminates confounding variables because both groups are statistically equivalent on all other factors. When only the treatment differs and outcomes diverge, causation is supported. Many important questions in education, medicine, and social science cannot be studied this way, which is why causal claims from observational data always require more scrutiny.

Planning templates for Mathematics