Skip to content
Mathematics · Year 11 · Data Interpretation and Statistics · Spring Term

Scatter Graphs and Correlation

Students will plot and interpret scatter graphs, identifying types of correlation and drawing lines of best fit.

National Curriculum Attainment TargetsGCSE: Mathematics - Statistics

About This Topic

Scatter graphs show the relationship between two variables by plotting data points on a coordinate grid. Year 11 students plot points from bivariate data sets, such as heights and weights or exam scores and study hours, then identify positive correlation where points trend upward, negative where they trend downward, and no correlation where points scatter randomly. They draw lines of best fit by eye, ensuring the line passes close to most points, and use these to estimate missing values or predict trends.

This topic aligns with GCSE Statistics requirements for data interpretation, building skills in pattern recognition and critical analysis. Students learn that correlation measures association strength but does not prove causation, a key distinction applied to real-world claims like ice cream sales and drowning rates both rising in summer. Practising with varied data sets strengthens their ability to question data reliability and context.

Active learning suits scatter graphs because students construct meaning from their own or peer-collected data. When they gather measurements in class, plot collaboratively, and debate lines of best fit, misconceptions surface naturally. Group critiques refine judgments, making abstract statistical concepts concrete and relevant to decision-making.

Key Questions

  1. Differentiate between positive, negative, and no correlation in scatter graphs.
  2. Explain why correlation does not imply causation.
  3. Predict future values using a line of best fit, assessing the reliability of the prediction.

Learning Objectives

  • Analyze bivariate data sets to identify and classify the type of correlation present in a scatter graph.
  • Evaluate the strength and direction of correlation from a scatter graph, distinguishing between strong, moderate, and weak relationships.
  • Construct a line of best fit on a scatter graph by eye, justifying its placement relative to the data points.
  • Predict future values or trends using a line of best fit, and critique the reliability of these predictions based on the data's spread and the prediction's distance from the plotted data.
  • Explain the difference between correlation and causation, providing a reasoned example to support the explanation.

Before You Start

Plotting Coordinates

Why: Students must be able to accurately plot points on a Cartesian grid to create scatter graphs.

Understanding Variables

Why: Students need to differentiate between independent and dependent variables to correctly label axes and interpret relationships.

Key Vocabulary

Bivariate DataA data set consisting of two variables for each individual observation, used to investigate relationships.
CorrelationA statistical measure that describes the extent to which two variables change together. It can be positive, negative, or none.
Line of Best FitA straight line drawn on a scatter graph that best represents the trend in the data, minimizing the distance between the line and the data points.
CausationThe relationship where one event is the result of another event; correlation does not imply causation.

Watch Out for These Misconceptions

Common MisconceptionA strong correlation always means one variable causes the other.

What to Teach Instead

Correlation shows association, not causation; third factors often explain links. Group debates on real examples like cricket scores and rainfall help students articulate alternatives. Peer challenges expose flawed reasoning quickly.

Common MisconceptionThe line of best fit must pass through every data point.

What to Teach Instead

Lines balance points above and below for best average fit. Collaborative plotting activities let students test and adjust lines together, seeing how outliers affect balance. Visual feedback from class graphs corrects over-precision.

Common MisconceptionNo correlation means the variables have zero relationship.

What to Teach Instead

Weak scatter still shows no linear link, though non-linear ones may exist. Sorting activities with varied plots train eyes to distinguish strengths. Student-led examples from daily life reinforce nuanced interpretation.

Active Learning Ideas

See all activities

Real-World Connections

  • Market researchers use scatter graphs to analyze the relationship between advertising spend and product sales, helping to predict future sales based on marketing budgets.
  • Environmental scientists plot data on scatter graphs to investigate links between pollution levels and respiratory illnesses in specific urban areas, informing public health initiatives.
  • Financial analysts examine scatter graphs to see if there is a correlation between a company's profit margins and its stock price, aiding investment decisions.

Assessment Ideas

Exit Ticket

Provide students with a scatter graph showing a clear positive correlation. Ask them to: 1. Describe the correlation in one sentence. 2. Draw a line of best fit. 3. Predict the value of the dependent variable when the independent variable is X (a value within the range of the data).

Discussion Prompt

Present students with two scenarios: Scenario A: Ice cream sales increase as the temperature rises. Scenario B: The number of shark attacks increases as ice cream sales rise. Ask: 'Which scenario shows correlation, and which might show causation? Explain your reasoning, focusing on the role of a third variable.'

Quick Check

Display three scatter graphs: one with positive correlation, one with negative, and one with no correlation. Ask students to hold up fingers corresponding to the type of correlation (e.g., 1 for positive, 2 for negative, 3 for none) for each graph shown.

Frequently Asked Questions

How do I teach students to draw lines of best fit accurately?
Guide students to plot data first, then sketch lines that minimise distances from points equally above and below. Use transparent overlays for practice or digital tools like GeoGebra for instant feedback. Emphasise judgement over perfection; follow with peer reviews where groups critique each other's lines against criteria like balance and steepness.
What real-world examples work best for correlation types?
Use heights vs arm spans for positive, temperature vs ice cream sales inversely for negative, and shoe sizes vs favourite colours for none. Provide raw data tables for plotting. Connect to GCSE exam styles by including outliers, prompting discussions on data quality and prediction limits.
How can active learning improve understanding of scatter graphs?
Active methods like collecting class data for heights and reaction times make plotting personal and engaging. Small group sorts of correlation cards build pattern recognition through talk. Prediction challenges with lines of best fit encourage testing ideas, reducing misconceptions as students defend choices and refine via feedback.
Why is distinguishing correlation from causation important in GCSE Maths?
Examiners test this to develop critical thinking; students must explain why linked variables like exam stress and headaches do not prove cause. Activities debating spurious pairs, such as UK rainfall and foreign chocolate prices, clarify lurking variables. This skill transfers to evaluating news claims and supports higher mark bands in interpretation questions.

Planning templates for Mathematics