Skip to content
Mathematics · Grade 8 · Patterns in Data · Term 3

Using Linear Models for Prediction

Using the equation of a linear model to solve problems in the context of bivariate measurement data.

Ontario Curriculum Expectations8.SP.A.3

About This Topic

In Grade 8 mathematics, students use the equation of a line of best fit to make predictions from bivariate measurement data. They interpret scatter plots of paired data, such as arm span versus height or temperature versus ice melt rates, to derive linear models. Predictions occur both within the data range for reliable estimates and outside it through extrapolation. Students critique these predictions by considering model fit, data spread, and context-specific limitations, aligning with Ontario Curriculum expectations for data analysis.

This topic, from the Patterns in Data unit, strengthens skills in interpreting relationships between variables. Students learn to assess correlation strength, identify outliers, and recognize when linearity fails, such as in curved patterns. These practices build statistical literacy for real applications like predicting sales from advertising spend or crop yields from rainfall.

Active learning benefits this topic greatly because students engage directly with data they collect. Plotting their measurements, calculating equations collaboratively, and testing predictions against new data make abstract concepts concrete. Group critiques of extrapolation scenarios highlight risks, such as assuming linear trends persist indefinitely, fostering deeper understanding and confidence in data-driven decisions.

Key Questions

  1. Predict values using a line of best fit for values within and outside the data range.
  2. Critique the reliability of predictions made using a line of best fit for values outside our data range.
  3. Analyze the limitations of using linear models to make predictions.

Learning Objectives

  • Calculate the predicted value of a dependent variable using the equation of a line of best fit for a given independent variable value.
  • Critique the reliability of predictions made using a line of best fit, distinguishing between interpolation and extrapolation.
  • Analyze the limitations of linear models in representing real-world bivariate data, identifying scenarios where linearity is inappropriate.
  • Compare predictions made from a line of best fit to actual data points, evaluating the accuracy of the model.
  • Explain the meaning of the slope and y-intercept in the context of a specific bivariate data set and its linear model.

Before You Start

Representing Data on Scatter Plots

Why: Students need to be able to plot and interpret scatter plots to understand the visual representation of bivariate data before fitting a line.

Determining the Equation of a Line

Why: Students must be able to find and use the equation of a line (y = mx + b) to make predictions.

Key Vocabulary

Line of best fitA straight line drawn on a scatter plot that best represents the general trend of the data points. It minimizes the overall distance between the line and the points.
InterpolationEstimating a value within the range of the observed data points. Predictions made through interpolation are generally more reliable.
ExtrapolationEstimating a value outside the range of the observed data points. Predictions made through extrapolation can be less reliable as the trend may not continue.
Bivariate dataA set of data consisting of two variables for each individual or event. These pairs of values are often plotted on a scatter plot.

Watch Out for These Misconceptions

Common MisconceptionThe line of best fit passes through all data points.

What to Teach Instead

Lines of best fit minimize overall distance to points but rarely hit every one. Hands-on plotting activities let students see residuals firsthand, adjusting lines to balance errors and grasp the least-squares concept through trial and error.

Common MisconceptionPredictions outside the data range are as reliable as those inside.

What to Teach Instead

Extrapolation assumes the linear pattern continues, which often fails due to changing conditions. Group debates on scenarios reveal this, as students test predictions with new data and note divergences, building caution in real applications.

Common MisconceptionAny two variables with a line of best fit show causation.

What to Teach Instead

Correlation indicates association, not cause. Collaborative data hunts pairing unrelated variables, like shoe size and math scores, prompt discussions that clarify this, strengthening critical analysis.

Active Learning Ideas

See all activities

Real-World Connections

  • Meteorologists use linear models to predict future temperatures based on historical data, helping to issue weather advisories and plan for seasonal changes.
  • Financial analysts may use linear models to forecast sales figures based on advertising spending, aiding businesses in budget allocation and marketing strategy.
  • Urban planners might use linear models to predict population growth in specific neighborhoods based on current trends, informing decisions about infrastructure development and resource management.

Assessment Ideas

Quick Check

Provide students with a scatter plot and the equation of the line of best fit. Ask them to calculate the predicted value for a given data point within the range and one outside the range. Then, ask: 'Which prediction do you trust more and why?'

Discussion Prompt

Present a scenario where a linear model is used to predict something unrealistic, like predicting a person's lifespan based solely on their height. Facilitate a class discussion using questions like: 'What are the limitations of this linear model? What other factors might influence lifespan? Why is extrapolation sometimes dangerous?'

Exit Ticket

Give students a scatter plot showing the relationship between hours studied and test scores, along with the line of best fit equation. Ask them to write one sentence explaining what the slope of the line means in this context. Then, ask them to predict the score for someone who studied 10 hours and state whether this is interpolation or extrapolation.

Frequently Asked Questions

How do students derive and use the equation of a line of best fit?
Start with a scatter plot of bivariate data. Students estimate the line visually or use tools like graphing calculators to find y = mx + b. They substitute x-values to predict y, practicing with contexts like predicting travel time from distance. Reinforce by having them verify predictions against actual data points.
What are the limitations of extrapolating with linear models?
Linear models assume constant relationships that may break outside observed ranges due to factors like saturation or thresholds. For example, predicting plant growth beyond tested light levels ignores biological limits. Teach critique by comparing extrapolated predictions to real-world deviations, emphasizing context and data quality.
How can active learning help students understand prediction reliability?
Active approaches like collecting class data on height and reach, then plotting and predicting, make reliability tangible. Students see tight clusters yield better predictions inside ranges but spot extrapolation risks when testing extremes. Pair shares and whole-class debates build consensus on limitations, turning passive calculation into critical evaluation.
How to assess student understanding of linear model predictions?
Use tasks requiring predictions with justifications, such as 'Predict sales at 10 ads; explain reliability.' Rubrics score equation accuracy, range awareness, and critique depth. Portfolios of student-generated models from personal data show growth in applying concepts across contexts.

Planning templates for Mathematics

Using Linear Models for Prediction | Grade 8 Mathematics Lesson Plan | Flip Education