Skip to content
Mathematics · Secondary 4 · Statistics and Probability · Semester 2

Lines of Best Fit and Estimation

Students will draw lines of best fit by eye on scatter diagrams and use them to make estimations and predictions.

MOE Syllabus OutcomesMOE: Statistics and Probability - S4

About This Topic

Lines of best fit guide students in capturing trends within scatter diagrams from bivariate data, such as advertising spend versus sales or temperature versus ice cream sales. They practice plotting points accurately, then sketching a straight line by eye that balances distances from points above and below it. From this line, students interpolate values within the data range and extrapolate for predictions, while noting limitations like unreliable far extrapolations.

This topic fits squarely in the Secondary 4 MOE Statistics and Probability unit, extending scatter plot analysis toward correlation and regression foundations. Students address key challenges: selecting lines that represent trends fairly, recognizing outlier impacts that pull the line, and evaluating prediction appropriateness based on data spread and linearity. These skills sharpen data literacy for real applications in business, science, and policy.

Active learning excels for this topic since students collect and plot their own class data, debate line placements collaboratively, and verify predictions with fresh measurements. Hands-on plotting reveals judgment nuances, group negotiations build consensus on 'best' fits, and real-world testing exposes limitations concretely.

Key Questions

  1. How do we draw a line of best fit that accurately represents the trend in a scatter diagram?
  2. When is it appropriate to use a line of best fit to make predictions, and what are its limitations?
  3. How can outliers affect the position of a line of best fit?

Learning Objectives

  • Draw a line of best fit by eye on a given scatter diagram to represent the general trend of bivariate data.
  • Calculate estimated y-values for given x-values using a drawn line of best fit, interpolating within the data range.
  • Predict y-values for given x-values by extrapolating from a drawn line of best fit, identifying potential inaccuracies.
  • Evaluate the impact of outliers on the position and accuracy of a line of best fit.
  • Critique the appropriateness of using a line of best fit for predictions based on the linearity and spread of the data.

Before You Start

Plotting Points on a Cartesian Plane

Why: Students must be able to accurately plot coordinate pairs to construct scatter diagrams.

Interpreting Graphs and Data Tables

Why: Students need to understand how to read and extract information from graphical representations of data.

Understanding Variables (Independent and Dependent)

Why: Students should be able to identify which variable is being manipulated or observed (independent) and which is expected to change in response (dependent).

Key Vocabulary

Scatter DiagramA graph that displays the relationship between two sets of data, plotted as points. Each point represents a pair of values for the two variables.
Line of Best FitA straight line drawn on a scatter diagram that best represents the general trend of the data points. It aims to pass through the middle of the data, with points scattered roughly equally above and below it.
InterpolationEstimating a value within the range of the observed data points using a line of best fit. This is generally more reliable than extrapolation.
ExtrapolationEstimating a value outside the range of the observed data points using a line of best fit. This can be unreliable, especially for values far from the original data.
OutlierA data point that is significantly different from other data points in the set. Outliers can disproportionately influence the position of a line of best fit.

Watch Out for These Misconceptions

Common MisconceptionThe line of best fit must pass through most or all data points exactly.

What to Teach Instead

The line minimizes overall deviation from points, balancing those above and below. Pairs critiquing each other's lines during plotting activities help students see that no single point dictates position, fostering a balanced trend view.

Common MisconceptionExtrapolations beyond the data range are as reliable as interpolations within it.

What to Teach Instead

Trends may curve or break outside observed ranges, reducing accuracy. Group prediction challenges with follow-up measurements demonstrate discrepancies, helping students qualify predictions cautiously.

Common MisconceptionOutliers should always be removed before drawing the line.

What to Teach Instead

Valid outliers influence the trend; removal needs justification. Small group debates on outlier inclusion clarify their pull on the line, building judgment skills through evidence discussion.

Active Learning Ideas

See all activities

Real-World Connections

  • Economists use scatter diagrams and lines of best fit to analyze relationships between economic indicators, such as unemployment rates and inflation, to make forecasts for government policy.
  • Environmental scientists might plot temperature data against the number of ice cream sales in a coastal town to predict demand during peak tourist seasons, helping businesses manage inventory.
  • Market researchers analyze advertising expenditure versus product sales data to estimate the potential impact of future marketing campaigns on revenue.

Assessment Ideas

Quick Check

Provide students with a scatter diagram showing a clear linear trend and a few outliers. Ask them to draw a line of best fit by eye. Then, ask them to estimate the y-value for a given x-value within the data range and predict the y-value for an x-value outside the data range. Check their drawings and calculations for reasonableness.

Discussion Prompt

Present students with two scatter diagrams: one with a strong linear trend and one with a weak, scattered trend. Ask: 'Which diagram is more appropriate for drawing a line of best fit and making predictions? Explain your reasoning, considering the linearity and spread of the data.' Facilitate a class discussion on why some data sets are better suited for this type of analysis.

Exit Ticket

Give students a scatter diagram with one obvious outlier. Ask them to: 1. Draw a line of best fit that accounts for the outlier. 2. Draw a second line of best fit that minimizes the outlier's influence. 3. Write one sentence explaining how the outlier affected the position of the line.

Frequently Asked Questions

How do students draw a line of best fit accurately by eye?
Instruct students to plot points precisely first, then sketch a line with roughly equal points on each side and minimal average distance. Practice with familiar data like heights and shoe sizes builds intuition. Emphasize it need not touch points but capture the central tendency; peer reviews refine subjective judgments over multiple trials.
What are the limitations of lines of best fit for predictions?
Lines assume linear trends, which fail for non-linear data or beyond observed ranges where patterns shift. Outliers skew results, and correlation does not imply causation. Teach by testing class predictions against new data, revealing errors and prompting qualifiers like confidence intervals in discussions.
How do outliers affect the position of a line of best fit?
Outliers exert disproportionate pull due to greater distance, shifting the line toward them unless balanced by cluster weight. Students learn this through activities comparing lines with and without outliers, measuring deviations. This highlights the need to assess outlier validity before fitting.
How can active learning help students master lines of best fit?
Active approaches like collecting class data for plotting engage students kinesthetically, making trends personal. Pair debates on line positions develop justification skills, while whole-class prediction relays test extrapolations against reality. These reveal subjectivity and limitations experientially, outperforming passive worksheets by boosting retention and critical data sense.

Planning templates for Mathematics