Skip to content
Mathematics · Secondary 4 · Statistics and Probability · Semester 2

Scatter Diagrams and Correlation

Students will construct and interpret scatter diagrams to identify relationships between two variables.

MOE Syllabus OutcomesMOE: Statistics and Probability - S4

About This Topic

Scatter diagrams plot paired data points on a coordinate plane to show relationships between two variables. Secondary 4 students construct these from bivariate datasets, such as advertising spend versus sales revenue. They identify positive correlation when points slope upward from left to right, negative correlation when they slope downward, and no correlation when points scatter randomly. The strength of correlation emerges visually: points clustered tightly near a straight line signal strong correlation, while wide spreads indicate weak correlation.

This topic anchors the Statistics and Probability unit in the MOE curriculum, building on prior data skills and preparing for advanced inference. Students interpret trends to predict general outcomes, like forecasting sales from past data, which sharpens analytical thinking for real-world applications in business and science. Class discussions reveal how outliers affect perceptions of strength.

Active learning suits this topic well. When students gather their own data, plot it collaboratively, and debate interpretations, they grasp nuances that static examples miss. Hands-on plotting reinforces construction accuracy, while group critiques build confidence in visual analysis.

Key Questions

  1. Differentiate between positive, negative, and no correlation in a scatter diagram.
  2. Analyze how the strength of a correlation is visually represented in a scatter plot.
  3. Predict the general trend between two variables based on a scatter diagram.

Learning Objectives

  • Construct scatter diagrams from bivariate data sets, plotting at least 20 data points accurately.
  • Analyze scatter diagrams to classify the type of correlation (positive, negative, or none) present between two variables.
  • Evaluate the strength of a correlation by visually assessing the dispersion of points around a potential line of best fit.
  • Predict the general trend of one variable based on the observed correlation with another variable in a given scatter diagram.
  • Critique the potential influence of outliers on the perceived strength and direction of correlation in a scatter diagram.

Before You Start

Coordinate Geometry

Why: Students need to be proficient in plotting points on a Cartesian plane to construct scatter diagrams.

Data Representation (Bar Graphs, Pie Charts)

Why: Familiarity with representing data visually prepares students for understanding the purpose and interpretation of graphical displays like scatter diagrams.

Mean, Median, Mode

Why: Understanding basic statistical measures helps students contextualize the central tendency and spread of data, which is relevant when discussing correlation strength.

Key Vocabulary

Bivariate DataA set of data that consists of paired measurements for two different variables, such as height and weight for individuals.
CorrelationA statistical measure that describes the extent to which two variables change together, indicating a relationship between them.
Positive CorrelationA relationship where as one variable increases, the other variable tends to increase as well, shown by points generally sloping upwards.
Negative CorrelationA relationship where as one variable increases, the other variable tends to decrease, shown by points generally sloping downwards.
No CorrelationA lack of a discernible linear relationship between two variables, where the data points appear randomly scattered.
OutlierA data point that differs significantly from other observations, which can disproportionately affect statistical analyses.

Watch Out for These Misconceptions

Common MisconceptionCorrelation always means causation.

What to Teach Instead

Students often assume a strong scatter plot proves one variable causes the other. Hands-on activities with spurious examples, like ice cream sales and shark attacks, prompt discussions that separate association from cause. Peer reviews of datasets highlight confounding factors.

Common MisconceptionAll strong correlations form perfect straight lines.

What to Teach Instead

Real data rarely aligns perfectly, yet students expect it. Group plotting of messy datasets shows how clusters near a trend line define strength. Adjustments during collaboration clarify that imperfect lines still convey reliable trends.

Common MisconceptionNegative correlation means values are opposites.

What to Teach Instead

Learners confuse it with exact inverses, like 1 minus the other value. Comparing plots in small groups reveals it's about opposite directions, not magnitudes. Visual manipulations build accurate mental models.

Active Learning Ideas

See all activities

Real-World Connections

  • Market researchers use scatter diagrams to explore relationships between advertising expenditure and product sales. For example, a company might plot monthly ad spend against units sold to see if increased advertising leads to higher sales.
  • Environmental scientists analyze data on industrial emissions and air quality readings using scatter plots. They might investigate if higher levels of certain pollutants correlate with increased respiratory illnesses in affected regions.
  • Financial analysts examine stock market data, plotting the price of one company's stock against another's. This helps them identify potential diversification strategies or understand how related companies' performances move together.

Assessment Ideas

Exit Ticket

Provide students with a pre-made scatter diagram showing a clear positive, negative, or no correlation. Ask them to write: 1. The type of correlation shown. 2. One sentence explaining what this correlation means for the two variables. 3. One word describing the strength of the correlation (e.g., strong, weak, moderate).

Quick Check

Present students with two different scatter diagrams. Ask them to hold up one finger for positive correlation, two fingers for negative correlation, and three fingers for no correlation. Then, ask them to point towards the ceiling if the correlation is strong and towards the floor if it is weak.

Discussion Prompt

Show a scatter diagram with a clear outlier. Ask: 'How does this single point affect our understanding of the relationship between the two variables? If we removed this outlier, would the correlation become stronger or weaker? Why?' Facilitate a brief class discussion on the impact of outliers.

Frequently Asked Questions

How do you identify positive, negative, and zero correlation in scatter diagrams?
Positive correlation shows points trending up from left to right, like study time and scores. Negative trends down, such as price and demand. Zero appears as random scatter with no direction. Teach by having students sketch quick examples from daily life, then plot class data to spot patterns firsthand. Strength comes from point clustering near an imaginary line.
What does the strength of correlation look like visually?
Strong correlation clusters points tightly along a line; moderate spreads somewhat; weak scatters broadly but hints at trend. No correlation lacks any pattern. Use layered activities: start with ideal lines, add noise progressively. Students measure spreads with rulers on plots to quantify visually, deepening interpretation skills for MOE assessments.
How can active learning help students master scatter diagrams?
Active approaches like collecting class data on heights and arm spans, then plotting in pairs, make abstract correlations concrete. Small group sorts of example plots spark debates on strength, correcting misconceptions through evidence. Whole-class predictions from real datasets build predictive confidence. These methods outperform lectures by engaging multiple senses and promoting collaborative reasoning, aligning with MOE's emphasis on inquiry.
How to predict trends from scatter diagrams?
Draw a line of best fit through most points, ignoring outliers, then extend it for predictions. For positive correlation, higher x means higher y. Practice with sports stats: plot training hours versus race times, predict for new values. Group challenges refine estimates, as peers challenge assumptions and improve accuracy for exam-style questions.

Planning templates for Mathematics