Mathematics · Year 9 · Statistics and Probability · Term 4

Collecting and Representing Data

Students will review methods of data collection and various ways to represent data, including frequency tables and histograms.

ACARA Content DescriptionsAC9M9ST01

About This Topic

Bivariate data and scatter plots introduce Year 9 students to the world of statistical relationships. Unlike univariate data, which looks at one variable at a time, bivariate data explores how two variables might be linked, such as height and arm span, or study time and exam scores. Students learn to construct scatter plots, identify the direction and strength of correlations, and draw 'lines of best fit' to make predictions.

In the Australian Curriculum, this unit emphasizes the critical thinking skill of distinguishing between correlation and causation. This is a vital life skill This topic comes alive when students can collect their own data from the class or use real-world Australian datasets (like temperature and ice cream sales). Students grasp this concept faster through collaborative data-gathering and peer-led 'investigations' where they must explain the relationships they find.

Key Questions

Compare different methods of data collection and their suitability for various research questions.
Explain how to choose the most appropriate graph to represent a given data set.
Critique common misrepresentations of data in graphs and charts.

Learning Objectives

Compare the suitability of different data collection methods, such as surveys, experiments, and observations, for specific research questions.
Explain the criteria for selecting the most appropriate graph type, including frequency tables, histograms, bar charts, and pie charts, to represent a given data set.
Critique common graphical misrepresentations, such as misleading scales or selective data presentation, identifying how they can distort interpretation.
Construct frequency tables and histograms accurately from raw data, ensuring correct labeling and intervals.
Analyze data presented in frequency tables and histograms to identify patterns, trends, and key features.

Before You Start

Collecting and Organizing Data

Why: Students need prior experience with basic data collection and organization to build upon for this topic.

Introduction to Data Representation

Why: Familiarity with simpler graph types like bar charts and pie charts is necessary before moving to histograms.

Key Vocabulary

Frequency Table	A table that lists data values and the number of times each value occurs, often grouped into intervals for continuous data.
Histogram	A graphical representation of the distribution of numerical data, where the data is grouped into bins or intervals and represented by bars.
Data Collection Method	A systematic procedure for gathering information, such as surveys, interviews, observations, or experiments.
Class Interval	A range of values in a frequency table or histogram that groups data points together.
Frequency	The number of times a particular data value or range of values appears in a data set.

Watch Out for These Misconceptions

Common MisconceptionStudents often believe that correlation always means one thing causes the other.

What to Teach Instead

This is the most famous error in statistics. Using 'spurious correlations' (like the shark attack example) in a structured debate helps them see that two things can be related by a third factor. Active discussion is the best way to break this logical habit.

Common MisconceptionThinking the line of best fit must go through the origin or connect the first and last points.

What to Teach Instead

Students often treat it like a connect-the-dots puzzle. Using a piece of string on a scatter plot allows them to 'see' the line that minimises the distance to all points. Peer-checking their 'string lines' helps them understand the concept of an average trend.

Active Learning Ideas

See all activities

Inquiry Circle: The Leonardo da Vinci Challenge

Students work in groups to measure each other's height and arm span. They plot the class data on a large scatter plot to see if the 'Vitruvian Man' theory (that they are equal) holds true. This involves data collection, plotting, and identifying correlation.

60 min·Whole Class

Formal Debate: Correlation vs. Causation

Give students 'silly' correlations (e.g., as ice cream sales rise, so do shark attacks). Groups must debate whether one causes the other or if there is a 'hidden variable' (like summer heat). This builds essential critical thinking skills for interpreting data.

30 min·Small Groups

Gallery Walk: The Line of Best Fit

Display several scatter plots without lines. Students move in pairs to place a piece of string on each plot where they think the 'line of best fit' should go, then justify their choice based on the balance of points. This builds an intuitive sense of trend lines.

35 min·Pairs

Real-World Connections

Market researchers use surveys and frequency tables to understand consumer preferences for new products, helping companies like Woolworths decide which items to stock.
Urban planners analyze traffic flow data, often collected through sensors and observations, and represent it in histograms to identify peak times and plan infrastructure improvements for cities like Melbourne.
Public health officials use data from hospitals and surveys to create charts and graphs that track disease outbreaks, informing decisions about public health campaigns and resource allocation.

Assessment Ideas

Quick Check

Provide students with a short scenario describing a research question (e.g., 'Investigating the most popular sports played by Year 9 students'). Ask them to write down the most suitable data collection method and justify their choice in one sentence.

Exit Ticket

Present students with a small data set (e.g., heights of 10 students). Ask them to construct a frequency table with appropriate intervals and then draw a histogram based on that table. Check for correct interval grouping and bar representation.

Discussion Prompt

Show students two different graphs representing the same data set, one of which is misleading (e.g., a broken y-axis scale). Ask: 'Which graph do you think presents the data more accurately and why? What makes the other graph misleading?'

Frequently Asked Questions

What is the difference between positive and negative correlation?

In a positive correlation, both variables move in the same direction (as one goes up, the other goes up). In a negative correlation, they move in opposite directions (as one goes up, the other goes down). For example, as exercise increases, body fat often decreases, that's a negative correlation.

What does a 'strong' correlation look like on a scatter plot?

A strong correlation means the points are clustered very closely around a line. If the points are scattered all over the place with no clear pattern, there is 'no correlation' or a very 'weak' one.

How do I draw a good line of best fit?

A good line of best fit should follow the general 'path' of the points. You want to have roughly the same number of points above the line as below it, and it should follow the direction (slope) of the data cluster.

How can active learning help students understand bivariate data?

Active learning, like the 'Leonardo da Vinci Challenge', makes data collection personal and meaningful. When students are plotting their own measurements, they are more invested in the results. Collaborative tasks that challenge 'causation' also force them to think like real statisticians, questioning the 'why' behind the numbers rather than just blindly following a trend line.

Planning templates for Mathematics

Lesson Plan

5E Model

The 5E Model structures lessons through five phases (Engage, Explore, Explain, Elaborate, and Evaluate), guiding students from curiosity to deep understanding through inquiry-based learning.

Unit Planner

Math Unit

Plan a multi-week math unit with conceptual coherence: from building number sense and procedural fluency to applying skills in context and developing mathematical reasoning across a connected sequence of lessons.

Rubric

Math Rubric

Build a math rubric that assesses problem-solving, mathematical reasoning, and communication alongside procedural accuracy, giving students feedback on how they think, not just whether they got the right answer.

From the Blog

How to Use Town Hall in Your Classroom

A step-by-step guide to running a classroom town hall simulation, from designing stakeholder role cards to the debrief that makes learning stick.

More in Statistics and Probability

Measures of Central Tendency (Mean, Median, Mode)

Students will calculate and interpret the mean, median, and mode for various data sets, understanding their strengths and weaknesses.

2 methodologies

Measures of Spread (Range, IQR)

Students will calculate and interpret the range and interquartile range (IQR) as measures of data spread.

2 methodologies

Five-Point Summary and Box Plots

Students will construct five-point summaries and draw box-and-whisker plots to visually represent and compare data distributions.

2 methodologies

Comparing Data Distributions

Students will compare the distributions of two or more data sets using measures of central tendency, spread, and appropriate graphical representations (e.g., back-to-back stem-and-leaf plots, parallel box plots).

2 methodologies

Interpreting Data Displays and Outliers

Students will interpret various data displays (histograms, box plots, stem-and-leaf plots) to describe data shape, identify outliers, and draw conclusions.

2 methodologies

Introduction to Probability

Students will review basic probability concepts, including sample space, events, and calculating theoretical probability.

2 methodologies