Skip to content
Mathematics · Year 9 · Statistics and Probability · Term 4

Collecting and Representing Data

Students will review methods of data collection and various ways to represent data, including frequency tables and histograms.

ACARA Content DescriptionsAC9M9ST01

About This Topic

Bivariate data and scatter plots introduce Year 9 students to the world of statistical relationships. Unlike univariate data, which looks at one variable at a time, bivariate data explores how two variables might be linked, such as height and arm span, or study time and exam scores. Students learn to construct scatter plots, identify the direction and strength of correlations, and draw 'lines of best fit' to make predictions.

In the Australian Curriculum, this unit emphasizes the critical thinking skill of distinguishing between correlation and causation. This is a vital life skill This topic comes alive when students can collect their own data from the class or use real-world Australian datasets (like temperature and ice cream sales). Students grasp this concept faster through collaborative data-gathering and peer-led 'investigations' where they must explain the relationships they find.

Key Questions

  1. Compare different methods of data collection and their suitability for various research questions.
  2. Explain how to choose the most appropriate graph to represent a given data set.
  3. Critique common misrepresentations of data in graphs and charts.

Learning Objectives

  • Compare the suitability of different data collection methods, such as surveys, experiments, and observations, for specific research questions.
  • Explain the criteria for selecting the most appropriate graph type, including frequency tables, histograms, bar charts, and pie charts, to represent a given data set.
  • Critique common graphical misrepresentations, such as misleading scales or selective data presentation, identifying how they can distort interpretation.
  • Construct frequency tables and histograms accurately from raw data, ensuring correct labeling and intervals.
  • Analyze data presented in frequency tables and histograms to identify patterns, trends, and key features.

Before You Start

Collecting and Organizing Data

Why: Students need prior experience with basic data collection and organization to build upon for this topic.

Introduction to Data Representation

Why: Familiarity with simpler graph types like bar charts and pie charts is necessary before moving to histograms.

Key Vocabulary

Frequency TableA table that lists data values and the number of times each value occurs, often grouped into intervals for continuous data.
HistogramA graphical representation of the distribution of numerical data, where the data is grouped into bins or intervals and represented by bars.
Data Collection MethodA systematic procedure for gathering information, such as surveys, interviews, observations, or experiments.
Class IntervalA range of values in a frequency table or histogram that groups data points together.
FrequencyThe number of times a particular data value or range of values appears in a data set.

Watch Out for These Misconceptions

Common MisconceptionStudents often believe that correlation always means one thing causes the other.

What to Teach Instead

This is the most famous error in statistics. Using 'spurious correlations' (like the shark attack example) in a structured debate helps them see that two things can be related by a third factor. Active discussion is the best way to break this logical habit.

Common MisconceptionThinking the line of best fit must go through the origin or connect the first and last points.

What to Teach Instead

Students often treat it like a connect-the-dots puzzle. Using a piece of string on a scatter plot allows them to 'see' the line that minimises the distance to all points. Peer-checking their 'string lines' helps them understand the concept of an average trend.

Active Learning Ideas

See all activities

Real-World Connections

  • Market researchers use surveys and frequency tables to understand consumer preferences for new products, helping companies like Woolworths decide which items to stock.
  • Urban planners analyze traffic flow data, often collected through sensors and observations, and represent it in histograms to identify peak times and plan infrastructure improvements for cities like Melbourne.
  • Public health officials use data from hospitals and surveys to create charts and graphs that track disease outbreaks, informing decisions about public health campaigns and resource allocation.

Assessment Ideas

Quick Check

Provide students with a short scenario describing a research question (e.g., 'Investigating the most popular sports played by Year 9 students'). Ask them to write down the most suitable data collection method and justify their choice in one sentence.

Exit Ticket

Present students with a small data set (e.g., heights of 10 students). Ask them to construct a frequency table with appropriate intervals and then draw a histogram based on that table. Check for correct interval grouping and bar representation.

Discussion Prompt

Show students two different graphs representing the same data set, one of which is misleading (e.g., a broken y-axis scale). Ask: 'Which graph do you think presents the data more accurately and why? What makes the other graph misleading?'

Frequently Asked Questions

What is the difference between positive and negative correlation?
In a positive correlation, both variables move in the same direction (as one goes up, the other goes up). In a negative correlation, they move in opposite directions (as one goes up, the other goes down). For example, as exercise increases, body fat often decreases, that's a negative correlation.
What does a 'strong' correlation look like on a scatter plot?
A strong correlation means the points are clustered very closely around a line. If the points are scattered all over the place with no clear pattern, there is 'no correlation' or a very 'weak' one.
How do I draw a good line of best fit?
A good line of best fit should follow the general 'path' of the points. You want to have roughly the same number of points above the line as below it, and it should follow the direction (slope) of the data cluster.
How can active learning help students understand bivariate data?
Active learning, like the 'Leonardo da Vinci Challenge', makes data collection personal and meaningful. When students are plotting their own measurements, they are more invested in the results. Collaborative tasks that challenge 'causation' also force them to think like real statisticians, questioning the 'why' behind the numbers rather than just blindly following a trend line.

Planning templates for Mathematics