Collecting and Representing Data
Students will review methods of data collection and various ways to represent data, including frequency tables and histograms.
About This Topic
Bivariate data and scatter plots introduce Year 9 students to the world of statistical relationships. Unlike univariate data, which looks at one variable at a time, bivariate data explores how two variables might be linked, such as height and arm span, or study time and exam scores. Students learn to construct scatter plots, identify the direction and strength of correlations, and draw 'lines of best fit' to make predictions.
In the Australian Curriculum, this unit emphasizes the critical thinking skill of distinguishing between correlation and causation. This is a vital life skill This topic comes alive when students can collect their own data from the class or use real-world Australian datasets (like temperature and ice cream sales). Students grasp this concept faster through collaborative data-gathering and peer-led 'investigations' where they must explain the relationships they find.
Key Questions
- Compare different methods of data collection and their suitability for various research questions.
- Explain how to choose the most appropriate graph to represent a given data set.
- Critique common misrepresentations of data in graphs and charts.
Learning Objectives
- Compare the suitability of different data collection methods, such as surveys, experiments, and observations, for specific research questions.
- Explain the criteria for selecting the most appropriate graph type, including frequency tables, histograms, bar charts, and pie charts, to represent a given data set.
- Critique common graphical misrepresentations, such as misleading scales or selective data presentation, identifying how they can distort interpretation.
- Construct frequency tables and histograms accurately from raw data, ensuring correct labeling and intervals.
- Analyze data presented in frequency tables and histograms to identify patterns, trends, and key features.
Before You Start
Why: Students need prior experience with basic data collection and organization to build upon for this topic.
Why: Familiarity with simpler graph types like bar charts and pie charts is necessary before moving to histograms.
Key Vocabulary
| Frequency Table | A table that lists data values and the number of times each value occurs, often grouped into intervals for continuous data. |
| Histogram | A graphical representation of the distribution of numerical data, where the data is grouped into bins or intervals and represented by bars. |
| Data Collection Method | A systematic procedure for gathering information, such as surveys, interviews, observations, or experiments. |
| Class Interval | A range of values in a frequency table or histogram that groups data points together. |
| Frequency | The number of times a particular data value or range of values appears in a data set. |
Watch Out for These Misconceptions
Common MisconceptionStudents often believe that correlation always means one thing causes the other.
What to Teach Instead
This is the most famous error in statistics. Using 'spurious correlations' (like the shark attack example) in a structured debate helps them see that two things can be related by a third factor. Active discussion is the best way to break this logical habit.
Common MisconceptionThinking the line of best fit must go through the origin or connect the first and last points.
What to Teach Instead
Students often treat it like a connect-the-dots puzzle. Using a piece of string on a scatter plot allows them to 'see' the line that minimises the distance to all points. Peer-checking their 'string lines' helps them understand the concept of an average trend.
Active Learning Ideas
See all activitiesInquiry Circle: The Leonardo da Vinci Challenge
Students work in groups to measure each other's height and arm span. They plot the class data on a large scatter plot to see if the 'Vitruvian Man' theory (that they are equal) holds true. This involves data collection, plotting, and identifying correlation.
Formal Debate: Correlation vs. Causation
Give students 'silly' correlations (e.g., as ice cream sales rise, so do shark attacks). Groups must debate whether one causes the other or if there is a 'hidden variable' (like summer heat). This builds essential critical thinking skills for interpreting data.
Gallery Walk: The Line of Best Fit
Display several scatter plots without lines. Students move in pairs to place a piece of string on each plot where they think the 'line of best fit' should go, then justify their choice based on the balance of points. This builds an intuitive sense of trend lines.
Real-World Connections
- Market researchers use surveys and frequency tables to understand consumer preferences for new products, helping companies like Woolworths decide which items to stock.
- Urban planners analyze traffic flow data, often collected through sensors and observations, and represent it in histograms to identify peak times and plan infrastructure improvements for cities like Melbourne.
- Public health officials use data from hospitals and surveys to create charts and graphs that track disease outbreaks, informing decisions about public health campaigns and resource allocation.
Assessment Ideas
Provide students with a short scenario describing a research question (e.g., 'Investigating the most popular sports played by Year 9 students'). Ask them to write down the most suitable data collection method and justify their choice in one sentence.
Present students with a small data set (e.g., heights of 10 students). Ask them to construct a frequency table with appropriate intervals and then draw a histogram based on that table. Check for correct interval grouping and bar representation.
Show students two different graphs representing the same data set, one of which is misleading (e.g., a broken y-axis scale). Ask: 'Which graph do you think presents the data more accurately and why? What makes the other graph misleading?'
Frequently Asked Questions
What is the difference between positive and negative correlation?
What does a 'strong' correlation look like on a scatter plot?
How do I draw a good line of best fit?
How can active learning help students understand bivariate data?
Planning templates for Mathematics
5E Model
The 5E Model structures lessons through five phases (Engage, Explore, Explain, Elaborate, and Evaluate), guiding students from curiosity to deep understanding through inquiry-based learning.
Unit PlannerMath Unit
Plan a multi-week math unit with conceptual coherence: from building number sense and procedural fluency to applying skills in context and developing mathematical reasoning across a connected sequence of lessons.
RubricMath Rubric
Build a math rubric that assesses problem-solving, mathematical reasoning, and communication alongside procedural accuracy, giving students feedback on how they think, not just whether they got the right answer.
More in Statistics and Probability
Measures of Central Tendency (Mean, Median, Mode)
Students will calculate and interpret the mean, median, and mode for various data sets, understanding their strengths and weaknesses.
2 methodologies
Measures of Spread (Range, IQR)
Students will calculate and interpret the range and interquartile range (IQR) as measures of data spread.
2 methodologies
Five-Point Summary and Box Plots
Students will construct five-point summaries and draw box-and-whisker plots to visually represent and compare data distributions.
2 methodologies
Comparing Data Distributions
Students will compare the distributions of two or more data sets using measures of central tendency, spread, and appropriate graphical representations (e.g., back-to-back stem-and-leaf plots, parallel box plots).
2 methodologies
Interpreting Data Displays and Outliers
Students will interpret various data displays (histograms, box plots, stem-and-leaf plots) to describe data shape, identify outliers, and draw conclusions.
2 methodologies
Introduction to Probability
Students will review basic probability concepts, including sample space, events, and calculating theoretical probability.
2 methodologies