Skip to content
Computer Science · Grade 9 · Data and Digital Representation · Term 2

Introduction to Data Analysis

Students will explore basic techniques for analyzing data to identify trends, patterns, and insights.

Ontario Curriculum ExpectationsCS.HS.DA.4CS.HS.S.2

About This Topic

Compression and storage are about the physical and logical limits of digital information. In the Ontario Grade 9 curriculum, students learn how we can shrink large files to save space and speed up transmission over the internet. This involves understanding the difference between lossless compression (where no data is lost) and lossy compression (where some quality is sacrificed for a smaller size).

This topic is vital for the Computer Environments and Systems strand, as it explains why some images look 'pixelated' or why streaming video sometimes buffers. It also touches on the environmental impact of data centers, which require massive amounts of energy to store the world's growing data. Students grasp this concept faster through hands-on modeling where they attempt to 'compress' a physical message using a shared codebook.

Key Questions

  1. Explain the purpose of data analysis in decision-making.
  2. Analyze simple datasets to identify key observations.
  3. Differentiate between qualitative and quantitative data analysis approaches.

Learning Objectives

  • Explain the purpose of data analysis in informing decisions for businesses or research.
  • Analyze a provided dataset to identify at least two key trends or patterns.
  • Differentiate between qualitative and quantitative data analysis methods by providing an example of each.
  • Calculate basic statistical measures such as mean, median, or mode from a simple dataset.

Before You Start

Introduction to Data and Digital Representation

Why: Students need a foundational understanding of what data is and how it is represented digitally before they can analyze it.

Basic Spreadsheet Operations

Why: Familiarity with organizing data in rows and columns using tools like Google Sheets or Microsoft Excel is helpful for hands-on analysis activities.

Key Vocabulary

Data AnalysisThe process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
Quantitative DataNumerical data that can be measured or counted, such as age, temperature, or the number of website visitors.
Qualitative DataDescriptive data that cannot be measured numerically, often gathered through observations, interviews, or open-ended survey questions, such as customer feedback or interview transcripts.
TrendA general direction in which something is developing or changing over time, often visualized in charts or graphs.
PatternA discernible regularity or sequence in data that can help in understanding relationships or making predictions.

Watch Out for These Misconceptions

Common MisconceptionLossy compression is always bad.

What to Teach Instead

Lossy compression is essential for the modern internet; without it, streaming video would be impossible. Peer comparisons of audio files help students realize that the human ear often can't tell the difference at certain levels.

Common MisconceptionCompressing a file multiple times will keep making it smaller.

What to Teach Instead

There is a limit to how much data can be removed. Hands-on activities where students try to 're-compress' an already compressed message help them see that eventually, there is no more redundancy to remove.

Active Learning Ideas

See all activities

Real-World Connections

  • Retail companies like Loblaws analyze sales data to identify popular products, optimize inventory, and plan seasonal promotions, influencing what products are stocked and where they are placed in stores.
  • Public health officials analyze disease outbreak data to track the spread of illnesses like influenza, identify high-risk populations, and allocate resources for vaccination campaigns or public health advisories.
  • Sports analysts examine player statistics and game outcomes to identify successful strategies, evaluate player performance, and inform coaching decisions for teams in leagues like the NHL.

Assessment Ideas

Exit Ticket

Provide students with a small table of data (e.g., student scores on a quiz). Ask them to: 1. Calculate the mean score. 2. Identify one trend or pattern they observe in the data. 3. State whether the data is primarily quantitative or qualitative.

Discussion Prompt

Pose the question: 'Imagine you are analyzing customer feedback for a new video game. What types of questions would you ask to gather quantitative data, and what types of questions would yield qualitative data? How might analyzing both types of data help the game developers?'

Quick Check

Present students with a scenario, such as a city council wanting to understand traffic patterns. Ask them to identify: 1. What is the primary goal of the data analysis? 2. What kind of data (quantitative or qualitative) might be useful? 3. What is one specific piece of information they hope to gain from the analysis?

Frequently Asked Questions

What is the difference between lossy and lossless compression?
Lossless compression reduces file size without losing any original data (like a .zip file). Lossy compression removes 'unnecessary' data to achieve much smaller sizes, but the quality is slightly lowered (like a .jpg or .mp3).
Why do we need to compress files?
Compression saves storage space on our devices and allows data to travel faster over the internet. This is especially important for people in rural or northern Canadian communities with limited bandwidth.
How can active learning help students understand compression?
Active learning turns a mathematical concept into a game of patterns. By asking students to find and replace repeating patterns in text or images, they discover the logic of compression algorithms (like Run-Length Encoding) for themselves.
Does compression affect the environment?
Yes. Smaller files require less energy to transmit and store. By using efficient compression, we can reduce the carbon footprint of the massive data centers that power the internet.