Skip to content
Computing · Year 9 · Computer Systems and Architecture · Spring Term

Representing Characters (ASCII/Unicode)

Students will understand how characters are encoded using ASCII and Unicode, and the need for different standards.

National Curriculum Attainment TargetsKS3: Computing - Data RepresentationKS3: Computing - Binary and Digitisation

About This Topic

Students explore how computers store and display text through character encoding standards like ASCII and Unicode. ASCII, a 7-bit system, assigns binary codes to 128 characters, primarily for English letters, numbers, and symbols. This foundation helps Year 9 pupils grasp that all data, including text, converts to binary for processing and storage in computer systems.

Unicode addresses ASCII's limitations by supporting over a million characters across languages, emojis, and symbols using variable-length encoding like UTF-8. Pupils compare how ASCII fails with non-English text, causing mojibake or display errors, while Unicode ensures consistent representation worldwide. This topic aligns with KS3 data representation, building skills in binary digitisation and understanding architecture dependencies.

Hands-on activities benefit this abstract topic by turning binary patterns into visible characters. When students manually convert text to ASCII binary or test Unicode in different files, they see encoding in action, connect theory to real software behaviour, and debug issues collaboratively.

Key Questions

  1. Explain the necessity of character encoding standards like ASCII and Unicode.
  2. Compare the limitations of ASCII with the advantages of Unicode.
  3. Analyze how different character encodings can lead to display issues if not handled correctly.

Learning Objectives

  • Explain the fundamental reason for character encoding standards in computing.
  • Compare and contrast the character set limitations of ASCII with the expanded capabilities of Unicode.
  • Analyze how mismatched character encodings can cause data corruption and display errors.
  • Identify the specific binary representations for common English characters using ASCII.
  • Demonstrate how a single character can have multiple representations in different Unicode encodings (e.g., UTF-8, UTF-16).

Before You Start

Introduction to Binary Numbers

Why: Students need a foundational understanding of how decimal numbers are converted to and from binary to grasp character encoding.

Computer Data Storage

Why: Understanding that computers store all information, including text, as binary digits is essential before exploring how characters are represented.

Key Vocabulary

Character EncodingA system that assigns a unique numerical value to each character, allowing computers to store and process text data.
ASCIIAn early character encoding standard using 7 bits to represent 128 characters, primarily for English letters, numbers, and basic symbols.
UnicodeA universal character encoding standard designed to represent characters from virtually all writing systems, emojis, and symbols, using variable-length encoding.
UTF-8A common variable-width Unicode encoding that uses 1 to 4 bytes per character, widely used on the internet for its efficiency and compatibility.
MojibakeGarbled text resulting from incorrect character encoding or decoding, where characters are displayed as a sequence of unrelated symbols.

Watch Out for These Misconceptions

Common MisconceptionASCII handles all characters universally.

What to Teach Instead

ASCII limits to 128 basic symbols, excluding accents or scripts. Hands-on file conversions reveal garbled outputs for other languages, prompting students to explore Unicode solutions through peer debugging sessions.

Common MisconceptionUnicode stores characters as pictures or images.

What to Teach Instead

Unicode uses binary codes like ASCII, not images. Activity with hex editors shows code patterns for emojis, helping students discard visual misconceptions via direct binary inspection.

Common MisconceptionCharacter encoding does not affect binary storage.

What to Teach Instead

All text becomes binary via encoding. Encoding challenges where groups flip bits and observe text changes clarify this, building accurate mental models through experimentation.

Active Learning Ideas

See all activities

Real-World Connections

  • International software development teams must agree on Unicode (often UTF-8) to ensure applications display text correctly across different languages, from French accents in Paris to Japanese Kanji in Tokyo.
  • Web developers use UTF-8 encoding for websites to ensure that characters like the Euro symbol (€) or emojis (😊) render consistently for users worldwide, regardless of their operating system or browser.
  • Archivists digitizing historical documents from various cultures must carefully select appropriate Unicode encodings to preserve the original characters accurately, preventing data loss or misinterpretation over time.

Assessment Ideas

Exit Ticket

Provide students with a short sentence containing both English letters and a common symbol (e.g., 'Year 9 is cool!'). Ask them to: 1. Write the ASCII binary code for the first three letters. 2. Explain why Unicode would be necessary if the sentence included a character like 'é'.

Quick Check

Display a short text file that has been saved with an incorrect encoding (e.g., a UTF-8 file opened as Latin-1). Ask students: 'What do you see happening to the text? What is the likely cause of this display issue?'

Discussion Prompt

Pose this question to small groups: 'Imagine you are designing a new global messaging app. What character encoding standard would you choose, and why? What potential problems might you still encounter, even with your chosen standard?'

Frequently Asked Questions

How do I explain ASCII and Unicode to Year 9 students?
Start with everyday text on screens, then show binary charts for 'A' as 01000001 in ASCII. Demonstrate Unicode by typing emojis and checking file properties. Use simple comparisons: ASCII for English basics, Unicode for global use. Relate to real apps like messaging fails with wrong encoding to keep it relevant.
What causes text display problems in computing?
Mismatches between encoding standards, like saving in UTF-8 but opening as ASCII, produce mojibake. Teach by having students create and swap files intentionally mismatched. This reveals how computers interpret binary differently, stressing consistent standards in data handling across systems.
How can active learning help teach character encoding?
Active tasks like manual ASCII conversion or testing Unicode files make binary abstractness concrete. Pairs encoding names see 7-bit limits firsthand, while group mojibake hunts foster debugging skills. These build confidence in data representation, as students experience errors and fixes collaboratively, deepening KS3 understanding.
Why compare ASCII limitations with Unicode advantages?
ASCII's 128 characters suit basic English but fail globally, while Unicode's vast set supports diversity. Classroom demos with non-English text garbling in ASCII highlight needs. Pupils analyze via charts, preparing for architecture units where encoding impacts storage and transmission efficiency.