What is the difference between assessment for learning and assessment of learning?

Assessment of learning (summative) measures what students have achieved at the end of a unit or course, a final exam, a grade, a report card. Assessment for learning (formative) happens during instruction and is designed to change what happens next. The goal is not to record a score but to generate information that teachers and students act on immediately.

Does assessment for learning replace grades?

No. AfL and summative grading serve different purposes and coexist in most classrooms. AfL generates moment-to-moment feedback that guides instruction; grades report achievement against a standard. Many schools use AfL extensively during a unit and then assign a single summative grade at the end. The two systems are complementary, not competing.

How much time does assessment for learning add to lesson planning?

Done well, AfL reduces reteaching time by catching misconceptions early, so the net cost is often negative. Exit tickets take two to three minutes to design and five minutes to review. Think-pair-share and cold-calling require no extra planning at all, they are embedded in delivery. The upfront investment is in writing clear learning intentions and success criteria, which good lesson planning already requires.

Can assessment for learning work in large classes?

Yes. Whole-class techniques such as mini-whiteboards, thumbs-up/thumbs-down signals, traffic-light cards, and digital polling tools (Mentimeter, Kahoot, Google Forms) give teachers a rapid read on the whole room without individual marking. Gallery walks and chalk-talk produce written artifacts that can be scanned quickly for patterns.

What does the research say about the size of AfL's effect on learning?

Paul Black and Dylan Wiliam's 1998 review of 250 studies found effect sizes between 0.4 and 0.7 standard deviations, translating to roughly 2 months of additional learning per year. John Hattie's 2009 Visible Learning synthesis places formative evaluation at an effect size of 0.90, among the highest of any instructional intervention.

Assessment for Learning (AfL) - Teaching Wiki

Definition

Assessment for Learning (AfL) is the practice of using evidence of student understanding, gathered continuously during instruction, to adjust teaching and learning in real time. The purpose is not to measure or record , it is to generate actionable information that both teachers and students use to close the gap between current performance and the learning goal.

The Assessment Reform Group (1999) defined AfL as "the process of seeking and interpreting evidence for use by learners and their teachers to decide where the learners are in their learning, where they need to go, and how best to get there." Three questions sit at the core of that definition: Where is the learner now? Where are they going? What is the best next step? Every AfL strategy answers at least one of them.

AfL is closely related to formative assessment but carries a stronger emphasis on student agency. Where formative assessment can describe any low-stakes check, AfL specifically requires that the evidence collected be shared with and acted upon by students themselves, not just logged by the teacher.

Historical Context

The modern framework for AfL emerged from a 1998 review by Paul Black and Dylan Wiliam, then at King's College London. Their paper "Inside the Black Box: Raising Standards Through Classroom Assessment" synthesized 250 studies published between 1988 and 1997 and concluded that formative assessment , when implemented with high-quality feedback and student involvement, produced effect sizes of 0.4 to 0.7 standard deviations. That translated, in their words, to moving a student from the 50th percentile to roughly the 65th to 75th percentile.

The phrase "assessment for learning" was popularized by the Assessment Reform Group (ARG), a UK research consortium active from 1989 to 2010 that included Black, Wiliam, Mary James, and Bethan Marshall, among others. Their 1999 publication "Assessment for Learning: Beyond the Black Box" named the concept, distinguished it from summative assessment, and set out ten principles that schools could use as a framework.

Black and Wiliam followed with "Assessment and Classroom Learning" (1998, Assessment in Education) and the practitioner-facing "Working Inside the Black Box" (2002), which introduced specific, usable strategies: questioning techniques, feedback that moves learning forward, sharing learning goals with students, and peer and self-assessment. By 2004, AfL had been adopted as official policy in England, Scotland, and Wales, and had spread to Australia, New Zealand, Canada, and Scandinavia.

The intellectual roots run deeper than the 1990s. Benjamin Bloom's mastery learning model (1968) showed that students who received formative checks and corrective feedback before moving to new material achieved at dramatically higher levels. Vygotsky's zone of proximal development (1978) provided a theoretical foundation: effective instruction must target the gap between what a learner can do independently and what they can do with support. AfL is, in practice, the mechanism for finding and closing that gap continuously.

Key Principles

Sharing Learning Intentions and Success Criteria

Students learn more effectively when they know what they are supposed to learn and what good work looks like. Sharing learning intentions ("By the end of this lesson, you will be able to...") is not the same as announcing an activity. Success criteria describe observable evidence of understanding: "You can explain why the water cycle moves faster in tropical regions using at least two weather variables." When students hold this standard themselves, they can monitor their own progress instead of guessing at the teacher's expectations.

Classroom Questioning That Generates Evidence

Low-order recall questions ("What year did the French Revolution begin?") confirm memory but reveal nothing about understanding. AfL requires questions that surface reasoning: "Why do you think the Third Estate was more volatile than the Second?" Techniques such as wait time (minimum three seconds after posing a question), random cold-calling, and no-hands-up policies ensure that evidence comes from all students, not only those who volunteer. The goal is diagnostic data, not performance.

Feedback That Moves Learning Forward

Effective feedback within AfL tells students specifically what they have done well, what needs improving, and how to improve it. Research by Kluger and DeNisi (1996), covering 2,500 experiments, found that feedback focused on the task and the next step consistently raised performance; feedback focused on the person (grades, praise, ego) often depressed it. Feedback in education functions as instruction, not evaluation. Comments such as "Your argument is clear, but you've only cited one source , find one more that supports your claim from a different angle" give the student a concrete action.

Peer Assessment

Students who assess each other's work consolidate their own understanding of the success criteria while generating feedback for a classmate. Dylan Wiliam emphasizes that peer assessment requires explicit training, students must learn to give specific, task-focused feedback rather than generic praise or criticism. The social dimension has an additional benefit: feedback from a peer is often received more readily than feedback from a teacher, because it carries less evaluative weight.

Self-Assessment and Self-Regulation

Self-assessment is the highest-leverage component of AfL because it builds the habit of monitoring understanding independently of the teacher. Techniques include traffic-light self-rating (red: I don't understand; amber: I'm uncertain; green: I understand), reflective journals, and structured self-evaluation against success criteria. Over time, self-assessment develops metacognitive awareness, students who can accurately judge their own understanding are better positioned to regulate their own learning.

Classroom Application

Exit Tickets (All Grade Levels)

An exit ticket is a brief written response to a targeted question, completed in the last three to five minutes of class and submitted before students leave. A science teacher might ask: "Draw and label the water cycle. Circle the stage you are least confident about." The teacher reviews the tickets before the next lesson, sorts them into three piles (solid understanding, partial understanding, misconception), and adjusts the next lesson's opening accordingly. Exit tickets cost almost no instructional time and provide more diagnostic information than end-of-unit tests because they arrive while correction is still possible.

Think-pair-share is typically described as an engagement technique, but it functions as a formative assessment tool when used deliberately. During the "share" phase, the teacher listens not for correct answers but for the range of reasoning across the room. A history teacher running think-pair-share on the causes of World War I will hear five to eight distinct explanations in four minutes , enough to know whether students are confusing proximate and structural causes, whether they are drawing on the sources assigned, and which pairs need direct intervention before moving forward.

Gallery Walk for Real-Time Diagnosis

A gallery walk posts student work or problem sets around the room and has students rotate to read, respond, and build on each other's thinking. For the teacher, it creates a distributed display of understanding that can be scanned in minutes. A mathematics teacher who posts six different student approaches to the same algebra problem can use the gallery walk to open a whole-class discussion about why three approaches work and three do not, without singling out individual students. This surfaces misconceptions at scale, in a low-stakes format.

Chalk-Talk for Written Formative Evidence

Chalk-talk is a silent, written discussion in which students respond to a central prompt posted on chart paper or a whiteboard. Because all contributions are visible, the teacher can read the room's collective understanding at a glance and add targeted follow-up questions directly onto the paper. Unlike verbal discussion, chalk-talk produces a permanent artifact that can be photographed and reviewed. It works especially well with topics where students hesitate to speak aloud (controversial issues, areas where they fear being wrong) and with classes where a few dominant voices tend to crowd out quieter students.

Research Evidence

Black and Wiliam's foundational 1998 review established the evidence base: across 250 studies, classrooms that implemented formative assessment practices consistently outperformed control classrooms by 0.4 to 0.7 standard deviations. The review was notable for its scope and for drawing from diverse national contexts, grade levels, and subject areas.

John Hattie's Visible Learning project (2009), a meta-analysis of over 800 meta-analyses covering 80 million students, ranks formative evaluation at an effect size of 0.90 , well above the 0.40 threshold Hattie identifies as "hinge point" for a year's expected growth. Feedback specifically scores 0.73. These are among the highest effect sizes of any instructional intervention, including technology integration, ability grouping, and extended school hours.

A 2011 study by Ruiz-Primo and Furtak (University of Colorado) observed middle school science teachers and coded their questioning behavior against student learning outcomes on pre- and post-tests. Teachers who used informal formative assessment, eliciting student thinking, recognizing the evidence, and using it to respond, produced significantly greater gains than those who did not, even controlling for prior student knowledge.

Research by Cowie and Bell (1999, published in Assessment in Education) distinguished planned from interactive AfL. Planned AfL involves deliberate instruments (exit tickets, pre-assessments). Interactive AfL happens spontaneously in dialogue, a teacher hearing confusion in a student's question and adjusting mid-explanation. Both produce learning gains, but interactive AfL is harder to train and sustains itself only when teachers have deep content knowledge and strong relationships with students.

The honest limitation: much of the AfL research relies on teacher self-report or short-term outcome measures. Long-term retention studies are scarcer. Some meta-analyses conflate high-quality formative feedback with low-stakes quizzing, which inflates effect sizes. The evidence for AfL's core mechanisms is strong; the evidence for specific implementation protocols is more variable.

Common Misconceptions

Connection to Active Learning

AfL and active learning are mutually reinforcing systems. Active learning generates the observable evidence that AfL requires; AfL gives teachers a principled way to respond to what active learning reveals.

Think-pair-share exemplifies this relationship. The technique forces every student to construct a response before hearing the teacher's explanation, which surfaces prior knowledge and misconceptions that would otherwise remain invisible. A teacher who listens during the pair phase and selectively amplifies certain responses during share is practicing interactive AfL , using the evidence to shape the direction of whole-class discussion in real time.

Chalk-talk produces a written record of collective thinking that functions as a formative artifact. Unlike a verbal discussion, the teacher can review the full range of student responses simultaneously, identify patterns in misunderstanding, and design a targeted follow-up sequence. The silence of chalk-talk also ensures that the quietest students in the room contribute evidence, a persistent problem with verbal AfL techniques that favor confident, fast responders.

Gallery walks turn student work into publicly visible data. When students post their reasoning and peers annotate it, the teacher gains a distributed picture of class understanding without one-on-one conferencing. The resulting artifacts can inform not only the next lesson but also which students need small-group intervention and which are ready for extension.

At a deeper level, active learning and AfL share a common premise: students are not passive recipients of instruction but active constructors of understanding. AfL makes that construction visible; active learning creates the conditions in which it happens.

For further reading on the feedback dimension of AfL, see Feedback in Education. For the student-facing component, see Self-Assessment.

Sources

Black, P., & Wiliam, D. (1998). Assessment and classroom learning. Assessment in Education: Principles, Policy & Practice, 5(1), 7–74.
Black, P., & Wiliam, D. (1998). Inside the Black Box: Raising Standards Through Classroom Assessment. King's College London School of Education.
Assessment Reform Group. (1999). Assessment for Learning: Beyond the Black Box. University of Cambridge School of Education.
Hattie, J. (2009). Visible Learning: A Synthesis of Over 800 Meta-Analyses Relating to Achievement. Routledge.

Assessment for Learning (AfL)

Definition

Historical Context

Key Principles

Sharing Learning Intentions and Success Criteria

Classroom Questioning That Generates Evidence

Feedback That Moves Learning Forward

Peer Assessment

Self-Assessment and Self-Regulation

Classroom Application

Exit Tickets (All Grade Levels)

Gallery Walk for Real-Time Diagnosis

Chalk-Talk for Written Formative Evidence

Research Evidence

Common Misconceptions

Connection to Active Learning

Sources

Frequently Asked Questions

Related Concepts

Related Articles

What is Formative Assessment? A Guide to Real-Time Student Growth

25+ Formative Assessment Strategies to Transform K-12 Student Learning

50+ Creative Exit Ticket Ideas: Formative Assessment Strategies for K-12

Related Methodologies

Think-Pair-Share

Chalk Talk

Gallery Walk

Definition

Historical Context

Key Principles

Sharing Learning Intentions and Success Criteria

Classroom Questioning That Generates Evidence

Feedback That Moves Learning Forward

Peer Assessment

Self-Assessment and Self-Regulation

Classroom Application

Exit Tickets (All Grade Levels)

Think-Pair-Share as an AfL Engine

Gallery Walk for Real-Time Diagnosis

Chalk-Talk for Written Formative Evidence

Research Evidence

Common Misconceptions

Connection to Active Learning

Sources

Frequently Asked Questions

Related Concepts

Related Articles

What is Formative Assessment? A Guide to Real-Time Student Growth

25+ Formative Assessment Strategies to Transform K-12 Student Learning

50+ Creative Exit Ticket Ideas: Formative Assessment Strategies for K-12

Related Methodologies

Think-Pair-Share

Chalk Talk

Gallery Walk