What is the difference between assessment for learning and assessment of learning?

Assessment of learning (summative) measures what students have achieved at the end of a unit or course , a board exam, a terminal examination, a report card. Assessment for learning (formative) happens during instruction and is designed to change what happens next. The goal is not to record a mark but to generate information that teachers and students act on immediately.

Does assessment for learning replace marks and grades?

No. AfL and summative grading serve different purposes and coexist in most classrooms. AfL generates moment-to-moment feedback that guides instruction; marks report achievement against a standard. Many schools use AfL extensively during a unit and then assign a single summative grade at the end , much as CBSE schools use Periodic Tests and subject enrichment activities alongside the end-of-year board examination. The two systems are complementary, not competing.

How much time does assessment for learning add to lesson planning?

Done well, AfL reduces reteaching time by catching misconceptions early, so the net cost is often negative. Exit tickets take two to three minutes to design and five minutes to review. Think-pair-share and cold-calling require no extra planning at all , they are embedded in delivery. The upfront investment is in writing clear learning intentions and success criteria, which good lesson planning already requires.

Can assessment for learning work in large classes?

Yes. India's classrooms often have 40–60 students, and whole-class techniques are well-suited to this scale. Mini-whiteboards or slates, thumbs-up/thumbs-down signals, traffic-light cards, and digital polling tools (Google Forms, Mentimeter, Quizizz) give teachers a rapid read on the whole room without individual marking. Gallery walks and chalk-talk produce written artifacts that can be scanned quickly for patterns.

What does the research say about the size of AfL's effect on learning?

Paul Black and Dylan Wiliam's 1998 review of 250 studies found effect sizes between 0.4 and 0.7 standard deviations, translating to roughly 2 months of additional learning per year. John Hattie's 2009 Visible Learning synthesis places formative evaluation at an effect size of 0.90, among the highest of any instructional intervention.

Assessment for Learning (AfL) - Teaching Wiki

Definition

Assessment for Learning (AfL) is the practice of using evidence of student understanding , gathered continuously during instruction , to adjust teaching and learning in real time. The purpose is not to measure or record; it is to generate actionable information that both teachers and students use to close the gap between current performance and the learning goal.

The Assessment Reform Group (1999) defined AfL as "the process of seeking and interpreting evidence for use by learners and their teachers to decide where the learners are in their learning, where they need to go, and how best to get there." Three questions sit at the core of that definition: Where is the learner now? Where are they going? What is the best next step? Every AfL strategy answers at least one of them.

AfL is closely related to formative assessment but carries a stronger emphasis on student agency. Where formative assessment can describe any low-stakes check, AfL specifically requires that the evidence collected be shared with and acted upon by students themselves , not just logged by the teacher.

In the Indian context, AfL connects directly to the continuous and comprehensive evaluation (CCE) framework that CBSE introduced and to the spirit behind NCERT's competency-based assessment guidelines, both of which emphasise using classroom evidence to support learning rather than merely sorting students.

Historical Context

The modern framework for AfL emerged from a 1998 review by Paul Black and Dylan Wiliam, then at King's College London. Their paper "Inside the Black Box: Raising Standards Through Classroom Assessment" synthesised 250 studies published between 1988 and 1997 and concluded that formative assessment , when implemented with high-quality feedback and student involvement , produced effect sizes of 0.4 to 0.7 standard deviations. That translated, in their words, to moving a student from the 50th percentile to roughly the 65th to 75th percentile.

The phrase "assessment for learning" was popularised by the Assessment Reform Group (ARG), a UK research consortium active from 1989 to 2010. Their 1999 publication "Assessment for Learning: Beyond the Black Box" named the concept, distinguished it from summative assessment, and set out ten principles that schools could use as a framework.

Black and Wiliam followed with "Working Inside the Black Box" (2002), which introduced specific, usable strategies: questioning techniques, feedback that moves learning forward, sharing learning goals with students, and peer and self-assessment. By 2004, AfL had been adopted as official policy in England, Scotland, and Wales, and had spread to Australia, New Zealand, Canada, and Scandinavia. In India, the National Curriculum Framework (NCF 2005) and later the National Education Policy (NEP 2020) both reflect AfL principles in their calls for reduced rote learning, competency-based progression, and holistic assessment.

The intellectual roots run deeper than the 1990s. Benjamin Bloom's mastery learning model (1968) showed that students who received formative checks and corrective feedback before moving to new material achieved at dramatically higher levels. Vygotsky's zone of proximal development (1978) provided a theoretical foundation: effective instruction must target the gap between what a learner can do independently and what they can do with support. AfL is, in practice, the mechanism for finding and closing that gap continuously.

Key Principles

Sharing Learning Intentions and Success Criteria

Students learn more effectively when they know what they are supposed to learn and what good work looks like. Sharing learning intentions ("By the end of this lesson, you will be able to...") is not the same as writing the chapter title on the board. Success criteria describe observable evidence of understanding: "You can explain why monsoon rainfall varies across India using at least two geographical factors." When students hold this standard themselves, they can monitor their own progress instead of guessing at the teacher's expectations.

In CBSE and NCERT-aligned classrooms, success criteria can be anchored to the competency indicators published in NCERT's learning outcomes documents , making the link between daily lessons and official standards visible to students.

Classroom Questioning That Generates Evidence

Low-order recall questions ("What is the capital of Maharashtra?") confirm memory but reveal nothing about understanding. AfL requires questions that surface reasoning: "Why do you think the Deccan Plateau experiences less rainfall than the Western Ghats?" Techniques such as wait time (minimum three seconds after posing a question), random cold-calling, and no-hands-up policies ensure that evidence comes from all students, not only those who volunteer. The goal is diagnostic data, not performance.

Feedback That Moves Learning Forward

Effective feedback within AfL tells students specifically what they have done well, what needs improving, and how to improve it. Research by Kluger and DeNisi (1996), covering 2,500 experiments, found that feedback focused on the task and the next step consistently raised performance; feedback focused on the person (marks, praise, ego) often depressed it. Feedback in education functions as instruction, not evaluation. A comment such as "Your explanation of photosynthesis is accurate, but you haven't yet connected it to the role of chlorophyll , add one sentence that makes that link" gives the student a concrete action.

Peer Assessment

Students who assess each other's work consolidate their own understanding of the success criteria while generating feedback for a classmate. Dylan Wiliam emphasises that peer assessment requires explicit training , students must learn to give specific, task-focused feedback rather than generic praise or criticism. In Indian classrooms, where students may be reluctant to critique a peer openly, structured peer assessment formats (written checklists, rating rubrics) reduce social friction and make the process feel fair.

Self-Assessment and Self-Regulation

Self-assessment is the highest-leverage component of AfL because it builds the habit of monitoring understanding independently of the teacher. Techniques include traffic-light self-rating (red: I don't understand; amber: I'm uncertain; green: I understand), reflective journals, and structured self-evaluation against success criteria. Over time, self-assessment develops metacognitive awareness , students who can accurately judge their own understanding are better positioned to regulate their own learning, a skill of particular value when preparing for high-stakes board examinations.

Classroom Application

Exit Tickets (Classes 1–12)

An exit ticket is a brief written response to a targeted question, completed in the last three to five minutes of class and submitted before students leave. A Class 9 science teacher might ask: "Draw and label the water cycle. Circle the stage you are least confident about." The teacher reviews the tickets before the next lesson, sorts them into three groups (solid understanding, partial understanding, misconception), and adjusts the next lesson's opening accordingly. Exit tickets cost almost no instructional time and provide more diagnostic information than unit tests because they arrive while correction is still possible.

Think-pair-share is typically described as an engagement technique, but it functions as a formative assessment tool when used deliberately. During the "share" phase, the teacher listens not for correct answers but for the range of reasoning across the room. A Class 10 social science teacher running think-pair-share on the causes of the 1857 uprising will hear five to eight distinct explanations in four minutes , enough to know whether students are confusing immediate triggers with long-term causes, whether they are drawing on the NCERT chapter, and which pairs need direct intervention before moving forward.

Gallery Walk for Real-Time Diagnosis

A gallery walk posts student work or problem sets around the room and has students rotate to read, respond, and build on each other's thinking. For the teacher, it creates a distributed display of understanding that can be scanned in minutes. A Class 8 mathematics teacher who posts six different student approaches to the same linear equation can use the gallery walk to open a whole-class discussion about why three approaches work and three do not , without singling out individual students. This surfaces misconceptions at scale, in a low-stakes format.

Chalk-Talk for Written Formative Evidence

Chalk-talk is a silent, written discussion in which students respond to a central prompt posted on the board or chart paper. Because all contributions are visible, the teacher can read the room's collective understanding at a glance and add targeted follow-up questions directly on the paper. Unlike verbal discussion, chalk-talk produces a permanent artifact that can be photographed and reviewed. It works especially well in Indian classrooms where students may hesitate to speak aloud on topics they find difficult, and in classes where a few confident voices tend to dominate question-and-answer sessions.

Research Evidence

Black and Wiliam's foundational 1998 review established the evidence base: across 250 studies, classrooms that implemented formative assessment practices consistently outperformed control classrooms by 0.4 to 0.7 standard deviations. The review was notable for its scope and for drawing from diverse national contexts, grade levels, and subject areas.

John Hattie's Visible Learning project (2009), a meta-analysis of over 800 meta-analyses covering 80 million students, ranks formative evaluation at an effect size of 0.90 , well above the 0.40 threshold Hattie identifies as the "hinge point" for a year's expected growth. Feedback specifically scores 0.73. These are among the highest effect sizes of any instructional intervention, including technology integration, ability grouping, and extended school hours.

A 2011 study by Ruiz-Primo and Furtak (University of Colorado) observed middle school science teachers and coded their questioning behaviour against student learning outcomes on pre- and post-tests. Teachers who used informal formative assessment , eliciting student thinking, recognising the evidence, and using it to respond , produced significantly greater gains than those who did not, even controlling for prior student knowledge.

Research by Cowie and Bell (1999, published in Assessment in Education) distinguished planned from interactive AfL. Planned AfL involves deliberate instruments (exit tickets, pre-assessments). Interactive AfL happens spontaneously in dialogue , a teacher hearing confusion in a student's question and adjusting mid-explanation. Both produce learning gains, but interactive AfL is harder to train and sustains itself only when teachers have deep content knowledge and strong relationships with students.

The honest limitation: much of the AfL research relies on teacher self-report or short-term outcome measures. Long-term retention studies are scarcer. Some meta-analyses conflate high-quality formative feedback with low-stakes quizzing, which inflates effect sizes. The evidence for AfL's core mechanisms is strong; the evidence for specific implementation protocols is more variable.

Common Misconceptions

Connection to Active Learning

AfL and active learning are mutually reinforcing systems. Active learning generates the observable evidence that AfL requires; AfL gives teachers a principled way to respond to what active learning reveals.

Think-pair-share exemplifies this relationship. The technique forces every student to construct a response before hearing the teacher's explanation, which surfaces prior knowledge and misconceptions that would otherwise remain invisible. A teacher who listens during the pair phase and selectively amplifies certain responses during share is practising interactive AfL , using the evidence to shape the direction of whole-class discussion in real time.

Chalk-talk produces a written record of collective thinking that functions as a formative artifact. Unlike a verbal discussion, the teacher can review the full range of student responses simultaneously, identify patterns in misunderstanding, and design a targeted follow-up sequence. The silence of chalk-talk also ensures that quieter students contribute evidence , a persistent problem with verbal AfL techniques that tend to favour confident, fast responders.

Gallery walks turn student work into publicly visible data. When students post their reasoning and peers annotate it, the teacher gains a distributed picture of class understanding without one-on-one conferencing. The resulting artifacts can inform not only the next lesson but also which students need small-group intervention and which are ready for extension.

At a deeper level, active learning and AfL share a common premise: students are not passive recipients of instruction but active constructors of understanding. AfL makes that construction visible; active learning creates the conditions in which it happens.

For further reading on the feedback dimension of AfL, see Feedback in Education. For the student-facing component, see Self-Assessment.

Sources

Black, P., & Wiliam, D. (1998). Assessment and classroom learning. Assessment in Education: Principles, Policy & Practice, 5(1), 7–74.
Black, P., & Wiliam, D. (1998). Inside the Black Box: Raising Standards Through Classroom Assessment. King's College London School of Education.
Assessment Reform Group. (1999). Assessment for Learning: Beyond the Black Box. University of Cambridge School of Education.
Hattie, J. (2009). Visible Learning: A Synthesis of Over 800 Meta-Analyses Relating to Achievement. Routledge.

Assessment for Learning (AfL)

Definition

Historical Context

Key Principles

Sharing Learning Intentions and Success Criteria

Classroom Questioning That Generates Evidence

Feedback That Moves Learning Forward

Peer Assessment

Self-Assessment and Self-Regulation

Classroom Application

Exit Tickets (Classes 1–12)

Gallery Walk for Real-Time Diagnosis

Chalk-Talk for Written Formative Evidence

Research Evidence

Common Misconceptions

Connection to Active Learning

Sources

Frequently Asked Questions

Related Concepts

Related Articles

14 Formative Assessment Strategies for the Modern CBSE Classroom

Formative Assessment in CBSE Classrooms: A Guide for Indian Educators

25+ Formative Assessment Strategies to Transform Class 1-12 Student Learning

Related Methodologies

Think-Pair-Share

Chalk Talk

Gallery Walk

Definition

Historical Context

Key Principles

Sharing Learning Intentions and Success Criteria

Classroom Questioning That Generates Evidence

Feedback That Moves Learning Forward

Peer Assessment

Self-Assessment and Self-Regulation

Classroom Application

Exit Tickets (Classes 1–12)

Think-Pair-Share as an AfL Engine

Gallery Walk for Real-Time Diagnosis

Chalk-Talk for Written Formative Evidence

Research Evidence

Common Misconceptions

Connection to Active Learning

Sources

Frequently Asked Questions

Related Concepts

Related Articles

14 Formative Assessment Strategies for the Modern CBSE Classroom

Formative Assessment in CBSE Classrooms: A Guide for Indian Educators

25+ Formative Assessment Strategies to Transform Class 1-12 Student Learning

Related Methodologies

Think-Pair-Share

Chalk Talk

Gallery Walk