Definition
Teaching strategies are the purposeful instructional techniques a teacher selects and applies to help students acquire, process, and retain knowledge. They are the concrete, moment-to-moment decisions about how content is presented, practiced, and assessed inside a classroom. A strategy is not a philosophy or a curriculum; it is a repeatable action with a specific learning function, whether that function is to activate prior knowledge, build conceptual understanding, check for comprehension, or promote transfer.
The term encompasses a broad range: explicit explanation, collaborative discussion, retrieval practice, graphic organizers, peer teaching, worked examples, and dozens more. What separates a strategy from a generic classroom activity is intentionality. Teachers who deploy strategies deliberately know what cognitive work the technique is designed to produce and can articulate why they chose it for this particular student, content, and moment.
Effective teaching is not the performance of charisma. It is the disciplined application of strategies matched to learning goals, adjusted in real time based on evidence of student understanding. The field of educational psychology has spent a century identifying which techniques reliably accelerate learning and which are popular but ineffective. The gap between those two categories is larger than most practitioners expect.
Historical Context
The systematic study of teaching strategies as a distinct field began in earnest with Edward Thorndike's early twentieth-century work on learning transfer. Thorndike's 1923 publication The Psychology of Arithmetic was among the first attempts to derive classroom technique directly from laboratory findings on how humans learn, rather than from tradition or intuition.
The mid-twentieth century brought two divergent traditions. Behaviorist researchers, particularly B.F. Skinner, developed programmed instruction and mastery learning frameworks in the 1950s and 1960s, which gave teachers tools for sequencing practice and immediate feedback. Meanwhile, cognitive psychologists following Jerome Bruner argued in The Process of Education (1960) that discovery learning and inductive reasoning were more durable approaches, particularly for developing conceptual understanding.
Lev Vygotsky's work, translated and widely disseminated in the West from the 1970s onward, shifted the conversation again. His concept of the Zone of Proximal Development established that strategies are most powerful when calibrated to what a learner can nearly but not yet do independently. This framing made explicit instruction, collaborative work, and gradual release not competing philosophies but sequential stages in a coherent instructional arc.
The most comprehensive synthesis of strategy effectiveness came from John Hattie's Visible Learning (2009), which aggregated findings from more than 800 meta-analyses covering 80 million students. Hattie ranked 138 educational influences by effect size, giving practitioners the first large-scale evidence base for comparing strategies against each other, rather than against no instruction at all. Hattie's work, alongside contemporaneous research by Robert Marzano and colleagues at McREL, established the modern vocabulary of evidence-based teaching strategies.
Key Principles
Alignment to Learning Goals
A strategy's value is always relative to the intended outcome. Retrieval practice is the right tool for consolidating factual knowledge; Socratic discussion is the right tool for building analytical reasoning. Selecting a strategy because it worked in a previous lesson, or because it is engaging, without checking whether it serves the current goal, reliably produces busy students who are not learning.
Grant Wiggins and Jay McTighe's Understanding by Design framework (1998) formalized this principle as "backward design": identify the learning goal first, determine what evidence of learning looks like, then select instructional strategies that produce that evidence. This sequence resists the tendency to teach activities rather than outcomes.
Cognitive Load Management
Strategies must account for the limits of working memory. John Sweller's cognitive load theory, developed through the 1980s and 1990s, demonstrated that working memory can hold approximately four chunks of information simultaneously before performance degrades. Strategies that introduce too many new elements at once, or that use unnecessarily complex formats, consume cognitive resources that should go toward learning.
Practical implications include using worked examples before problem-solving (which reduces extraneous load), chunking complex content into smaller units, and ensuring that collaborative strategies like jigsaws or station rotations have clear, simple enough mechanics that the procedure itself does not compete with the content.
Feedback Loops
No strategy functions well without feedback, for the teacher or the student. John Hattie and Helen Timperley's 2007 synthesis in Review of Educational Research identified feedback as the single highest-leverage instructional variable, with an average effect size of 0.79. Feedback operates in both directions: students need to know where they are relative to the goal, and teachers need to know whether the strategy is producing understanding.
Strategies that build in natural feedback moments, exit tickets, cold calling, whiteboards, think-alouds, peer review, are more effective not because of the technique itself but because they generate information that enables immediate adjustment.
Gradual Release of Responsibility
Douglas Fisher and Nancy Frey's Gradual Release of Responsibility model (2008) captures a principle embedded in most effective instructional sequences: move from teacher-modeled to collaboratively practiced to independently applied. Strategies deployed in isolation, without this sequence, tend to either scaffold too heavily (students succeed but cannot transfer) or release too quickly (students fail and disengage).
The model does not prescribe a rigid order. Some lessons begin with student exploration before explicit instruction. What matters is that the teacher consciously tracks who currently holds the cognitive responsibility and adjusts accordingly.
Metacognitive Awareness
Research by Ann Brown (1987) and subsequent investigators established that students who are explicitly taught to monitor their own understanding, plan their approaches to tasks, and evaluate their own work substantially outperform those who receive only content instruction. Strategies that build metacognitive awareness, such as self-explanation prompts, learning journals, and structured reflection, add durable value beyond the immediate lesson.
Classroom Application
Elementary: Retrieval Practice Through Low-Stakes Quizzing
A third-grade teacher teaching multiplication facts uses retrieval practice as a daily five-minute warm-up. Students write answers to a set of 15 problems from memory, then immediately check against an answer key. The teacher tracks class-wide error patterns by scanning completed sheets in two minutes. Problems that produce widespread errors appear again the next day; mastered facts cycle out.
This approach is directly supported by Roediger and Karpicke's 2006 research in Psychological Science demonstrating that retrieval practice produces stronger long-term retention than re-studying the same material. The low stakes (no grade attached) keep anxiety from interfering with performance.
Middle School: Think-Pair-Share for Conceptual Discussion
In a seventh-grade history class exploring the causes of World War I, the teacher poses a complex prompt: "Which single factor was most responsible for escalating the assassination of Franz Ferdinand into a global war?" Students have 90 seconds to write their position before pairing with a neighbor to compare reasoning. The teacher then cold-calls three pairs, pressing for evidence, not opinion.
Think-pair-share functions here not as an engagement technique but as a thinking structure. The individual writing step forces every student to commit to a position before social influence shapes it. The discussion surface is now populated by genuine cognitive diversity rather than the responses of the three fastest hands.
High School: Jigsaw for Complex Texts
An eleventh-grade English teacher assigns four critical essays on Beloved by Toni Morrison. Rather than assigning all four to all students, she uses the jigsaw method: four home groups of four students each, where each student becomes the "expert" on one essay. Students first meet in expert groups, discussing and annotating their assigned text in depth. They then return to their home groups and teach the essay to their peers.
The instructional logic is compression. Teaching a text to a peer requires a fundamentally different kind of understanding than reading it. Students in jigsaw conditions are forced into the highest levels of Bloom's taxonomy, synthesis and evaluation, to fulfill their expert role.
Mixed Grades: Learning Stations for Differentiated Practice
Station rotations, or stations, allow a teacher to deliver three different levels of practice simultaneously while running a targeted small-group intervention at a fourth station. A sixth-grade math teacher designs three practice stations (procedural fluency, conceptual application, and extension problem-solving) and pulls a rotating group of four students for direct re-teaching while the rest rotate.
This strategy operationalizes differentiated instruction without requiring three separate lesson plans. The differentiation lives inside the station design.
Research Evidence
John Hattie's Visible Learning (2009) established a reference point for comparing strategies by effect size against an average of 0.40 (roughly one year of standard schooling). Strategies that consistently exceed this threshold include classroom discussion (0.82), feedback (0.79), spaced practice (0.71), and metacognitive strategies (0.69). Strategies that consistently underperform include learning styles matching (0.17) and discovery learning without structure (0.21).
Robert Marzano, Debra Pickering, and Jane Pollock's Classroom Instruction That Works (2001) synthesized McREL's database of instructional research to identify nine high-yield categories, including similarities and differences, summarizing, nonlinguistic representations, and cooperative learning. Their analysis found effect sizes for cooperative learning averaging 0.73 across multiple studies.
Peter Liljedahl's large-scale observational research, published in Building Thinking Classrooms (2020), examined 400 teachers over 15 years and found that most common classroom practices, including taking notes from a board, sitting in rows, and beginning lessons with a review, consistently produce "non-thinking" student behavior. His most effective substitution was vertical non-permanent surfaces (whiteboards on walls) paired with visibly random group formation, which consistently increased on-task, high-cognitive-demand behavior within two weeks.
Limitations exist. Most strategy research measures short-term outcomes in controlled conditions. Effect sizes from meta-analyses aggregate across diverse study designs and populations, which can obscure whether a strategy works for a specific context. Hattie himself has been criticized by statisticians including Arne Kohn and John O'Neil for methodological choices in his aggregation approach. Teachers are right to treat effect size rankings as a starting point for professional judgment, not a substitution for it.
Common Misconceptions
Misconception: More strategies in a lesson means more engagement and better learning. Strategy variety serves a purpose when different techniques address different phases of the learning cycle. Stacking multiple strategies to maintain novelty, a common interpretation of "student engagement," fragments cognitive work and increases the extraneous load students carry. A lesson with two well-sequenced strategies typically outperforms a lesson with five loosely connected ones. Engagement is a product of appropriate challenge and clear purpose, not activity turnover.
Misconception: Effective strategies work the same way for all learners. The "learning styles" hypothesis, the idea that students are visual, auditory, or kinesthetic learners and should be taught accordingly, has been tested repeatedly and consistently failed to show that matching instruction to a preferred style improves outcomes. (Pashler et al., Psychological Science in the Public Interest, 2008). The research-supported alternative is differentiated instruction based on readiness, interest, and learning profile, which is meaningfully different from style-matching. Multiple representations of content (visual, verbal, physical) benefit all learners, not because of their "type" but because dual coding and retrieval in multiple formats strengthens memory for everyone.
Misconception: A strategy that worked last year will work the same way this year. Strategies interact with the specific group dynamic, prior knowledge distribution, and classroom culture of a given class. A discussion protocol that produced rigorous analysis with one cohort can produce superficial compliance with another if the norms are not explicitly built. Teachers who treat strategies as fixed recipes rather than adaptable frameworks often become frustrated when reliable tools stop producing results. The strategy is a starting structure; the teacher's real-time reading of the room determines whether it functions.
Connection to Active Learning
Teaching strategies sit at the operational core of active learning. Active learning, as defined by Bonwell and Eison (1991) in their landmark ASHE-ERIC report, requires students to do more than passively receive information; they must talk, write, read, reflect, or solve problems in ways that make their thinking visible. Strategies are the specific mechanisms that produce that activity.
Think-pair-share is perhaps the most extensively documented active learning strategy in classroom research. Its structure, individual think time, paired dialogue, shared synthesis, mirrors the processing cycle that cognitive psychology associates with deep encoding. It is particularly powerful for conceptual content where diverse perspectives expose the inadequacy of surface-level answers.
Jigsaw, developed by Elliot Aronson at the University of Texas in 1971, addresses a structural problem with passive learning at scale: in a class of 30, most students cannot get enough cognitive practice with complex material in a single period. By making each student responsible for a piece of content that others need, jigsaw guarantees that every student engages at a depth that passive reading does not produce.
Stations connect directly to scaffolding by allowing the teacher to engineer different levels of support into the physical environment. A student who needs concrete manipulatives works at a station equipped with them; a student ready for abstract application moves to a station designed for that level. The teacher is freed from whole-class pacing constraints to deliver targeted instruction where it is most needed.
The common thread is that strategies are not neutral delivery mechanisms. They are structures that determine where thinking happens, who does it, and at what level of complexity. Choosing a strategy is an act of instructional engineering, not decoration.
Sources
-
Hattie, J. (2009). Visible Learning: A Synthesis of Over 800 Meta-Analyses Relating to Achievement. Routledge.
-
Roediger, H. L., & Karpicke, J. D. (2006). Test-enhanced learning: Taking memory tests improves long-term retention. Psychological Science, 17(3), 249–255.
-
Sweller, J. (1988). Cognitive load during problem solving: Effects on learning. Cognitive Science, 12(2), 257–285.
-
Marzano, R. J., Pickering, D. J., & Pollock, J. E. (2001). Classroom Instruction That Works: Research-Based Strategies for Increasing Student Achievement. ASCD.