Evidence-Based Learning Techniques Ranked
Learning techniques ranked by scientific evidence — practice testing, spaced repetition, interleaving, and more. Effect sizes, use cases, and how to combine them.
Students spend thousands of hours studying over their academic careers. Most of that time goes to techniques that research has repeatedly shown produce poor long-term retention: rereading textbooks, highlighting passages, copying notes, and cramming the night before exams. Meanwhile, techniques with the strongest scientific support — practice testing, spaced repetition, interleaved practice — are used least frequently. This is not because students are lazy or unintelligent. It is because the techniques that feel most effective are not the techniques that actually work, and nobody taught them the difference.
In 2013, John Dunlosky and colleagues published a landmark review evaluating ten common learning techniques by evidence quality. Their findings shocked educators: the methods students prefer rank lowest, and the methods they avoid rank highest. Since then, hundreds of additional studies have confirmed, refined, and extended these rankings. This guide presents the definitive evidence-based ranking of learning techniques — not based on popularity, tradition, or what feels productive, but on decades of cognitive psychology research measuring actual long-term retention and transfer.
Every technique in this guide is ranked by scientific evidence, explained with its underlying mechanism, and paired with practical implementation guidance. You will learn which techniques to prioritize, which to abandon, how to combine techniques for maximum effect, and how to build a personal learning system grounded in research rather than habit. Whether you are a student, professional, or lifelong learner, this ranking gives you the evidence to study smarter — not harder.
How Techniques Are Ranked
This ranking evaluates learning techniques by the quality and consistency of scientific evidence supporting their effectiveness for long-term retention and transfer — not by popularity, ease of use, or subjective feeling of productivity.
Ranking Criteria
- Evidence quality: Number of studies, sample sizes, experimental rigor (randomized controlled trials vs correlational)
- Effect consistency: Whether results replicate across materials, age groups, and laboratories
- Effect size: Magnitude of improvement over control conditions (measured by Cohen's d, Hedges' g, or percentage improvement)
- Retention interval: Whether benefits persist days, weeks, and months after learning — not just immediate post-study
- Transfer: Whether benefits extend to new materials and contexts beyond what was directly studied
- Practical utility: Whether the technique can be implemented by learners without specialized equipment or extensive training
Tier System
| Tier | Evidence Level | Recommendation |
|---|---|---|
| S | Very strong — hundreds of replications | Use always — foundation of every study system |
| A | Strong — consistent moderate-to-large effects | Use regularly — core supplementary techniques |
| B | Moderate — reliable but context-dependent | Use strategically — for specific material types |
| C | Low-moderate — limited or narrow evidence | Use with caution — not as primary methods |
| D | Low — weak or negative evidence | Avoid as primary — replace with higher-tier techniques |
What This Ranking Does Not Measure
Evidence rankings measure retention and transfer — not enjoyment, speed of initial encoding, or subjective confidence. High-tier techniques often feel harder and less productive during study while producing dramatically better long-term results. Low-tier techniques often feel easy and productive while producing poor retention. This feeling-reality gap is why evidence-based rankings matter — they override misleading subjective experience.
The Dunlosky Review: Foundation Study
John Dunlosky, Katherine Rawson, Elizabeth Marsh, Mitchell Nathan, and Daniel Willingham published "Improving Students' Learning With Effective Learning Techniques" in Psychological Science in the Public Interest (2013). This review remains the most cited and comprehensive evaluation of learning techniques available.
What They Evaluated
Ten learning techniques commonly used by students: elaborative interrogation, self-explanation, summarization, highlighting, keyword mnemonic, imagery for text, rereading, practice testing, distributed practice, and interleaved practice. Each was rated on utility (high, moderate, low) based on evidence quality, breadth of applicability, and ease of implementation.
Key Findings
| Technique | Dunlosky Rating | Current Tier |
|---|---|---|
| Practice testing | High utility | S |
| Distributed practice (spacing) | High utility | S |
| Interleaved practice | Moderate utility | A (upgraded with subsequent research) |
| Elaborative interrogation | Moderate utility | A |
| Self-explanation | Moderate utility | A |
| Keyword mnemonic | Low utility | C |
| Imagery for text | Low utility | C |
| Summarization | Low utility | C |
| Highlighting | Low utility | D |
| Rereading | Low utility | D |
Why the Review Matters
Before Dunlosky et al., learning technique recommendations were based on tradition, teacher preference, and intuition. After the review, educators and learners had an evidence-based framework for strategy selection. The review's most important finding was not which techniques work — it was the gap between what students do (rereading, highlighting) and what works (testing, spacing). This guide extends the Dunlosky framework with subsequent research (2013–2025) that has strengthened some ratings and added new techniques.
Tier S: Highest Evidence (Use Always)
Tier S techniques have the strongest, most replicated evidence base. Every effective study system should be built on these two foundations. If you implement nothing else, implement these.
#1 Practice Testing (Active Recall)
Tier: S | Dunlosky Rating: High | Effect Size: d = 0.50–0.80 | Evidence: 500+ studies
What It Is
Practice testing — also called retrieval practice or active recall — involves testing yourself on material without looking at notes. Flashcards, practice problems, free recall writes, practice exams, and self-quizzing are all forms of practice testing. The act of retrieving information from memory strengthens the memory trace more than restudying the same information.
Why It Works
- Testing effect: Retrieval itself is a learning event — each successful recall strengthens the neural pathway to that memory
- Desirable difficulty: Retrieval is harder than recognition, producing stronger encoding (Bjork)
- Metacognitive calibration: Testing reveals what you actually know vs what you think you know
- Exam transfer: Practice testing mirrors exam demands (retrieval under pressure) better than any other technique
- Error detection: Failed retrievals identify specific gaps for targeted review
Key Research
Roediger and Karpicke (2006): students who tested themselves remembered 80% after one week; students who restudied remembered 36%. Same study time, more than double the retention. Rowland (2014) meta-analysis of 217 studies confirmed the testing effect with an overall effect size of d = 0.50. Adesope et al. (2010) meta-analysis found testing effects across age groups, materials, and retention intervals.
How to Implement
- Flashcards: Daily review with Problemory Flashcards or Anki — attempt recall before revealing answer
- Free recall: After every study session, close materials and write everything remembered (5–10 min)
- Practice problems: Solve without reference — cover solutions, attempt first
- Practice exams: Weekly timed tests under exam conditions
- Read-recite-review: Read a section, close book, recite from memory, check gaps
- Pre-testing: Attempt questions before learning the material — even failed attempts improve subsequent learning
When to Use
Always. Practice testing should be part of every study session for every subject. Minimum: one retrieval event per session. Optimal: retrieval bookends every session (begin and end with testing).
Deep dive: active recall guide → | retrieval practice →
#2 Distributed Practice (Spaced Repetition)
Tier: S | Dunlosky Rating: High | Effect Size: d = 0.40–0.70 | Evidence: 300+ studies
What It Is
Distributed practice — spacing study sessions over time rather than massing them in one block — produces substantially better long-term retention. Reviewing material at expanding intervals (1 day, 3 days, 7 days, 14 days, 30 days) leverages the spacing effect to build durable memories with minimum total study time.
Why It Works
- Spacing effect: Gaps between sessions allow memory consolidation and create desirable difficulty at each retrieval
- Forgetting and relearning: Partial forgetting before review forces deeper re-encoding than immediate restudy
- Context variation: Spaced sessions occur in different mental states, creating multiple retrieval cues
- Consolidation time: Sleep-dependent memory consolidation occurs between spaced sessions
Key Research
Cepeda et al. (2006) meta-analysis of 317 experiments: spaced practice produced large effect sizes across all retention intervals tested. Cepeda et al. (2008) identified optimal gap as 10–20% of desired retention interval. Carpenter et al. (2012) demonstrated spacing benefits in real classroom settings, not just laboratories.
How to Implement
- Flashcard SRS: Automated spacing via Problemory Flashcards or Anki — review due cards daily
- Manual schedule: Review new material at 1, 3, 7, 14, 30 days after initial learning
- Leitner box: Physical 5-box system with expanding review intervals
- Calendar blocking: Schedule chapter reviews at expanding intervals on calendar
When to Use
Always, in combination with practice testing. Spacing without retrieval (rereading at intervals) is less effective. Retrieval with spacing is the gold standard combination. Start spacing from day one of learning — not before exams.
Deep dive: spaced repetition guide → | review schedule →
Tier A: Strong Evidence (Use Regularly)
Tier A techniques have strong evidence supporting their effectiveness, often with large effect sizes in specific contexts. Use them regularly as supplements to Tier S foundations.
#3 Interleaved Practice
Tier: A | Dunlosky Rating: Moderate (upgraded) | Effect Size: d = 0.40–0.60 | Evidence: 100+ studies
What It Is
Interleaved practice mixes different topics, problem types, or skills within a single study session rather than studying one topic exhaustively before moving to the next (blocked practice). Instead of AAA BBB CCC, interleaving produces ABC ABC ABC.
Why It Works
- Discrimination: Mixing similar concepts forces you to identify which strategy or knowledge applies — critical for exams
- Desirable difficulty: Interleaving feels harder during practice, producing stronger retention
- Exam simulation: Real exams mix topics randomly — interleaved practice mirrors this
- Prevents context-dependent learning: Blocked practice creates "I can do this because I just studied it" without genuine mastery
Key Research
Rohrer and Taylor (2007): interleaved math problems produced 72% accuracy vs 38% for blocked practice after one week. Brunmeyer and MacDonald (2020) meta-analysis confirmed interleaving benefits across domains. Dunlosky rated it moderate in 2013; subsequent research supports upgrading to Tier A.
How to Implement
- Mix problem types from multiple chapters in each practice session
- Use one interleaved flashcard deck (not separate decks per chapter)
- Take practice tests that mix all covered topics
- After initial blocked learning of a new topic, switch to interleaved review
Deep dive: interleaving guide →
#4 Elaborative Interrogation
Tier: A | Dunlosky Rating: Moderate | Effect Size: d = 0.30–0.50 | Evidence: 80+ studies
What It Is
Elaborative interrogation involves asking "why" and "how" questions about facts and concepts during learning. "Why does this make sense?" "Why is this true?" "How does this relate to what I already know?" Forcing explanatory answers produces deeper processing than passive acceptance of facts.
Why It Works
Elaborative interrogation forces connection between new information and existing knowledge — building associative networks that provide multiple retrieval pathways. It also reveals comprehension gaps immediately — if you cannot explain why, you do not understand.
How to Implement
- For every fact learned, ask "Why is this true?" and answer before moving on
- Add "Why?" to flashcard backs alongside definitions
- During reading, pause after each section and ask "Why does the author claim this?"
- Best for factual material with logical relationships — less effective for arbitrary facts
#5 Self-Explanation
Tier: A | Dunlosky Rating: Moderate | Effect Size: d = 0.30–0.60 | Evidence: 60+ studies
What It Is
Self-explanation requires generating explanations of material in your own words as you study — not copying or paraphrasing, but explaining how concepts work, why steps in a procedure are taken, and how parts relate to the whole.
Why It Works
Chi et al. (1994) found that students who self-explained during problem-solving learned twice as much as students who did not — even when the self-explainers took longer. Self-explanation forces generative processing, comprehension monitoring, and gap identification simultaneously.
How to Implement
- After each paragraph or problem step, explain it aloud or in writing in your own words
- Use the Feynman technique — explain as if teaching a beginner (Feynman guide →)
- Write margin notes that explain, not copy
- Best for procedural material (math, science, coding) and conceptual material
Tier B: Moderate Evidence (Use Strategically)
Tier B techniques have reliable evidence in specific contexts but are not universally applicable. Use them strategically for appropriate material types.
#6 Dual Coding
Tier: B | Effect Size: d = 0.20–0.40 | Evidence: 40+ studies
What It Is
Dual coding combines verbal and visual processing of the same information — creating diagrams, mind maps, or sketches alongside text notes. Paivio's dual coding theory proposes that verbal and visual information are processed through separate channels, and combining them creates redundant retrieval pathways.
How to Implement
- Draw diagrams of processes described in text
- Create mind maps connecting concepts radially
- Sketch structures, cycles, and systems from memory
- Best for spatial, structural, and process-oriented material
- Must be generative (you create the visual) — not passive viewing of others' diagrams
#7 Concrete Examples
Tier: B | Effect Size: d = 0.30–0.50 | Evidence: 30+ studies
What It Is
Providing or generating concrete examples of abstract concepts accelerates understanding and retention. Abstract definitions are harder to encode than definitions paired with specific, vivid examples — especially self-generated ones.
How to Implement
- For every abstract concept, write or find two concrete examples
- Include examples on flashcard backs — not just definitions
- Generate original examples (stronger than reading provided ones)
- Best for abstract concepts in math, philosophy, law, and science
#8 Retrieval Plus Elaboration
Tier: B | Effect Size: d = 0.40–0.60 | Evidence: 30+ studies
What It Is
Combining retrieval practice with elaborative processing — retrieve a fact, then explain why it is true, connect it to other knowledge, or generate an example. This combination produces larger effects than either technique alone.
How to Implement
- Flashcard front: term. Back: definition + why it matters + example + connection to related concept
- After free recall, elaborate on each recalled item — do not just list, explain
- After practice problems, explain why the solution method works, not just how
#9 Sleep and Memory Consolidation
Tier: B | Effect Size: d = 0.30–0.50 | Evidence: 100+ studies
What It Is
Sleep — particularly slow-wave and REM sleep — consolidates memories encoded during waking study. Studying before sleep and getting adequate sleep (7–8 hours) significantly improves retention compared to equal study time with sleep deprivation.
Key Research
Walker and Stickgold: sleep after learning improves retention by 20–40%. Gais and Born: sleep after vocabulary learning produces 30–50% better retention. Sleep deprivation reduces learning capacity by up to 40%.
How to Implement
- Review flashcards before bed — sleep consolidates what you studied last
- Protect 7–8 hours of sleep during study periods — non-negotiable
- Schedule difficult new learning in the morning when well-rested
- Never pull all-nighters — material studied while sleep-deprived is poorly encoded AND poorly consolidated
Deep dive: sleep and memory →
Tier C: Low-Moderate Evidence (Use With Caution)
Tier C techniques show benefits in narrow contexts but lack the broad, robust evidence of higher tiers. Use as supplements, not foundations.
#10 Summarization
Tier: C | Dunlosky Rating: Low | Effect Size: d = 0.10–0.30 | Evidence: 40+ studies
What It Is
Writing summaries of read material — condensing chapters, lectures, or articles into shorter form. Dunlosky rated summarization low because benefits depend heavily on skill level and are inconsistent across studies.
When It Helps
Summarization produces learning when it requires deep processing — writing in your own words, identifying main points, connecting ideas. It fails when it becomes copying or when summaries are never reviewed via retrieval.
How to Upgrade Summarization
Do not summarize and stop — summarize, then test yourself on the summary without looking. Convert summary points into flashcards. Use Cornell note-taking format with summary section on left and questions on right.
#11 Keyword Mnemonic
Tier: C | Dunlosky Rating: Low | Effect Size: d = 0.30–0.50 (narrow) | Evidence: 30+ studies
What It Is
Creating phonetic links between foreign words and familiar words, paired with vivid imagery. Effective for paired associate learning (foreign vocabulary) but limited generalizability to other material types.
When It Helps
Language vocabulary acquisition, medical terminology, lists of unrelated items. Less effective for conceptual understanding, procedural skills, or interconnected knowledge. Best used as an encoding accelerator paired with spaced repetition for retention.
Deep dive: mnemonic techniques → | memory palace →
#12 Imagery for Text
Tier: C | Dunlosky Rating: Low | Effect Size: d = 0.10–0.30 | Evidence: 20+ studies
What It Is
Creating mental images while reading text to enhance encoding. Dunlosky rated it low because effects are inconsistent and depend on individual imagery ability and material type.
When It Helps
Concrete, visualizable content (narratives, descriptions, spatial information). Less effective for abstract concepts, mathematical reasoning, or procedural content. Vivid, absurd images work better than literal ones.
Tier D: Low Evidence (Avoid as Primary Methods)
Tier D techniques are widely used but poorly supported by evidence. They may aid initial comprehension but do not produce durable retention. Replace with Tier S and A techniques.
#13 Rereading
Tier: D | Dunlosky Rating: Low | Effect Size: d = 0.05–0.15 | Evidence: 50+ studies (consistently weak)
What It Is
Rereading textbooks, notes, or articles — the most commonly used study technique worldwide. Dunlosky rated it low utility despite its popularity.
Why It Fails
- Creates fluency illusion — material feels familiar, producing false confidence
- No retrieval practice — information flows in but is never pulled out
- Diminishing returns — first read captures most benefit; subsequent reads add little
- Attention decline — eyes move on rereads but processing stops
Replacement
Read once, then switch to read-recite-review or flashcards. One retrieval cycle beats three rereads (study mistakes →).
#14 Highlighting and Underlining
Tier: D | Dunlosky Rating: Low | Effect Size: d = 0.00–0.10 | Evidence: 30+ studies (consistently null)
What It Is
Marking text with highlighter or underlining key passages — the second most popular study technique after rereading.
Why It Fails
- Shallow processing — marking words without understanding concepts
- Over-highlighting — 30–50% of text highlighted means nothing is distinguished
- No retrieval — reviewing highlights is rereading with color, not testing
- False completion — highlighted pages feel "done" without learning occurring
Replacement
Replace highlighting with margin notes in your own words + flashcard creation for key concepts. If you highlight, follow up with retrieval practice on highlighted content only.
Effect Sizes: How Much Each Technique Helps
Effect sizes quantify the magnitude of improvement. Cohen's d interpretation: 0.2 = small, 0.5 = medium, 0.8 = large. In learning research, even d = 0.30 represents meaningful improvement that can change exam outcomes.
| Rank | Technique | Tier | Effect Size (d) | Retention Improvement |
|---|---|---|---|---|
| 1 | Practice testing | S | 0.50–0.80 | 50–100% better than restudy |
| 2 | Spaced repetition | S | 0.40–0.70 | 40–70% better than massed |
| 3 | Interleaved practice | A | 0.40–0.60 | 40–90% better than blocked |
| 4 | Self-explanation | A | 0.30–0.60 | 30–60% better than reading |
| 5 | Elaborative interrogation | A | 0.30–0.50 | 30–50% better than reading |
| 6 | Retrieval + elaboration | B | 0.40–0.60 | Combined exceeds either alone |
| 7 | Concrete examples | B | 0.30–0.50 | Abstract concepts only |
| 8 | Dual coding | B | 0.20–0.40 | Visual/spatial material |
| 9 | Sleep consolidation | B | 0.30–0.50 | 20–40% vs sleep deprivation |
| 10 | Keyword mnemonic | C | 0.30–0.50 | Paired-associate only |
| 11 | Summarization | C | 0.10–0.30 | Skill-dependent, inconsistent |
| 12 | Imagery for text | C | 0.10–0.30 | Concrete content only |
| 13 | Rereading | D | 0.05–0.15 | Negligible beyond first read |
| 14 | Highlighting | D | 0.00–0.10 | Null effect on test performance |
What Effect Sizes Mean in Practice
A student scoring 60% using rereading might score 75–85% using practice testing with the same study time. A student scoring 50% using blocked practice might score 70–80% using interleaved practice. These are not marginal improvements — they are the difference between failing and passing, or between passing and excelling. The techniques at the top of this ranking do not require more time — they require different methods.
Combining Techniques for Maximum Effect
Individual techniques produce significant effects. Combined techniques produce multiplicative effects — the whole exceeds the sum of parts.
The Gold Standard Combination
Practice testing + Spaced repetition — the two Tier S techniques combined. Flashcards with SRS automate both: each review is a retrieval event (testing) scheduled at optimal intervals (spacing). This single combination outperforms any other technique or combination in the research literature.
High-Impact Combinations
| Combination | Techniques Combined | Best For |
|---|---|---|
| Gold standard | Practice testing + Spacing | All material — foundation of every system |
| Deep encoding | Practice testing + Self-explanation | Conceptual and procedural material |
| Exam preparation | Practice testing + Interleaving + Spacing | Mixed-topic exams |
| Abstract concepts | Elaborative interrogation + Concrete examples + Testing | Philosophy, law, advanced science |
| Visual material | Dual coding + Practice testing + Spacing | Anatomy, geography, engineering |
| Vocabulary | Keyword mnemonic + Spaced flashcards + Production practice | Language learning, terminology |
| Maximum retention | Testing + Spacing + Interleaving + Self-explanation + Sleep | High-stakes comprehensive exams |
Technique Stacking Protocol
For each study session, stack techniques in this order:
- Plan (metacognition): Set goal, choose strategy, define success check
- Learn: Initial exposure with self-explanation and elaborative interrogation
- Encode: Create flashcards with examples and connections (dual coding where appropriate)
- Retrieve: Practice testing — flashcards, free recall, problems
- Space: Schedule review at expanding intervals
- Interleave: Mix with other topics in subsequent sessions
- Evaluate (metacognition): Assess retention, adjust strategy
- Consolidate: Pre-sleep review + adequate sleep
Best Techniques by Learning Goal
Exam Preparation (High-Stakes Tests)
- Practice testing (daily flashcards + weekly mock exams)
- Spaced repetition (from day one, not just before exam)
- Interleaved practice (mixed-topic sessions and tests)
- Self-explanation (for conceptual understanding)
- Sleep consolidation (7–8 hours, pre-sleep review)
Long-Term Retention (Professional Knowledge)
- Spaced repetition (daily maintenance review indefinitely)
- Practice testing (regular self-assessment)
- Elaborative interrogation (connect to professional experience)
- Concrete examples (from real work contexts)
Language Learning
- Spaced repetition flashcards (vocabulary + grammar)
- Practice testing (production — L1 to L2 direction)
- Keyword mnemonics (initial encoding of difficult words)
- Concrete examples (sentences in context)
- Interleaved practice (mix vocabulary, grammar, reading, listening)
Skill Acquisition (Math, Coding, Procedures)
- Interleaved practice (mixed problem types)
- Practice testing (solve without reference)
- Self-explanation (explain each step and why)
- Spaced repetition (review problem types at intervals)
Reading Comprehension and Retention
- Self-explanation (explain each section in own words)
- Elaborative interrogation (ask why after each claim)
- Practice testing (free recall after each chapter)
- Spaced repetition (flashcards for key concepts)
Best Techniques by Subject
| Subject | Primary Technique | Secondary Techniques | Avoid |
|---|---|---|---|
| Mathematics | Interleaved problem practice | Self-explanation, spaced review of problem types | Rereading worked examples |
| Biology | Spaced flashcards | Dual coding (diagrams), self-explanation of processes | Highlighting terminology lists |
| Chemistry | Practice testing (reactions, mechanisms) | Concrete examples, interleaved problem types | Rereading reaction lists |
| Physics | Interleaved problem solving | Self-explanation of concepts, concrete examples | Rereading formula sheets |
| History | Spaced flashcards + elaborative interrogation | Self-explanation (cause-effect), concrete examples | Highlighting timelines |
| Law | Practice testing (case application) | Elaborative interrogation, concrete examples | Rereading case summaries |
| Medicine | Spaced flashcards (Anki) | Interleaved practice questions, self-explanation | Rereading textbooks |
| Languages | Spaced flashcards (production) | Keyword mnemonics, concrete sentence examples | Rereading word lists |
| Programming | Interleaved coding problems | Self-explanation of algorithms, practice testing | Rereading code examples |
| Literature | Self-explanation + elaborative interrogation | Practice testing (essay outlines from memory) | Highlighting passages |
Best Techniques by Timeline
One Week Before Exam
Practice testing (2–3 mock exams) + spaced flashcard review (high-failure cards) + interleaved problem practice. No new content. No rereading.
One Month Before Exam
Daily spaced flashcards + weekly mock tests + interleaved practice + error log analysis. Reduce new content intake. Prioritize retrieval over acquisition.
Full Semester (3–6 Months)
Daily flashcards from day one + weekly practice tests + interleaved sessions + self-explanation during initial learning + monthly retention audits. Build the full technique stack progressively.
Long-Term (Years)
Daily spaced repetition maintenance + periodic practice testing + elaborative connections to new experience + annual knowledge audit. Tier S techniques indefinitely; others as needed.
What Students Do vs What Research Says
The gap between student behavior and research recommendations is the single largest untapped opportunity in education.
| Rank by Usage | Student Behavior | Research Tier | Rank by Evidence |
|---|---|---|---|
| 1 (most used) | Rereading | D | 13–14 (worst) |
| 2 | Highlighting | D | 14 (worst) |
| 3 | Reviewing notes | D | 13 (near worst) |
| 4 | Creating summaries | C | 10–11 |
| 5 | Cramming | D (massed) | N/A (anti-technique) |
| ... | ... | ... | ... |
| Rarely used | Self-testing | S | 1 (best) |
| Rarely used | Spaced review | S | 2 (best) |
| Rarely used | Interleaved practice | A | 3 (third best) |
Students rank techniques in almost perfect inverse order to the evidence. The techniques used most (rereading, highlighting) are the least effective. The techniques used least (self-testing, spacing) are the most effective. Closing this gap — replacing Tier D habits with Tier S habits — produces larger performance improvements than increasing study hours.
Building Your Evidence-Based Study System
Translating this ranking into a personal study system requires replacing low-tier habits with high-tier ones progressively — not all at once.
Phase 1: Foundation (Week 1–2)
- Install daily flashcard review (Tier S: testing + spacing)
- Replace rereading with read-recite-review (Tier S: testing)
- Stop highlighting as a standalone method (eliminate Tier D)
- Track retention rate to verify improvement
Phase 2: Expansion (Week 3–4)
- Add weekly practice test (Tier S: testing)
- Begin interleaved practice sessions (Tier A)
- Add self-explanation to every study session (Tier A)
- Start elaborative interrogation for factual material (Tier A)
Phase 3: Optimization (Week 5–8)
- Add dual coding for visual/spatial material (Tier B)
- Implement pre-sleep review + sleep protection (Tier B)
- Build error log with strategy evaluation (metacognition)
- Monthly retention audit and technique effectiveness review
Phase 4: Mastery (Week 9+)
- Full technique stack running automatically
- Strategy selection based on material type (automatic)
- Calibration within 10% on practice tests
- Personal evidence-based strategy profile documented
The Daily Evidence-Based Study System
Morning (25 minutes)
- Spaced flashcard review — all due cards (Tier S: testing + spacing) — 20 min
- Free recall warm-up — write yesterday's key points from memory (Tier S: testing) — 5 min
Study Block (45–90 minutes)
- Pre-study plan — goal, strategy, success check (metacognition) — 3 min
- Learn new material with self-explanation and elaborative interrogation (Tier A) — 25 min
- Create flashcards for new concepts with examples (Tier S + B) — 10 min
- Interleaved practice problems from current and previous topics (Tier A) — 20 min
- End-of-session free recall — close all materials, write key points (Tier S) — 5 min
- Post-session evaluation — three questions (metacognition) — 3 min
Evening (10 minutes)
- Pre-sleep flashcard review — today's failed cards (Tier S + B: sleep consolidation) — 5 min
- Learning journal entry — what worked, what did not (metacognition) — 5 min
Weekly (Sunday, 60 minutes)
- Cumulative practice test under timed conditions (Tier S) — 30 min
- Error analysis and categorization (metacognition) — 15 min
- Calibration review — predicted vs actual scores — 5 min
- Next week planning — priorities based on error data — 10 min
Implementation Mistakes
1. Adding Tier S Without Removing Tier D
Adding flashcards while continuing to reread and highlight. Total time increases but Tier D activities dilute Tier S benefits. Replace, do not accumulate.
2. Using Tier S Techniques Passively
Flipping flashcards without attempting recall first. Taking practice tests while peeking at notes. The retrieval attempt IS the learning event — passive flipping produces Tier D results from Tier S tools.
3. Spacing Without Testing
Reviewing material at intervals by rereading. Spacing rereading is better than massed rereading but far worse than spaced retrieval. Always pair spacing with testing.
4. Abandoning Tier A Techniques Because They Feel Hard
Interleaving and self-explanation feel slower and more difficult than rereading. This difficulty IS the learning — abandoning effective techniques because they feel ineffective is the fluency illusion in action.
5. Ignoring Context Dependencies
Keyword mnemonics for math problems. Highlighting for procedural skills. Every technique has optimal contexts — using techniques outside their evidence base produces disappointment and abandonment.
Emerging Techniques and Future Research
Learning science continues to evolve. Several emerging areas may change future rankings.
Desirable Difficulties Framework (Bjork)
Robert Bjork's desirable difficulties research suggests that conditions making learning feel harder (spacing, interleaving, generation, variation) produce better long-term retention. This framework explains WHY top-tier techniques work and predicts that future high-evidence techniques will share the property of introducing productive difficulty.
Generation Effect
Self-generated content (writing your own examples, creating your own questions, drawing your own diagrams) produces better retention than passively receiving the same content. Effect sizes comparable to practice testing for some materials. Likely future Tier A technique.
Pre-Testing Effect
Attempting to answer questions before learning the material improves subsequent learning — even when initial answers are wrong. Growing evidence base suggests pre-testing as a standalone Tier A technique for initial exposure.
AI-Assisted Learning
AI tools for generating practice questions, flashcards, and explanations show promise but lack long-term retention studies. Current recommendation: use AI to generate retrieval materials (practice questions, flashcards), not as a replacement for retrieval itself. AI-generated summaries used passively likely rank Tier D (AI for learning →).
Exercise and Cognitive Enhancement
Aerobic exercise before or after learning improves encoding and consolidation. Effect sizes moderate (d = 0.20–0.40). Best treated as a biological support for Tier S techniques rather than a standalone learning technique (exercise and cognition →).
Additional Techniques Beyond the Dunlosky Review
Research since 2013 has evaluated additional techniques not included in the original review. Here is how they rank based on subsequent evidence.
Memory Palace (Method of Loci) — Tier B+
Effect Size: d = 0.40–0.60 (for ordered lists) | Evidence: 40+ studies
The method of loci — placing vivid mental images in familiar spatial locations — produces large effects for ordered information (lists, sequences, speeches). Memory athletes use this technique exclusively. For academic learning, it works best for: ordered lists (periodic table groups, taxonomy classifications), sequential processes (legal arguments, historical timelines), and batch vocabulary encoding. Limitation: high initial encoding effort, requires transfer to spaced repetition for long-term maintenance. Best used as an encoding accelerator paired with flashcard review, not as a standalone retention system.
Deep dive: memory palace guide → | science of memory palaces →
Chunking — Tier B
Effect Size: d = 0.30–0.50 | Evidence: 30+ studies
Organizing individual items into meaningful groups (chunks) expands effective working memory capacity. Phone numbers (555-123-4567 vs 5551234567), chess positions (meaningful patterns vs individual pieces), and acronyms (HOMES for Great Lakes) all leverage chunking. For learning: group related vocabulary into thematic clusters, organize formulas by topic, create acronyms for list items. Chunking is most effective when chunks are meaningful (not arbitrary groupings) and when learners create their own chunks rather than receiving pre-made ones.
Deep dive: chunking guide →
Generation Effect — Tier A- (Emerging)
Effect Size: d = 0.40–0.60 | Evidence: 50+ studies
Information that learners generate themselves (examples, questions, summaries, diagrams) is remembered better than information passively received — even when the generated content is imperfect. Creating your own practice questions, writing your own examples, drawing your own diagrams, and formulating your own flashcards all leverage the generation effect. This overlaps with self-explanation and practice testing but emphasizes the creative/generative component specifically.
Varied Practice — Tier A- (Emerging)
Effect Size: d = 0.30–0.50 | Evidence: 30+ studies
Practicing skills in varied contexts (different problem formats, different settings, different conditions) produces better transfer to novel situations than practicing in identical conditions. Related to interleaving but focuses on contextual variation rather than topic mixing. For motor skills: practice in varied environments. For cognitive skills: solve problems in multiple formats (multiple choice, essay, application, teaching).
Retrieval-Induced Forgetting (and How to Avoid It)
Practicing retrieval of some items can temporarily suppress related unpracticed items — a phenomenon called retrieval-induced forgetting. Practical implication: cumulative review must cover ALL material, not just recently studied topics. Daily flashcard review handles this automatically (all due cards appear). Weekly cumulative tests prevent selective review that leaves early material vulnerable.
How Learning Research Works
Understanding how techniques are evaluated helps you assess claims and distinguish evidence-based recommendations from marketing.
The Gold Standard: Randomized Controlled Trials
Participants randomly assigned to technique A or technique B (or control). Same material, same time, same conditions — only the technique differs. Retention tested after a delay (typically 1 day to 1 week). The technique producing higher delayed retention wins. Most Tier S and A evidence comes from this design.
Meta-Analyses
Statistical combination of results from dozens or hundreds of individual studies. Meta-analyses provide the most reliable effect size estimates because they aggregate across laboratories, materials, and populations. The Dunlosky review, Cepeda spacing meta-analysis, and Rowland testing meta-analysis are examples.
Effect Size Interpretation
- d = 0.20: Small effect — noticeable but modest improvement
- d = 0.50: Medium effect — meaningful improvement that changes outcomes
- d = 0.80: Large effect — dramatic improvement, often difference between pass and fail
- d = 1.00+: Very large — rare in learning research, usually comparing effective vs completely ineffective methods
Red Flags in Learning Claims
- "Brain training" apps claiming to improve general intelligence (see: brain training myths →)
- Techniques validated only by immediate post-study tests (not delayed retention)
- Claims based on single studies without replication
- "Learning styles" (visual, auditory, kinesthetic) — no evidence that matching instruction to preferred style improves learning
- Products claiming to bypass the need for effortful retrieval
- Techniques that feel easy and promise fast results — effective learning requires desirable difficulty
Popular Learning Myths vs Evidence
| Myth | Evidence Says | What to Do Instead |
|---|---|---|
| "Reread until you know it" | Rereading creates fluency illusion, not retrieval | Read once, then test yourself |
| "Highlight the important parts" | Highlighting does not improve test performance | Write margin notes + create flashcards |
| "Cram the night before" | Massed practice produces rapid decay | Daily spaced review from day one |
| "Study one topic at a time" | Blocked practice produces poor transfer | Interleave topics within sessions |
| "I'm a visual learner" | Learning styles hypothesis not supported | Use evidence-based techniques for all |
| "More hours = more learning" | Method matters more than duration | Same hours with Tier S techniques |
| "Review notes before exams" | Passive review produces poor retention | Practice tests + flashcard review |
| "Copy notes neatly to learn them" | Copying is shallow processing | Self-explain in own words + flashcards |
| "Brain training games improve memory" | Transfer to real learning is not demonstrated | Practice testing on actual material |
| "Listening to lectures twice is enough" | Passive re-exposure adds minimal retention | Lecture + self-explanation + flashcards |
Detailed Implementation Guide by Technique
Implementing Practice Testing: Step by Step
- Choose format: Flashcards for facts, free recall for concepts, practice problems for procedures, mock exams for comprehensive assessment
- Attempt before checking: Always retrieve before looking at answers — the attempt IS the learning event
- Log failures: Every failed retrieval goes into error log or gets shorter flashcard interval
- Schedule: Minimum one retrieval event per study session; optimal: bookend every session with retrieval
- Increase difficulty: Progress from cued recall (flashcards) to free recall (blank page) to application (problems, essays)
- Track: Retention rate over time — should be 85%+ on mature cards
Implementing Spaced Repetition: Step by Step
- Choose system: Anki/Problemory for 200+ items; manual calendar for under 200
- Create cards on learning day: One concept per card, with example sentence
- Review daily: Complete all due cards before adding new ones
- New card rate: 10–20/day for sustainable growth; reduce if review exceeds 30 min
- Failed cards: Reset to short interval; re-review in same session
- Maintenance: Continue daily review indefinitely — never stop
Implementing Interleaved Practice: Step by Step
- Initial learning: First exposure can be blocked (one new topic at a time)
- Switch to interleaving: From second session onward, mix all previously studied topics
- Flashcards: One mixed deck — not separate decks per chapter
- Problems: Mixed problem sets from all chapters — never single-chapter problem sets for review
- Expect difficulty: Interleaved sessions feel harder — this confirms it is working
- Test interleaved: All practice tests should mix topics
Common Implementation Timeline
Most learners see measurable retention improvement within 1–2 weeks of switching from Tier D to Tier S techniques. Full habit formation and calibration accuracy typically develop over 4–8 weeks. Exam score improvements become visible within 4–6 weeks of consistent evidence-based practice. The investment is front-loaded — the first two weeks require conscious effort to override habitual passive methods. After that, Tier S techniques become automatic and require less willpower than the passive methods they replaced. The research is clear, the results are reproducible, and the techniques are available to you starting today.
Evidence in Action: Case Studies
Case 1: Medical Student — Anki Transformation
Before: Reread textbooks, highlighted notes, scored 65% on practice questions.
Intervention: Created Anki deck from day one of medical school. Daily 30-min review. Replaced all rereading with flashcard retrieval.
After: Step 1 score: 245 (90th percentile). Daily review time: 30 min. Total deck: 12,000 cards at 88% retention. Study hours decreased; scores increased.
Case 2: Language Learner — Tier S Only
Before: Reread vocabulary lists, watched videos passively. 500 words learned over 6 months with ~30% retention.
Intervention: Switched to daily spaced flashcards (20 words/day) with production cards and sentence examples.
After: 3,000 words in 6 months at 85% retention. Speaking ability matched vocabulary depth for the first time.
Case 3: Engineering Student — Interleaving Discovery
Before: Blocked problem practice (all Chapter 3, then all Chapter 4). Felt confident per chapter. Mixed exam score: 58%.
Intervention: Switched to interleaved problem sets mixing all chapters.
After: Next mixed exam: 79%. Interleaved practice felt harder during study but produced dramatically better exam performance.
Case 4: Professional Certification — Technique Stack
Before: Reread study guide twice, highlighted key sections. Practice exam: 62%.
Intervention: Full technique stack — daily flashcards, weekly mock exams, interleaved practice questions, self-explanation of missed concepts, pre-sleep review.
After: Passed certification on first attempt. Final practice exam: 87%. Total study time: same hours, different methods.
Technique Selection Decision Guide
Use this guide to select the optimal technique for any learning situation.
Step 1: What Type of Material?
- Discrete facts (definitions, dates, terms, formulas) → Flashcards (Tier S) + Spacing (Tier S)
- Concepts (theories, mechanisms, principles) → Self-explanation (Tier A) + Testing (Tier S)
- Procedures (problem-solving, calculations, algorithms) → Interleaved practice (Tier A) + Testing (Tier S)
- Application (essays, case analysis, clinical reasoning) → Practice testing (Tier S) + Concrete examples (Tier B)
- Lists/sequences (ordered items, steps, classifications) → Memory palace (Tier B+) + Flashcards (Tier S)
- Vocabulary (foreign words, terminology) → Spaced flashcards (Tier S) + Keyword mnemonics (Tier C)
Step 2: What Is Your Timeline?
- Exam in 1 week: Testing (Tier S) + Spacing (Tier S) — focus on retrieval of existing material
- Exam in 1 month: Add interleaving (Tier A) + weekly mock tests
- Exam in 3+ months: Full technique stack — build progressively from Tier S upward
- Long-term retention: Daily spacing (Tier S) indefinitely + periodic testing
Step 3: What Is Your Current Method?
- If rereading: Replace with read-recite-review (Tier S) — highest priority change
- If highlighting: Replace with margin notes + flashcards (Tier S) — highest priority change
- If cramming: Install daily spaced review (Tier S) — start today
- If already using flashcards: Add interleaving (Tier A) and weekly mock tests (Tier S)
- If already using full stack: Optimize — track retention, adjust intervals, refine card quality
Sample Week Using Evidence-Based Techniques
| Day | Tier S | Tier A | Tier B/Meta | Total Time |
|---|---|---|---|---|
| Monday | Flashcard review (20 min) + free recall (5 min) | Self-explanation during new lecture content | Pre-study plan (3 min) | ~60 min study |
| Tuesday | Flashcard review (20 min) + practice problems (15 min) | Interleaved problems (mixed chapters) | Post-session evaluation (5 min) | ~65 min study |
| Wednesday | Flashcard review (20 min) + free recall (5 min) | Elaborative interrogation on new reading | Dual coding — draw process diagram | ~55 min study |
| Thursday | Flashcard review (20 min) + practice problems (15 min) | Interleaved problems + self-explanation | Pre-sleep review (5 min) | ~65 min study |
| Friday | Flashcard review (20 min) + sectional test (20 min) | Interleaved review of week's topics | Error log update (10 min) | ~70 min study |
| Saturday | Full mock exam (60 min) + flashcard review (20 min) | Interleaved error re-attempt | Exam wrapper (10 min) | ~90 min study |
| Sunday | Flashcard review (20 min) + cumulative free recall (15 min) | Strategy evaluation | Weekly plan + calibration review (15 min) | ~50 min study |
The Neuroscience Behind Top-Ranked Techniques
Understanding why Tier S and A techniques work at the neural level reinforces commitment when they feel difficult during practice.
Practice Testing and Neural Pathways
Each retrieval attempt activates the hippocampus and prefrontal cortex simultaneously — the hippocampus locates the memory trace while the prefrontal cortex executes controlled retrieval. Successful retrieval triggers dopamine release in the nucleus accumbens, reinforcing the neural pathway. Failed retrieval followed by correct feedback produces even stronger encoding than successful retrieval alone — the error signals the brain to prioritize that trace. This is why pre-testing (attempting before learning) and errorful retrieval produce such large effects.
Spacing and Synaptic Consolidation
Synaptic connections strengthen during consolidation periods between study sessions — not during study itself. Massed practice does not allow consolidation between exposures. Spaced practice allows protein synthesis, dendritic spine growth, and systems consolidation to occur between sessions, producing physically stronger memory traces. Sleep between spaced sessions amplifies this effect further (neuroplasticity →).
Interleaving and Neural Discrimination
Blocked practice activates the same neural circuits repeatedly — strengthening one pathway but not building discrimination between similar pathways. Interleaved practice activates different circuits in rapid succession, forcing the prefrontal cortex to select the appropriate pathway for each item. This selection process builds the discrimination ability that mixed exams require — explaining why interleaved learners outperform blocked learners on mixed tests despite equal total practice time.
Why Tier D Techniques Fail Neurologically
Rereading and highlighting primarily activate visual processing areas without triggering hippocampal retrieval circuits. Information passes through awareness without being consolidated into retrievable long-term memory. The fluency felt during rereading is perceptual fluency (ease of processing the text) misattributed as memory fluency (ability to retrieve the content) — two distinct neural phenomena that feel identical subjectively.
Evidence-Based Techniques for Specific Exams
Medical Licensing (USMLE, MCAT, PLAB)
Primary stack: Anki spaced repetition (10,000–15,000 cards) + UWorld/practice question interleaving + weekly timed practice blocks.
Evidence basis: Top scorers universally use spaced flashcards + interleaved question practice. Rereading textbooks correlates with lower scores in retrospective surveys.
Daily minimum: 30 min flashcard review + 30 min practice questions. No rereading.
Law (Bar Exam, LSAT)
Primary stack: Spaced flashcards for rules and definitions + interleaved practice essays + self-explanation of case reasoning.
Evidence basis: Bar passage correlates with practice question volume and spaced review, not outline rereading.
Daily minimum: 20 min flashcards + 45 min practice questions/essays.
Graduate Admissions (GRE, GMAT, LSAT)
Primary stack: Vocabulary flashcards with spacing + interleaved quant/verbal practice + weekly full mock tests.
Evidence basis: Vocabulary retention requires daily spaced review (Tier S). Quant improvement requires interleaved problem types (Tier A).
Daily minimum: 20 min vocabulary flashcards + 30 min interleaved practice + weekly mock.
Professional Certifications (CPA, PMP, CISSP)
Primary stack: Spaced flashcards for domain knowledge + interleaved practice exams + error log analysis.
Evidence basis: Professional exam pass rates correlate with practice test volume, not study guide rereading.
Daily minimum: 20 min flashcards + 30 min practice questions.
Language Proficiency (TOEFL, IELTS, DELE)
Primary stack: Vocabulary flashcards with production cards + interleaved skill practice (reading, listening, writing, speaking) + concrete sentence examples.
Evidence basis: Vocabulary depth (spaced flashcards) predicts all four skill scores. Passive immersion without retrieval produces recognition without production.
Daily minimum: 20 min flashcards + 15 min production practice.
For Teachers and Parents: Evidence-Based Recommendations
If you guide others' learning, these evidence-based recommendations produce the largest impact with the smallest intervention.
Top 5 Recommendations for Educators
- Require retrieval in class: Start lectures with a quiz on previous material (Tier S). Low-stakes, frequent, cumulative.
- Assign spaced review, not rereading: "Review flashcards for 15 minutes" not "Reread Chapter 5."
- Design interleaved assessments: Mix topics on all quizzes and exams — never single-topic tests after initial learning.
- Teach technique explicitly: Most students do not know rereading is ineffective. One 30-minute lesson on evidence-based techniques changes behavior.
- Model metacognition: Think aloud during problem-solving. Ask "How do you know you understand this?" not "Do you understand?"
Top 5 Recommendations for Parents
- Ask "Can you explain it to me?" not "Did you study?" — retrieval practice in disguise
- Provide flashcard tools: Problemory Flashcards or Anki — remove the barrier to Tier S techniques
- Protect sleep: 7–8 hours non-negotiable — sleep deprivation destroys Tier S effectiveness
- Celebrate retrieval success: Praise correct recall, not hours spent at desk
- Do not reward rereading: "You read the chapter three times!" reinforces Tier D behavior
A Brief History of Learning Technique Research
The evidence ranking in this guide represents over a century of scientific inquiry into how humans learn and remember.
- 1885: Hermann Ebbinghaus publishes forgetting curve research — first systematic study of memory retention over time, establishing the foundation for spaced repetition
- 1917: Arthur Gates demonstrates that recitation (active recall) produces better retention than reading — early evidence for the testing effect
- 1978: Thomas Bjork introduces "desirable difficulties" framework — explaining why effective learning feels hard
- 1979: John Flavell coins "metacognition" — establishing the study of thinking about thinking
- 1994: Michelene Chi demonstrates self-explanation effect — students who explain learn twice as much
- 2006: Roediger and Karpicke publish landmark testing effect study — 80% vs 36% retention
- 2007: Rohrer and Taylor demonstrate interleaving superiority — 72% vs 38% on delayed tests
- 2008: Cepeda et al. meta-analysis confirms spacing effect across 317 experiments
- 2013: Dunlosky et al. publish comprehensive technique review — the foundation of modern evidence-based study recommendations
- 2014: Rowland meta-analysis confirms testing effect across 217 studies (d = 0.50)
- 2020: Brunmeyer and MacDonald meta-analysis confirms interleaving benefits across domains
Tools for Evidence-Based Learning
| Tier | Technique | Tool |
|---|---|---|
| S | Practice testing + Spacing | Problemory Flashcards, Anki |
| S | Practice testing | Practice exams, Problemory Score Tracker |
| A | Interleaved practice | Mixed flashcard decks, mixed problem sets |
| A | Self-explanation | Blank paper, Feynman technique |
| B | Dual coding | Drawing tools, mind mapping apps |
| B | Keyword mnemonics | Problemory Mnemonic Generator |
| B | Memory palace | Problemory Memory Palace |
| Meta | Metacognitive tracking | Score Tracker, learning journal |
Practical Exercises
Exercise 1: Personal Technique Audit
List every study method you used last week. Rank each against this evidence ranking. Calculate what percentage of your study time went to Tier S/A vs Tier C/D. Target: 80%+ on Tier S/A within 30 days.
Exercise 2: Tier Comparison Test
Take one topic. Study half using rereading (Tier D). Study half using flashcards (Tier S). Test both 48 hours later with free recall. Quantify the tier difference with your own data.
Exercise 3: Technique Stack Session
Execute one full study session using the complete technique stack (plan → learn with self-explanation → flashcards → retrieve → evaluate). Compare retention to a typical passive session on similar material.
Exercise 4: 30-Day Tier S Challenge
For 30 days, use only Tier S techniques (practice testing + spaced repetition). No rereading, no highlighting. Track retention rate weekly. Document improvement over the month.
Exercise 5: Combination Experiment
Study three similar topics with three different combinations: (A) testing alone, (B) spacing alone, (C) testing + spacing + interleaving. Test all three after one week. Identify your optimal combination.
FAQ
What are the most evidence-based learning techniques?
Practice testing (active recall) and distributed practice (spaced repetition) have the strongest evidence — both rated high utility by Dunlosky et al. and confirmed by hundreds of subsequent studies. Interleaved practice, self-explanation, and elaborative interrogation rank next with strong evidence.
Is rereading ever useful?
The first read of material is necessary for initial exposure. Beyond the first read, rereading produces negligible additional retention (d = 0.05–0.15). Replace second and subsequent reads with retrieval practice — one read-recite cycle outperforms three rereads.
Why do students use ineffective techniques?
Ineffective techniques (rereading, highlighting) feel productive because they create fluency — material feels familiar. Effective techniques (testing, spacing) feel harder because they reveal gaps. Students choose based on feeling, not evidence. Education rarely teaches technique selection explicitly.
Can I combine multiple techniques?
Yes — combining techniques produces larger effects than any single technique. The gold standard combination is practice testing + spaced repetition (flashcards with SRS). Add interleaving, self-explanation, and sleep consolidation for maximum effect.
How long until I see results from switching techniques?
Measurable retention improvement within 1–2 weeks of switching from Tier D to Tier S techniques. Full calibration and habit formation takes 4–8 weeks. Exam score improvements typically visible within 4–6 weeks.
Are flashcards the best study method?
Flashcards automate the two highest-evidence techniques (practice testing + spaced repetition) — making them the most efficient single tool. However, flashcards alone may insufficient for procedural skills (need interleaved problem practice) and application tasks (need timed practice tests). Use flashcards as the foundation, supplemented by other Tier A techniques as needed.
What technique has the largest effect size?
Practice testing (d = 0.50–0.80) and spaced repetition (d = 0.40–0.70) have the largest consistently replicated effect sizes. Interleaved practice shows large effects (d = 0.40–0.60) in direct comparisons with blocked practice. Combined techniques produce even larger practical improvements.
Do these rankings apply to all ages?
Yes. The testing effect, spacing effect, and interleaving effect replicate across age groups from elementary school through adulthood. Implementation may differ (younger learners need more scaffolding for self-explanation), but the relative rankings hold.
How does this ranking compare to the original Dunlosky review?
This guide extends the 2013 Dunlosky review with subsequent research (2013–2025). Interleaving has been upgraded from moderate to Tier A based on meta-analytic evidence. Generation effect and varied practice are added as emerging Tier A techniques. Memory palace and chunking are included as additional techniques. The core finding is unchanged: practice testing and spaced repetition remain the highest-evidence techniques.
Should I stop using all Tier C and D techniques immediately?
Stop Tier D techniques (rereading, highlighting) as primary methods immediately — replace with Tier S equivalents. Tier C techniques (summarization, keyword mnemonics, imagery) can supplement Tier S/A when used in appropriate contexts — keyword mnemonics for vocabulary encoding, summarization followed by self-testing. Never use Tier C/D as your primary or only study method.
What if my teacher requires highlighting or summary assignments?
Complete the assignment, then add retrieval practice on the same material. Highlight, then create flashcards from highlighted content. Summarize, then test yourself on the summary without looking. The assignment becomes Tier B/C input; your added retrieval practice provides the Tier S learning.
Quick Reference: Technique Selection at a Glance
Use this quick reference when you need to choose a technique immediately.
| Situation | Use This | Tier | Avoid This |
|---|---|---|---|
| Learning new vocabulary | Spaced flashcards + production practice | S | Rereading word lists |
| Preparing for any exam | Daily flashcards + weekly mock tests | S | Cramming + rereading |
| Understanding a concept | Self-explanation + Feynman technique | A | Rereading the explanation |
| Solving math/science problems | Interleaved practice problems | A | Blocked single-topic sets |
| Remembering a list | Memory palace + spaced flashcards | B+/S | Rereading the list |
| Reading a textbook chapter | Read-recite-review + flashcard creation | S | Highlighting + rereading |
| Reviewing before exam | Flashcard review + practice test | S | Rereading all notes |
| Material feels easy during study | Switch to interleaving or self-testing | A/S | Continuing passive review |
| Cannot explain a concept simply | Elaborative interrogation + examples | A/B | Rereading until it "clicks" |
| Long-term professional knowledge | Daily spaced review indefinitely | S | Cramming before assessments |
The pattern is consistent across every situation: retrieval beats recognition, spacing beats massing, and active beats passive. When in doubt, test yourself on spaced intervals. That single principle captures the essence of every Tier S and A technique in this ranking — and it is the most important takeaway from decades of learning science research.
Key Takeaways
- Practice testing and spaced repetition are Tier S — the highest-evidence techniques, confirmed by 500+ and 300+ studies respectively
- Students use techniques in almost perfect inverse order to the evidence — rereading and highlighting (worst) are most popular; testing and spacing (best) are least popular
- Effect sizes for top techniques (d = 0.50–0.80) represent 50–100% retention improvement over common methods — with the same study time
- Interleaved practice, self-explanation, and elaborative interrogation are Tier A — strong evidence, use regularly
- Rereading and highlighting are Tier D — avoid as primary methods; replace with active recall and flashcards
- Combining techniques (testing + spacing + interleaving) produces multiplicative effects exceeding any single technique
- Effective techniques feel harder during study — the difficulty IS the learning (desirable difficulties)
- Build your system progressively: install Tier S first, add Tier A over weeks, eliminate Tier D immediately
- Every subject benefits from Tier S techniques; specific Tier A/B techniques vary by material type
- The ranking is based on retention and transfer — not enjoyment, speed, or subjective confidence
- When in doubt, test yourself on spaced intervals — that single principle captures the essence of all top-ranked evidence-based learning techniques
Replace one Tier D habit with one Tier S habit today. Open Problemory's Flashcards Trainer, create cards for your current material, and review them tomorrow. Track your retention in the Score Tracker. The evidence is clear — the techniques at the top of this ranking work. The only question is whether you will use them. Start with practice testing and spaced repetition today. Everything else builds from there.
Ready to Try Flashcards Trainer?
Put your knowledge into practice with our interactive flashcards trainer.
Start TrainingRelated Articles
How Active Recall Works and Why It Beats Rereading
Active recall strengthens memory by forcing retrieval, not recognition. Learn how it works, why it beats rereading, and how to use it in your studies today.
Read MoreThe Complete Guide to Spaced Repetition
Spaced repetition fights the forgetting curve by reviewing material at optimal intervals. Learn the science, schedules, tools, and how to build a system that works.
Read MoreHow to Memorize Anything Faster
Learn how to memorize anything faster using active recall, spaced repetition, mnemonics, and chunking. Practical strategies for exams, languages, and daily life.
Read More