Skip to main content
Learning Science

Evidence-Based Learning Techniques Ranked

Learning techniques ranked by scientific evidence — practice testing, spaced repetition, interleaving, and more. Effect sizes, use cases, and how to combine them.

15/7/2025
54 min read

Students spend thousands of hours studying over their academic careers. Most of that time goes to techniques that research has repeatedly shown produce poor long-term retention: rereading textbooks, highlighting passages, copying notes, and cramming the night before exams. Meanwhile, techniques with the strongest scientific support — practice testing, spaced repetition, interleaved practice — are used least frequently. This is not because students are lazy or unintelligent. It is because the techniques that feel most effective are not the techniques that actually work, and nobody taught them the difference.

In 2013, John Dunlosky and colleagues published a landmark review evaluating ten common learning techniques by evidence quality. Their findings shocked educators: the methods students prefer rank lowest, and the methods they avoid rank highest. Since then, hundreds of additional studies have confirmed, refined, and extended these rankings. This guide presents the definitive evidence-based ranking of learning techniques — not based on popularity, tradition, or what feels productive, but on decades of cognitive psychology research measuring actual long-term retention and transfer.

Every technique in this guide is ranked by scientific evidence, explained with its underlying mechanism, and paired with practical implementation guidance. You will learn which techniques to prioritize, which to abandon, how to combine techniques for maximum effect, and how to build a personal learning system grounded in research rather than habit. Whether you are a student, professional, or lifelong learner, this ranking gives you the evidence to study smarter — not harder.

How Techniques Are Ranked

This ranking evaluates learning techniques by the quality and consistency of scientific evidence supporting their effectiveness for long-term retention and transfer — not by popularity, ease of use, or subjective feeling of productivity.

Ranking Criteria

  • Evidence quality: Number of studies, sample sizes, experimental rigor (randomized controlled trials vs correlational)
  • Effect consistency: Whether results replicate across materials, age groups, and laboratories
  • Effect size: Magnitude of improvement over control conditions (measured by Cohen's d, Hedges' g, or percentage improvement)
  • Retention interval: Whether benefits persist days, weeks, and months after learning — not just immediate post-study
  • Transfer: Whether benefits extend to new materials and contexts beyond what was directly studied
  • Practical utility: Whether the technique can be implemented by learners without specialized equipment or extensive training

Tier System

TierEvidence LevelRecommendation
SVery strong — hundreds of replicationsUse always — foundation of every study system
AStrong — consistent moderate-to-large effectsUse regularly — core supplementary techniques
BModerate — reliable but context-dependentUse strategically — for specific material types
CLow-moderate — limited or narrow evidenceUse with caution — not as primary methods
DLow — weak or negative evidenceAvoid as primary — replace with higher-tier techniques

What This Ranking Does Not Measure

Evidence rankings measure retention and transfer — not enjoyment, speed of initial encoding, or subjective confidence. High-tier techniques often feel harder and less productive during study while producing dramatically better long-term results. Low-tier techniques often feel easy and productive while producing poor retention. This feeling-reality gap is why evidence-based rankings matter — they override misleading subjective experience.

Evidence-based learning techniques ranked by scientific research for long-term retention
Practice testing and spaced repetition rank highest in evidence — rereading and highlighting rank lowest.

The Dunlosky Review: Foundation Study

John Dunlosky, Katherine Rawson, Elizabeth Marsh, Mitchell Nathan, and Daniel Willingham published "Improving Students' Learning With Effective Learning Techniques" in Psychological Science in the Public Interest (2013). This review remains the most cited and comprehensive evaluation of learning techniques available.

What They Evaluated

Ten learning techniques commonly used by students: elaborative interrogation, self-explanation, summarization, highlighting, keyword mnemonic, imagery for text, rereading, practice testing, distributed practice, and interleaved practice. Each was rated on utility (high, moderate, low) based on evidence quality, breadth of applicability, and ease of implementation.

Key Findings

TechniqueDunlosky RatingCurrent Tier
Practice testingHigh utilityS
Distributed practice (spacing)High utilityS
Interleaved practiceModerate utilityA (upgraded with subsequent research)
Elaborative interrogationModerate utilityA
Self-explanationModerate utilityA
Keyword mnemonicLow utilityC
Imagery for textLow utilityC
SummarizationLow utilityC
HighlightingLow utilityD
RereadingLow utilityD

Why the Review Matters

Before Dunlosky et al., learning technique recommendations were based on tradition, teacher preference, and intuition. After the review, educators and learners had an evidence-based framework for strategy selection. The review's most important finding was not which techniques work — it was the gap between what students do (rereading, highlighting) and what works (testing, spacing). This guide extends the Dunlosky framework with subsequent research (2013–2025) that has strengthened some ratings and added new techniques.

Tier S: Highest Evidence (Use Always)

Tier S techniques have the strongest, most replicated evidence base. Every effective study system should be built on these two foundations. If you implement nothing else, implement these.

#1 Practice Testing (Active Recall)

Tier: S | Dunlosky Rating: High | Effect Size: d = 0.50–0.80 | Evidence: 500+ studies

What It Is

Practice testing — also called retrieval practice or active recall — involves testing yourself on material without looking at notes. Flashcards, practice problems, free recall writes, practice exams, and self-quizzing are all forms of practice testing. The act of retrieving information from memory strengthens the memory trace more than restudying the same information.

Why It Works

  • Testing effect: Retrieval itself is a learning event — each successful recall strengthens the neural pathway to that memory
  • Desirable difficulty: Retrieval is harder than recognition, producing stronger encoding (Bjork)
  • Metacognitive calibration: Testing reveals what you actually know vs what you think you know
  • Exam transfer: Practice testing mirrors exam demands (retrieval under pressure) better than any other technique
  • Error detection: Failed retrievals identify specific gaps for targeted review

Key Research

Roediger and Karpicke (2006): students who tested themselves remembered 80% after one week; students who restudied remembered 36%. Same study time, more than double the retention. Rowland (2014) meta-analysis of 217 studies confirmed the testing effect with an overall effect size of d = 0.50. Adesope et al. (2010) meta-analysis found testing effects across age groups, materials, and retention intervals.

How to Implement

  • Flashcards: Daily review with Problemory Flashcards or Anki — attempt recall before revealing answer
  • Free recall: After every study session, close materials and write everything remembered (5–10 min)
  • Practice problems: Solve without reference — cover solutions, attempt first
  • Practice exams: Weekly timed tests under exam conditions
  • Read-recite-review: Read a section, close book, recite from memory, check gaps
  • Pre-testing: Attempt questions before learning the material — even failed attempts improve subsequent learning

When to Use

Always. Practice testing should be part of every study session for every subject. Minimum: one retrieval event per session. Optimal: retrieval bookends every session (begin and end with testing).

Deep dive: active recall guide → | retrieval practice →

#2 Distributed Practice (Spaced Repetition)

Tier: S | Dunlosky Rating: High | Effect Size: d = 0.40–0.70 | Evidence: 300+ studies

What It Is

Distributed practice — spacing study sessions over time rather than massing them in one block — produces substantially better long-term retention. Reviewing material at expanding intervals (1 day, 3 days, 7 days, 14 days, 30 days) leverages the spacing effect to build durable memories with minimum total study time.

Why It Works

  • Spacing effect: Gaps between sessions allow memory consolidation and create desirable difficulty at each retrieval
  • Forgetting and relearning: Partial forgetting before review forces deeper re-encoding than immediate restudy
  • Context variation: Spaced sessions occur in different mental states, creating multiple retrieval cues
  • Consolidation time: Sleep-dependent memory consolidation occurs between spaced sessions

Key Research

Cepeda et al. (2006) meta-analysis of 317 experiments: spaced practice produced large effect sizes across all retention intervals tested. Cepeda et al. (2008) identified optimal gap as 10–20% of desired retention interval. Carpenter et al. (2012) demonstrated spacing benefits in real classroom settings, not just laboratories.

How to Implement

  • Flashcard SRS: Automated spacing via Problemory Flashcards or Anki — review due cards daily
  • Manual schedule: Review new material at 1, 3, 7, 14, 30 days after initial learning
  • Leitner box: Physical 5-box system with expanding review intervals
  • Calendar blocking: Schedule chapter reviews at expanding intervals on calendar

When to Use

Always, in combination with practice testing. Spacing without retrieval (rereading at intervals) is less effective. Retrieval with spacing is the gold standard combination. Start spacing from day one of learning — not before exams.

Deep dive: spaced repetition guide → | review schedule →

Tier A: Strong Evidence (Use Regularly)

Tier A techniques have strong evidence supporting their effectiveness, often with large effect sizes in specific contexts. Use them regularly as supplements to Tier S foundations.

#3 Interleaved Practice

Tier: A | Dunlosky Rating: Moderate (upgraded) | Effect Size: d = 0.40–0.60 | Evidence: 100+ studies

What It Is

Interleaved practice mixes different topics, problem types, or skills within a single study session rather than studying one topic exhaustively before moving to the next (blocked practice). Instead of AAA BBB CCC, interleaving produces ABC ABC ABC.

Why It Works

  • Discrimination: Mixing similar concepts forces you to identify which strategy or knowledge applies — critical for exams
  • Desirable difficulty: Interleaving feels harder during practice, producing stronger retention
  • Exam simulation: Real exams mix topics randomly — interleaved practice mirrors this
  • Prevents context-dependent learning: Blocked practice creates "I can do this because I just studied it" without genuine mastery

Key Research

Rohrer and Taylor (2007): interleaved math problems produced 72% accuracy vs 38% for blocked practice after one week. Brunmeyer and MacDonald (2020) meta-analysis confirmed interleaving benefits across domains. Dunlosky rated it moderate in 2013; subsequent research supports upgrading to Tier A.

How to Implement

  • Mix problem types from multiple chapters in each practice session
  • Use one interleaved flashcard deck (not separate decks per chapter)
  • Take practice tests that mix all covered topics
  • After initial blocked learning of a new topic, switch to interleaved review

Deep dive: interleaving guide →

#4 Elaborative Interrogation

Tier: A | Dunlosky Rating: Moderate | Effect Size: d = 0.30–0.50 | Evidence: 80+ studies

What It Is

Elaborative interrogation involves asking "why" and "how" questions about facts and concepts during learning. "Why does this make sense?" "Why is this true?" "How does this relate to what I already know?" Forcing explanatory answers produces deeper processing than passive acceptance of facts.

Why It Works

Elaborative interrogation forces connection between new information and existing knowledge — building associative networks that provide multiple retrieval pathways. It also reveals comprehension gaps immediately — if you cannot explain why, you do not understand.

How to Implement

  • For every fact learned, ask "Why is this true?" and answer before moving on
  • Add "Why?" to flashcard backs alongside definitions
  • During reading, pause after each section and ask "Why does the author claim this?"
  • Best for factual material with logical relationships — less effective for arbitrary facts

#5 Self-Explanation

Tier: A | Dunlosky Rating: Moderate | Effect Size: d = 0.30–0.60 | Evidence: 60+ studies

What It Is

Self-explanation requires generating explanations of material in your own words as you study — not copying or paraphrasing, but explaining how concepts work, why steps in a procedure are taken, and how parts relate to the whole.

Why It Works

Chi et al. (1994) found that students who self-explained during problem-solving learned twice as much as students who did not — even when the self-explainers took longer. Self-explanation forces generative processing, comprehension monitoring, and gap identification simultaneously.

How to Implement

  • After each paragraph or problem step, explain it aloud or in writing in your own words
  • Use the Feynman technique — explain as if teaching a beginner (Feynman guide →)
  • Write margin notes that explain, not copy
  • Best for procedural material (math, science, coding) and conceptual material

Tier B: Moderate Evidence (Use Strategically)

Tier B techniques have reliable evidence in specific contexts but are not universally applicable. Use them strategically for appropriate material types.

#6 Dual Coding

Tier: B | Effect Size: d = 0.20–0.40 | Evidence: 40+ studies

What It Is

Dual coding combines verbal and visual processing of the same information — creating diagrams, mind maps, or sketches alongside text notes. Paivio's dual coding theory proposes that verbal and visual information are processed through separate channels, and combining them creates redundant retrieval pathways.

How to Implement

  • Draw diagrams of processes described in text
  • Create mind maps connecting concepts radially
  • Sketch structures, cycles, and systems from memory
  • Best for spatial, structural, and process-oriented material
  • Must be generative (you create the visual) — not passive viewing of others' diagrams

#7 Concrete Examples

Tier: B | Effect Size: d = 0.30–0.50 | Evidence: 30+ studies

What It Is

Providing or generating concrete examples of abstract concepts accelerates understanding and retention. Abstract definitions are harder to encode than definitions paired with specific, vivid examples — especially self-generated ones.

How to Implement

  • For every abstract concept, write or find two concrete examples
  • Include examples on flashcard backs — not just definitions
  • Generate original examples (stronger than reading provided ones)
  • Best for abstract concepts in math, philosophy, law, and science

#8 Retrieval Plus Elaboration

Tier: B | Effect Size: d = 0.40–0.60 | Evidence: 30+ studies

What It Is

Combining retrieval practice with elaborative processing — retrieve a fact, then explain why it is true, connect it to other knowledge, or generate an example. This combination produces larger effects than either technique alone.

How to Implement

  • Flashcard front: term. Back: definition + why it matters + example + connection to related concept
  • After free recall, elaborate on each recalled item — do not just list, explain
  • After practice problems, explain why the solution method works, not just how

#9 Sleep and Memory Consolidation

Tier: B | Effect Size: d = 0.30–0.50 | Evidence: 100+ studies

What It Is

Sleep — particularly slow-wave and REM sleep — consolidates memories encoded during waking study. Studying before sleep and getting adequate sleep (7–8 hours) significantly improves retention compared to equal study time with sleep deprivation.

Key Research

Walker and Stickgold: sleep after learning improves retention by 20–40%. Gais and Born: sleep after vocabulary learning produces 30–50% better retention. Sleep deprivation reduces learning capacity by up to 40%.

How to Implement

  • Review flashcards before bed — sleep consolidates what you studied last
  • Protect 7–8 hours of sleep during study periods — non-negotiable
  • Schedule difficult new learning in the morning when well-rested
  • Never pull all-nighters — material studied while sleep-deprived is poorly encoded AND poorly consolidated

Deep dive: sleep and memory →

Tier C: Low-Moderate Evidence (Use With Caution)

Tier C techniques show benefits in narrow contexts but lack the broad, robust evidence of higher tiers. Use as supplements, not foundations.

#10 Summarization

Tier: C | Dunlosky Rating: Low | Effect Size: d = 0.10–0.30 | Evidence: 40+ studies

What It Is

Writing summaries of read material — condensing chapters, lectures, or articles into shorter form. Dunlosky rated summarization low because benefits depend heavily on skill level and are inconsistent across studies.

When It Helps

Summarization produces learning when it requires deep processing — writing in your own words, identifying main points, connecting ideas. It fails when it becomes copying or when summaries are never reviewed via retrieval.

How to Upgrade Summarization

Do not summarize and stop — summarize, then test yourself on the summary without looking. Convert summary points into flashcards. Use Cornell note-taking format with summary section on left and questions on right.

#11 Keyword Mnemonic

Tier: C | Dunlosky Rating: Low | Effect Size: d = 0.30–0.50 (narrow) | Evidence: 30+ studies

What It Is

Creating phonetic links between foreign words and familiar words, paired with vivid imagery. Effective for paired associate learning (foreign vocabulary) but limited generalizability to other material types.

When It Helps

Language vocabulary acquisition, medical terminology, lists of unrelated items. Less effective for conceptual understanding, procedural skills, or interconnected knowledge. Best used as an encoding accelerator paired with spaced repetition for retention.

Deep dive: mnemonic techniques → | memory palace →

#12 Imagery for Text

Tier: C | Dunlosky Rating: Low | Effect Size: d = 0.10–0.30 | Evidence: 20+ studies

What It Is

Creating mental images while reading text to enhance encoding. Dunlosky rated it low because effects are inconsistent and depend on individual imagery ability and material type.

When It Helps

Concrete, visualizable content (narratives, descriptions, spatial information). Less effective for abstract concepts, mathematical reasoning, or procedural content. Vivid, absurd images work better than literal ones.

Tier D: Low Evidence (Avoid as Primary Methods)

Tier D techniques are widely used but poorly supported by evidence. They may aid initial comprehension but do not produce durable retention. Replace with Tier S and A techniques.

#13 Rereading

Tier: D | Dunlosky Rating: Low | Effect Size: d = 0.05–0.15 | Evidence: 50+ studies (consistently weak)

What It Is

Rereading textbooks, notes, or articles — the most commonly used study technique worldwide. Dunlosky rated it low utility despite its popularity.

Why It Fails

  • Creates fluency illusion — material feels familiar, producing false confidence
  • No retrieval practice — information flows in but is never pulled out
  • Diminishing returns — first read captures most benefit; subsequent reads add little
  • Attention decline — eyes move on rereads but processing stops

Replacement

Read once, then switch to read-recite-review or flashcards. One retrieval cycle beats three rereads (study mistakes →).

#14 Highlighting and Underlining

Tier: D | Dunlosky Rating: Low | Effect Size: d = 0.00–0.10 | Evidence: 30+ studies (consistently null)

What It Is

Marking text with highlighter or underlining key passages — the second most popular study technique after rereading.

Why It Fails

  • Shallow processing — marking words without understanding concepts
  • Over-highlighting — 30–50% of text highlighted means nothing is distinguished
  • No retrieval — reviewing highlights is rereading with color, not testing
  • False completion — highlighted pages feel "done" without learning occurring

Replacement

Replace highlighting with margin notes in your own words + flashcard creation for key concepts. If you highlight, follow up with retrieval practice on highlighted content only.

Effect Sizes: How Much Each Technique Helps

Effect sizes quantify the magnitude of improvement. Cohen's d interpretation: 0.2 = small, 0.5 = medium, 0.8 = large. In learning research, even d = 0.30 represents meaningful improvement that can change exam outcomes.

RankTechniqueTierEffect Size (d)Retention Improvement
1Practice testingS0.50–0.8050–100% better than restudy
2Spaced repetitionS0.40–0.7040–70% better than massed
3Interleaved practiceA0.40–0.6040–90% better than blocked
4Self-explanationA0.30–0.6030–60% better than reading
5Elaborative interrogationA0.30–0.5030–50% better than reading
6Retrieval + elaborationB0.40–0.60Combined exceeds either alone
7Concrete examplesB0.30–0.50Abstract concepts only
8Dual codingB0.20–0.40Visual/spatial material
9Sleep consolidationB0.30–0.5020–40% vs sleep deprivation
10Keyword mnemonicC0.30–0.50Paired-associate only
11SummarizationC0.10–0.30Skill-dependent, inconsistent
12Imagery for textC0.10–0.30Concrete content only
13RereadingD0.05–0.15Negligible beyond first read
14HighlightingD0.00–0.10Null effect on test performance

What Effect Sizes Mean in Practice

A student scoring 60% using rereading might score 75–85% using practice testing with the same study time. A student scoring 50% using blocked practice might score 70–80% using interleaved practice. These are not marginal improvements — they are the difference between failing and passing, or between passing and excelling. The techniques at the top of this ranking do not require more time — they require different methods.

Student applying top-ranked evidence-based learning techniques including active recall and spaced repetition
Switching from Tier D to Tier S techniques can improve retention by 50–100% with the same study time.

Combining Techniques for Maximum Effect

Individual techniques produce significant effects. Combined techniques produce multiplicative effects — the whole exceeds the sum of parts.

The Gold Standard Combination

Practice testing + Spaced repetition — the two Tier S techniques combined. Flashcards with SRS automate both: each review is a retrieval event (testing) scheduled at optimal intervals (spacing). This single combination outperforms any other technique or combination in the research literature.

High-Impact Combinations

CombinationTechniques CombinedBest For
Gold standardPractice testing + SpacingAll material — foundation of every system
Deep encodingPractice testing + Self-explanationConceptual and procedural material
Exam preparationPractice testing + Interleaving + SpacingMixed-topic exams
Abstract conceptsElaborative interrogation + Concrete examples + TestingPhilosophy, law, advanced science
Visual materialDual coding + Practice testing + SpacingAnatomy, geography, engineering
VocabularyKeyword mnemonic + Spaced flashcards + Production practiceLanguage learning, terminology
Maximum retentionTesting + Spacing + Interleaving + Self-explanation + SleepHigh-stakes comprehensive exams

Technique Stacking Protocol

For each study session, stack techniques in this order:

  1. Plan (metacognition): Set goal, choose strategy, define success check
  2. Learn: Initial exposure with self-explanation and elaborative interrogation
  3. Encode: Create flashcards with examples and connections (dual coding where appropriate)
  4. Retrieve: Practice testing — flashcards, free recall, problems
  5. Space: Schedule review at expanding intervals
  6. Interleave: Mix with other topics in subsequent sessions
  7. Evaluate (metacognition): Assess retention, adjust strategy
  8. Consolidate: Pre-sleep review + adequate sleep

Best Techniques by Learning Goal

Exam Preparation (High-Stakes Tests)

  1. Practice testing (daily flashcards + weekly mock exams)
  2. Spaced repetition (from day one, not just before exam)
  3. Interleaved practice (mixed-topic sessions and tests)
  4. Self-explanation (for conceptual understanding)
  5. Sleep consolidation (7–8 hours, pre-sleep review)

Long-Term Retention (Professional Knowledge)

  1. Spaced repetition (daily maintenance review indefinitely)
  2. Practice testing (regular self-assessment)
  3. Elaborative interrogation (connect to professional experience)
  4. Concrete examples (from real work contexts)

Language Learning

  1. Spaced repetition flashcards (vocabulary + grammar)
  2. Practice testing (production — L1 to L2 direction)
  3. Keyword mnemonics (initial encoding of difficult words)
  4. Concrete examples (sentences in context)
  5. Interleaved practice (mix vocabulary, grammar, reading, listening)

Skill Acquisition (Math, Coding, Procedures)

  1. Interleaved practice (mixed problem types)
  2. Practice testing (solve without reference)
  3. Self-explanation (explain each step and why)
  4. Spaced repetition (review problem types at intervals)

Reading Comprehension and Retention

  1. Self-explanation (explain each section in own words)
  2. Elaborative interrogation (ask why after each claim)
  3. Practice testing (free recall after each chapter)
  4. Spaced repetition (flashcards for key concepts)

Best Techniques by Subject

SubjectPrimary TechniqueSecondary TechniquesAvoid
MathematicsInterleaved problem practiceSelf-explanation, spaced review of problem typesRereading worked examples
BiologySpaced flashcardsDual coding (diagrams), self-explanation of processesHighlighting terminology lists
ChemistryPractice testing (reactions, mechanisms)Concrete examples, interleaved problem typesRereading reaction lists
PhysicsInterleaved problem solvingSelf-explanation of concepts, concrete examplesRereading formula sheets
HistorySpaced flashcards + elaborative interrogationSelf-explanation (cause-effect), concrete examplesHighlighting timelines
LawPractice testing (case application)Elaborative interrogation, concrete examplesRereading case summaries
MedicineSpaced flashcards (Anki)Interleaved practice questions, self-explanationRereading textbooks
LanguagesSpaced flashcards (production)Keyword mnemonics, concrete sentence examplesRereading word lists
ProgrammingInterleaved coding problemsSelf-explanation of algorithms, practice testingRereading code examples
LiteratureSelf-explanation + elaborative interrogationPractice testing (essay outlines from memory)Highlighting passages

Best Techniques by Timeline

One Week Before Exam

Practice testing (2–3 mock exams) + spaced flashcard review (high-failure cards) + interleaved problem practice. No new content. No rereading.

One Month Before Exam

Daily spaced flashcards + weekly mock tests + interleaved practice + error log analysis. Reduce new content intake. Prioritize retrieval over acquisition.

Full Semester (3–6 Months)

Daily flashcards from day one + weekly practice tests + interleaved sessions + self-explanation during initial learning + monthly retention audits. Build the full technique stack progressively.

Long-Term (Years)

Daily spaced repetition maintenance + periodic practice testing + elaborative connections to new experience + annual knowledge audit. Tier S techniques indefinitely; others as needed.

What Students Do vs What Research Says

The gap between student behavior and research recommendations is the single largest untapped opportunity in education.

Rank by UsageStudent BehaviorResearch TierRank by Evidence
1 (most used)RereadingD13–14 (worst)
2HighlightingD14 (worst)
3Reviewing notesD13 (near worst)
4Creating summariesC10–11
5CrammingD (massed)N/A (anti-technique)
............
Rarely usedSelf-testingS1 (best)
Rarely usedSpaced reviewS2 (best)
Rarely usedInterleaved practiceA3 (third best)

Students rank techniques in almost perfect inverse order to the evidence. The techniques used most (rereading, highlighting) are the least effective. The techniques used least (self-testing, spacing) are the most effective. Closing this gap — replacing Tier D habits with Tier S habits — produces larger performance improvements than increasing study hours.

Building Your Evidence-Based Study System

Translating this ranking into a personal study system requires replacing low-tier habits with high-tier ones progressively — not all at once.

Phase 1: Foundation (Week 1–2)

  • Install daily flashcard review (Tier S: testing + spacing)
  • Replace rereading with read-recite-review (Tier S: testing)
  • Stop highlighting as a standalone method (eliminate Tier D)
  • Track retention rate to verify improvement

Phase 2: Expansion (Week 3–4)

  • Add weekly practice test (Tier S: testing)
  • Begin interleaved practice sessions (Tier A)
  • Add self-explanation to every study session (Tier A)
  • Start elaborative interrogation for factual material (Tier A)

Phase 3: Optimization (Week 5–8)

  • Add dual coding for visual/spatial material (Tier B)
  • Implement pre-sleep review + sleep protection (Tier B)
  • Build error log with strategy evaluation (metacognition)
  • Monthly retention audit and technique effectiveness review

Phase 4: Mastery (Week 9+)

  • Full technique stack running automatically
  • Strategy selection based on material type (automatic)
  • Calibration within 10% on practice tests
  • Personal evidence-based strategy profile documented

The Daily Evidence-Based Study System

Morning (25 minutes)

  1. Spaced flashcard review — all due cards (Tier S: testing + spacing) — 20 min
  2. Free recall warm-up — write yesterday's key points from memory (Tier S: testing) — 5 min

Study Block (45–90 minutes)

  1. Pre-study plan — goal, strategy, success check (metacognition) — 3 min
  2. Learn new material with self-explanation and elaborative interrogation (Tier A) — 25 min
  3. Create flashcards for new concepts with examples (Tier S + B) — 10 min
  4. Interleaved practice problems from current and previous topics (Tier A) — 20 min
  5. End-of-session free recall — close all materials, write key points (Tier S) — 5 min
  6. Post-session evaluation — three questions (metacognition) — 3 min

Evening (10 minutes)

  1. Pre-sleep flashcard review — today's failed cards (Tier S + B: sleep consolidation) — 5 min
  2. Learning journal entry — what worked, what did not (metacognition) — 5 min

Weekly (Sunday, 60 minutes)

  1. Cumulative practice test under timed conditions (Tier S) — 30 min
  2. Error analysis and categorization (metacognition) — 15 min
  3. Calibration review — predicted vs actual scores — 5 min
  4. Next week planning — priorities based on error data — 10 min

Implementation Mistakes

1. Adding Tier S Without Removing Tier D

Adding flashcards while continuing to reread and highlight. Total time increases but Tier D activities dilute Tier S benefits. Replace, do not accumulate.

2. Using Tier S Techniques Passively

Flipping flashcards without attempting recall first. Taking practice tests while peeking at notes. The retrieval attempt IS the learning event — passive flipping produces Tier D results from Tier S tools.

3. Spacing Without Testing

Reviewing material at intervals by rereading. Spacing rereading is better than massed rereading but far worse than spaced retrieval. Always pair spacing with testing.

4. Abandoning Tier A Techniques Because They Feel Hard

Interleaving and self-explanation feel slower and more difficult than rereading. This difficulty IS the learning — abandoning effective techniques because they feel ineffective is the fluency illusion in action.

5. Ignoring Context Dependencies

Keyword mnemonics for math problems. Highlighting for procedural skills. Every technique has optimal contexts — using techniques outside their evidence base produces disappointment and abandonment.

Emerging Techniques and Future Research

Learning science continues to evolve. Several emerging areas may change future rankings.

Desirable Difficulties Framework (Bjork)

Robert Bjork's desirable difficulties research suggests that conditions making learning feel harder (spacing, interleaving, generation, variation) produce better long-term retention. This framework explains WHY top-tier techniques work and predicts that future high-evidence techniques will share the property of introducing productive difficulty.

Generation Effect

Self-generated content (writing your own examples, creating your own questions, drawing your own diagrams) produces better retention than passively receiving the same content. Effect sizes comparable to practice testing for some materials. Likely future Tier A technique.

Pre-Testing Effect

Attempting to answer questions before learning the material improves subsequent learning — even when initial answers are wrong. Growing evidence base suggests pre-testing as a standalone Tier A technique for initial exposure.

AI-Assisted Learning

AI tools for generating practice questions, flashcards, and explanations show promise but lack long-term retention studies. Current recommendation: use AI to generate retrieval materials (practice questions, flashcards), not as a replacement for retrieval itself. AI-generated summaries used passively likely rank Tier D (AI for learning →).

Exercise and Cognitive Enhancement

Aerobic exercise before or after learning improves encoding and consolidation. Effect sizes moderate (d = 0.20–0.40). Best treated as a biological support for Tier S techniques rather than a standalone learning technique (exercise and cognition →).

Additional Techniques Beyond the Dunlosky Review

Research since 2013 has evaluated additional techniques not included in the original review. Here is how they rank based on subsequent evidence.

Memory Palace (Method of Loci) — Tier B+

Effect Size: d = 0.40–0.60 (for ordered lists) | Evidence: 40+ studies

The method of loci — placing vivid mental images in familiar spatial locations — produces large effects for ordered information (lists, sequences, speeches). Memory athletes use this technique exclusively. For academic learning, it works best for: ordered lists (periodic table groups, taxonomy classifications), sequential processes (legal arguments, historical timelines), and batch vocabulary encoding. Limitation: high initial encoding effort, requires transfer to spaced repetition for long-term maintenance. Best used as an encoding accelerator paired with flashcard review, not as a standalone retention system.

Deep dive: memory palace guide → | science of memory palaces →

Chunking — Tier B

Effect Size: d = 0.30–0.50 | Evidence: 30+ studies

Organizing individual items into meaningful groups (chunks) expands effective working memory capacity. Phone numbers (555-123-4567 vs 5551234567), chess positions (meaningful patterns vs individual pieces), and acronyms (HOMES for Great Lakes) all leverage chunking. For learning: group related vocabulary into thematic clusters, organize formulas by topic, create acronyms for list items. Chunking is most effective when chunks are meaningful (not arbitrary groupings) and when learners create their own chunks rather than receiving pre-made ones.

Deep dive: chunking guide →

Generation Effect — Tier A- (Emerging)

Effect Size: d = 0.40–0.60 | Evidence: 50+ studies

Information that learners generate themselves (examples, questions, summaries, diagrams) is remembered better than information passively received — even when the generated content is imperfect. Creating your own practice questions, writing your own examples, drawing your own diagrams, and formulating your own flashcards all leverage the generation effect. This overlaps with self-explanation and practice testing but emphasizes the creative/generative component specifically.

Varied Practice — Tier A- (Emerging)

Effect Size: d = 0.30–0.50 | Evidence: 30+ studies

Practicing skills in varied contexts (different problem formats, different settings, different conditions) produces better transfer to novel situations than practicing in identical conditions. Related to interleaving but focuses on contextual variation rather than topic mixing. For motor skills: practice in varied environments. For cognitive skills: solve problems in multiple formats (multiple choice, essay, application, teaching).

Retrieval-Induced Forgetting (and How to Avoid It)

Practicing retrieval of some items can temporarily suppress related unpracticed items — a phenomenon called retrieval-induced forgetting. Practical implication: cumulative review must cover ALL material, not just recently studied topics. Daily flashcard review handles this automatically (all due cards appear). Weekly cumulative tests prevent selective review that leaves early material vulnerable.

How Learning Research Works

Understanding how techniques are evaluated helps you assess claims and distinguish evidence-based recommendations from marketing.

The Gold Standard: Randomized Controlled Trials

Participants randomly assigned to technique A or technique B (or control). Same material, same time, same conditions — only the technique differs. Retention tested after a delay (typically 1 day to 1 week). The technique producing higher delayed retention wins. Most Tier S and A evidence comes from this design.

Meta-Analyses

Statistical combination of results from dozens or hundreds of individual studies. Meta-analyses provide the most reliable effect size estimates because they aggregate across laboratories, materials, and populations. The Dunlosky review, Cepeda spacing meta-analysis, and Rowland testing meta-analysis are examples.

Effect Size Interpretation

  • d = 0.20: Small effect — noticeable but modest improvement
  • d = 0.50: Medium effect — meaningful improvement that changes outcomes
  • d = 0.80: Large effect — dramatic improvement, often difference between pass and fail
  • d = 1.00+: Very large — rare in learning research, usually comparing effective vs completely ineffective methods

Red Flags in Learning Claims

  • "Brain training" apps claiming to improve general intelligence (see: brain training myths →)
  • Techniques validated only by immediate post-study tests (not delayed retention)
  • Claims based on single studies without replication
  • "Learning styles" (visual, auditory, kinesthetic) — no evidence that matching instruction to preferred style improves learning
  • Products claiming to bypass the need for effortful retrieval
  • Techniques that feel easy and promise fast results — effective learning requires desirable difficulty

Popular Learning Myths vs Evidence

MythEvidence SaysWhat to Do Instead
"Reread until you know it"Rereading creates fluency illusion, not retrievalRead once, then test yourself
"Highlight the important parts"Highlighting does not improve test performanceWrite margin notes + create flashcards
"Cram the night before"Massed practice produces rapid decayDaily spaced review from day one
"Study one topic at a time"Blocked practice produces poor transferInterleave topics within sessions
"I'm a visual learner"Learning styles hypothesis not supportedUse evidence-based techniques for all
"More hours = more learning"Method matters more than durationSame hours with Tier S techniques
"Review notes before exams"Passive review produces poor retentionPractice tests + flashcard review
"Copy notes neatly to learn them"Copying is shallow processingSelf-explain in own words + flashcards
"Brain training games improve memory"Transfer to real learning is not demonstratedPractice testing on actual material
"Listening to lectures twice is enough"Passive re-exposure adds minimal retentionLecture + self-explanation + flashcards

Detailed Implementation Guide by Technique

Implementing Practice Testing: Step by Step

  1. Choose format: Flashcards for facts, free recall for concepts, practice problems for procedures, mock exams for comprehensive assessment
  2. Attempt before checking: Always retrieve before looking at answers — the attempt IS the learning event
  3. Log failures: Every failed retrieval goes into error log or gets shorter flashcard interval
  4. Schedule: Minimum one retrieval event per study session; optimal: bookend every session with retrieval
  5. Increase difficulty: Progress from cued recall (flashcards) to free recall (blank page) to application (problems, essays)
  6. Track: Retention rate over time — should be 85%+ on mature cards

Implementing Spaced Repetition: Step by Step

  1. Choose system: Anki/Problemory for 200+ items; manual calendar for under 200
  2. Create cards on learning day: One concept per card, with example sentence
  3. Review daily: Complete all due cards before adding new ones
  4. New card rate: 10–20/day for sustainable growth; reduce if review exceeds 30 min
  5. Failed cards: Reset to short interval; re-review in same session
  6. Maintenance: Continue daily review indefinitely — never stop

Implementing Interleaved Practice: Step by Step

  1. Initial learning: First exposure can be blocked (one new topic at a time)
  2. Switch to interleaving: From second session onward, mix all previously studied topics
  3. Flashcards: One mixed deck — not separate decks per chapter
  4. Problems: Mixed problem sets from all chapters — never single-chapter problem sets for review
  5. Expect difficulty: Interleaved sessions feel harder — this confirms it is working
  6. Test interleaved: All practice tests should mix topics

Common Implementation Timeline

Most learners see measurable retention improvement within 1–2 weeks of switching from Tier D to Tier S techniques. Full habit formation and calibration accuracy typically develop over 4–8 weeks. Exam score improvements become visible within 4–6 weeks of consistent evidence-based practice. The investment is front-loaded — the first two weeks require conscious effort to override habitual passive methods. After that, Tier S techniques become automatic and require less willpower than the passive methods they replaced. The research is clear, the results are reproducible, and the techniques are available to you starting today.

Evidence in Action: Case Studies

Case 1: Medical Student — Anki Transformation

Before: Reread textbooks, highlighted notes, scored 65% on practice questions.
Intervention: Created Anki deck from day one of medical school. Daily 30-min review. Replaced all rereading with flashcard retrieval.
After: Step 1 score: 245 (90th percentile). Daily review time: 30 min. Total deck: 12,000 cards at 88% retention. Study hours decreased; scores increased.

Case 2: Language Learner — Tier S Only

Before: Reread vocabulary lists, watched videos passively. 500 words learned over 6 months with ~30% retention.
Intervention: Switched to daily spaced flashcards (20 words/day) with production cards and sentence examples.
After: 3,000 words in 6 months at 85% retention. Speaking ability matched vocabulary depth for the first time.

Case 3: Engineering Student — Interleaving Discovery

Before: Blocked problem practice (all Chapter 3, then all Chapter 4). Felt confident per chapter. Mixed exam score: 58%.
Intervention: Switched to interleaved problem sets mixing all chapters.
After: Next mixed exam: 79%. Interleaved practice felt harder during study but produced dramatically better exam performance.

Case 4: Professional Certification — Technique Stack

Before: Reread study guide twice, highlighted key sections. Practice exam: 62%.
Intervention: Full technique stack — daily flashcards, weekly mock exams, interleaved practice questions, self-explanation of missed concepts, pre-sleep review.
After: Passed certification on first attempt. Final practice exam: 87%. Total study time: same hours, different methods.

Technique Selection Decision Guide

Use this guide to select the optimal technique for any learning situation.

Step 1: What Type of Material?

  • Discrete facts (definitions, dates, terms, formulas) → Flashcards (Tier S) + Spacing (Tier S)
  • Concepts (theories, mechanisms, principles) → Self-explanation (Tier A) + Testing (Tier S)
  • Procedures (problem-solving, calculations, algorithms) → Interleaved practice (Tier A) + Testing (Tier S)
  • Application (essays, case analysis, clinical reasoning) → Practice testing (Tier S) + Concrete examples (Tier B)
  • Lists/sequences (ordered items, steps, classifications) → Memory palace (Tier B+) + Flashcards (Tier S)
  • Vocabulary (foreign words, terminology) → Spaced flashcards (Tier S) + Keyword mnemonics (Tier C)

Step 2: What Is Your Timeline?

  • Exam in 1 week: Testing (Tier S) + Spacing (Tier S) — focus on retrieval of existing material
  • Exam in 1 month: Add interleaving (Tier A) + weekly mock tests
  • Exam in 3+ months: Full technique stack — build progressively from Tier S upward
  • Long-term retention: Daily spacing (Tier S) indefinitely + periodic testing

Step 3: What Is Your Current Method?

  • If rereading: Replace with read-recite-review (Tier S) — highest priority change
  • If highlighting: Replace with margin notes + flashcards (Tier S) — highest priority change
  • If cramming: Install daily spaced review (Tier S) — start today
  • If already using flashcards: Add interleaving (Tier A) and weekly mock tests (Tier S)
  • If already using full stack: Optimize — track retention, adjust intervals, refine card quality

Sample Week Using Evidence-Based Techniques

DayTier STier ATier B/MetaTotal Time
MondayFlashcard review (20 min) + free recall (5 min)Self-explanation during new lecture contentPre-study plan (3 min)~60 min study
TuesdayFlashcard review (20 min) + practice problems (15 min)Interleaved problems (mixed chapters)Post-session evaluation (5 min)~65 min study
WednesdayFlashcard review (20 min) + free recall (5 min)Elaborative interrogation on new readingDual coding — draw process diagram~55 min study
ThursdayFlashcard review (20 min) + practice problems (15 min)Interleaved problems + self-explanationPre-sleep review (5 min)~65 min study
FridayFlashcard review (20 min) + sectional test (20 min)Interleaved review of week's topicsError log update (10 min)~70 min study
SaturdayFull mock exam (60 min) + flashcard review (20 min)Interleaved error re-attemptExam wrapper (10 min)~90 min study
SundayFlashcard review (20 min) + cumulative free recall (15 min)Strategy evaluationWeekly plan + calibration review (15 min)~50 min study

The Neuroscience Behind Top-Ranked Techniques

Understanding why Tier S and A techniques work at the neural level reinforces commitment when they feel difficult during practice.

Practice Testing and Neural Pathways

Each retrieval attempt activates the hippocampus and prefrontal cortex simultaneously — the hippocampus locates the memory trace while the prefrontal cortex executes controlled retrieval. Successful retrieval triggers dopamine release in the nucleus accumbens, reinforcing the neural pathway. Failed retrieval followed by correct feedback produces even stronger encoding than successful retrieval alone — the error signals the brain to prioritize that trace. This is why pre-testing (attempting before learning) and errorful retrieval produce such large effects.

Spacing and Synaptic Consolidation

Synaptic connections strengthen during consolidation periods between study sessions — not during study itself. Massed practice does not allow consolidation between exposures. Spaced practice allows protein synthesis, dendritic spine growth, and systems consolidation to occur between sessions, producing physically stronger memory traces. Sleep between spaced sessions amplifies this effect further (neuroplasticity →).

Interleaving and Neural Discrimination

Blocked practice activates the same neural circuits repeatedly — strengthening one pathway but not building discrimination between similar pathways. Interleaved practice activates different circuits in rapid succession, forcing the prefrontal cortex to select the appropriate pathway for each item. This selection process builds the discrimination ability that mixed exams require — explaining why interleaved learners outperform blocked learners on mixed tests despite equal total practice time.

Why Tier D Techniques Fail Neurologically

Rereading and highlighting primarily activate visual processing areas without triggering hippocampal retrieval circuits. Information passes through awareness without being consolidated into retrievable long-term memory. The fluency felt during rereading is perceptual fluency (ease of processing the text) misattributed as memory fluency (ability to retrieve the content) — two distinct neural phenomena that feel identical subjectively.

Evidence-Based Techniques for Specific Exams

Medical Licensing (USMLE, MCAT, PLAB)

Primary stack: Anki spaced repetition (10,000–15,000 cards) + UWorld/practice question interleaving + weekly timed practice blocks.
Evidence basis: Top scorers universally use spaced flashcards + interleaved question practice. Rereading textbooks correlates with lower scores in retrospective surveys.
Daily minimum: 30 min flashcard review + 30 min practice questions. No rereading.

Law (Bar Exam, LSAT)

Primary stack: Spaced flashcards for rules and definitions + interleaved practice essays + self-explanation of case reasoning.
Evidence basis: Bar passage correlates with practice question volume and spaced review, not outline rereading.
Daily minimum: 20 min flashcards + 45 min practice questions/essays.

Graduate Admissions (GRE, GMAT, LSAT)

Primary stack: Vocabulary flashcards with spacing + interleaved quant/verbal practice + weekly full mock tests.
Evidence basis: Vocabulary retention requires daily spaced review (Tier S). Quant improvement requires interleaved problem types (Tier A).
Daily minimum: 20 min vocabulary flashcards + 30 min interleaved practice + weekly mock.

Professional Certifications (CPA, PMP, CISSP)

Primary stack: Spaced flashcards for domain knowledge + interleaved practice exams + error log analysis.
Evidence basis: Professional exam pass rates correlate with practice test volume, not study guide rereading.
Daily minimum: 20 min flashcards + 30 min practice questions.

Language Proficiency (TOEFL, IELTS, DELE)

Primary stack: Vocabulary flashcards with production cards + interleaved skill practice (reading, listening, writing, speaking) + concrete sentence examples.
Evidence basis: Vocabulary depth (spaced flashcards) predicts all four skill scores. Passive immersion without retrieval produces recognition without production.
Daily minimum: 20 min flashcards + 15 min production practice.

For Teachers and Parents: Evidence-Based Recommendations

If you guide others' learning, these evidence-based recommendations produce the largest impact with the smallest intervention.

Top 5 Recommendations for Educators

  1. Require retrieval in class: Start lectures with a quiz on previous material (Tier S). Low-stakes, frequent, cumulative.
  2. Assign spaced review, not rereading: "Review flashcards for 15 minutes" not "Reread Chapter 5."
  3. Design interleaved assessments: Mix topics on all quizzes and exams — never single-topic tests after initial learning.
  4. Teach technique explicitly: Most students do not know rereading is ineffective. One 30-minute lesson on evidence-based techniques changes behavior.
  5. Model metacognition: Think aloud during problem-solving. Ask "How do you know you understand this?" not "Do you understand?"

Top 5 Recommendations for Parents

  1. Ask "Can you explain it to me?" not "Did you study?" — retrieval practice in disguise
  2. Provide flashcard tools: Problemory Flashcards or Anki — remove the barrier to Tier S techniques
  3. Protect sleep: 7–8 hours non-negotiable — sleep deprivation destroys Tier S effectiveness
  4. Celebrate retrieval success: Praise correct recall, not hours spent at desk
  5. Do not reward rereading: "You read the chapter three times!" reinforces Tier D behavior

A Brief History of Learning Technique Research

The evidence ranking in this guide represents over a century of scientific inquiry into how humans learn and remember.

  • 1885: Hermann Ebbinghaus publishes forgetting curve research — first systematic study of memory retention over time, establishing the foundation for spaced repetition
  • 1917: Arthur Gates demonstrates that recitation (active recall) produces better retention than reading — early evidence for the testing effect
  • 1978: Thomas Bjork introduces "desirable difficulties" framework — explaining why effective learning feels hard
  • 1979: John Flavell coins "metacognition" — establishing the study of thinking about thinking
  • 1994: Michelene Chi demonstrates self-explanation effect — students who explain learn twice as much
  • 2006: Roediger and Karpicke publish landmark testing effect study — 80% vs 36% retention
  • 2007: Rohrer and Taylor demonstrate interleaving superiority — 72% vs 38% on delayed tests
  • 2008: Cepeda et al. meta-analysis confirms spacing effect across 317 experiments
  • 2013: Dunlosky et al. publish comprehensive technique review — the foundation of modern evidence-based study recommendations
  • 2014: Rowland meta-analysis confirms testing effect across 217 studies (d = 0.50)
  • 2020: Brunmeyer and MacDonald meta-analysis confirms interleaving benefits across domains

Tools for Evidence-Based Learning

TierTechniqueTool
SPractice testing + SpacingProblemory Flashcards, Anki
SPractice testingPractice exams, Problemory Score Tracker
AInterleaved practiceMixed flashcard decks, mixed problem sets
ASelf-explanationBlank paper, Feynman technique
BDual codingDrawing tools, mind mapping apps
BKeyword mnemonicsProblemory Mnemonic Generator
BMemory palaceProblemory Memory Palace
MetaMetacognitive trackingScore Tracker, learning journal

Practical Exercises

Exercise 1: Personal Technique Audit

List every study method you used last week. Rank each against this evidence ranking. Calculate what percentage of your study time went to Tier S/A vs Tier C/D. Target: 80%+ on Tier S/A within 30 days.

Exercise 2: Tier Comparison Test

Take one topic. Study half using rereading (Tier D). Study half using flashcards (Tier S). Test both 48 hours later with free recall. Quantify the tier difference with your own data.

Exercise 3: Technique Stack Session

Execute one full study session using the complete technique stack (plan → learn with self-explanation → flashcards → retrieve → evaluate). Compare retention to a typical passive session on similar material.

Exercise 4: 30-Day Tier S Challenge

For 30 days, use only Tier S techniques (practice testing + spaced repetition). No rereading, no highlighting. Track retention rate weekly. Document improvement over the month.

Exercise 5: Combination Experiment

Study three similar topics with three different combinations: (A) testing alone, (B) spacing alone, (C) testing + spacing + interleaving. Test all three after one week. Identify your optimal combination.

FAQ

What are the most evidence-based learning techniques?

Practice testing (active recall) and distributed practice (spaced repetition) have the strongest evidence — both rated high utility by Dunlosky et al. and confirmed by hundreds of subsequent studies. Interleaved practice, self-explanation, and elaborative interrogation rank next with strong evidence.

Is rereading ever useful?

The first read of material is necessary for initial exposure. Beyond the first read, rereading produces negligible additional retention (d = 0.05–0.15). Replace second and subsequent reads with retrieval practice — one read-recite cycle outperforms three rereads.

Why do students use ineffective techniques?

Ineffective techniques (rereading, highlighting) feel productive because they create fluency — material feels familiar. Effective techniques (testing, spacing) feel harder because they reveal gaps. Students choose based on feeling, not evidence. Education rarely teaches technique selection explicitly.

Can I combine multiple techniques?

Yes — combining techniques produces larger effects than any single technique. The gold standard combination is practice testing + spaced repetition (flashcards with SRS). Add interleaving, self-explanation, and sleep consolidation for maximum effect.

How long until I see results from switching techniques?

Measurable retention improvement within 1–2 weeks of switching from Tier D to Tier S techniques. Full calibration and habit formation takes 4–8 weeks. Exam score improvements typically visible within 4–6 weeks.

Are flashcards the best study method?

Flashcards automate the two highest-evidence techniques (practice testing + spaced repetition) — making them the most efficient single tool. However, flashcards alone may insufficient for procedural skills (need interleaved problem practice) and application tasks (need timed practice tests). Use flashcards as the foundation, supplemented by other Tier A techniques as needed.

What technique has the largest effect size?

Practice testing (d = 0.50–0.80) and spaced repetition (d = 0.40–0.70) have the largest consistently replicated effect sizes. Interleaved practice shows large effects (d = 0.40–0.60) in direct comparisons with blocked practice. Combined techniques produce even larger practical improvements.

Do these rankings apply to all ages?

Yes. The testing effect, spacing effect, and interleaving effect replicate across age groups from elementary school through adulthood. Implementation may differ (younger learners need more scaffolding for self-explanation), but the relative rankings hold.

How does this ranking compare to the original Dunlosky review?

This guide extends the 2013 Dunlosky review with subsequent research (2013–2025). Interleaving has been upgraded from moderate to Tier A based on meta-analytic evidence. Generation effect and varied practice are added as emerging Tier A techniques. Memory palace and chunking are included as additional techniques. The core finding is unchanged: practice testing and spaced repetition remain the highest-evidence techniques.

Should I stop using all Tier C and D techniques immediately?

Stop Tier D techniques (rereading, highlighting) as primary methods immediately — replace with Tier S equivalents. Tier C techniques (summarization, keyword mnemonics, imagery) can supplement Tier S/A when used in appropriate contexts — keyword mnemonics for vocabulary encoding, summarization followed by self-testing. Never use Tier C/D as your primary or only study method.

What if my teacher requires highlighting or summary assignments?

Complete the assignment, then add retrieval practice on the same material. Highlight, then create flashcards from highlighted content. Summarize, then test yourself on the summary without looking. The assignment becomes Tier B/C input; your added retrieval practice provides the Tier S learning.

Quick Reference: Technique Selection at a Glance

Use this quick reference when you need to choose a technique immediately.

SituationUse ThisTierAvoid This
Learning new vocabularySpaced flashcards + production practiceSRereading word lists
Preparing for any examDaily flashcards + weekly mock testsSCramming + rereading
Understanding a conceptSelf-explanation + Feynman techniqueARereading the explanation
Solving math/science problemsInterleaved practice problemsABlocked single-topic sets
Remembering a listMemory palace + spaced flashcardsB+/SRereading the list
Reading a textbook chapterRead-recite-review + flashcard creationSHighlighting + rereading
Reviewing before examFlashcard review + practice testSRereading all notes
Material feels easy during studySwitch to interleaving or self-testingA/SContinuing passive review
Cannot explain a concept simplyElaborative interrogation + examplesA/BRereading until it "clicks"
Long-term professional knowledgeDaily spaced review indefinitelySCramming before assessments

The pattern is consistent across every situation: retrieval beats recognition, spacing beats massing, and active beats passive. When in doubt, test yourself on spaced intervals. That single principle captures the essence of every Tier S and A technique in this ranking — and it is the most important takeaway from decades of learning science research.

Key Takeaways

  1. Practice testing and spaced repetition are Tier S — the highest-evidence techniques, confirmed by 500+ and 300+ studies respectively
  2. Students use techniques in almost perfect inverse order to the evidence — rereading and highlighting (worst) are most popular; testing and spacing (best) are least popular
  3. Effect sizes for top techniques (d = 0.50–0.80) represent 50–100% retention improvement over common methods — with the same study time
  4. Interleaved practice, self-explanation, and elaborative interrogation are Tier A — strong evidence, use regularly
  5. Rereading and highlighting are Tier D — avoid as primary methods; replace with active recall and flashcards
  6. Combining techniques (testing + spacing + interleaving) produces multiplicative effects exceeding any single technique
  7. Effective techniques feel harder during study — the difficulty IS the learning (desirable difficulties)
  8. Build your system progressively: install Tier S first, add Tier A over weeks, eliminate Tier D immediately
  9. Every subject benefits from Tier S techniques; specific Tier A/B techniques vary by material type
  10. The ranking is based on retention and transfer — not enjoyment, speed, or subjective confidence
  11. When in doubt, test yourself on spaced intervals — that single principle captures the essence of all top-ranked evidence-based learning techniques

Replace one Tier D habit with one Tier S habit today. Open Problemory's Flashcards Trainer, create cards for your current material, and review them tomorrow. Track your retention in the Score Tracker. The evidence is clear — the techniques at the top of this ranking work. The only question is whether you will use them. Start with practice testing and spaced repetition today. Everything else builds from there.

Ready to Try Flashcards Trainer?

Put your knowledge into practice with our interactive flashcards trainer.

Start Training

Related Articles

Learning Science

How Active Recall Works and Why It Beats Rereading

Active recall strengthens memory by forcing retrieval, not recognition. Learn how it works, why it beats rereading, and how to use it in your studies today.

Read More
Learning Science

The Complete Guide to Spaced Repetition

Spaced repetition fights the forgetting curve by reviewing material at optimal intervals. Learn the science, schedules, tools, and how to build a system that works.

Read More
Learning Science

How to Memorize Anything Faster

Learn how to memorize anything faster using active recall, spaced repetition, mnemonics, and chunking. Practical strategies for exams, languages, and daily life.

Read More