How to Do a Meta-Analysis: Complete Step-by-Step Tutorial (2026)

What Is a Meta-Analysis?

A meta-analysis is a statistical method that combines the quantitative results of multiple independent studies addressing the same research question into a single pooled estimate. It sits at the top of the evidence hierarchy in evidence-based medicine, above individual randomized controlled trials (RCTs), cohort studies, and expert opinions.

Meta-analysis is not the same as a systematic review. A systematic review is the broader research process of identifying, evaluating, and synthesizing all relevant evidence on a topic. A meta-analysis is the statistical technique used within a systematic review to quantitatively combine results. You can have a systematic review without a meta-analysis (a narrative synthesis), but you should never perform a meta-analysis without a rigorous systematic review underpinning it.

When Should You Do a Meta-Analysis?

Multiple studies have examined the same question with similar designs and comparable outcome measures
Individual studies have small sample sizes or conflicting results, and pooling would increase statistical power
You want to estimate a more precise overall effect size across studies
You need to explore sources of variation (heterogeneity) between studies through subgroup analysis

When NOT to do a meta-analysis: If the included studies measure fundamentally different constructs (comparing apples to oranges), use different outcome definitions that cannot be reconciled, or show extreme heterogeneity (I² > 90%) with no identifiable explanation, a narrative synthesis is more appropriate than forcing a numerical pooling.

The Evidence Hierarchy

Level	Evidence Type	Strength
1	Systematic reviews and meta-analyses of RCTs	Highest
2	Individual randomized controlled trials	High
3	Cohort studies	Moderate
4	Case-control studies	Moderate-Low
5	Case series / Case reports	Low
6	Expert opinion / Editorials	Lowest

Key takeaway: A well-conducted meta-analysis provides the highest level of evidence because it synthesizes all available data, increases statistical power, improves precision of effect estimates, and can resolve conflicting findings across individual studies.

Table of Contents: 9 Steps to a Complete Meta-Analysis

Define Your Research Question (PICO Framework)
Develop a Literature Search Strategy
Study Selection and Screening
Data Extraction
Choose the Right Effect Size
Statistical Analysis (Models, Heterogeneity, Forest Plots)
Assess Publication Bias
Sensitivity Analysis
Report Results Following PRISMA 2020
Free Tools for Meta-Analysis
Common Mistakes to Avoid
Frequently Asked Questions

Define Your Research Question (PICO Framework)

Every meta-analysis begins with a clearly formulated research question. The PICO framework is the gold standard for structuring clinical questions:

Element	Meaning	Example
P (Population)	Who are the patients or participants?	Adults with type 2 diabetes mellitus
I (Intervention)	What treatment or exposure is being studied?	GLP-1 receptor agonists (semaglutide, liraglutide)
C (Comparator)	What is the control or alternative?	Placebo or standard care
O (Outcome)	What outcome are you measuring?	Change in HbA1c, body weight, adverse events

Example PICO question: "In adults with type 2 diabetes (P), does treatment with GLP-1 receptor agonists (I) compared to placebo (C) lead to greater reduction in HbA1c (O)?"

Setting Inclusion and Exclusion Criteria

Your PICO question directly determines your eligibility criteria. Before searching the literature, define these precisely:

Study design: RCTs only? Include observational studies?
Population specifics: Age range, disease severity, comorbidities
Intervention details: Dose range, duration, route of administration
Outcome measurement: How was the outcome measured? Minimum follow-up duration?
Language: English only, or all languages?
Publication date: Any time restrictions?

Register Your Protocol

Before starting your search, register your protocol on PROSPERO (International Prospective Register of Systematic Reviews). Registration demonstrates that your review was planned before results were known, reducing the risk of outcome reporting bias. PROSPERO registration is free and increasingly required by journals.

Important: PROSPERO registration must be completed before data extraction begins. If you register after analysis, it provides no protection against reporting bias, and reviewers will note this.

Develop a Literature Search Strategy

A comprehensive, reproducible search strategy is the backbone of any meta-analysis. The goal is high sensitivity (recall) -- it is better to retrieve too many irrelevant articles than to miss relevant ones.

Which Databases to Search

At minimum, search these three databases:

PubMed / MEDLINE -- The largest biomedical literature database, indexing over 36 million citations. Essential for any health-related meta-analysis.
Embase -- Stronger coverage of pharmacology, toxicology, and European literature. Approximately 40% of Embase content is unique (not in PubMed).
Cochrane Central Register of Controlled Trials (CENTRAL) -- The most comprehensive source of reports of RCTs.

Depending on your topic, also consider: Web of Science, Scopus, PsycINFO (psychology), CINAHL (nursing), ClinicalTrials.gov (unpublished trial data), and conference proceedings.

Building Your Search String

Translate each PICO element into search terms. Combine synonyms with OR and PICO elements with AND:

Example PubMed search:

("diabetes mellitus, type 2"[MeSH] OR "type 2 diabetes" OR "T2DM") AND ("GLP-1 receptor agonists"[MeSH] OR "glucagon-like peptide-1" OR "semaglutide" OR "liraglutide" OR "dulaglutide") AND ("randomized controlled trial"[pt] OR "controlled clinical trial"[pt])

Search Tips for Better Results

Use both MeSH terms (controlled vocabulary) and free-text keywords to maximize recall
Include truncation (e.g., diabet*) to capture word variations
Do NOT use overly restrictive outcome terms in your search -- these cause missed studies
Search reference lists of included studies and relevant reviews (backward citation searching)
Search citing articles of key studies (forward citation searching via Google Scholar or Scopus)

Document Everything

Record the exact search string, database, date of search, and number of results for each database. PRISMA 2020 requires this level of transparency, and reviewers will ask for it.

MetaReview tip: MetaReview has a built-in PubMed search that lets you search by keywords, date range, article type, and language directly within the tool. You can send search results straight to your screening pipeline.

Study Selection and Screening

Following the PRISMA 2020 flow diagram, study selection proceeds in distinct phases:

Phase 1: Deduplication

After searching multiple databases, you will have duplicate records. Use reference management software (Zotero, EndNote, or MetaReview's built-in deduplication) to identify and remove duplicates. Typically, 20-40% of combined results are duplicates.

Phase 2: Title and Abstract Screening

Rapidly screen each unique record based on its title and abstract against your inclusion criteria. At this stage, be inclusive -- if in doubt, keep it for full-text review. Two independent reviewers should screen all records separately.

Phase 3: Full-Text Review

Retrieve the full text of all potentially eligible articles. Read each one carefully against your complete inclusion/exclusion criteria. Record the specific reason for excluding each article (PRISMA 2020 requirement).

Inter-Rater Agreement

Calculate Cohen's kappa coefficient to quantify agreement between the two reviewers:

Kappa Value	Level of Agreement
< 0.20	Poor
0.21 - 0.40	Fair
0.41 - 0.60	Moderate
0.61 - 0.80	Substantial
0.81 - 1.00	Almost perfect

Disagreements should be resolved through discussion or by consulting a third reviewer. Aim for kappa ≥ 0.80.

MetaReview tip: MetaReview offers AI-powered screening using PICO keyword matching and large language model (LLM) deep screening. It can screen hundreds of abstracts in minutes, providing inclusion/exclusion recommendations with confidence scores -- dramatically reducing screening time while maintaining quality.

Data Extraction

Data extraction is where you systematically pull the quantitative and qualitative information needed from each included study. Accuracy here is critical -- errors in data extraction propagate directly into your meta-analysis results.

What to Extract

Your extraction form should capture:

Study identifiers: First author, publication year, journal, country
Study characteristics: Study design (RCT, cohort, case-control), single/multi-center, funding source
Population: Sample size, age (mean/median), sex distribution, disease severity, inclusion criteria used
Intervention details: Drug name, dose, route, duration, comparison treatment details
Outcome data: The numerical results needed to calculate effect sizes (see table below)
Quality assessment data: Information needed for risk of bias evaluation

Outcome Type	Data to Extract	Example
Binary (dichotomous)	Events and total N, for both intervention and control groups	Deaths: 15/200 (treatment) vs 30/198 (control)
Continuous	Mean, standard deviation (SD), and N for both groups	HbA1c change: -1.2 (SD 0.8, n=150) vs -0.4 (SD 0.7, n=148)
Time-to-event (survival)	Hazard ratio (HR), 95% CI, or data to reconstruct them	HR = 0.72 (95% CI: 0.58-0.89)

Double Extraction

Two reviewers should independently extract data from every study. After extraction, compare the results and resolve any discrepancies. This catches transcription errors, misread tables, and misinterpreted outcome definitions. Studies have shown that single-reviewer extraction has an error rate of 10-30%.

Handling Missing Data

Contact the original study authors (allow 2-4 weeks for a response)
Calculate SD from confidence intervals, p-values, or interquartile ranges using established formulas (see Cochrane Handbook Chapter 6)
If SE is reported instead of SD: SD = SE × √n
If only median and IQR are reported: use the Wan et al. (2014) method to estimate mean and SD

Warning: Never fabricate or impute data without documenting the method used. If critical effect size data is truly unavailable and cannot be calculated, the study may need to be excluded from the quantitative synthesis (but should still be described narratively).

MetaReview tip: MetaReview's PDF data extraction feature can automatically extract effect size data (events, means, SDs, sample sizes) from uploaded research papers, reducing manual entry and potential transcription errors.

Choose the Right Effect Size

The effect size measure you choose determines how results are combined and interpreted. Choosing the wrong effect size is one of the most common mistakes in meta-analysis. Here is a decision framework:

Decision Tree

Is your outcome binary (yes/no) or continuous (numerical)?

Binary outcome (e.g., dead/alive, cured/not cured, event/no event):
- Cohort study or RCT → Risk Ratio (RR)
- Case-control study → Odds Ratio (OR)
- Rare events (<10% incidence) → OR and RR are approximately equal
Continuous outcome (e.g., blood pressure, pain score, weight):
- All studies use the same scale/unit → Mean Difference (MD)
- Studies use different scales measuring the same concept → Standardized Mean Difference (SMD)
Time-to-event outcome (e.g., overall survival, progression-free survival):
- Use Hazard Ratio (HR)

Effect Size Comparison Table

Effect Size	Data Type	When to Use	Null Value	Interpretation Example
OR (Odds Ratio)	Binary	Case-control studies; logistic regression outputs	1.0	OR = 2.5: The odds of the event are 2.5 times higher in the intervention group
RR (Risk Ratio)	Binary	RCTs and cohort studies (preferred over OR)	1.0	RR = 0.70: 30% relative risk reduction in the intervention group
MD (Mean Difference)	Continuous	Same outcome scale across all studies	0	MD = -5.3 mmHg: Blood pressure is 5.3 mmHg lower in the intervention group
SMD (Standardized Mean Difference)	Continuous	Different scales measuring the same construct	0	SMD = -0.50: A medium effect favoring the intervention (Cohen's conventions)
HR (Hazard Ratio)	Time-to-event	Survival analysis, Cox regression data	1.0	HR = 0.65: 35% reduction in the instantaneous hazard of the event

Common pitfall: Do not mix different effect size types in the same meta-analysis. If some studies report OR and others report RR, you must either convert them to a common metric (possible under certain conditions) or choose one type and recalculate from raw data where available.

For a deeper dive into effect size selection, including formulas and conversion methods, see our dedicated guide: Choosing Effect Sizes: OR, RR, MD, SMD Guide.

Statistical Analysis

This is the computational core of your meta-analysis. Three key decisions must be made: the analytical model, heterogeneity assessment, and how to visualize results.

Fixed-Effect vs. Random-Effects Model

Feature	Fixed-Effect Model	Random-Effects Model
Assumption	All studies estimate the same single true effect	Each study estimates its own true effect; these effects follow a distribution
Source of variation	Within-study sampling error only	Within-study error + between-study variance (τ²)
Weighting	Based on study precision (inverse variance)	Adjusted weights that account for between-study heterogeneity
Confidence intervals	Narrower (can be falsely precise if heterogeneity exists)	Wider (more conservative, typically more realistic)
When to use	Studies are clinically and methodologically homogeneous; I² < 25%	Studies differ in populations, settings, or methods (most real-world scenarios)

Practical advice: The random-effects model (DerSimonian-Laird method) is the default choice for most meta-analyses because studies in practice almost always differ in their populations, settings, and methods. Use the fixed-effect model only when you have strong reason to believe all studies are estimating exactly the same underlying effect.

Understanding Heterogeneity

Heterogeneity refers to variability in study results beyond what is expected from sampling error alone. Three key statistics quantify it:

Statistic	What It Measures	Interpretation
I²	Percentage of total variability due to true heterogeneity	0-25% low, 25-50% moderate, 50-75% substantial, >75% considerable
Cochran's Q	Whether observed differences in results are compatible with chance alone	p < 0.10 suggests significant heterogeneity (uses a liberal threshold because the test has low power)
τ² (tau-squared)	Absolute between-study variance	Expressed in the same units as the effect size squared; larger values mean more heterogeneity

Reading a Forest Plot

The forest plot is the signature visualization of a meta-analysis. Here is how to read one:

Each row represents one study, labeled with author and year
The square on each row is the point estimate (effect size) for that study
The horizontal line through the square is the 95% confidence interval
Square size reflects the study's weight in the pooled analysis (larger = more weight)
The diamond at the bottom is the pooled effect estimate; its width shows the 95% CI
The vertical dashed line is the line of no effect (1.0 for OR/RR/HR, 0 for MD/SMD)
If a study's CI crosses the line of no effect, that individual study's result is not statistically significant

Subgroup Analysis

When heterogeneity is substantial (I² > 50%), subgroup analysis can help identify sources. Divide studies into groups based on pre-specified characteristics:

Drug dosage (low vs. high)
Study design (RCT vs. observational)
Geographic region (Asia vs. Europe vs. North America)
Risk of bias (low vs. high)
Follow-up duration (short-term vs. long-term)

Use the Q-between test (interaction test) to determine if the effect truly differs between subgroups (p < 0.05).

Caution: Subgroup analyses should be pre-specified in your protocol, not generated after seeing the data. Post-hoc subgroup analyses are exploratory and should be labeled as such. Too many subgroup analyses increase the risk of false-positive findings.

MetaReview tip: MetaReview supports all four effect sizes (OR, RR, MD, SMD), both fixed-effect and random-effects models, and automatically calculates I², Q test, and τ². It generates publication-quality forest plots, subgroup forest plots, and funnel plots -- all without writing a single line of code.

Assess Publication Bias

Publication bias occurs because studies with positive or statistically significant results are more likely to be published than those with null or negative findings. This means the available literature may overestimate the true effect, and your meta-analysis could inherit that bias.

Funnel Plot

A funnel plot graphs each study's effect size (x-axis) against its precision, typically standard error (y-axis, inverted). In the absence of publication bias:

Studies scatter symmetrically around the pooled effect estimate
Small studies (low precision, bottom of plot) show greater spread
Large studies (high precision, top of plot) cluster tightly near the pooled estimate

Asymmetry in the funnel plot -- typically a gap in the bottom-right or bottom-left corner -- suggests that small studies with unfavorable results may be missing.

Statistical Tests for Publication Bias

Test	Method	When to Use	Significance Threshold
Egger's test	Linear regression of effect size on standard error	Continuous outcomes (MD, SMD); works well with ≥10 studies	p < 0.10
Begg's test	Rank correlation between effect size and variance	Binary outcomes (OR, RR); less powerful than Egger's	p < 0.10
Peter's test	Regression of effect size on inverse of total sample size	Binary outcomes; less affected by mathematical coupling than Egger's	p < 0.10

Trim-and-Fill Method

If publication bias is detected, the trim-and-fill method provides an adjusted estimate. It works by:

Identifying asymmetrically unmatched studies on the funnel plot
"Trimming" them and recalculating the pooled effect
"Filling" in the hypothetically missing studies on the opposite side
Recalculating the pooled estimate with the augmented dataset

The adjusted estimate shows what the pooled effect might be if publication bias were absent. A large shift from the original estimate is concerning.

Limitation: Funnel plot asymmetry can be caused by factors other than publication bias, including genuine heterogeneity, methodological differences between small and large studies, or chance. Always interpret publication bias tests alongside clinical and methodological context. Formal tests require at least 10 studies to have adequate power.

Sensitivity Analysis

Sensitivity analysis tests the robustness of your meta-analysis results. The question it answers: "Would the conclusions change if we made different analytical decisions?"

Leave-One-Out Analysis

The most common sensitivity analysis method. Procedure:

Remove one study from the meta-analysis
Recalculate the pooled effect with the remaining studies
Repeat for every study
Compare all results to the original pooled estimate

If removing any single study causes the pooled effect to change direction (e.g., from significant to non-significant, or from favoring intervention to favoring control), that study is influential and must be discussed explicitly.

Other Sensitivity Analyses

Excluding high risk-of-bias studies: Rerun the analysis with only low/moderate risk-of-bias studies. If results are consistent, conclusions are more robust.
Model comparison: Compare fixed-effect and random-effects results. If they give substantially different conclusions, heterogeneity is driving the results.
Cumulative meta-analysis: Add studies chronologically and observe how the pooled estimate evolves over time. Useful for identifying when the evidence stabilized.
Influence diagnostics: Quantify each study's influence on the pooled estimate using statistics like Cook's distance, DFBETAS, or hat values (available in R's metafor package).
Excluding outliers: If any study has a point estimate far from the pooled value (e.g., residual z-score > 2), rerun the analysis without it.

MetaReview tip: MetaReview includes built-in leave-one-out sensitivity analysis that automatically highlights any study whose removal causes a directional change in the pooled result -- making it easy to spot influential studies at a glance.

Report Results Following PRISMA 2020

The PRISMA 2020 (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) statement provides a 27-item checklist for transparent reporting. Most medical journals require PRISMA compliance.

Key Sections of a Meta-Analysis Manuscript

Introduction: Rationale, objectives, PICO question
Methods: Protocol registration, eligibility criteria, databases searched, search strategy, screening process, data extraction methods, effect size and model choices, heterogeneity assessment approach, sensitivity and subgroup analyses planned
Results:
- PRISMA flow diagram (identification, screening, eligibility, included)
- Characteristics of included studies (Table 1)
- Risk of bias assessment
- Pooled effect size (95% CI), p-value
- Heterogeneity statistics (I², Q, τ²)
- Forest plot (main analysis)
- Subgroup analyses with forest plots
- Sensitivity analysis results
- Publication bias (funnel plot + Egger's test)
Discussion: Summary of evidence, comparison with existing literature, strengths and limitations, implications for practice and research

Writing the Results Paragraph: Template

Here is a standard template for reporting your primary meta-analysis result:

"A total of k studies involving n participants were included in the meta-analysis. Using a random-effects model, [intervention] was associated with a significantly [higher/lower] [outcome] compared to [comparator] (OR/RR/MD = X.XX, 95% CI: X.XX to X.XX, p = X.XXX). Substantial heterogeneity was observed across studies (I² = XX%, Q = XX.XX, p = X.XXX, τ² = X.XX). Visual inspection of the funnel plot and Egger's regression test (p = X.XX) suggested no significant publication bias."

PRISMA 2020 Checklist Highlights

Section	Key Items to Report
Title	Identify the report as a systematic review, meta-analysis, or both
Registration	Registration number and registry name (e.g., PROSPERO CRD42025xxxxx)
Search strategy	Full search strings for all databases (typically in a supplementary file)
Study selection	PRISMA flow diagram with numbers at each stage
Effect measures	Specify effect measure (OR, RR, MD, SMD, HR) and why it was chosen
Synthesis methods	Model (fixed/random), software used, method for pooling
Certainty assessment	GRADE framework for overall quality of evidence (optional but recommended)

For a complete PRISMA 2020 flow diagram guide, see: PRISMA 2020 Flow Diagram Guide.

MetaReview tip: MetaReview automatically generates a publication-ready results paragraph in English covering main analysis, subgroup results, and sensitivity analysis conclusions -- saving significant writing time.

Free Tools for Meta-Analysis

Choosing the right software can make or break your meta-analysis experience. Here is an honest comparison of the main options available today:

Feature	MetaReview	RevMan (Cochrane)	R (meta/metafor)	Stata	Covidence
Price	Free	Free (Cochrane authors) / Paid	Free	Paid ($$$)	Paid ($$)
Installation	None (browser-based)	Desktop download required	Install R + packages	Desktop license	None (browser-based)
Coding required	No	No	Yes (R scripts)	Yes (do-files)	No
Effect sizes	OR, RR, MD, SMD	OR, RR, MD, SMD	All types + custom	All types + custom	No statistical analysis
Forest plot	Yes (SVG, publication-quality)	Yes	Yes (customizable)	Yes (customizable)	No
Funnel plot	Yes	Yes	Yes	Yes	No
Subgroup analysis	Yes	Yes	Yes	Yes	No
Sensitivity analysis	Leave-one-out	Limited	Full suite	Full suite	No
Literature search	Built-in PubMed search	Cochrane Library	No	No	Import only
AI screening	Yes (LLM-powered)	No	No	No	No
PDF data extraction	Yes (AI-powered)	No	No	No	No
Auto-generated results text	Yes	No	No	No	No
Best for	Researchers who want an all-in-one free tool	Cochrane review authors	Statisticians who want full control	Biostatisticians with Stata access	Screening and collaboration only

Our recommendation: If you want to go from research question to forest plot without installing software, writing code, or paying for a license, MetaReview is the best free option available. It covers the entire meta-analysis workflow in one browser tab: literature search, AI-powered screening, PDF data extraction, statistical analysis, forest plot generation, and auto-generated results text.

For a detailed feature-by-feature comparison, see: Meta-Analysis Software Comparison.

Common Mistakes to Avoid

After reviewing thousands of published meta-analyses and their peer review feedback, these are the most frequent errors that lead to rejection or revision requests:

1. Mixing Incompatible Effect Sizes

Combining OR from one study with RR from another without proper conversion produces meaningless pooled estimates. Always convert to a common metric or recalculate from raw data.

2. Ignoring Heterogeneity

Reporting a pooled effect with I² = 85% and no attempt to explore or explain the heterogeneity is a red flag for reviewers. High heterogeneity demands subgroup analysis, meta-regression, or a narrative approach.

3. Cherry-Picking Studies

Excluding studies without pre-specified, transparent criteria is scientific misconduct. Every exclusion must be documented with a clear reason. This is why protocol registration on PROSPERO matters.

4. Not Registering Your Protocol

Without prospective registration, reviewers cannot verify that your methods, outcomes, and analyses were not changed after seeing the results. PROSPERO registration takes 30 minutes and prevents months of reviewer questions.

5. Using Fixed-Effect Model When Random-Effects Is Appropriate

If studies come from different populations, settings, and time periods, a fixed-effect model will underestimate the uncertainty. When in doubt, use random-effects.

6. Insufficient Database Coverage

Searching only PubMed is not sufficient. Cochrane recommends at least three databases. Missing Embase alone can mean missing 20-30% of relevant studies.

7. No Sensitivity Analysis

Failing to perform and report sensitivity analysis (at minimum, leave-one-out) leaves your conclusions unverified. Reviewers expect to see evidence that results are robust.

8. Post-Hoc Subgroup Analyses Presented as Confirmatory

Subgroup analyses not specified in the protocol should be explicitly labeled as exploratory. Treating data-driven subgroups as definitive findings is misleading.

9. Ignoring Publication Bias With Fewer Than 10 Studies

Formal tests (Egger's, Begg's) lack statistical power with fewer than 10 studies. Acknowledge this limitation rather than claiming "no publication bias detected" based on an underpowered test.

10. Extracting Unadjusted Instead of Adjusted Estimates

For observational studies, always prefer the most adjusted (multivariable) estimates. Unadjusted estimates may be confounded and produce biased pooled results.

Bottom line: Most of these mistakes are preventable with careful planning, protocol registration, and adherence to PRISMA 2020 guidelines. A well-designed protocol written before any data collection begins is the single best safeguard against all of these errors.

Start Your Meta-Analysis Now

MetaReview is a free online tool. Go from data entry to a publication-quality forest plot in under 5 minutes. No installation, no coding, no cost.

Open MetaReview - It's Free

Stay Updated

Get notified about new features and meta-analysis tips.

No spam. Unsubscribe anytime.

Frequently Asked Questions

What is the difference between a systematic review and a meta-analysis?

A systematic review is the entire process of systematically identifying, evaluating, and synthesizing all relevant research on a topic. It follows a structured protocol with explicit inclusion/exclusion criteria. A meta-analysis is specifically the statistical method used within a systematic review to quantitatively pool results from multiple studies into a single effect estimate. You can conduct a systematic review without a meta-analysis (presenting a narrative synthesis), but a meta-analysis should always be embedded within a systematic review framework. Think of systematic review as the research method and meta-analysis as the statistical technique.

How many studies do I need for a meta-analysis?

There is no absolute minimum, but practical considerations matter. With 2 studies, you can technically compute a pooled estimate, but the result will be driven almost entirely by sample size differences and provides limited insight. With 5 or more studies, heterogeneity statistics (I², Q) become more meaningful. With 10 or more studies, you can reliably perform publication bias tests (Egger's, Begg's) and funnel plot analysis. Most reviewers consider 5 studies a reasonable minimum for a credible meta-analysis, and will accept fewer only if the topic is narrow and the studies are high-quality.

What software can I use for meta-analysis for free?

MetaReview is a completely free, browser-based meta-analysis tool that requires no installation, no account, and no coding knowledge. It supports OR, RR, MD, and SMD effect sizes, fixed and random-effects models, forest plots, funnel plots, subgroup analysis, leave-one-out sensitivity analysis, and auto-generated results paragraphs. Other free options include the R statistical language with the "meta" and "metafor" packages, which are powerful but require programming skills. RevMan is free for Cochrane review authors but requires desktop installation. OpenMeta-Analyst is another free option but is no longer actively maintained.

How do I interpret a forest plot?

A forest plot displays each study as a row. The square represents the study's effect estimate (e.g., OR, RR, or MD), with the square size proportional to the study's weight. The horizontal line through the square is the 95% confidence interval. The diamond at the bottom represents the pooled (combined) effect. A vertical reference line shows the null effect (1.0 for ratio measures like OR/RR, or 0 for difference measures like MD/SMD). If a study's confidence interval crosses this null line, that study alone did not find a statistically significant effect. If the diamond does not touch the null line, the pooled result is statistically significant.

What does I-squared heterogeneity mean?

I² tells you what percentage of the observed variation across study results is due to genuine differences between studies (true heterogeneity) rather than random sampling variation. An I² of 0% means all variation is due to chance; an I² of 75% means three-quarters of the observed variability reflects true differences in underlying effects. The Cochrane Handbook provides rough benchmarks: 0-40% might not be important, 30-60% may represent moderate heterogeneity, 50-90% may represent substantial heterogeneity, and 75-100% indicates considerable heterogeneity. When I² is high, explore sources through subgroup analysis or meta-regression rather than simply reporting the pooled estimate.

Can I do a meta-analysis without knowing statistics or coding?

Yes. Point-and-click tools like MetaReview are designed for researchers who do not have programming or advanced biostatistics training. You enter your extracted data (event counts, sample sizes, means, standard deviations), select your effect size type and model, and the tool computes everything: pooled estimates, confidence intervals, heterogeneity statistics, forest plots, funnel plots, and sensitivity analyses. That said, understanding what these statistics mean and how to interpret them is essential for writing a defensible manuscript. We recommend reading the relevant chapters of the Cochrane Handbook for Systematic Reviews even if you use a no-code tool.

How long does it take to complete a meta-analysis?

A realistic timeline for a focused meta-analysis is 3 to 12 months from protocol registration to manuscript submission. Protocol development and PROSPERO registration takes 1-2 weeks. The literature search typically takes 1-3 weeks. Screening can take 2-8 weeks depending on volume (tools like MetaReview's AI screening can compress this significantly). Data extraction takes 2-6 weeks for 15-30 studies. Quality assessment takes 1-2 weeks. Statistical analysis and figure generation can be done in 1-3 days using the right tools. Writing the manuscript takes 2-4 weeks. Peer review and revisions add another 2-6 months. The most common bottleneck is screening and data extraction, which together account for roughly half of the total time.

What is the PRISMA checklist?

PRISMA stands for Preferred Reporting Items for Systematic Reviews and Meta-Analyses. The PRISMA 2020 update consists of a 27-item checklist covering everything that should be reported in a systematic review or meta-analysis: title, abstract, rationale, objectives, protocol registration, eligibility criteria, information sources, search strategy, selection process, data extraction, effect measures, synthesis methods, risk of bias, results of syntheses, reporting biases, certainty of evidence, and conclusions. It also includes a standardized flow diagram template. Most biomedical journals require authors to submit a completed PRISMA checklist alongside their manuscript. The checklist is freely available at prisma-statement.org.

How to Do a Meta-Analysis: Complete Step-by-Step Tutorial

What Is a Meta-Analysis?

When Should You Do a Meta-Analysis?

The Evidence Hierarchy

Table of Contents: 9 Steps to a Complete Meta-Analysis

Define Your Research Question (PICO Framework)

Setting Inclusion and Exclusion Criteria

Register Your Protocol

Develop a Literature Search Strategy

Which Databases to Search

Building Your Search String

Search Tips for Better Results

Document Everything

Study Selection and Screening

Phase 1: Deduplication

Phase 2: Title and Abstract Screening

Phase 3: Full-Text Review

Inter-Rater Agreement

Data Extraction

What to Extract

Double Extraction

Handling Missing Data

Choose the Right Effect Size

Decision Tree

Effect Size Comparison Table

Statistical Analysis

Fixed-Effect vs. Random-Effects Model

Understanding Heterogeneity

Reading a Forest Plot

Subgroup Analysis

Assess Publication Bias

Funnel Plot

Statistical Tests for Publication Bias

Trim-and-Fill Method

Sensitivity Analysis

Leave-One-Out Analysis

Other Sensitivity Analyses

Report Results Following PRISMA 2020

Key Sections of a Meta-Analysis Manuscript

Writing the Results Paragraph: Template

PRISMA 2020 Checklist Highlights

Free Tools for Meta-Analysis

Common Mistakes to Avoid

1. Mixing Incompatible Effect Sizes

2. Ignoring Heterogeneity

3. Cherry-Picking Studies

4. Not Registering Your Protocol

5. Using Fixed-Effect Model When Random-Effects Is Appropriate

6. Insufficient Database Coverage

7. No Sensitivity Analysis

8. Post-Hoc Subgroup Analyses Presented as Confirmatory

9. Ignoring Publication Bias With Fewer Than 10 Studies

10. Extracting Unadjusted Instead of Adjusted Estimates

Start Your Meta-Analysis Now

Stay Updated

Frequently Asked Questions

What is the difference between a systematic review and a meta-analysis?

How many studies do I need for a meta-analysis?

What software can I use for meta-analysis for free?

How do I interpret a forest plot?

What does I-squared heterogeneity mean?

Can I do a meta-analysis without knowing statistics or coding?

How long does it take to complete a meta-analysis?

What is the PRISMA checklist?

Related Guides