MetaReview

Hazard Ratio Meta-Analysis Guide: HR Extraction, Calculation & Forest Plot

From Kaplan-Meier curves to forest plots. A complete guide to pooling hazard ratios for survival outcomes.

What Is Hazard Ratio Meta-Analysis?

In medical research, many clinical outcomes are time-to-event data: the time from treatment initiation to disease progression, death, or another endpoint. These data are analyzed using survival analysis methods.

When multiple studies report the effect of the same intervention on a survival outcome, meta-analysis pools them to obtain a more precise overall estimate. The core effect measure for survival data meta-analysis is the Hazard Ratio (HR).

What Is a Hazard Ratio?

HR is the ratio of instantaneous event rates (hazards) between two groups. It comes from the Cox proportional hazards regression model and accounts for follow-up time and censoring. HR = 0.70 means that at any given time point, the intervention group's event rate is 70% of the control group's — a 30% risk reduction.

When Do You Need HR Meta-Analysis?

HR vs. RR: Don't Confuse Them

FeatureHR (Hazard Ratio)RR (Risk Ratio)
Data sourceCox proportional hazards / survival analysis2×2 table / cumulative incidence
Accounts for timeYes, considers when events occur during follow-upNo, only cumulative event counts at a fixed time point
Handles censoringYes, correctly handles loss to follow-up and study endNo, censored patients are typically excluded or simply handled
Null value11
Typical useOncology OS/PFS, cardiovascular MACERCT binary outcomes (cure rate, mortality at a fixed time)
Key distinction: HR is based on event rates across the entire follow-up period, while RR is based on cumulative event rates at a fixed time point. They cannot be mixed in the same meta-analysis.

How to Extract HR Data from Papers

Extracting HR data is the most critical and error-prone step in survival meta-analysis. Depending on what the paper reports, there are three extraction scenarios.

Scenario 1: Paper Reports HR + 95% CI (Most Common, Easiest)

This is the ideal case. Most high-quality RCTs and cohort studies directly report the HR and 95% CI from Cox regression in the text or tables.

Example: Paper reports "HR = 0.72, 95% CI: 0.58–0.90." Record these three numbers:

MetaReview handles all subsequent log transformation and SE calculations automatically.

Extraction priority: Results section text > Tables (typically in multivariable analysis tables) > Figure legends > Supplementary material. If multiple models report HR, prefer the adjusted (multivariable) HR, as it controls for confounders.

Scenario 2: Paper Reports HR + p-value (No CI)

Some older papers or conference abstracts report only the HR and p-value without a 95% CI. You can back-calculate the SE from the p-value:

  1. Convert the p-value to a z-score (e.g., p = 0.05 → z = 1.96, p = 0.01 → z = 2.576)
  2. Calculate SE: SE[log(HR)] = |log(HR)| / z
  3. Reconstruct 95% CI: CI = exp(log(HR) ± 1.96 × SE)
SE[log(HR)] = |log(HR)| / z
where z is the normal distribution quantile corresponding to the p-value

Example: Paper reports HR = 0.65, p = 0.003.

  1. p = 0.003 (two-sided) → z = 2.968
  2. log(0.65) = −0.431
  3. SE = |−0.431| / 2.968 = 0.145
  4. 95% CI = exp(−0.431 ± 1.96 × 0.145) = (0.489, 0.863)
Caution: If the paper only reports "p < 0.05," using z = 1.96 will overestimate the SE (underestimate precision). This is a conservative approach — acceptable but not ideal. Contact the authors for exact data whenever possible.

Scenario 3: Paper Only Shows Kaplan-Meier Curves (No HR Reported)

This is the most challenging scenario. Some older publications or studies with negative results only show KM curves without reporting HR.

Use the Tierney et al. (2007) method to estimate HR from KM curves:

  1. Digitize the KM curve using a tool like WebPlotDigitizer to extract survival probability coordinates at each time point
  2. Combine with the number-at-risk table (usually displayed below the KM plot)
  3. Reconstruct event and censoring counts for each time interval
  4. Estimate the HR from the reconstructed approximate individual patient data (IPD)

Reference: Tierney JF, Stewart LA, Ghersi D, Burdett S, Sydes MR. Practical methods for incorporating summary time-to-event data into meta-analysis. Trials. 2007;8:16.

Tip: Tierney's method has a companion Excel calculator available in the paper's supplementary materials. If the KM plot lacks a number-at-risk table, estimation accuracy drops significantly. Consider listing "unable to extract HR" as an exclusion criterion.

Data Extraction Template

FieldDescriptionExample
First AuthorStudy's first authorWang
YearPublication year2023
Study DesignRCT / Cohort / OtherRCT
InterventionExperimental group treatmentImmunotherapy + chemo
ControlControl group treatmentChemotherapy alone
OutcomeOS / PFS / DFS etc.OS
HRHazard Ratio0.72
CI Lower95% CI lower bound0.58
CI Upper95% CI upper bound0.90
HR TypeAdjusted / UnadjustedAdjusted
Data SourceDirectly reported / p-value derived / KM extractedDirectly reported

Statistical Principles of HR Meta-Analysis

Understanding the statistical foundation helps you correctly perform the analysis and identify errors. The key point: HRs are pooled on the log scale, not the original scale.

Why Log Transformation?

HR values range from (0, +∞) and have a right-skewed distribution. For example, HR = 0.5 (risk halved) and HR = 2.0 (risk doubled) represent effects of equal magnitude but opposite direction, yet they are unequally distant from the null value of 1 (distances: 0.5 vs. 1.0).

After log transformation:

Calculating the Standard Error

When you know the HR and 95% CI:

SE[log(HR)] = (log(CIupper) − log(CIlower)) / 3.92

Where 3.92 = 2 × 1.96, because 95% CI = log(HR) ± 1.96 × SE, so CI width = 2 × 1.96 × SE.

Example: HR = 0.72, 95% CI (0.58, 0.90):

  1. log(HR) = log(0.72) = −0.329
  2. log(0.90) = −0.105
  3. log(0.58) = −0.545
  4. SE = (−0.105 − (−0.545)) / 3.92 = 0.440 / 3.92 = 0.112

Inverse-Variance Weighted Pooling

Meta-analysis uses the inverse-variance method to pool log(HR) values across studies:

Weight wi = 1 / SEi²

Pooled log(HR) = ∑(wi × log(HRi)) / ∑wi

Studies with smaller SE (typically larger sample size, more events) receive greater weight. After pooling, the combined log(HR) and its CI are back-transformed via exp():

Pooled HR = exp(pooled log(HR))
Pooled 95% CI = exp(pooled log(HR) ± 1.96 × SEpooled)
Good news: MetaReview handles all these calculations automatically. You only need to enter the original HR and 95% CI — the tool performs log transformation, SE calculation, inverse-variance weighting, and back-transformation behind the scenes.

Fixed-Effect vs. Random-Effects Model

Decision guide: Use random-effects when I² > 50% or Q-test p < 0.10.

Forest Plot Interpretation for HR

The forest plot is the most important visual output of any meta-analysis. HR forest plots share the same structure as OR/RR plots but have specific interpretation details.

Basic Elements

ElementMeaning
Square (each row)Point estimate of a single study's HR. Square size = study weight (smaller SE = larger square).
Horizontal lines95% CI of that study's HR. Shorter line = more precise estimate.
Vertical dashed line (x = 1)Null line. HR = 1 means no difference between groups.
Diamond (bottom)Pooled HR. Diamond center = pooled point estimate; diamond width = pooled 95% CI.

Direction Interpretation

HR forest plots typically use a log-scale x-axis. Key judgments:

Forest Plot Reading Example: Study A: ----[====]---- HR = 0.65 (0.48, 0.88) CI entirely left of 1 → Significantly favours treatment Study B: -----[==]-----| HR = 0.89 (0.71, 1.12) CI crosses null line 1 → Not statistically significant Study C: --[======]--- HR = 0.58 (0.39, 0.86) Large square → High weight study Pooled: <====> HR = 0.71 (0.59, 0.85) Diamond left of 1 → Pooled result significantly favours treatment
Publication tip: Always label the forest plot sides: left = "Favours treatment," right = "Favours control." This is recommended by the Cochrane Handbook and is frequently checked by peer reviewers.

Heterogeneity Indicators

If I² is very high (>75%): The pooled HR may be misleading. Prioritize subgroup analysis or meta-regression to identify the source of heterogeneity (e.g., different tumor types, treatment regimens, or follow-up durations).

Step-by-Step: HR Meta-Analysis in MetaReview

Step 1: Select Effect Measure

Open MetaReview and select "Hazard Ratio" from the effect measure dropdown at the top. The data entry area automatically switches to HR-specific input fields.

Step 2: Enter Study Information

For each row, fill in:

Step 3: Enter HR and Confidence Interval

MetaReview automatically performs log(HR) transformation and SE calculation in the background.

Step 4: Run the Analysis

After entering all study data, click "Run Meta-Analysis". Results are computed within seconds.

Step 5: Review Results

  1. Forest Plot — Displays each study's HR, 95% CI, weight, and pooled result. X-axis uses log scale with the null line at HR = 1.
  2. Funnel Plot — Assesses publication bias. A symmetric funnel shape suggests no obvious bias; asymmetry suggests possible small-study effects.
  3. Sensitivity Analysis (Leave-one-out) — Recalculates the pooled HR after removing each study one at a time. If removing a study substantially changes the result, that study has high influence.
  4. Heterogeneity Statistics — I², Q-test p-value, and τ².

Keyboard Shortcuts for Faster Data Entry

Batch entry tip: Organize all study data in Excel first (Study, Year, HR, CI Lower, CI Upper), then copy and paste row by row. MetaReview also supports pasting tabular data from Excel or Google Sheets.

Common Pitfalls and How to Avoid Them

Pitfall 1: Mixing HR with OR/RR

This is the most serious methodological error. HR comes from survival analysis (Cox regression) and accounts for time and censoring. OR comes from logistic regression, RR from frequency tables — they only consider cumulative events at a fixed time point. Their mathematical foundations are entirely different.

Solution: Specify in your inclusion criteria that "studies must report HR or provide data from which HR can be calculated." Analyze studies reporting only OR separately.

Pitfall 2: Mixing Adjusted and Unadjusted HRs

The same paper may report both unadjusted (univariate) and adjusted (multivariable) HRs. They can differ substantially.

Solution: Pre-specify in your protocol which type of HR to extract. Prefer adjusted HRs because they control for confounders. Maintain consistency across all included studies. If mixing is unavoidable, conduct a sensitivity analysis.

Pitfall 3: Ignoring the Proportional Hazards Assumption

The Cox model assumes the HR remains constant throughout follow-up (proportional hazards assumption). If KM curves cross, this assumption is violated and a single HR cannot accurately describe the treatment effect.

How to check:

Solution: Discuss the PH limitation in your Discussion section. If multiple studies show crossing curves, consider restricted mean survival time (RMST) as an alternative effect measure.

Pitfall 4: Incorrect Subgroup HR Extraction

Papers often report subgroup HRs (by tumor type, age group, PD-L1 expression, etc.) in forest plot format.

Pitfall 5: Ignoring Follow-Up Duration Differences

Different studies may have vastly different median follow-up times (e.g., 12 months vs. 60 months). Short follow-up may underestimate long-term effects, while long follow-up may be affected by treatment crossover.

Solution: Record median follow-up time during data extraction and explore it as a potential source of heterogeneity via meta-regression.

Pitfall 6: Publication Bias

Studies with positive results (HR significantly < 1) are more likely to be published, while negative results may remain in file drawers.

Solution:

Start Your HR Meta-Analysis Now

Enter HR and 95% CI. MetaReview automatically handles log transformation, pooling, forest plots, and funnel plots. Free, no coding required.

Open MetaReview

See live demo: Aspirin vs Placebo meta-analysis (7 RCTs) →

Stay Updated

Get notified about new features and meta-analysis tips.

No spam. Unsubscribe anytime.