The performance of different propensity score methods for estimating the effects of multiple treatments or exposures: a neutral comparison study

Background — The generalized propensity score is an extension of the conventional propensity score to settings with a categorical exposure with more than two levels of treatment or exposure. Six different methods of using the generalized propensity score have been used in the general internal medical literature. However, no studies have evaluated the relative performance of these methods.

Methods — We used Monte Carlo simulations to evaluate the performance of seven methods for using the generalized propensity score to estimate the effect of three levels of exposure when outcomes are continuous or binary. We examined estimation of both the average treatment effect and the average treatment effect for the treated. These methods for using the generalized propensity score included: regression and weighting-based approaches proposed by Imbens, regression and weighting-based approaches proposed by McCaffrey, a regression-based approached proposed by Spreeuwenberg, Rubin’s pairwise comparison method, three-way matching, matching weights, and overlap weights. We illustrated the application of these methods by estimating the effect of smoking status (current smoker vs. former smoker vs. never smoker) on death within one year of hospitalization for acute myocardial infarction.

Results — No method had consistently superior performance across all scenarios and target estimands.

Conclusion — We make recommendations for the preferred method depending on the nature of the outcome and the target estimand.

Information

Citation

Austin PC, Austin DE. BMC Med Res Methodol. 2026; Apr 1 [Epub ahead of print].

Contributing ICES Scientists

Peter Austin

Research Programs

Cardiovascular

Associated Topics

Associated Sites

ICES Central

Discover More

Journal Article

02/04/2026

Association between adoption of robotic total knee arthroplasty in Canada and major surgical complications

Pincus D, Ekhtiari S, Lex JR, Schemitsch E, Ruangsomboon P, Paterson JM, Ravi B. J Arthroplasty. 2026; S0883-5403(26)00301-3. Epub 2026 Apr 2.

Journal Article

01/04/2026

Using propensity score weighting with clustered data when the treatment is applied at the level of the cluster and outcomes are assessed at the level of the individual: the observational analog of cluster randomization trials

Austin PC. Stat Med. 2026; 45(8-9): e70501.

Journal Article

26/03/2026

Health services use for injury amongst persons experiencing homelessness in Ontario, Canada: a population-based retrospective matched cohort study

Visser C, Richard L, Walker M, Li W, Evans CC. BMC Public Health. 2026; Mar 26 [Epub ahead of print].

See All