A review of the use of propensity score diagnostics in papers published in high-ranking medical journals

Emily Granger; Tim Watkins; Jamie C Sergeant; Mark Lunt

doi:10.1186/s12874-020-00994-0

A review of the use of propensity score diagnostics in papers published in high-ranking medical journals

BMC Med Res Methodol. 2020 May 27;20(1):132. doi: 10.1186/s12874-020-00994-0.

Authors

Emily Granger¹, Tim Watkins², Jamie C Sergeant^{3

4}, Mark Lunt³

Affiliations

¹ Centre for Epidemiology Versus Arthritis, Centre for Musculoskeletal Research, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, M13 9PT, UK. emily.granger-2@postgrad.manchester.ac.uk.
² Department of Developmental Disability Neuropsychiatry, School of Psychiatry, University of New South Wales, Sydney, Australia.
³ Centre for Epidemiology Versus Arthritis, Centre for Musculoskeletal Research, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, M13 9PT, UK.
⁴ Centre for Biostatistics, Division of Population Health, Health Services Research and Primary Care, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, M13 9PT, UK.

Abstract

Background: Propensity scores are widely used to deal with confounding bias in medical research. An incorrectly specified propensity score model may lead to residual confounding bias; therefore it is essential to use diagnostics to assess propensity scores in a propensity score analysis. The current use of propensity score diagnostics in the medical literature is unknown. The objectives of this study are to (1) assess the use of propensity score diagnostics in medical studies published in high-ranking journals, and (2) assess whether the use of propensity score diagnostics differs between studies (a) in different research areas and (b) using different propensity score methods.

Methods: A PubMed search identified studies published in high-impact journals between Jan 1st 2014 and Dec 31st 2016 using propensity scores to answer an applied medical question. From each study we extracted information regarding how propensity scores were assessed and which propensity score method was used. Research area was defined using the journal categories from the Journal Citations Report.

Results: A total of 894 papers were included in the review. Of these, 187 (20.9%) failed to report whether the propensity score had been assessed. Commonly reported diagnostics were p-values from hypothesis tests (36.6%) and the standardised mean difference (34.6%). Statistical tests provided marginally stronger evidence for a difference in diagnostic use between studies in different research areas (p = 0.033) than studies using different propensity score methods (p = 0.061).

Conclusions: The use of diagnostics in the propensity score medical literature is far from optimal, with different diagnostics preferred in different areas of medicine. The propensity score literature may improve with focused efforts to change practice in areas where suboptimal practice is most common.

Keywords: Confounding; Covariate balance; Diagnostics; Epidemiology; Propensity scores.

Publication types

Research Support, Non-U.S. Gov't
Review

MeSH terms

Bias
Humans
Periodicals as Topic*
Propensity Score
Publications
Research Design

Grants and funding

1789957/Medical Research Council (UK)/International