Adjusting for unmeasured confounding using validation data: Simplified two-stage calibration for survival and dichotomous outcomes

Vidar Hjellvik; Marie L De Bruin; Sven O Samuelsen; Øystein Karlstad; Morten Andersen; Jari Haukka; Peter Vestergaard; Frank de Vries; Kari Furu

doi:10.1002/sim.8131

Adjusting for unmeasured confounding using validation data: Simplified two-stage calibration for survival and dichotomous outcomes

Stat Med. 2019 Jul 10;38(15):2719-2734. doi: 10.1002/sim.8131. Epub 2019 Mar 3.

Authors

Vidar Hjellvik¹, Marie L De Bruin^{2

3}, Sven O Samuelsen^{1

4}, Øystein Karlstad¹, Morten Andersen^{5

6

7}, Jari Haukka⁸, Peter Vestergaard⁹, Frank de Vries^{2

10}, Kari Furu¹

Affiliations

¹ Department of Chronic Diseases and Ageing, Norwegian Institute of Public Health, Oslo, Norway.
² Division of Pharmacoepidemiology and Clinical Pharmacology, Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Utrecht, The Netherlands.
³ Department of Pharmacy, Copenhagen Centre for Regulatory Science, University of Copenhagen, Copenhagen, Denmark.
⁴ Department of Mathematics, University of Oslo, Oslo, Norway.
⁵ Centre for Pharmacoepidemiology, Karolinska Institutet, Clinical Epidemiology Division, Karolinska University Hospital, Solna, Sweden.
⁶ Department of Drug Design and Pharmacology, University of Copenhagen, Copenhagen, Denmark.
⁷ Research Unit of General Practice, University of Southern Denmark, Odense, Denmark.
⁸ Department of Public Health, University of Helsinki, Helsinki, Finland.
⁹ Department of Clinical Medicine and Department of Endocrinology, Aalborg University Hospital, Aalborg, Denmark.
¹⁰ Department of Clinical Pharmacy and Toxicology, Maastricht University Medical Centre, Maastricht, The Netherlands.

PMID: 30828842
DOI: 10.1002/sim.8131

Abstract

In epidemiology, one typically wants to estimate the risk of an outcome associated with an exposure after adjusting for confounders. Sometimes, outcome and exposure and maybe some confounders are available in a large data set, whereas some important confounders are only available in a validation data set that is typically a subset of the main data set. A generally applicable method in this situation is the two-stage calibration (TSC) method. We present a simplified easy-to-implement version of the TSC for the case where the validation data are a subset of the main data. We compared the simplified version to the standard TSC version for incidence rate ratios, odds ratios, relative risks, and hazard ratios using simulated data, and the simplified version performed better than our implementation of the standard version. The simplified version was also tested on real data and performed well.

Keywords: bias correction; epidemiology; two-stage calibration; unmeasured confounding; validation data.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't

MeSH terms

Calibration
Computer Simulation
Confounding Factors, Epidemiologic*
Humans
Probability*
Proportional Hazards Models
Reproducibility of Results
Risk Assessment / methods*
Survival Analysis