A Comparison of Agent-Based Models and the Parametric G-Formula for Causal Inference

Am J Epidemiol. 2017 Jul 15;186(2):131-142. doi: 10.1093/aje/kwx091.

Abstract

Decision-making requires choosing from treatments on the basis of correctly estimated outcome distributions under each treatment. In the absence of randomized trials, 2 possible approaches are the parametric g-formula and agent-based models (ABMs). The g-formula has been used exclusively to estimate effects in the population from which data were collected, whereas ABMs are commonly used to estimate effects in multiple populations, necessitating stronger assumptions. Here, we describe potential biases that arise when ABM assumptions do not hold. To do so, we estimated 12-month mortality risk in simulated populations differing in prevalence of an unknown common cause of mortality and a time-varying confounder. The ABM and g-formula correctly estimated mortality and causal effects when all inputs were from the target population. However, whenever any inputs came from another population, the ABM gave biased estimates of mortality-and often of causal effects even when the true effect was null. In the absence of unmeasured confounding and model misspecification, both methods produce valid causal inferences for a given population when all inputs are from that population. However, ABMs may result in bias when extrapolated to populations that differ on the distribution of unmeasured outcome determinants, even when the causal network linking variables is identical.

Keywords: Monte Carlo methods; agent-based models; causal inference; decision analysis; individual-level models; mathematical models; medical decision making; parametric g-formula.

Publication types

  • Comparative Study

MeSH terms

  • Bias
  • Causality
  • Cause of Death / trends
  • Computer Simulation / statistics & numerical data
  • Data Interpretation, Statistical
  • Decision Making
  • Epidemiologic Research Design*
  • Humans
  • Models, Statistical*
  • Monte Carlo Method
  • Outcome Assessment, Health Care / methods
  • Outcome Assessment, Health Care / statistics & numerical data*
  • Risk Assessment / methods
  • Systems Analysis