Inferring Causal Relationships Between Risk Factors and Outcomes from Genome-Wide Association Study Data

Annu Rev Genomics Hum Genet. 2018 Aug 31:19:303-327. doi: 10.1146/annurev-genom-083117-021731. Epub 2018 Apr 25.

Abstract

An observational correlation between a suspected risk factor and an outcome does not necessarily imply that interventions on levels of the risk factor will have a causal impact on the outcome (correlation is not causation). If genetic variants associated with the risk factor are also associated with the outcome, then this increases the plausibility that the risk factor is a causal determinant of the outcome. However, if the genetic variants in the analysis do not have a specific biological link to the risk factor, then causal claims can be spurious. We review the Mendelian randomization paradigm for making causal inferences using genetic variants. We consider monogenic analysis, in which genetic variants are taken from a single gene region, and polygenic analysis, which includes variants from multiple regions. We focus on answering two questions: When can Mendelian randomization be used to make reliable causal inferences, and when can it be used to make relevant causal inferences?

Keywords: causal inference; drug discovery; genetic epidemiology; instrumental variable; target validation.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Causality
  • Genome-Wide Association Study*
  • Humans
  • Mendelian Randomization Analysis
  • Risk Factors