Statistical power in COVID-19 case-control host genomic study design

Genome Med. 2020 Dec 28;12(1):115. doi: 10.1186/s13073-020-00818-2.

Abstract

The identification of genetic variation that directly impacts infection susceptibility to SARS-CoV-2 and disease severity of COVID-19 is an important step towards risk stratification, personalized treatment plans, therapeutic, and vaccine development and deployment. Given the importance of study design in infectious disease genetic epidemiology, we use simulation and draw on current estimates of exposure, infectivity, and test accuracy of COVID-19 to demonstrate the feasibility of detecting host genetic factors associated with susceptibility and severity in published COVID-19 study designs. We demonstrate that limited phenotypic data and exposure/infection information in the early stages of the pandemic significantly impact the ability to detect most genetic variants with moderate effect sizes, especially when studying susceptibility to SARS-CoV-2 infection. Our insights can aid in the interpretation of genetic findings emerging in the literature and guide the design of future host genetic studies.

Keywords: Genetic epidemiology; Genome-wide association studies; Statistical genetics; Study design.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • COVID-19 / epidemiology*
  • COVID-19 / genetics
  • COVID-19 Testing
  • Case-Control Studies*
  • Computer Simulation
  • Confounding Factors, Epidemiologic
  • Exposome
  • False Negative Reactions
  • Genetic Predisposition to Disease
  • Genetic Variation
  • Genomics / methods*
  • Host-Pathogen Interactions / genetics
  • Humans
  • Pandemics*
  • Research Design* / statistics & numerical data
  • Reverse Transcriptase Polymerase Chain Reaction
  • Risk
  • SARS-CoV-2*
  • Sensitivity and Specificity