Prioritized subset analysis: improving power in genome-wide association studies

Hum Hered. 2008;65(3):129-41. doi: 10.1159/000109730. Epub 2007 Oct 12.

Abstract

Background: Genome-wide association studies (GWAS) are now feasible for studying the genetics underlying complex diseases. For many diseases, a list of candidate genes or regions exists and incorporation of such information into data analyses can potentially improve the power to detect disease variants. Traditional approaches for assessing the overall statistical significance of GWAS results ignore such information by inherently treating all markers equally.

Methods: We propose the prioritized subset analysis (PSA), in which a prioritized subset of markers is pre-selected from candidate regions, and the false discovery rate (FDR) procedure is carried out in the prioritized subset and its complementary subset, respectively.

Results: The PSA is more powerful than the whole-genome single-step FDR adjustment for a range of alternative models. The degree of power improvement depends on the fraction of associated SNPs in the prioritized subset and their nominal power, with higher fraction of associated SNPs and higher nominal power leading to more power improvement. The power improvement can be substantial; for disease loci not included in the prioritized subset, the power loss is almost negligible.

Conclusion: The PSA has the flexibility of allowing investigators to combine prior information from a variety of sources, and will be a useful tool for GWAS.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computer Simulation
  • Gene Frequency
  • Genetic Linkage*
  • Genetic Markers
  • Genetic Predisposition to Disease
  • Genome, Human*
  • Genomics / methods*
  • Humans
  • Models, Genetic
  • Polymorphism, Single Nucleotide

Substances

  • Genetic Markers