Improving power and robustness for detecting genetic association with extreme-value sampling design

Hua Yun Chen; Mingyao Li

doi:10.1002/gepi.20631

Improving power and robustness for detecting genetic association with extreme-value sampling design

Genet Epidemiol. 2011 Dec;35(8):823-30. doi: 10.1002/gepi.20631. Epub 2011 Oct 17.

Authors

Hua Yun Chen¹, Mingyao Li

Affiliation

¹ Division of Epidemiology and Biostatistics, School of Public Health, University of Illinois at Chicago, IL, USA. hychen@uic.edu

PMID: 22006659
DOI: 10.1002/gepi.20631

Abstract

Extreme-value sampling design that samples subjects with extremely large or small quantitative trait values is commonly used in genetic association studies. Samples in such designs are often treated as "cases" and "controls" and analyzed using logistic regression. Such a case-control analysis ignores the potential dose-response relationship between the quantitative trait and the underlying trait locus and thus may lead to loss of power in detecting genetic association. An alternative approach to analyzing such data is to model the dose-response relationship by a linear regression model. However, parameter estimation from this model can be biased, which may lead to inflated type I errors. We propose a robust and efficient approach that takes into consideration of both the biased sampling design and the potential dose-response relationship. Extensive simulations demonstrate that the proposed method is more powerful than the traditional logistic regression analysis and is more robust than the linear regression analysis. We applied our method to the analysis of a candidate gene association study on high-density lipoprotein cholesterol (HDL-C) which includes study subjects with extremely high or low HDL-C levels. Using our method, we identified several SNPs showing a stronger evidence of association with HDL-C than the traditional case-control logistic regression analysis. Our results suggest that it is important to appropriately model the quantitative traits and to adjust for the biased sampling when dose-response relationship exists in extreme-value sampling designs.

Publication types

Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Case-Control Studies
Cholesterol, HDL / genetics*
Cross-Sectional Studies
Genetic Predisposition to Disease*
Humans
Models, Genetic*
Polymorphism, Single Nucleotide
Quantitative Trait Loci
Quantitative Trait, Heritable*
Regression Analysis
Selection Bias

Substances

Cholesterol, HDL

Abstract

Publication types

MeSH terms

Substances

Grants and funding