Efficient generalized least squares method for mixed population and family-based samples in genome-wide association studies

Genet Epidemiol. 2014 Jul;38(5):430-8. doi: 10.1002/gepi.21811. Epub 2014 May 20.

Abstract

Genome-wide association studies (GWAS) that draw samples from multiple studies with a mixture of relationship structures are becoming more common. Analytical methods exist for using mixed-sample data, but few methods have been proposed for the analysis of genotype-by-environment (G×E) interactions. Using GWAS data from a study of sarcoidosis susceptibility genes in related and unrelated African Americans, we explored the current analytic options for genotype association testing in studies using both unrelated and family-based designs. We propose a novel method-generalized least squares (GLX)-to estimate both SNP and G×E interaction effects for categorical environmental covariates and compared this method to generalized estimating equations (GEE), logistic regression, the Cochran-Armitage trend test, and the WQLS and MQLS methods. We used simulation to demonstrate that the GLX method reduces type I error under a variety of pedigree structures. We also demonstrate its superior power to detect SNP effects while offering computational advantages and comparable power to detect G×E interactions versus GEE. Using this method, we found two novel SNPs that demonstrate a significant genome-wide interaction with insecticide exposure-rs10499003 and rs7745248, located in the intronic and 3' UTR regions of the FUT9 gene on chromosome 6q16.1.

Keywords: GWAS; G×E; gene-by-environment; generalized least squares; mixed samples; sarcoidosis.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Black or African American / genetics*
  • Environment
  • Family*
  • Genetic Predisposition to Disease / genetics*
  • Genome-Wide Association Study / methods*
  • Genotype
  • Humans
  • Least-Squares Analysis
  • Logistic Models
  • Models, Genetic
  • Pedigree
  • Polymorphism, Single Nucleotide / genetics
  • Sarcoidosis / genetics*