Bipartite Community Structure of eQTLs

PLoS Comput Biol. 2016 Sep 12;12(9):e1005033. doi: 10.1371/journal.pcbi.1005033. eCollection 2016 Sep.

Abstract

Genome Wide Association Studies (GWAS) and expression quantitative trait locus (eQTL) analyses have identified genetic associations with a wide range of human phenotypes. However, many of these variants have weak effects and understanding their combined effect remains a challenge. One hypothesis is that multiple SNPs interact in complex networks to influence functional processes that ultimately lead to complex phenotypes, including disease states. Here we present CONDOR, a method that represents both cis- and trans-acting SNPs and the genes with which they are associated as a bipartite graph and then uses the modular structure of that graph to place SNPs into a functional context. In applying CONDOR to eQTLs in chronic obstructive pulmonary disease (COPD), we found the global network "hub" SNPs were devoid of disease associations through GWAS. However, the network was organized into 52 communities of SNPs and genes, many of which were enriched for genes in specific functional classes. We identified local hubs within each community ("core SNPs") and these were enriched for GWAS SNPs for COPD and many other diseases. These results speak to our intuition: rather than single SNPs influencing single genes, we see groups of SNPs associated with the expression of families of functionally related genes and that disease SNPs are associated with the perturbation of those functions. These methods are not limited in their application to COPD and can be used in the analysis of a wide variety of disease processes and other phenotypic traits.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Genome-Wide Association Study / methods*
  • Humans
  • Models, Statistical
  • Polymorphism, Single Nucleotide / genetics
  • Pulmonary Disease, Chronic Obstructive / genetics
  • Quantitative Trait Loci / genetics*