Systematic localization of common disease-associated variation in regulatory DNA

Science. 2012 Sep 7;337(6099):1190-5. doi: 10.1126/science.1222794. Epub 2012 Sep 5.

Abstract

Genome-wide association studies have identified many noncoding variants associated with common diseases and traits. We show that these variants are concentrated in regulatory DNA marked by deoxyribonuclease I (DNase I) hypersensitive sites (DHSs). Eighty-eight percent of such DHSs are active during fetal development and are enriched in variants associated with gestational exposure-related phenotypes. We identified distant gene targets for hundreds of variant-containing DHSs that may explain phenotype associations. Disease-associated variants systematically perturb transcription factor recognition sequences, frequently alter allelic chromatin states, and form regulatory networks. We also demonstrated tissue-selective enrichment of more weakly disease-associated variants within DHSs and the de novo identification of pathogenic cell types for Crohn's disease, multiple sclerosis, and an electrocardiogram trait, without prior knowledge of physiological mechanisms. Our results suggest pervasive involvement of regulatory DNA variation in common human disease and provide pathogenic insights into diverse disorders.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Alleles
  • Chromatin / metabolism
  • Chromatin / ultrastructure
  • Crohn Disease / genetics
  • DNA / genetics*
  • Deoxyribonuclease I / metabolism
  • Disease / genetics*
  • Electrocardiography
  • Fetal Development
  • Fetus / metabolism
  • Gene Regulatory Networks
  • Genetic Variation*
  • Genome, Human
  • Genome-Wide Association Study
  • Humans
  • Multiple Sclerosis / genetics
  • Phenotype
  • Polymorphism, Single Nucleotide*
  • Promoter Regions, Genetic
  • Regulatory Elements, Transcriptional*
  • Regulatory Sequences, Nucleic Acid*
  • Transcription Factors / chemistry
  • Transcription Factors / genetics
  • Transcription Factors / metabolism*

Substances

  • Chromatin
  • Transcription Factors
  • DNA
  • Deoxyribonuclease I

Associated data

  • GEO/GSE18927
  • GEO/GSE29692