Massively parallel decoding of mammalian regulatory sequences supports a flexible organizational model

Nat Genet. 2013 Sep;45(9):1021-1028. doi: 10.1038/ng.2713. Epub 2013 Jul 28.

Abstract

Despite continual progress in the cataloging of vertebrate regulatory elements, little is known about their organization and regulatory architecture. Here we describe a massively parallel experiment to systematically test the impact of copy number, spacing, combination and order of transcription factor binding sites on gene expression. A complex library of ∼5,000 synthetic regulatory elements containing patterns from 12 liver-specific transcription factor binding sites was assayed in mice and in HepG2 cells. We find that certain transcription factors act as direct drivers of gene expression in homotypic clusters of binding sites, independent of spacing between sites, whereas others function only synergistically. Heterotypic enhancers are stronger than their homotypic analogs and favor specific transcription factor binding site combinations, mimicking putative native enhancers. Exhaustive testing of binding site permutations suggests that there is flexibility in binding site order. Our findings provide quantitative support for a flexible model of regulatory element activity and suggest a framework for the design of synthetic tissue-specific enhancers.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, N.I.H., Intramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Binding Sites
  • Cell Line
  • Cluster Analysis
  • Enhancer Elements, Genetic
  • Gene Amplification
  • Gene Dosage
  • Gene Expression
  • Gene Expression Regulation*
  • Genes, Reporter
  • Humans
  • Liver / metabolism
  • Male
  • Mice
  • Models, Biological*
  • Nucleotide Motifs
  • Organ Specificity / genetics
  • Protein Binding
  • Regulatory Sequences, Nucleic Acid*
  • Transcription Factors / metabolism*

Substances

  • Transcription Factors