Background: Transcription initiation is controlled by cis-regulatory modules. Although these modules are usually made of clusters of short transcription factor binding sites, a small minority of such clusters in the genome have cis-regulatory activity. This paradox is currently unsolved.
Results: To identify what discriminates active from inactive clusters, we focused our attention on short topologically unconstrained clusters of two ETS and two GATA binding sites, similar to the early neural enhancer of Ciona intestinalis Otx. We first computationally identified 55 such clusters, conserved between the two Ciona genomes. In vivo assay of the activity of 19 hits identified three novel early neural enhancers, all located next to genes coexpressed with Otx. Optimization of ETS and GATA binding sites was not always sufficient to confer activity to inactive clusters. Rather, a dinucleotide sequence code associated to nucleosome depletion showed a robust correlation with enhancer potential. Identification of a large collection of Ciona regulatory regions revealed that predicted nucleosome depletion constitutes a general signature of Ciona enhancers, which is conserved between orthologous loci in the two Ciona genomes and which partitions conserved noncoding sequences into a major nucleosome-bound fraction and a minor nucleosome-free fraction with higher cis-regulatory potential. We also found this signature in a large fraction of short Drosophila cis-regulatory modules.
Conclusion: This study indicates that a sequence-based dinucleotide signature, previously associated with nucleosome depletion and independent of transcription factor binding sites, contributes to the definition of a local cis-regulatory potential in two metazoa, Ciona intestinalis and Drosophila melanogaster.
(c) 2010 Elsevier Ltd. All rights reserved.