The structural basis for selective binding of non-methylated CpG islands by the CFP1 CXXC domain

Nat Commun. 2011:2:227. doi: 10.1038/ncomms1237.

Abstract

CFP1 is a CXXC domain-containing protein and an essential component of the SETD1 histone H3K4 methyltransferase complex. CXXC domain proteins direct different chromatin-modifying activities to various chromatin regions. Here, we report crystal structures of the CFP1 CXXC domain in complex with six different CpG DNA sequences. The crescent-shaped CFP1 CXXC domain is wedged into the major groove of the CpG DNA, distorting the B-form DNA, and interacts extensively with the major groove of the DNA. The structures elucidate the molecular mechanism of the non-methylated CpG-binding specificity of the CFP1 CXXC domain. The CpG motif is confined by a tripeptide located in a rigid loop, which only allows the accommodation of the non-methylated CpG dinucleotide. Furthermore, we demonstrate that CFP1 has a preference for a guanosine nucleotide following the CpG motif.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Binding Sites
  • Cell Nucleus / genetics
  • Cell Nucleus / metabolism
  • Chromatin / genetics
  • Chromatin / metabolism
  • Cloning, Molecular
  • CpG Islands / genetics*
  • Crystallization
  • Crystallography, X-Ray
  • DNA / chemistry
  • DNA / metabolism*
  • DNA Methylation
  • DNA-Binding Proteins* / genetics
  • DNA-Binding Proteins* / metabolism
  • Escherichia coli
  • Histones / genetics
  • Histones / metabolism
  • Humans
  • Models, Molecular
  • Molecular Sequence Data
  • Oligopeptides / genetics
  • Oligopeptides / metabolism
  • Protein Binding / genetics
  • Protein Structure, Tertiary
  • Recombinant Proteins* / genetics
  • Recombinant Proteins* / metabolism
  • Sequence Alignment
  • Structure-Activity Relationship
  • Trans-Activators

Substances

  • CXXC1 protein, human
  • Chromatin
  • DNA-Binding Proteins
  • Histones
  • Oligopeptides
  • Recombinant Proteins
  • Trans-Activators
  • DNA