MACE: model based analysis of ChIP-exo

Liguo Wang; Junsheng Chen; Chen Wang; Liis Uusküla-Reimand; Kaifu Chen; Alejandra Medina-Rivera; Edwin J Young; Michael T Zimmermann; Huihuang Yan; Zhifu Sun; Yuji Zhang; Stephen T Wu; Haojie Huang; Michael D Wilson; Jean-Pierre A Kocher; Wei Li

doi:10.1093/nar/gku846

MACE: model based analysis of ChIP-exo

Nucleic Acids Res. 2014 Nov 10;42(20):e156. doi: 10.1093/nar/gku846. Epub 2014 Sep 23.

Authors

Affiliations

¹ Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, MN 55905, USA Division of Biostatistics, Dan L. Duncan Cancer Center and Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX 77030, USA wl1@bcm.edu.
² School of Life Science and Technology, Tongji University, Shanghai 200092, China.
³ Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, MN 55905, USA.
⁴ Genetics & Genome Biology Program, SickKids Research Institute, 686 Bay St. Toronto, ON, M5G 0A4, Canada.
⁵ Division of Biostatistics, Dan L. Duncan Cancer Center and Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX 77030, USA.
⁶ Department of Biochemistry and Molecular Biology, Mayo Clinic, MN 55905, USA.
⁷ Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S1A8, Canada.
⁸ Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, MN 55905, USA wl1@bcm.edu.
⁹ Division of Biostatistics, Dan L. Duncan Cancer Center and Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX 77030, USA wl1@bcm.edu.

Abstract

Understanding the role of a given transcription factor (TF) in regulating gene expression requires precise mapping of its binding sites in the genome. Chromatin immunoprecipitation-exo, an emerging technique using λ exonuclease to digest TF unbound DNA after ChIP, is designed to reveal transcription factor binding site (TFBS) boundaries with near-single nucleotide resolution. Although ChIP-exo promises deeper insights into transcription regulation, no dedicated bioinformatics tool exists to leverage its advantages. Most ChIP-seq and ChIP-chip analytic methods are not tailored for ChIP-exo, and thus cannot take full advantage of high-resolution ChIP-exo data. Here we describe a novel analysis framework, termed MACE (model-based analysis of ChIP-exo) dedicated to ChIP-exo data analysis. The MACE workflow consists of four steps: (i) sequencing data normalization and bias correction; (ii) signal consolidation and noise reduction; (iii) single-nucleotide resolution border peak detection using the Chebyshev Inequality and (iv) border matching using the Gale-Shapley stable matching algorithm. When applied to published human CTCF, yeast Reb1 and our own mouse ONECUT1/HNF6 ChIP-exo data, MACE is able to define TFBSs with high sensitivity, specificity and spatial resolution, as evidenced by multiple criteria including motif enrichment, sequence conservation, direct sequence pileup, nucleosome positioning and open chromatin states. In addition, we show that the fundamental advance of MACE is the identification of two boundaries of a TFBS with high resolution, whereas other methods only report a single location of the same event. The two boundaries help elucidate the in vivo binding structure of a given TF, e.g. whether the TF may bind as dimers or in a complex with other co-factors.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms
Animals
Binding Sites
CCCTC-Binding Factor
Chromatin Immunoprecipitation / methods*
Computer Simulation
DNA-Binding Proteins / metabolism
Exodeoxyribonucleases
Genome
Hepatocyte Nuclear Factor 6 / metabolism
Humans
Male
Mice
Mice, Inbred C57BL
Repressor Proteins / metabolism
Saccharomyces cerevisiae Proteins / metabolism
Sequence Analysis, DNA / methods*
Transcription Factors / metabolism*

Substances

CCCTC-Binding Factor
CTCF protein, human
Ctcf protein, mouse
DNA-Binding Proteins
Hepatocyte Nuclear Factor 6
Onecut1 protein, mouse
REB1 protein, S cerevisiae
Repressor Proteins
Saccharomyces cerevisiae Proteins
Transcription Factors
Exodeoxyribonucleases

Abstract

Publication types

MeSH terms

Substances

Grants and funding