Transcription factors (TFs) bind to DNA and regulate the transcription of nearby genes. However, only a small fraction of TF binding sites have such regulatory effects. Here we search for the predictors of functional binding sites by carrying out a systematic computational screening of a variety of contextual factors (histone modifications, nuclear lamin-bindings, and cofactor bindings). We used regression analysis to test if contextual factors are associated with upregulation or downregulation of neighboring genes following the induction or knockdown of the 9 TFs in mouse embryonic stem (ES) cells. Functional TF binding sites appeared to be either active (i.e., bound by P300, CHD7, mediator, cohesin, and SWI/SNF) or repressed (i.e., with H3K27me3 histone marks and bound by Polycomb factors). Active binding sites mediated the downregulation of nearby genes upon knocking down the activating TFs or inducing repressors. Repressed TF binding sites mediated the upregulation of nearby genes (e.g., poised developmental regulators) upon inducing TFs. In addition, repressed binding sites mediated repressive effects of TFs, identified by the downregulation of target genes after the induction of TFs or by the upregulation of target genes after the knockdown of TFs. The contextual factors associated with functions of DNA-bound TFs were used to improve the identification of candidate target genes regulated by TFs.
Keywords: chromatin modification; cis-regulatory module; embryonic stem cells; enhancer; target genes; transcription factor binding site.