Unknown

Dataset Information

0

Inferring direct DNA binding from ChIP-seq.


ABSTRACT: Genome-wide binding data from transcription factor ChIP-seq experiments is the best source of information for inferring the relative DNA-binding affinity of these proteins in vivo. However, standard motif enrichment analysis and motif discovery approaches sometimes fail to correctly identify the binding motif for the ChIP-ed factor. To overcome this problem, we propose 'central motif enrichment analysis' (CMEA), which is based on the observation that the positional distribution of binding sites matching the direct-binding motif tends to be unimodal, well centered and maximal in the precise center of the ChIP-seq peak regions. We describe a novel visualization and statistical analysis tool--CentriMo--that identifies the region of maximum central enrichment in a set of ChIP-seq peak regions and displays the positional distributions of predicted sites. Using CentriMo for motif enrichment analysis, we provide evidence that one transcription factor (Nanog) has different binding affinity in vivo than in vitro, that another binds DNA cooperatively (E2f1), and confirm the in vivo affinity of NFIC, rescuing a difficult ChIP-seq data set. In another data set, CentriMo strongly suggests that there is no evidence of direct DNA binding by the ChIP-ed factor (Smad1). CentriMo is now part of the MEME Suite software package available at http://meme.nbcr.net. All data and output files presented here are available at: http://research.imb.uq.edu.au/t.bailey/sd/Bailey2011a.

SUBMITTER: Bailey TL 

PROVIDER: S-EPMC3458523 | biostudies-literature | 2012 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Inferring direct DNA binding from ChIP-seq.

Bailey Timothy L TL   Machanick Philip P  

Nucleic acids research 20120518 17


Genome-wide binding data from transcription factor ChIP-seq experiments is the best source of information for inferring the relative DNA-binding affinity of these proteins in vivo. However, standard motif enrichment analysis and motif discovery approaches sometimes fail to correctly identify the binding motif for the ChIP-ed factor. To overcome this problem, we propose 'central motif enrichment analysis' (CMEA), which is based on the observation that the positional distribution of binding sites  ...[more]

Similar Datasets

| S-EPMC4579343 | biostudies-literature
| S-EPMC3159476 | biostudies-literature
| S-EPMC2597701 | biostudies-literature
| S-EPMC2994895 | biostudies-literature
| S-EPMC4460594 | biostudies-literature
| S-EPMC5223348 | biostudies-literature
| S-EPMC2532738 | biostudies-literature
| S-EPMC4413818 | biostudies-literature