Unknown

Dataset Information

0

Epigenetic priors for identifying active transcription factor binding sites.


ABSTRACT: Accurate knowledge of the genome-wide binding of transcription factors in a particular cell type or under a particular condition is necessary for understanding transcriptional regulation. Using epigenetic data such as histone modification and DNase I, accessibility data has been shown to improve motif-based in silico methods for predicting such binding, but this approach has not yet been fully explored.We describe a probabilistic method for combining one or more tracks of epigenetic data with a standard DNA sequence motif model to improve our ability to identify active transcription factor binding sites (TFBSs). We convert each data type into a position-specific probabilistic prior and combine these priors with a traditional probabilistic motif model to compute a log-posterior odds score. Our experiments, using histone modifications H3K4me1, H3K4me3, H3K9ac and H3K27ac, as well as DNase I sensitivity, show conclusively that the log-posterior odds score consistently outperforms a simple binary filter based on the same data. We also show that our approach performs competitively with a more complex method, CENTIPEDE, and suggest that the relative simplicity of the log-posterior odds scoring method makes it an appealing and very general method for identifying functional TFBSs on the basis of DNA and epigenetic evidence.FIMO, part of the MEME Suite software toolkit, now supports log-posterior odds scoring using position-specific priors for motif search. A web server and source code are available at http://meme.nbcr.net. Utilities for creating priors are at http://research.imb.uq.edu.au/t.bailey/SD/Cuellar2011.t.bailey@uq.edu.auSupplementary data are available at Bioinformatics online.

SUBMITTER: Cuellar-Partida G 

PROVIDER: S-EPMC3244768 | biostudies-literature | 2012 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Epigenetic priors for identifying active transcription factor binding sites.

Cuellar-Partida Gabriel G   Buske Fabian A FA   McLeay Robert C RC   Whitington Tom T   Noble William Stafford WS   Bailey Timothy L TL  

Bioinformatics (Oxford, England) 20111108 1


<h4>Motivation</h4>Accurate knowledge of the genome-wide binding of transcription factors in a particular cell type or under a particular condition is necessary for understanding transcriptional regulation. Using epigenetic data such as histone modification and DNase I, accessibility data has been shown to improve motif-based in silico methods for predicting such binding, but this approach has not yet been fully explored.<h4>Results</h4>We describe a probabilistic method for combining one or mor  ...[more]

Similar Datasets

| S-EPMC547828 | biostudies-other
| S-EPMC2981494 | biostudies-literature
| S-EPMC2647310 | biostudies-literature
| S-EPMC4675984 | biostudies-literature
| S-EPMC3898213 | biostudies-literature
| S-EPMC6001751 | biostudies-literature
| S-EPMC4413818 | biostudies-literature