Unknown

Dataset Information

0

Unrealistic phylogenetic trees may improve phylogenetic footprinting.


ABSTRACT: The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolution of binding sites, but these trees and substitution probabilities are typically not known and cannot be estimated easily.Here, we investigate the influence of phylogenetic trees with different substitution probabilities on the classification performance of phylogenetic footprinting using synthetic and real data. For synthetic data we find that the classification performance is highest when the substitution probability used for phylogenetic footprinting is similar to that used for data generation. For real data, however, we typically find that the classification performance of phylogenetic footprinting surprisingly increases with increasing substitution probabilities and is often highest for unrealistically high substitution probabilities close to one. This finding suggests that choosing realistic model assumptions might not always yield optimal predictions in general and that choosing unrealistically high substitution probabilities close to one might actually improve the classification performance of phylogenetic footprinting.The proposed PF is implemented in JAVA and can be downloaded from https://github.com/mgledi/PhyFoo.: martin.nettling@informatik.uni-halle.de.Supplementary data are available at Bioinformatics online.

SUBMITTER: Nettling M 

PROVIDER: S-EPMC5447242 | biostudies-literature | 2017 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Unrealistic phylogenetic trees may improve phylogenetic footprinting.

Nettling Martin M   Treutler Hendrik H   Cerquides Jesus J   Grosse Ivo I  

Bioinformatics (Oxford, England) 20170601 11


<h4>Motivation</h4>The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolut  ...[more]

Similar Datasets

| S-EPMC1156870 | biostudies-literature
| S-EPMC1538810 | biostudies-other
| S-EPMC3813836 | biostudies-other
| S-EPMC3669789 | biostudies-other
| S-EPMC2744684 | biostudies-literature
| S-EPMC1160516 | biostudies-literature
| S-EPMC1691382 | biostudies-literature
| S-EPMC30389 | biostudies-literature
| S-EPMC4012501 | biostudies-literature
| S-EPMC5333389 | biostudies-literature