Unknown

Dataset Information

0

'Multi-SpaM': a maximum-likelihood approach to phylogeny reconstruction using multiple spaced-word matches and quartet trees.


ABSTRACT: Word-based or 'alignment-free' methods for phylogeny inference have become popular in recent years. These methods are much faster than traditional, alignment-based approaches, but they are generally less accurate. Most alignment-free methods calculate 'pairwise' distances between nucleic-acid or protein sequences; these distance values can then be used as input for tree-reconstruction programs such as neighbor-joining. In this paper, we propose the first word-based phylogeny approach that is based on 'multiple' sequence comparison and 'maximum likelihood'. Our algorithm first samples small, gap-free alignments involving four taxa each. For each of these alignments, it then calculates a quartet tree and, finally, the program 'Quartet MaxCut' is used to infer a super tree for the full set of input taxa from the calculated quartet trees. Experimental results show that trees produced with our approach are of high quality.

SUBMITTER: Dencker T 

PROVIDER: S-EPMC7671388 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

'Multi-SpaM': a maximum-likelihood approach to phylogeny reconstruction using multiple spaced-word matches and quartet trees.

Dencker Thomas T   Leimeister Chris-André CA   Gerth Michael M   Bleidorn Christoph C   Snir Sagi S   Morgenstern Burkhard B  

NAR genomics and bioinformatics 20191030 1


Word-based or 'alignment-free' methods for phylogeny inference have become popular in recent years. These methods are much faster than traditional, alignment-based approaches, but they are generally less accurate. Most alignment-free methods calculate 'pairwise' distances between nucleic-acid or protein sequences; these distance values can then be used as input for tree-reconstruction programs such as neighbor-joining. In this paper, we propose the first word-based phylogeny approach that is bas  ...[more]

Similar Datasets

| S-EPMC5409309 | biostudies-literature
| S-EPMC9387344 | biostudies-literature
| S-EPMC2835736 | biostudies-literature
| S-EPMC6436989 | biostudies-literature
| S-EPMC6330006 | biostudies-literature
| S-EPMC4267948 | biostudies-literature
| S-EPMC11887328 | biostudies-literature
| S-EPMC3649670 | biostudies-literature
| S-EPMC5416903 | biostudies-literature
2024-03-20 | GSE261769 | GEO