Unknown

Dataset Information

0

Asteroid: a new algorithm to infer species trees from gene trees under high proportions of missing data.


ABSTRACT:

Motivation

Missing data and incomplete lineage sorting (ILS) are two major obstacles to accurate species tree inference. Gene tree summary methods such as ASTRAL and ASTRID have been developed to account for ILS. However, they can be severely affected by high levels of missing data.

Results

We present Asteroid, a novel algorithm that infers an unrooted species tree from a set of unrooted gene trees. We show on both empirical and simulated datasets that Asteroid is substantially more accurate than ASTRAL and ASTRID for very high proportions (>80%) of missing data. Asteroid is several orders of magnitude faster than ASTRAL for datasets that contain thousands of genes. It offers advanced features such as parallelization, support value computation and support for multi-copy and multifurcating gene trees.

Availability and implementation

Asteroid is freely available at https://github.com/BenoitMorel/Asteroid.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Morel B 

PROVIDER: S-EPMC9838317 | biostudies-literature | 2023 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Asteroid: a new algorithm to infer species trees from gene trees under high proportions of missing data.

Morel Benoit B   Williams Tom A TA   Stamatakis Alexandros A  

Bioinformatics (Oxford, England) 20230101 1


<h4>Motivation</h4>Missing data and incomplete lineage sorting (ILS) are two major obstacles to accurate species tree inference. Gene tree summary methods such as ASTRAL and ASTRID have been developed to account for ILS. However, they can be severely affected by high levels of missing data.<h4>Results</h4>We present Asteroid, a novel algorithm that infers an unrooted species tree from a set of unrooted gene trees. We show on both empirical and simulated datasets that Asteroid is substantially mo  ...[more]

Similar Datasets

| S-EPMC11232582 | biostudies-literature
| S-EPMC5998179 | biostudies-literature
| S-EPMC8932604 | biostudies-literature
| S-EPMC5998899 | biostudies-literature
| S-EPMC9116704 | biostudies-literature
| S-EPMC9236578 | biostudies-literature
| S-EPMC5920143 | biostudies-literature
| S-EPMC2576265 | biostudies-literature
| S-EPMC10942411 | biostudies-literature
| S-EPMC4061266 | biostudies-literature