Dataset Information


OrthoReD: a rapid and accurate orthology prediction tool with low computational requirement.



Identifying orthologous genes is an initial step required for phylogenetics, and it is also a common strategy employed in functional genetics to find candidates for functionally equivalent genes across multiple species. At the same time, in silico orthology prediction tools often require large computational resources only available on computing clusters. Here we present OrthoReD, an open-source orthology prediction tool with accuracy comparable to published tools that requires only a desktop computer. The low computational resource requirement of OrthoReD is achieved by repeating orthology searches on one gene of interest at a time, thereby generating a reduced dataset to limit the scope of orthology search for each gene of interest.


The output of OrthoReD was highly similar to the outputs of two other published orthology prediction tools, OrthologID and/or OrthoDB, for the three dataset tested, which represented three phyla with different ranges of species diversity and different number of genomes included. Median CPU time for ortholog prediction per gene by OrthoReD executed on a desktop computer was <15 min even for the largest dataset tested, which included all coding sequences of 100 bacterial species.


With high-throughput sequencing, unprecedented numbers of genes from non-model organisms are available with increasing need for clear information about their orthologies and/or functional equivalents in model organisms. OrthoReD is not only fast and accurate as an orthology prediction tool, but also gives researchers flexibility in the number of genes analyzed at a time, without requiring a high-performance computing cluster.

SUBMITTER: Battenberg K 

PROVIDER: S-EPMC5479036 | BioStudies | 2017-01-01

REPOSITORIES: biostudies

Similar Datasets

2013-01-01 | S-EPMC3853218 | BioStudies
2015-01-01 | S-EPMC4301908 | BioStudies
1000-01-01 | S-EPMC3228541 | BioStudies
2015-01-01 | S-EPMC5467691 | BioStudies
1000-01-01 | S-EPMC3114741 | BioStudies
2019-01-01 | S-EPMC6352304 | BioStudies
2007-01-01 | S-EPMC2211326 | BioStudies
1000-01-01 | S-EPMC2820494 | BioStudies
2014-01-01 | S-EPMC4219706 | BioStudies
2019-01-01 | S-EPMC6325911 | BioStudies