Ontology highlight
ABSTRACT: Motivation
Extra-long tandem repeats (ETRs) are widespread in eukaryotic genomes and play an important role in fundamental cellular processes, such as chromosome segregation. Although emerging long-read technologies have enabled ETR assemblies, the accuracy of such assemblies is difficult to evaluate since there are no tools for their quality assessment. Moreover, since the mapping of error-prone reads to ETRs remains an open problem, it is not clear how to polish draft ETR assemblies.Results
To address these problems, we developed the TandemTools software that includes the TandemMapper tool for mapping reads to ETRs and the TandemQUAST tool for polishing ETR assemblies and their quality assessment. We demonstrate that TandemTools not only reveals errors in ETR assemblies but also improves the recently generated assemblies of human centromeres.Availability and implementation
https://github.com/ablab/TandemTools.Supplementary information
Supplementary data are available at Bioinformatics online.
SUBMITTER: Mikheenko A
PROVIDER: S-EPMC7355294 | biostudies-literature | 2020 Jul
REPOSITORIES: biostudies-literature
Mikheenko Alla A Bzikadze Andrey V AV Gurevich Alexey A Miga Karen H KH Pevzner Pavel A PA
Bioinformatics (Oxford, England) 20200701 Suppl_1
<h4>Motivation</h4>Extra-long tandem repeats (ETRs) are widespread in eukaryotic genomes and play an important role in fundamental cellular processes, such as chromosome segregation. Although emerging long-read technologies have enabled ETR assemblies, the accuracy of such assemblies is difficult to evaluate since there are no tools for their quality assessment. Moreover, since the mapping of error-prone reads to ETRs remains an open problem, it is not clear how to polish draft ETR assemblies.<h ...[more]