Unknown

Dataset Information

0

CloudSPAdes: assembly of synthetic long reads using de Bruijn graphs.


ABSTRACT: MOTIVATION:The recently developed barcoding-based synthetic long read (SLR) technologies have already found many applications in genome assembly and analysis. However, although some new barcoding protocols are emerging and the range of SLR applications is being expanded, the existing SLR assemblers are optimized for a narrow range of parameters and are not easily extendable to new barcoding technologies and new applications such as metagenomics or hybrid assembly. RESULTS:We describe the algorithmic challenge of the SLR assembly and present a cloudSPAdes algorithm for SLR assembly that is based on analyzing the de Bruijn graph of SLRs. We benchmarked cloudSPAdes across various barcoding technologies/applications and demonstrated that it improves on the state-of-the-art SLR assemblers in accuracy and speed. AVAILABILITY AND IMPLEMENTATION:Source code and installation manual for cloudSPAdes are available at https://github.com/ablab/spades/releases/tag/cloudspades-paper. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Tolstoganov I 

PROVIDER: S-EPMC6612831 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

cloudSPAdes: assembly of synthetic long reads using de Bruijn graphs.

Tolstoganov Ivan I   Bankevich Anton A   Chen Zhoutao Z   Pevzner Pavel A PA  

Bioinformatics (Oxford, England) 20190701 14


<h4>Motivation</h4>The recently developed barcoding-based synthetic long read (SLR) technologies have already found many applications in genome assembly and analysis. However, although some new barcoding protocols are emerging and the range of SLR applications is being expanded, the existing SLR assemblers are optimized for a narrow range of parameters and are not easily extendable to new barcoding technologies and new applications such as metagenomics or hybrid assembly.<h4>Results</h4>We descr  ...[more]

Similar Datasets

| S-EPMC5206522 | biostudies-literature
| S-EPMC5351550 | biostudies-literature
| S-EPMC8562525 | biostudies-literature
| S-EPMC3272472 | biostudies-literature
| S-EPMC2336801 | biostudies-literature
| S-EPMC3167803 | biostudies-literature
| S-EPMC3421212 | biostudies-literature
| S-EPMC6061703 | biostudies-literature
| S-EPMC5872255 | biostudies-literature
| S-EPMC4015147 | biostudies-literature