Unknown

Dataset Information

0

Swarm v3: towards tera-scale amplicon clustering.


ABSTRACT: Previously we presented swarm, an open-source amplicon clustering program that produces fine-scale molecular operational taxonomic units (OTUs) that are free of arbitrary global clustering thresholds. Here we present swarm v3 to address issues of contemporary datasets that are growing towards tera-byte sizes. When compared to previous swarm versions, swarm v3 has modernized C ++ source code, reduced memory footprint by up to 50%, optimized CPU-usage and multithreading (more than 7 times faster with default parameters), and it has been extensively tested for its robustness and logic. Source code and binaries are available at https://github.com/torognes/swarm. Supplementary data are available at Bioinformatics online.

SUBMITTER: Mahe F 

PROVIDER: S-EPMC8696092 | biostudies-literature | 2021 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Swarm v3: towards tera-scale amplicon clustering.

Mahé Frédéric F   Czech Lucas L   Stamatakis Alexandros A   Quince Christopher C   de Vargas Colomban C   Dunthorn Micah M   Rognes Torbjørn T  

Bioinformatics (Oxford, England) 20211201 1


<h4>Motivation</h4>Previously we presented swarm, an open-source amplicon clustering programme that produces fine-scale molecular operational taxonomic units (OTUs) that are free of arbitrary global clustering thresholds. Here, we present swarm v3 to address issues of contemporary datasets that are growing towards tera-byte sizes.<h4>Results</h4>When compared with previous swarm versions, swarm v3 has modernized C++ source code, reduced memory footprint by up to 50%, optimized CPU-usage and mult  ...[more]

Similar Datasets

| S-EPMC4690345 | biostudies-literature
| S-EPMC4178461 | biostudies-literature
| S-EPMC5716574 | biostudies-literature
| S-EPMC8554525 | biostudies-literature
| S-EPMC8118605 | biostudies-literature
| S-EPMC4488466 | biostudies-literature
| S-EPMC4562660 | biostudies-literature
| S-EPMC9409525 | biostudies-literature
| S-EPMC11911446 | biostudies-literature
| S-EPMC3295656 | biostudies-literature