Unknown

Dataset Information

0

Scaling read aligners to hundreds of threads on general-purpose processors.


ABSTRACT:

Motivation

General-purpose processors can now contain many dozens of processor cores and support hundreds of simultaneous threads of execution. To make best use of these threads, genomics software must contend with new and subtle computer architecture issues. We discuss some of these and propose methods for improving thread scaling in tools that analyze each read independently, such as read aligners.

Results

We implement these methods in new versions of Bowtie, Bowtie 2 and HISAT. We greatly improve thread scaling in many scenarios, including on the recent Intel Xeon Phi architecture. We also highlight how bottlenecks are exacerbated by variable-record-length file formats like FASTQ and suggest changes that enable superior scaling.

Availability and implementation

Experiments for this study: https://github.com/BenLangmead/bowtie-scaling.

Bowtie

http://bowtie-bio.sourceforge.net.

Bowtie 2

http://bowtie-bio.sourceforge.net/bowtie2.

Hisat

http://www.ccb.jhu.edu/software/hisat.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Langmead B 

PROVIDER: S-EPMC6361242 | biostudies-literature | 2019 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Scaling read aligners to hundreds of threads on general-purpose processors.

Langmead Ben B   Wilks Christopher C   Antonescu Valentin V   Charles Rone R  

Bioinformatics (Oxford, England) 20190201 3


<h4>Motivation</h4>General-purpose processors can now contain many dozens of processor cores and support hundreds of simultaneous threads of execution. To make best use of these threads, genomics software must contend with new and subtle computer architecture issues. We discuss some of these and propose methods for improving thread scaling in tools that analyze each read independently, such as read aligners.<h4>Results</h4>We implement these methods in new versions of Bowtie, Bowtie 2 and HISAT.  ...[more]

Similar Datasets

| S-EPMC4165466 | biostudies-other
| S-EPMC6642177 | biostudies-literature
| S-EPMC7320598 | biostudies-literature
| S-EPMC6821417 | biostudies-literature
| S-EPMC8087178 | biostudies-literature
| S-EPMC10458655 | biostudies-literature
| S-EPMC3124697 | biostudies-literature
| S-EPMC2858443 | biostudies-literature
| S-EPMC10249055 | biostudies-literature
| S-EPMC11315692 | biostudies-literature