Unknown

Dataset Information

0

Accelerated nanopore basecalling with SLOW5 data format.


ABSTRACT:

Motivation

Nanopore sequencing is emerging as a key pillar in the genomic technology landscape but computational constraints limiting its scalability remain to be overcome. The translation of raw current signal data into DNA or RNA sequence reads, known as 'basecalling', is a major friction in any nanopore sequencing workflow. Here, we exploit the advantages of the recently developed signal data format 'SLOW5' to streamline and accelerate nanopore basecalling on high-performance computing (HPC) and cloud environments.

Results

SLOW5 permits highly efficient sequential data access, eliminating a potential analysis bottleneck. To take advantage of this, we introduce Buttery-eel, an open-source wrapper for Oxford Nanopore's Guppy basecaller that enables SLOW5 data access, resulting in performance improvements that are essential for scalable, affordable basecalling.

Availability and implementation

Buttery-eel is available at https://github.com/Psy-Fer/buttery-eel.

SUBMITTER: Samarakoon H 

PROVIDER: S-EPMC10261880 | biostudies-literature | 2023 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Accelerated nanopore basecalling with SLOW5 data format.

Samarakoon Hiruna H   Ferguson James M JM   Gamaarachchi Hasindu H   Deveson Ira W IW  

Bioinformatics (Oxford, England) 20230601 6


<h4>Motivation</h4>Nanopore sequencing is emerging as a key pillar in the genomic technology landscape but computational constraints limiting its scalability remain to be overcome. The translation of raw current signal data into DNA or RNA sequence reads, known as 'basecalling', is a major friction in any nanopore sequencing workflow. Here, we exploit the advantages of the recently developed signal data format 'SLOW5' to streamline and accelerate nanopore basecalling on high-performance computin  ...[more]

Similar Datasets

| S-EPMC9020074 | biostudies-literature
| S-EPMC11383704 | biostudies-literature
| S-EPMC11429927 | biostudies-literature
| S-EPMC6984161 | biostudies-literature
| S-EPMC7178565 | biostudies-literature
| S-EPMC6591954 | biostudies-literature
| S-EPMC10422362 | biostudies-literature
| S-EPMC10088207 | biostudies-literature
| S-EPMC10769248 | biostudies-literature
| S-EPMC11339354 | biostudies-literature