Unknown

Dataset Information

0

Finding Maximal Exact Matches Using the r-Index.


ABSTRACT: Efficiently finding maximal exact matches (MEMs) between a sequence read and a database of genomes is a key first step in read alignment. But until recently, it was unknown how to build a data structure in [Formula: see text] space that supports efficient MEM finding, where r is the number of runs in the Burrows-Wheeler Transform. In 2021, Rossi et al. showed how to build a small auxiliary data structure called thresholds in addition to the r-index in [Formula: see text] space. This addition enables efficient MEM finding using the r-index. In this article, we present the tool that implements this solution, which we call MONI. Namely, we give a high-level view of the main components of the data structure and show how the source code can be downloaded, compiled, and used to find MEMs between a set of sequence reads and a set of genomes.

SUBMITTER: Rossi M 

PROVIDER: S-EPMC8902461 | biostudies-literature | 2022 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Finding Maximal Exact Matches Using the r-Index.

Rossi Massimiliano M   Oliva Marco M   Bonizzoni Paola P   Langmead Ben B   Gagie Travis T   Boucher Christina C  

Journal of computational biology : a journal of computational molecular cell biology 20220117 2


Efficiently finding maximal exact matches (MEMs) between a sequence read and a database of genomes is a key first step in read alignment. But until recently, it was unknown how to build a data structure in [Formula: see text] space that supports efficient MEM finding, where <i>r</i> is the number of runs in the Burrows-Wheeler Transform. In 2021, Rossi et al. showed how to build a small auxiliary data structure called <i>thresholds</i> in addition to the <i>r</i>-index in [Formula: see text] spa  ...[more]

Similar Datasets

| S-EPMC8892979 | biostudies-literature
| S-EPMC2732316 | biostudies-literature
| S-EPMC6528274 | biostudies-literature
| S-EPMC403711 | biostudies-literature
| S-EPMC2722993 | biostudies-literature
| S-EPMC7430640 | biostudies-literature
| S-EPMC3436841 | biostudies-literature
| S-EPMC10209524 | biostudies-literature
| S-EPMC5172543 | biostudies-literature
| S-EPMC1764478 | biostudies-literature