Unknown

Dataset Information

0

InvMap: a sensitive mapping tool for long noisy reads with inversion structural variants.


ABSTRACT:

Motivation

Longer reads produced by PacBio or Oxford Nanopore sequencers could more frequently span the breakpoints of structural variations (SVs) than shorter reads. Therefore, existing long-read mapping methods often generate wrong alignments and variant calls. Compared to deletions and insertions, inversion events are more difficult to be detected since the anchors in inversion regions are nonlinear to those in SV-free regions. To address this issue, this study presents a novel long-read mapping algorithm (named as invMap).

Results

For each long noisy read, invMap first locates the aligned region with a specifically designed scoring method for chaining, then checks the remaining anchors in the aligned region to discover potential inversions. We benchmark invMap on simulated datasets across different genomes and sequencing coverages, experimental results demonstrate that invMap is more accurate to locate aligned regions and call SVs for inversions than the competing methods. The real human genome sequencing dataset of NA12878 illustrates that invMap can effectively find more candidate variant calls for inversions than the competing methods.

Availability and implementation

The invMap software is available at https://github.com/zhang134/invMap.git.

SUBMITTER: Wei ZG 

PROVIDER: S-EPMC11320709 | biostudies-literature | 2023 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

invMap: a sensitive mapping tool for long noisy reads with inversion structural variants.

Wei Ze-Gang ZG   Bu Peng-Yu PY   Zhang Xiao-Dan XD   Liu Fei F   Qian Yu Y   Wu Fang-Xiang FX  

Bioinformatics (Oxford, England) 20231201 12


<h4>Motivation</h4>Longer reads produced by PacBio or Oxford Nanopore sequencers could more frequently span the breakpoints of structural variations (SVs) than shorter reads. Therefore, existing long-read mapping methods often generate wrong alignments and variant calls. Compared to deletions and insertions, inversion events are more difficult to be detected since the anchors in inversion regions are nonlinear to those in SV-free regions. To address this issue, this study presents a novel long-r  ...[more]

Similar Datasets

| S-EPMC10959152 | biostudies-literature
| S-EPMC7879691 | biostudies-literature
| S-EPMC9117619 | biostudies-literature
| S-EPMC6547545 | biostudies-literature
| S-EPMC6902338 | biostudies-literature
| S-EPMC6298053 | biostudies-literature
| S-EPMC10997618 | biostudies-literature
| S-EPMC4908361 | biostudies-literature
| S-EPMC5131822 | biostudies-literature
| S-EPMC10682169 | biostudies-literature