Unknown

Dataset Information

0

Deciphering the exact breakpoints of structural variations using long sequencing reads with DeBreak.


ABSTRACT: Long-read sequencing has demonstrated great potential for characterizing all types of structural variations (SVs). However, existing algorithms have insufficient sensitivity and precision. To address these limitations, we present DeBreak, a computational method for comprehensive and accurate SV discovery. Based on alignment results, DeBreak employs a density-based approach for clustering SV candidates together with a local de novo assembly approach for reconstructing long insertions. A partial order alignment algorithm ensures precise SV breakpoints with single base-pair resolution, and a k-means clustering method can report multi-allele SV events. DeBreak outperforms existing tools on both simulated and real long-read sequencing data from both PacBio and Nanopore platforms. An important application of DeBreak is analyzing cancer genomes for potentially tumor-driving SVs. DeBreak can also be used for supplementing whole-genome assembly-based SV discovery.

SUBMITTER: Chen Y 

PROVIDER: S-EPMC9845341 | biostudies-literature | 2023 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Deciphering the exact breakpoints of structural variations using long sequencing reads with DeBreak.

Chen Yu Y   Wang Amy Y AY   Barkley Courtney A CA   Zhang Yixin Y   Zhao Xinyang X   Gao Min M   Edmonds Mick D MD   Chong Zechen Z  

Nature communications 20230117 1


Long-read sequencing has demonstrated great potential for characterizing all types of structural variations (SVs). However, existing algorithms have insufficient sensitivity and precision. To address these limitations, we present DeBreak, a computational method for comprehensive and accurate SV discovery. Based on alignment results, DeBreak employs a density-based approach for clustering SV candidates together with a local de novo assembly approach for reconstructing long insertions. A partial o  ...[more]

Similar Datasets

| S-EPMC8128450 | biostudies-literature
| S-EPMC3161018 | biostudies-literature
| S-EPMC6735718 | biostudies-literature
| S-EPMC7721175 | biostudies-literature
| S-EPMC4699355 | biostudies-literature
| S-EPMC9122538 | biostudies-literature
| S-EPMC8082928 | biostudies-literature
| S-EPMC10013895 | biostudies-literature
| S-EPMC11601376 | biostudies-literature
| S-EPMC3865185 | biostudies-literature