Unknown

Dataset Information

0

BOA: A partitioned view of genome assembly.


ABSTRACT: De novo genome assembly is a fundamental problem in computational molecular biology that aims to reconstruct an unknown genome sequence from a set of short DNA sequences (or reads) obtained from the genome. The relative ordering of the reads along the target genome is not known a priori, which is one of the main contributors to the increased complexity of the assembly process. In this article, with the dual objective of improving assembly quality and exposing a high degree of parallelism, we present a partitioning-based approach. Our framework, BOA (bucket-order-assemble), uses a bucketing alongside graph- and hypergraph-based partitioning techniques to produce a partial ordering of the reads. This partial ordering enables us to divide the read set into disjoint blocks that can be independently assembled in parallel using any state-of-the-art serial assembler of choice. Experimental results show that BOA improves both the overall assembly quality and performance.

SUBMITTER: An X 

PROVIDER: S-EPMC9593263 | biostudies-literature | 2022 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications


<i>De novo</i> genome assembly is a fundamental problem in computational molecular biology that aims to reconstruct an unknown genome sequence from a set of short DNA sequences (or <i>reads</i>) obtained from the genome. The relative ordering of the reads along the target genome is <i>not</i> known <i>a priori</i>, which is one of the main contributors to the increased complexity of the assembly process. In this article, with the dual objective of improving assembly quality and exposing a high d  ...[more]

Similar Datasets

| S-EPMC6256578 | biostudies-literature
| S-EPMC9709994 | biostudies-literature
| S-EPMC8778117 | biostudies-literature
| S-EPMC9246002 | biostudies-literature
| S-EPMC10652540 | biostudies-literature
| S-EPMC11243985 | biostudies-literature
| S-EPMC5443515 | biostudies-literature
| S-EPMC10507586 | biostudies-literature
| PRJNA528644 | ENA
| PRJNA189799 | ENA