Unknown

Dataset Information

0

OPERA-LG: efficient and exact scaffolding of large, repeat-rich eukaryotic genomes with performance guarantees.


ABSTRACT: The assembly of large, repeat-rich eukaryotic genomes represents a significant challenge in genomics. While long-read technologies have made the high-quality assembly of small, microbial genomes increasingly feasible, data generation can be expensive for larger genomes. OPERA-LG is a scalable, exact algorithm for the scaffold assembly of large, repeat-rich genomes, out-performing state-of-the-art programs for scaffold correctness and contiguity. It provides a rigorous framework for scaffolding of repetitive sequences and a systematic approach for combining data from different second-generation and third-generation sequencing technologies. OPERA-LG provides an avenue for systematic augmentation and improvement of thousands of existing draft eukaryotic genome assemblies.

SUBMITTER: Gao S 

PROVIDER: S-EPMC4864936 | biostudies-literature | 2016 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

OPERA-LG: efficient and exact scaffolding of large, repeat-rich eukaryotic genomes with performance guarantees.

Gao Song S   Bertrand Denis D   Chia Burton K H BK   Nagarajan Niranjan N  

Genome biology 20160511


The assembly of large, repeat-rich eukaryotic genomes represents a significant challenge in genomics. While long-read technologies have made the high-quality assembly of small, microbial genomes increasingly feasible, data generation can be expensive for larger genomes. OPERA-LG is a scalable, exact algorithm for the scaffold assembly of large, repeat-rich genomes, out-performing state-of-the-art programs for scaffold correctness and contiguity. It provides a rigorous framework for scaffolding o  ...[more]

Similar Datasets

| S-EPMC3268121 | biostudies-literature
| S-EPMC4603742 | biostudies-literature
| S-EPMC2687942 | biostudies-literature
| S-EPMC6061838 | biostudies-other
| S-EPMC3846640 | biostudies-literature
| S-EPMC403711 | biostudies-literature
| S-EPMC3124593 | biostudies-literature