Dataset Information


The hidden perils of read mapping as a quality assessment tool in genome sequencing.

ABSTRACT: This article provides a comparative analysis of the various methods of genome sequencing focusing on verification of the assembly quality. The results of a comparative assessment of various de novo assembly tools, as well as sequencing technologies, are presented using a recently completed sequence of the genome of Lactobacillus fermentum 3872. In particular, quality of assemblies is assessed by using CLC Genomics Workbench read mapping and Optical mapping developed by OpGen. Over-extension of contigs without prior knowledge of contig location can lead to misassembled contigs, even when commonly used quality indicators such as read mapping suggest that a contig is well assembled. Precautions must also be undertaken when using long read sequencing technology, which may also lead to misassembled contigs.


PROVIDER: S-EPMC5320493 | BioStudies | 2017-01-01

REPOSITORIES: biostudies

Similar Datasets

2015-01-01 | S-EPMC4542784 | BioStudies
2014-01-01 | S-EPMC4197248 | BioStudies
2011-01-01 | S-EPMC3128070 | BioStudies
2015-01-01 | S-EPMC4476701 | BioStudies
2015-01-01 | S-EPMC4696311 | BioStudies
1000-01-01 | S-EPMC4728574 | BioStudies
2017-01-01 | S-EPMC5382505 | BioStudies
1000-01-01 | S-EPMC4180827 | BioStudies
2019-01-01 | S-EPMC6889754 | BioStudies
2013-01-01 | S-EPMC3853357 | BioStudies