Unknown

Dataset Information

0

Local de novo assembly of RAD paired-end contigs using short sequencing reads.


ABSTRACT: Despite the power of massively parallel sequencing platforms, a drawback is the short length of the sequence reads produced. We demonstrate that short reads can be locally assembled into longer contigs using paired-end sequencing of restriction-site associated DNA (RAD-PE) fragments. We use this RAD-PE contig approach to identify single nucleotide polymorphisms (SNPs) and determine haplotype structure in threespine stickleback and to sequence E. coli and stickleback genomic DNA with overlapping contigs of several hundred nucleotides. We also demonstrate that adding a circularization step allows the local assembly of contigs up to 5 kilobases (kb) in length. The ease of assembly and accuracy of the individual contigs produced from each RAD site sequence suggests RAD-PE sequencing is a useful way to convert genome-wide short reads into individually-assembled sequences hundreds or thousands of nucleotides long.

SUBMITTER: Etter PD 

PROVIDER: S-EPMC3076424 | biostudies-literature | 2011 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Local de novo assembly of RAD paired-end contigs using short sequencing reads.

Etter Paul D PD   Preston Jessica L JL   Bassham Susan S   Cresko William A WA   Johnson Eric A EA  

PloS one 20110413 4


Despite the power of massively parallel sequencing platforms, a drawback is the short length of the sequence reads produced. We demonstrate that short reads can be locally assembled into longer contigs using paired-end sequencing of restriction-site associated DNA (RAD-PE) fragments. We use this RAD-PE contig approach to identify single nucleotide polymorphisms (SNPs) and determine haplotype structure in threespine stickleback and to sequence E. coli and stickleback genomic DNA with overlapping  ...[more]

Similar Datasets

| S-EPMC3158087 | biostudies-literature
| S-EPMC3527383 | biostudies-literature
| S-EPMC3614465 | biostudies-other
| S-EPMC7168855 | biostudies-literature
2010-07-13 | E-GEOD-22765 | biostudies-arrayexpress
| S-EPMC4252104 | biostudies-literature
2010-07-13 | GSE22765 | GEO
| S-EPMC3919575 | biostudies-literature