Dataset Information


Development and validation of an rDNA operon based primer walking strategy applicable to de novo bacterial genome finishing.

ABSTRACT: Advances in sequencing technology have drastically increased the depth and feasibility of bacterial genome sequencing. However, little information is available that details the specific techniques and procedures employed during genome sequencing despite the large numbers of published genomes. Shotgun approaches employed by second-generation sequencing platforms has necessitated the development of robust bioinformatics tools for in silico assembly, and complete assembly is limited by the presence of repetitive DNA sequences and multi-copy operons. Typically, re-sequencing with multiple platforms and laborious, targeted Sanger sequencing are employed to finish a draft bacterial genome. Here we describe a novel strategy based on the identification and targeted sequencing of repetitive rDNA operons to expedite bacterial genome assembly and finishing. Our strategy was validated by finishing the genome of Paenibacillus polymyxa strain CR1, a bacterium with potential in sustainable agriculture and bio-based processes. An analysis of the 38 contigs contained in the P. polymyxa strain CR1 draft genome revealed 12 repetitive rDNA operons with varied intragenic and flanking regions of variable length, unanimously located at contig boundaries and within contig gaps. These highly similar but not identical rDNA operons were experimentally verified and sequenced simultaneously with multiple, specially designed primer sets. This approach also identified and corrected significant sequence rearrangement generated during the initial in silico assembly of sequencing reads. Our approach reduces the required effort associated with blind primer walking for contig assembly, increasing both the speed and feasibility of genome finishing. Our study further reinforces the notion that repetitive DNA elements are major limiting factors for genome finishing. Moreover, we provided a step-by-step workflow for genome finishing, which may guide future bacterial genome finishing projects.

PROVIDER: S-EPMC4301005 | BioStudies |

REPOSITORIES: biostudies

Similar Datasets

| S-EPMC4650653 | BioStudies
| S-EPMC7263694 | BioStudies
| S-EPMC3900909 | BioStudies
| S-EPMC5513972 | BioStudies
| S-EPMC4173024 | BioStudies
| S-EPMC3648348 | BioStudies
| S-EPMC4159541 | BioStudies
| S-EPMC4110674 | BioStudies
| S-EPMC540276 | BioStudies
| S-EPMC6511069 | BioStudies