Dataset Information


International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons.

ABSTRACT: This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing(®), or Ion Torrent PGM(®). The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies.


PROVIDER: S-EPMC4822220 | BioStudies | 2015-01-01

REPOSITORIES: biostudies

Similar Datasets

2019-01-01 | S-EPMC6377111 | BioStudies
2012-01-01 | S-EPMC3552681 | BioStudies
2005-01-01 | S-EPMC1151890 | BioStudies
2008-01-01 | S-EPMC2613232 | BioStudies
2017-01-01 | S-EPMC6044758 | BioStudies
2019-01-01 | S-EPMC6834636 | BioStudies
2012-01-01 | S-EPMC3335818 | BioStudies
1000-01-01 | S-EPMC168106 | BioStudies
1000-01-01 | S-EPMC4889936 | BioStudies
1000-01-01 | S-EPMC2813033 | BioStudies