Project description:Microbial community profiling using 16S rRNA gene sequences requires accurate taxonomy assignments. 'Universal' primers target conserved sequences and amplify sequences from many taxa, but they provide variable coverage of different environments, and regions of the rRNA gene differ in taxonomic informativeness--especially when high-throughput short-read sequencing technologies (for example, 454 and Illumina) are used. We introduce a new evaluation procedure that provides an improved measure of expected taxonomic precision when classifying environmental sequence reads from a given primer. Applying this measure to thousands of combinations of primers and read lengths, simulating single-ended and paired-end sequencing, reveals that these choices greatly affect taxonomic informativeness. The most informative sequence region may differ by environment, partly due to variable coverage of different environments in reference databases. Using our Rtax method of classifying paired-end reads, we found that paired-end sequencing provides substantial benefit in some environments including human gut, but not in others. Optimal primer choice for short reads totaling 96 nt provides 82-100% of the confident genus classifications available from longer reads.
Project description:Recent advances in genomic technologies have enabled more in-depth study of the oral microbiome. In this study, we compared the amplicons generated by primers targeting different sites of the 16S rRNA gene found in the Human Oral Microbiome Database (HOMD). Six sets of primer targeting V1-V2, V1-V3, V3-V4, V4-V5, V5-V7 and V6-V8 regions of 16S rRNA were tested via in silico simulation. Primers targeting the V1-V2, V3-V4, and V4-V5 regions generated more than 90% of the original input sequences. Primers targeting the V1-V2 and V1-V3 regions exhibited a low number of mismatches and unclassified sequences at the taxonomic level, but there were notable discrepancies at the species level. Phylogenetic tree comparisons showed primers targeting the V1-V2 and V3-V4 regions showed performances similar to primers targeting the whole 16s RNA region in terms of separating total oral microbiomes and periodontopathogens. In an analysis of clinical oral samples, V1-V2 primers showed superior performance for identifying more taxa and had better resolution sensitivity for Streptococcus than V3-V4 primers. In conclusion, primers targeting the V1-V2 region of 16S rRNA showed the best performance for oral microbiome studies. In addition, the study demonstrates the need for careful PCR primer selections.
Project description:AimTo design and validate broad-range 16S rRNA primers for use in high throughput sequencing to classify bacteria isolated from the human foregut microbiome.MethodsA foregut microbiome dataset was constructed using 16S rRNA gene sequences obtained from oral, esophageal, and gastric microbiomes produced by Sanger sequencing in previous studies represented by 219 bacterial species. Candidate primers evaluated were from the European rRNA database. To assess the effect of sequence length on accuracy of classification, 16S rRNA genes of various lengths were created by trimming the full length sequences. Sequences spanning various hypervariable regions were selected to simulate the amplicons that would be obtained using possible primer pairs. The sequences were compared with full length 16S rRNA genes for accuracy in taxonomic classification using online software at the Ribosomal Database Project (RDP). The universality of the primer set was evaluated using the RDP 16S rRNA database which is comprised of 433 306 16S rRNA genes, represented by 36 phyla.ResultsTruncation to 100 nucleotides (nt) downstream from the position corresponding to base 28 in the Escherichia coli 16S rRNA gene caused misclassification of 87 (39.7%) of the 219 sequences, compared with misclassification of only 29 (13.2%) sequences with truncation to 350 nt. Among 350-nt sequence reads within various regions of the 16S rRNA gene, the reverse read of an amplicon generated using the 343F/798R primers had the least (8.2%) effect on classification. In comparison, truncation to 900 nt mimicking single pass Sanger reads misclassified 5.0% of the 219 sequences. The 343F/798R amplicon accurately assigned 91.8% of the 219 sequences at the species level. Weighted by abundance of the species in the esophageal dataset, the 343F/798R amplicon yielded similar classification accuracy without a significant loss in species coverage (92%). Modification of the 343F/798R primers to 347F/803R increased their universality among foregut species. Assuming that a typical polymerase chain reaction can tolerate 2 mismatches between a primer and a template, the modified 347F and 803R primers should be able to anneal 98% and 99.6% of all 16S rRNA genes in the RDP database.Conclusion347F/803R is the most suitable pair of primers for classification of foregut 16S rRNA genes but also possess universality suitable for analyses of other complex microbiomes.
Project description:A highly sensitive quantitative PCR detection method has been developed and applied to the distribution analysis of human intestinal bifidobacteria by combining real-time PCR with Bifidobacterium genus- and species-specific primers. Real-time PCR detection of serially diluted DNA extracted from cultured bifidobacteria was linear for cell counts ranging from 10(6) to 10 cells per PCR assay. It was also found that the method was applicable to the detection of Bifidobacterium in feces when it was present at concentrations of >10(6) cells per g of feces. Concerning the distribution of Bifidobacterium species in intestinal flora, the Bifidobacterium adolescentis group, the Bifidobacterium catenulatum group, and Bifidobacterium longum were found to be the three predominant species by examination of DNA extracted from the feces of 46 healthy adults. We also examined changes in the population and composition of Bifidobacterium species in human intestinal flora of six healthy adults over an 8-month period. The results showed that the composition of bifidobacterial flora was basically stable throughout the test period.
Project description:Microbial community analysis based on the 16S rRNA-gene is used to investigate both beneficial and harmful microorganisms in various fields and environments. Recently, the next-generation sequencing (NGS) technology has enabled rapid and accurate microbial community analysis. Despite these advantages of NGS based metagenomics study, sample transport, storage conditions, amplification, library preparation kits, sequencing, and bioinformatics procedures can bias microbial community analysis results. In this study, eight mock communities were pooled from genomic DNA of Lactobacillus acidophilus KCTC 3164T, Limosilactobacillus fermentum KCTC 3112T, Lactobacillus gasseri KCTC 3163T, Lacticaseibacillus paracasei subsp. paracasei KCTC 3510T, Limosilactobacillus reuteri KCTC 3594T, Lactococcus lactis subsp. lactis KCTC 3769T, Bifidobacterium animalis subsp. lactis KCTC 5854T, and Bifidobacterium breve KCTC 3220T. The genomic DNAs were quantified by droplet digital PCR (ddPCR) and were mixed as mock communities. The mock communities were amplified with various 16S rRNA gene universal primer pairs and sequenced by MiSeq, IonTorrent, MGIseq-2000, Sequel II, and MinION NGS platforms. In a comparison of primer-dependent bias, the microbial profiles of V1-V2 and V3 regions were similar to the original ratio of the mock communities, while the microbial profiles of the V1-V3 region were relatively biased. In a comparison of platform-dependent bias, the sequence read from short-read platforms (MiSeq, IonTorrent, and MGIseq-2000) showed lower bias than that of long-read platforms (Sequel II and MinION). Meanwhile, the sequences read from Sequel II and MinION platforms were relatively biased in some mock communities. In the data of all NGS platforms and regions, L. acidophilus was greatly underrepresented while Lactococcus lactis subsp. lactis was generally overrepresented. In all samples of this study, the bias index (BI) was calculated and PCA was performed for comparison. The samples with biased relative abundance showed high BI values and were separated in the PCA results. In particular, analysis of regions rich in AT and GC poses problems for genome assembly, which can lead to sequencing bias. According to this comparative analysis, the development of reference material (RM) material has been proposed to calibrate the bias in microbiome analysis.
Project description:Selection of optimal primer pairs in 16S rRNA gene sequencing is a pivotal issue in microorganism diversity analysis. However, limited effort has been put into investigation of specific primer sets for analysis of the bacterial diversity of aging flue-cured tobaccos (AFTs), as well as prediction of the function of the bacterial community. In this study, the performance of four primer pairs in determining bacterial community structure based on 16S rRNA gene sequences in AFTs was assessed, and the functions of genes were predicted using Phylogenetic Investigation of Communities by Reconstruction of Unobserved States (PICRUSt). Results revealed that the primer set 799F-1193R covering the amplification region V5V6V7 gave a more accurate picture of the bacterial community structure of AFTs, with lower co-amplification levels of chloroplast and mitochondrial genes, and more genera covered than when using the other primers. In addition, functional gene prediction suggested that the microbiome of AFTs was involved in kinds of interested pathways. A high abundance of functional genes involved in nitrogen metabolism was detected in AFTs, reflecting a high level of bacteria involved in degrading harmful nitrogen compounds and generating nitrogenous nutrients for others. Additionally, the functional genes involved in biosynthesis of valuable metabolites and degradation of toxic compounds provided information that the AFTs possess a huge library of microorganisms and genes that could be applied to further studies. All of these findings provide a significance reference for researchers working on the bacterial diversity assessment of tobacco-related samples.
Project description:Designing primers for PCR-based taxonomic surveys that amplify a broad range of phylotypes in varied community samples is a difficult challenge, and the comparability of data sets amplified with varied primers requires attention. Here, we examined the performance of modified 16S rRNA gene and internal transcribed spacer (ITS) primers for archaea/bacteria and fungi, respectively, with nonaquatic samples. We moved primer bar codes to the 5' end, allowing for a range of different 3' primer pairings, such as the 515f/926r primer pair, which amplifies variable regions 4 and 5 of the 16S rRNA gene. We additionally demonstrated that modifications to the 515f/806r (variable region 4) 16S primer pair, which improves detection of Thaumarchaeota and clade SAR11 in marine samples, do not degrade performance on taxa already amplified effectively by the original primer set. Alterations to the fungal ITS primers did result in differential but overall improved performance compared to the original primers. In both cases, the improved primers should be widely adopted for amplicon studies. IMPORTANCE We continue to uncover a wealth of information connecting microbes in important ways to human and environmental ecology. As our scientific knowledge and technical abilities improve, the tools used for microbiome surveys can be modified to improve the accuracy of our techniques, ensuring that we can continue to identify groundbreaking connections between microbes and the ecosystems they populate, from ice caps to the human body. It is important to confirm that modifications to these tools do not cause new, detrimental biases that would inhibit the field rather than continue to move it forward. We therefore demonstrated that two recently modified primer pairs that target taxonomically discriminatory regions of bacterial and fungal genomic DNA do not introduce new biases when used on a variety of sample types, from soil to human skin. This confirms the utility of these primers for maintaining currently recommended microbiome research techniques as the state of the art.
Project description:Recent studies suggest that the association between a metronidazole-resistant anaerobe, Atopobium vaginae, and bacterial vaginosis (BV) warrants further investigation. In the present study, specific primers enhanced detection of A. vaginae and provided additional evidence that this bacterium is prevalent among patients with BV but absent among patients with normal vaginal flora.
Project description:Based on comparative phylogenetic analysis of 16S rRNA gene sequences deposited in an RDP database, we constructed a local database of thaumarchaeotal 16S rRNA gene sequences and developed a novel PCR primer specific for the archaeal phylum Thaumarchaeota. Among 9,727 quality-filtered (chimeral-checked, size >1.2 kb) archaeal sequences downloaded from the RDP database, 1,549 thaumarchaeotal sequences were identified and included in our local database. In our study, Thaumarchaeota included archaeal groups MG-I, SAGMCG-I, SCG, FSCG, RC, and HWCG-III, forming a monophyletic group in the phylogenetic tree. Cluster analysis revealed 114 phylotypes for Thaumarchaeota. The majority of the phylotypes (66.7%) belonged to the MG-I and SCG, which together contained most (93.9%) of the thaumarchaeotal sequences in our local database. A phylum-directed primer was designed from a consensus sequence of the phylotype sequences, and the primer's specificity was evaluated for coverage and tolerance both in silico and empirically. The phylum-directed primer, designated THAUM-494, showed >90% coverage for Thaumarchaeota and <1% tolerance to non-target taxa, indicating high specificity. To validate this result experimentally, PCRs were performed with THAUM-494 in combination with a universal archaeal primer (ARC917R or 1017FAR) and DNAs from five environmental samples to construct clone libraries. THAUM-494 showed a satisfactory specificity in empirical studies, as expected from the in silico results. Phylogenetic analysis of 859 cloned sequences obtained from 10 clone libraries revealed that >95% of the amplified sequences belonged to Thaumarchaeota. The most frequently sampled thaumarchaeotal subgroups in our samples were SCG, MG-I, and SAGMCG-I. To our knowledge, THAUM-494 is the first phylum-level primer for Thaumarchaeota. Furthermore, the high coverage and low tolerance of THAUM-494 will make it a potentially valuable tool in understanding the phylogenetic diversity and ecological niche of Thaumarchaeota.
Project description:The objective of this research was to characterize the genetic diversity and phylogenetic diversity among 12 cattle breeds (10 Chinese breeds and two foreign taurine breeds as controls) utilizing gene mtDNA 16S rRNA. The complete sequences of the mtDNA 16S rRNA genes of the 251 animals were 1570?bp long. The mean percentages of the four nitrogen bases were 37.8?% for adenine (A), 23.7?% for thymine (T), 20.9?% for cytosine (C), and 17.6?% for guanine (G). The mtDNA 16S rRNA gene base percentages had a strong bias towards A? + ?T. All detected nucleotide variations in gene mtDNA 16S rRNA were either transitions (62.3?%) or transversions (37.7?%); no indels (insertions and deletions) were found. A total of 40 haplotypes were constructed based on these mutations. A total of 36 haplotypes of these 40 haplotypes were present in 10 Chinese cattle breeds. The haplotype diversity of all Chinese cattle populations was 0.903±0.077 , while the nucleotide diversity was 0.0071±0.0039 . Kimura's two-parameter genetic distances between pairs of the studied 12 breeds ranged from 0.001 to 0.010. The phylogenetic analysis assigned the 10 Chinese breeds to two distinct lineages that likely differed in their percentage of Bos taurus and Bos indicus ancestry.