Project description:Hypervariable regions V3-V5 of bacterial 16S rRNA genes. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Project description:This study presents a validated, open-source QIIME2- and R-based pipeline for 16S rRNA gene profiling using multi-amplicon sequencing. It aims to overcome the limitations of commercial, closed-source tools by offering a standardized and reproducible workflow. The pipeline was benchmarked against proprietary software using five mock communities and 12 child–caregiver fecal sample pairs, showing nearly identical microbial profiles, greater sequencing depth, and improved taxonomic resolution. High reproducibility (R = 0.99, p < 0.0001) was achieved across all datasets. Application to pediatric cancer samples revealed distinct Bifidobacterium variants in children whose microbiota closely matched their caregivers’. This highlights the pipeline’s utility in studying microbial relationships. Overall, the pipeline supports transparent, adaptable, and accurate microbiome analysis, advancing research in both clinical and experimental settings while promoting open-source solutions for reproducible science.
Project description:16S rRNA gene sequences are commonly analyzed for taxonomic and phylogenetic studies because they contain variable regions that can help distinguish different genera. However, intra-genus distinction using variable region homology is often impossible due to the high overall sequence identities among closely related species, even though some residues may be conserved within respective species. Using a computational method that included the allelic diversity within individual genomes, we discovered that certain Escherichia and Shigella species can be distinguished by a multi-allelic 16S rRNA variable region single nucleotide polymorphism (SNP). To evaluate the performance of 16S rRNAs with altered variable regions, we developed an in vivo system that measures the acceptance and distribution of variant 16S rRNAs into a large pool of natural versions supporting normal translation and growth. We found that 16S rRNAs containing evolutionarily disparate variable regions were underpopulated both in ribosomes and in active translation pools, even for an SNP. Overall, this study revealed that variable region sequences can substantially influence the performance of 16S rRNAs and that this biological constraint can be leveraged to justify refining taxonomic assignments of variable region sequence data. IMPORTANCE This study reevaluates the notion that 16S rRNA gene variable region sequences are uninformative for intra-genus classification and that single nucleotide variations within them have no consequence to strains that bear them. We demonstrated that the performance of 16S rRNAs in Escherichia coli can be negatively impacted by sequence changes in variable regions, even for single nucleotide changes that are native to closely related Escherichia and Shigella species; thus, biological performance is likely constraining the evolution of variable regions in bacteria. Further, the native nucleotide variations we tested occur in all strains of their respective species and across their multiple 16S rRNA gene copies, suggesting that these species evolved beyond what would be discerned from a consensus sequence comparison. Therefore, this work also reveals that the multiple 16S rRNA gene alleles found in most bacteria can provide more informative phylogenetic and taxonomic detail than a single reference allele.
Project description:Cells devote a significant effort toward the production of multiple modified nucleotides in rRNAs, which fine tune the ribosome function. Here, we report that two methyltransferases, RsmB and RsmF, are responsible for all four 5-methylcytidine (m(5)C) modifications in 16S rRNA of Thermus thermophilus. Like Escherichia coli RsmB, T. thermophilus RsmB produces m(5)C967. In contrast to E. coli RsmF, which introduces a single m(5)C1407 modification, T. thermophilus RsmF modifies three positions, generating m(5)C1400 and m(5)C1404 in addition to m(5)C1407. These three residues are clustered near the decoding site of the ribosome, but are situated in distinct structural contexts, suggesting a requirement for flexibility in the RsmF active site that is absent from the E. coli enzyme. Two of these residues, C1400 and C1404, are sufficiently buried in the mature ribosome structure so as to require extensive unfolding of the rRNA to be accessible to RsmF. In vitro, T. thermophilus RsmF methylates C1400, C1404, and C1407 in a 30S subunit substrate, but only C1400 and C1404 when naked 16S rRNA is the substrate. The multispecificity of T. thermophilus RsmF is potentially explained by three crystal structures of the enzyme in a complex with cofactor S-adenosyl-methionine at up to 1.3 A resolution. In addition to confirming the overall structural similarity to E. coli RsmF, these structures also reveal that key segments in the active site are likely to be dynamic in solution, thereby expanding substrate recognition by T. thermophilus RsmF.
Project description:Contaminated aquifer (Dusseldorf-Flinger, Germany) templates extracted from 5 sediment depths ranging between 6.4 and 8.4 m below ground and over 3 years of sampling were amplified for amplicon pyrosequencing using the primers Ba27f (5’-aga gtt tga tcm tgg ctc ag-3’) and Ba519r (5’- tat tac cgc ggc kgc tg-3’), extended as amplicon fusion primers with respective primer A or B adapters, key sequence and multiplex identifiers (MID) as recommended by 454/Roche. Amplicons were purified and pooled as specified by the manufacturer. Emulsion PCR (emPCR), purification of DNA-enriched beads and sequencing run were performed following protocols and using a 2nd generation pyrosequencer (454 GS FLX Titanium, Roche) as recommended by the developer. Quality filtering of the pyrosequencing reads was performed using the automatic amplicon pipeline of the GS Run Processor (Roche), with a slight modification concerning the valley filter (vfScanAllFlows false instead of TiOnly) to extract the sequences. Demultiplexed raw reads were furhter trimmed for quality and lenght (>250 bp). 15 samples examined in total from important plume zones of the aquifer sampled in Feb. 2006, Sep. 2008 and Jun. 2009 (5 every year of sampling).
Project description:In this work, we compare the resolution of V2-V3 and V3-V4 16S rRNA regions for the purposes of estimating microbial community diversity using paired-end Illumina MiSeq reads, and show that the fragment, including V2 and V3 regions, has higher resolution for lower-rank taxa (genera and species). It allows for a more precise distance-based clustering of reads into species-level OTUs. Statistically convergent estimates of the diversity of major species (defined as those that together are covered by 95% of reads) can be achieved at the sample sizes of 10000 to 15000 reads. The relative error of the Shannon index estimate for this condition is lower than 4%.
Project description:Contaminated aquifer (Dusseldorf-Flinger, Germany) templates extracted from 5 sediment depths ranging between 6.4 and 8.4 m below ground and over 3 years of sampling were amplified for amplicon pyrosequencing using the primers Ba27f (5’-aga gtt tga tcm tgg ctc ag-3’) and Ba519r (5’- tat tac cgc ggc kgc tg-3’), extended as amplicon fusion primers with respective primer A or B adapters, key sequence and multiplex identifiers (MID) as recommended by 454/Roche. Amplicons were purified and pooled as specified by the manufacturer. Emulsion PCR (emPCR), purification of DNA-enriched beads and sequencing run were performed following protocols and using a 2nd generation pyrosequencer (454 GS FLX Titanium, Roche) as recommended by the developer. Quality filtering of the pyrosequencing reads was performed using the automatic amplicon pipeline of the GS Run Processor (Roche), with a slight modification concerning the valley filter (vfScanAllFlows false instead of TiOnly) to extract the sequences. Demultiplexed raw reads were furhter trimmed for quality and lenght (>250 bp).
Project description:Microbial functions in the host physiology are a result of co-evolution between microbial communities and their hosts. Here we show that cold exposure leads to marked shift of the microbiota composition, referred to as cold microbiota. Transplantation of the cold microbiota to germ-free mice is sufficient to increase the insulin sensitivity of the host, and enable complete tolerance to cold partly by promoting the white fat browning, leading to increased energy expenditure and fat loss. During prolonged cold however, the body weight loss is attenuated, caused by adaptive mechanisms maximising caloric uptake and increasing intestinal, villi and microvilli lengths. This increased absorptive surface is promoted by the cold microbiota - effect that can be diminished by co-transplanting the most downregulated bacterial strain from the Verrucomicrobia phylum, Akkermansia muciniphila, during the cold microbiota transfer. Our results demonstrate the microbiota as a key factor orchestrating the overall energy homeostasis during increased demand.