The Awesome Power of Yeast Evolutionary Genetics: New Genome Sequences and Strain Resources for the Saccharomyces sensu stricto Genus.
ABSTRACT: High-quality, well-annotated genome sequences and standardized laboratory strains fuel experimental and evolutionary research. We present improved genome sequences of three species of Saccharomyces sensu stricto yeasts: S. bayanus var. uvarum (CBS 7001), S. kudriavzevii (IFO 1802(T) and ZP 591), and S. mikatae (IFO 1815(T)), and describe their comparison to the genomes of S. cerevisiae and S. paradoxus. The new sequences, derived by assembling millions of short DNA sequence reads together with previously published Sanger shotgun reads, have vastly greater long-range continuity and far fewer gaps than the previously available genome sequences. New gene predictions defined a set of 5261 protein-coding orthologs across the five most commonly studied Saccharomyces yeasts, enabling a re-examination of the tempo and mode of yeast gene evolution and improved inferences of species-specific gains and losses. To facilitate experimental investigations, we generated genetically marked, stable haploid strains for all three of these Saccharomyces species. These nearly complete genome sequences and the collection of genetically marked strains provide a valuable toolset for comparative studies of gene function, metabolism, and evolution, and render Saccharomyces sensu stricto the most experimentally tractable model genus. These resources are freely available and accessible through www.SaccharomycesSensuStricto.org.
Project description:Inter-specific hybridization leading to abrupt speciation is a well-known, common mechanism in angiosperm evolution; only recently, however, have similar hybridization and speciation mechanisms been documented to occur frequently among the closely related group of sensu stricto Saccharomyces yeasts. The economically important lager beer yeast Saccharomyces pastorianus is such a hybrid, formed by the union of Saccharomyces cerevisiae and Saccharomyces bayanus-related yeasts; efforts to understand its complex genome, searching for both biological and brewing-related insights, have been underway since its hybrid nature was first discovered. It had been generally thought that a single hybridization event resulted in a unique S. pastorianus species, but it has been recently postulated that there have been two or more hybridization events. Here, we show that there may have been two independent origins of S. pastorianus strains, and that each independent group--defined by characteristic genome rearrangements, copy number variations, ploidy differences, and DNA sequence polymorphisms--is correlated with specific breweries and/or geographic locations. Finally, by reconstructing common ancestral genomes via array-CGH data analysis and by comparing representative DNA sequences of the S. pastorianus strains with those of many different S. cerevisiae isolates, we have determined that the most likely S. cerevisiae ancestral parent for each of the independent S. pastorianus groups was an ale yeast, with different, but closely related ale strains contributing to each group's parentage.
Project description:Comparative genomics is a formidable tool to identify functional elements throughout a genome. In the past ten years, studies in the budding yeast Saccharomyces cerevisiae and a set of closely related species have been instrumental in showing the benefit of analyzing patterns of sequence conservation. Increasing the number of closely related genome sequences makes the comparative genomics approach more powerful and accurate.Here, we report the genome sequence and analysis of Saccharomyces arboricolus, a yeast species recently isolated in China, that is closely related to S. cerevisiae. We obtained high quality de novo sequence and assemblies using a combination of next generation sequencing technologies, established the phylogenetic position of this species and considered its phenotypic profile under multiple environmental conditions in the light of its gene content and phylogeny.We suggest that the genome of S. arboricolus will be useful in future comparative genomics analysis of the Saccharomyces sensu stricto yeasts.
Project description:Saccharomyces sensu stricto complex consist of yeast species, which are not only important in the fermentation industry but are also model systems for genomic and ecological analysis. Here, we present the complete genome assemblies of Saccharomyces jurei, a newly discovered Saccharomyces sensu stricto species from high altitude oaks. Phylogenetic and phenotypic analysis revealed that S. jurei is more closely related to S. mikatae, than S. cerevisiae, and S. paradoxus The karyotype of S. jurei presents two reciprocal chromosomal translocations between chromosome VI/VII and I/XIII when compared to the S. cerevisiae genome. Interestingly, while the rearrangement I/XIII is unique to S. jurei, the other is in common with S. mikatae strain IFO1815, suggesting shared evolutionary history of this species after the split between S. cerevisiae and S. mikatae The number of Ty elements differed in the new species, with a higher number of Ty elements present in S. jurei than in S. cerevisiae Phenotypically, the S. jurei strain NCYC 3962 has relatively higher fitness than the other strain NCYC 3947T under most of the environmental stress conditions tested and showed remarkably increased fitness in higher concentration of acetic acid compared to the other sensu stricto species. Both strains were found to be better adapted to lower temperatures compared to S. cerevisiae.
Project description:Yeasts within the Saccharomyces sensu stricto cluster can produce different killer toxins. Each toxin is encoded by a medium size (1.5-2.4 Kb) M dsRNA virus, maintained by a larger helper virus generally called L-A (4.6 Kb). Different types of L-A are found associated to specific Ms: L-A in K1 strains and L-A-2 in K2 strains. Here, we extend the analysis of L-A helper viruses to yeasts other than S. cerevisiae, namely S. paradoxus, S. uvarum and S. kudriavzevii. Our sequencing data from nine new L-A variants confirm the specific association of each toxin-producing M and its helper virus, suggesting co-evolution. Their nucleotide sequences vary from 10% to 30% and the variation seems to depend on the geographical location of the hosts, suggesting cross-species transmission between species in the same habitat. Finally, we transferred by genetic methods different killer viruses from S. paradoxus into S. cerevisiae or viruses from S. cerevisiae into S. uvarum or S. kudriavzevii. In the foster hosts, we observed no impairment for their stable transmission and maintenance, indicating that the requirements for virus amplification in these species are essentially the same. We also characterized new killer toxins from S. paradoxus and constructed "superkiller" strains expressing them.
Project description:Novel genes arising from random DNA sequences (de novo genes) have been suggested to be widespread in the genomes of different organisms. However, our knowledge about the origin and evolution of de novo genes is still limited. To systematically understand the general features of de novo genes, we established a robust pipeline to analyze >20,000 transcript-supported coding sequences (CDSs) from the budding yeast Saccharomyces cerevisiae. Our analysis pipeline combined phylogeny, synteny, and sequence alignment information to identify possible orthologs across 20 Saccharomycetaceae yeasts and discovered 4,340?S. cerevisiae-specific de novo genes and 8,871?S. sensu stricto-specific de novo genes. We further combine information on CDS positions and transcript structures to show that >65% of de novo genes arose from transcript isoforms of ancient genes, especially in the upstream and internal regions of ancient genes. Fourteen identified de novo genes with high transcript levels were chosen to verify their protein expressions. Ten of them, including eight transcript isoform-associated CDSs, showed translation signals and five proteins exhibited specific cytosolic localizations. Our results suggest that de novo genes frequently arise in the S. sensu stricto complex and have the potential to be quickly integrated into ancient cellular network.
Project description:Fermentation of xylose is a fundamental requirement for the efficient production of ethanol from lignocellulosic biomass sources. Although they aggressively ferment hexoses, it has long been thought that native Saccharomyces cerevisiae strains cannot grow fermentatively or non-fermentatively on xylose. Population surveys have uncovered a few naturally occurring strains that are weakly xylose-positive, and some S. cerevisiae have been genetically engineered to ferment xylose, but no strain, either natural or engineered, has yet been reported to ferment xylose as efficiently as glucose. Here, we used a medium-throughput screen to identify Saccharomyces strains that can increase in optical density when xylose is presented as the sole carbon source. We identified 38 strains that have this xylose utilization phenotype, including strains of S. cerevisiae, other sensu stricto members, and hybrids between them. All the S. cerevisiae xylose-utilizing strains we identified are wine yeasts, and for those that could produce meiotic progeny, the xylose phenotype segregates as a single gene trait. We mapped this gene by Bulk Segregant Analysis (BSA) using tiling microarrays and high-throughput sequencing. The gene is a putative xylitol dehydrogenase, which we name XDH1, and is located in the subtelomeric region of the right end of chromosome XV in a region not present in the S288c reference genome. We further characterized the xylose phenotype by performing gene expression microarrays and by genetically dissecting the endogenous Saccharomyces xylose pathway. We have demonstrated that natural S. cerevisiae yeasts are capable of utilizing xylose as the sole carbon source, characterized the genetic basis for this trait as well as the endogenous xylose utilization pathway, and demonstrated the feasibility of BSA using high-throughput sequencing.
Project description:Using the biological species definition, yeasts of the genus Saccharomyces sensu stricto comprise six species and one natural hybrid. Previous work has shown that reproductive isolation between the species is due primarily to sequence divergence acted upon by the mismatch repair system and not due to major gene differences or chromosomal rearrangements. Sequence divergence through mismatch repair has also been shown to cause partial reproductive isolation among populations within a species. We have surveyed sequence variation in populations of Saccharomyces sensu stricto yeasts and measured meiotic sterility in hybrids. This allows us to determine the divergence necessary to produce the reproductive isolation seen among species. Rather than a sharp transition from fertility to sterility, which may have been expected, we find a smooth monotonic relationship between diversity and reproductive isolation, even as far as the well-accepted designations of S. paradoxus and S. cerevisiae as distinct species. Furthermore, we show that one species of Saccharomyces--S. cariocanus--differs from a population of S. paradoxus by four translocations, but not by sequence. There is molecular evidence of recent introgression from S. cerevisiae into the European population of S. paradoxus, supporting the idea that in nature the boundary between these species is fuzzy.
Project description:Up to now, the only species in the complex Borrelia burgdorferi sensu lato known to cause Lyme borreliosis in the United States has been B. burgdorferi sensu stricto. However, some atypical strains closely related to the previously designated genomic group DN127 have been isolated in the United States, mostly in California. To explore the diversity of B. burgdorferi sensu lato group DN127, we analyzed the nucleotide sequences of the rrf-rrl intergenic spacer regions from 19 atypical strains (18 from California and one from New York) and 13 North American B. burgdorferi sensu stricto strains (6 from California). The spacer region sequences from the entire B. burgdorferi sensu lato complex available in data banks were used for comparison. Phylogenetic analysis of sequences shows that the main species of the B. burgdorferi sensu lato complex (B. afzelii, B. garinii, B. andersonii, B. japonica, B. burgdorferi sensu stricto, B. valaisiana, and B. lusitaniae) each form a coherent cluster. A heterogeneous group comprising strains belonging to the previously designated group DN127 clustered separately from B. burgdorferi sensu stricto. Within this cluster, the deep branches expressing the distances between the rrf-rrl sequences reflect a high level of divergence. This unexpected diversity contrasts with the monomorphism exhibited by B. burgdorferi sensu stricto. To clarify the taxonomic status of this highly heterogeneous group, analysis of the rrs sequences of selected strains chosen from deeply separated branches was performed. The results show that these strains significantly diverge at a level that is compatible with several distinct genomic groups. We conclude that the taxonomy and phylogeny of North American B. burgdorferi sensu lato should be reevaluated. For now, we propose that the genomic group DN127 should be referred to as a new species, B. bissettii sp. nov., and that other related but distinct strains, which require further characterization, be referred to as Borrelia spp.
Project description:Lyme disease is caused by spirochaetes of the Borrelia burgdorferi sensu lato genospecies. Complete genome assemblies are available for fewer than ten strains of Borrelia burgdorferi sensu stricto, the primary cause of Lyme disease in North America. MM1 is a sensu stricto strain originally isolated in the midwestern United States. Aside from a small number of genes, the complete genome sequence of this strain has not been reported. Here we present the complete genome sequence of MM1 in relation to other sensu stricto strains and in terms of its Multi Locus Sequence Typing. Our results indicate that MM1 is a new sequence type which contains a conserved main chromosome and 15 plasmids. Our results include the first contiguous 28.5 kb assembly of lp28-8, a linear plasmid carrying the vls antigenic variation system, from a Borrelia burgdorferi sensu stricto strain.
Project description:Candida glabrata complex includes three species identified through molecular biology methods: C. glabrata sensu stricto , C. nivariensis and C. bracarensis . In Mexico, the phenotypic methods are still used in the diagnosis; therefore, the presence of C. nivariensis and C. bracarensis among clinical isolates is still unknown. The aim of this study was to evaluate the utility of a multiplex PCR for the identification of the C. glabrata species complex. DNA samples from 92 clinical isolates that were previously identified through phenotypic characteristics as C. glabrata were amplified by four oligonucleotides (UNI-5.8S, GLA-f, BRA-f, and NIV-f) that generate amplicons of 397, 293 and 223-bp corresponding to C. glabrata sensu stricto , C. nivariensis , and C. bracarensis , respectively. The amplicon sequences were used to perform a phylogenetic analysis through the Maximum Likelihood method (MEGA6), including strains and reference sequences of species belonging to C. glabrata complex. In addition, recombination and linkage disequilibrium were estimated (DnaSP version 5.0) for C. glabrata sensu stricto isolate s . Eighty-eight isolates generated a 397-bp fragment and only in one isolate a 223-bp amplicon was observed. In the phylogenetic tree, the sequences of 397-bp were grouped with C. glabrata reference sequences , and the sequence of 223-bp was grouped with C. bracarensis reference sequences, corroborating the PCR identification. The number of recombination events for the isolates of C. glabrata sensu stricto was zero, suggesting a clonal population structure. Three isolates that did not amplify any of the expected fragments were identified as Saccharomyces cerevisiae through the sequencing of the D1/D2 domain region within the 28S rDNA gene. The multiplex PCR is a fast, cost-effective and reliable tool that can be used in clinical laboratories to identify C. glabrata complex species.