Project description:Organisms are defined by the information encoded in their genomes, and since the origin of life this information has been encoded using a two-base-pair genetic alphabet (A-T and G-C). In vitro, the alphabet has been expanded to include several unnatural base pairs (UBPs). We have developed a class of UBPs formed between nucleotides bearing hydrophobic nucleobases, exemplified by the pair formed between d5SICS and dNaM (d5SICS-dNaM), which is efficiently PCR-amplified and transcribed in vitro, and whose unique mechanism of replication has been characterized. However, expansion of an organism's genetic alphabet presents new and unprecedented challenges: the unnatural nucleoside triphosphates must be available inside the cell; endogenous polymerases must be able to use the unnatural triphosphates to faithfully replicate DNA containing the UBP within the complex cellular milieu; and finally, the UBP must be stable in the presence of pathways that maintain the integrity of DNA. Here we show that an exogenously expressed algal nucleotide triphosphate transporter efficiently imports the triphosphates of both d5SICS and dNaM (d5SICSTP and dNaMTP) into Escherichia coli, and that the endogenous replication machinery uses them to accurately replicate a plasmid containing d5SICS-dNaM. Neither the presence of the unnatural triphosphates nor the replication of the UBP introduces a notable growth burden. Lastly, we find that the UBP is not efficiently excised by DNA repair pathways. Thus, the resulting bacterium is the first organism to propagate stably an expanded genetic alphabet.
Project description:The bacterial strain JCVI-syn3.0 stands as the first example of a living organism with a minimized synthetic genome, derived from the Mycoplasma mycoides genome and chemically synthesized in vitro. Here, we report the experimental evolution of a syn3.0- derived strain. Ten independent replicates were evolved for several hundred generations, leading to growth rate improvements of > 15%. Endpoint strains possessed an average of 8 mutations composed of indels and SNPs, with a pronounced C/G- > A/T transversion bias. Multiple genes were repeated mutational targets across the independent lineages, including phase variable lipoprotein activation, 5 distinct; nonsynonymous substitutions in the same membrane transporter protein, and inactivation of an uncharacterized gene. Transcriptomic analysis revealed an overall tradeoff reflected in upregulated ribosomal proteins and downregulated DNA and RNA related proteins during adaptation. This work establishes the suitability of synthetic, minimal strains for laboratory evolution, providing a means to optimize strain growth characteristics and elucidate gene functionality.
Project description:Previously, we reported the creation of a semi-synthetic organism (SSO) that stores and retrieves increased information by virtue of stably maintaining an unnatural base pair (UBP) in its DNA, transcribing the corresponding unnatural nucleotides into the codons and anticodons of mRNAs and tRNAs, and then using them to produce proteins containing noncanonical amino acids (ncAAs). Here we report a systematic extension of the effort to optimize the SSO by exploring a variety of deoxy- and ribonucleotide analogues. Importantly, this includes the first in vivo structure-activity relationship (SAR) analysis of unnatural ribonucleoside triphosphates. Similarities and differences between how DNA and RNA polymerases recognize the unnatural nucleotides were observed, and remarkably, we found that a wide variety of unnatural ribonucleotides can be efficiently transcribed into RNA and then productively and selectively paired at the ribosome to mediate the synthesis of proteins with ncAAs. The results extend previous studies, demonstrating that nucleotides bearing no significant structural or functional homology to the natural nucleotides can be efficiently and selectively paired during replication, to include each step of the entire process of information storage and retrieval. From a practical perspective, the results identify the most optimal UBP for replication and transcription, as well as the most optimal unnatural ribonucleoside triphosphates for transcription and translation. The optimized SSO is now, for the first time, able to efficiently produce proteins containing multiple, proximal ncAAs.
Project description:Since at least the last common ancestor of all life on Earth, genetic information has been stored in a four-letter alphabet that is propagated and retrieved by the formation of two base pairs. The central goal of synthetic biology is to create new life forms and functions, and the most general route to this goal is the creation of semi-synthetic organisms whose DNA harbours two additional letters that form a third, unnatural base pair. Previous efforts to generate such semi-synthetic organisms culminated in the creation of a strain of Escherichia coli that, by virtue of a nucleoside triphosphate transporter from Phaeodactylum tricornutum, imports the requisite unnatural triphosphates from its medium and then uses them to replicate a plasmid containing the unnatural base pair dNaM-dTPT3. Although the semi-synthetic organism stores increased information when compared to natural organisms, retrieval of the information requires in vivo transcription of the unnatural base pair into mRNA and tRNA, aminoacylation of the tRNA with a non-canonical amino acid, and efficient participation of the unnatural base pair in decoding at the ribosome. Here we report the in vivo transcription of DNA containing dNaM and dTPT3 into mRNAs with two different unnatural codons and tRNAs with cognate unnatural anticodons, and their efficient decoding at the ribosome to direct the site-specific incorporation of natural or non-canonical amino acids into superfolder green fluorescent protein. The results demonstrate that interactions other than hydrogen bonding can contribute to every step of information storage and retrieval. The resulting semi-synthetic organism both encodes and retrieves increased information and should serve as a platform for the creation of new life forms and functions.