Project description:Solanum lycopersicum and Solanum tuberosum are agriculturally important crop species as they are rich sources of starch, protein, antioxidants, lycopene, beta-carotene, vitamin C, and fiber. The genomes of S. lycopersicum and S. tuberosum are currently available. However the linear strings of nucleotides that together comprise a genome sequence are of limited significance by themselves. Computational and bioinformatics approaches can be used to exploit the genomes for fundamental research for improving their varieties. The comparative genome analysis, Pfam analysis of predicted reviewed paralogous proteins was performed. It was found that S. lycopersicum proteins belong to more families, domains and clans in comparison with S. tuberosum. It was also found that mostly intergenic regions are conserved in two genomes followed by exons, intron and UTR. This can be exploited to predict regions between genomes that are similar to each other and to study the evolutionary relationship between two genomes, leading towards the development of disease resistance, stress tolerance and improved varieties of tomato.
Project description:BACKGROUND: Many highly beneficial traits (e.g. disease or abiotic stress resistance) have been transferred into crops through crosses with their wild relatives. The 13 recognized species of tomato (Solanum section Lycopersicon) are closely related to each other and wild species genes have been extensively used for improvement of the crop, Solanum lycopersicum L. In addition, the lack of geographical barriers has permitted natural hybridization between S. lycopersicum and its closest wild relative Solanum pimpinellifolium in Ecuador, Peru and northern Chile. In order to better understand patterns of S. lycopersicum diversity, we sequenced 47 markers ranging in length from 130 to 1200?bp (total of 24?kb) in genotypes of S. lycopersicum and wild tomato species S. pimpinellifolium, Solanum arcanum, Solanum peruvianum, Solanum pennellii and Solanum habrochaites. Between six and twelve genotypes were comparatively analyzed per marker. Several of the markers had previously been hypothesized as carrying wild species alleles within S. lycopersicum, i.e., cryptic introgressions. RESULTS: Each marker was mapped with high confidence (e<1 x 10-30) to a single genomic location using BLASTN against tomato whole genome shotgun chromosomes (SL2.40) database. Neighbor-joining trees showed high mean bootstrap support (86.8 ± 2.34%) for distinguishing red-fruited from green-fruited taxa for 38 of the markers. Hybridization and parsimony splits networks, genomic map positions of markers relative to documented introgressions, and historical origins of accessions were used to interpret evolutionary patterns at nine markers with putatively introgressed alleles. CONCLUSION: Of the 47 genetic markers surveyed in this study, four were involved in linkage drag on chromosome 9 during introgression breeding, while alleles at five markers apparently originated from natural hybridization with S. pimpinellifolium and were associated with primitive genotypes of S. lycopersicum. The positive identification of introgressed genes within crop species such as S. lycopersicum will help inform conservation and utilization of crop germplasm diversity, for example, facilitating the purging of undesirable linkage drag or the exploitation of novel, favorable alleles.
Project description:Solanum lycopersicum and Solanum habrochaites (f. typicum) accession PI127826 emit a variety of sesquiterpenes. To identify terpene synthases involved in the production of these volatile sesquiterpenes, we used massive parallel pyrosequencing (RNA-seq) to obtain the transcriptome of the stem trichomes from these plants. This approach resulted initially in the discovery of six sesquiterpene synthase cDNAs from S. lycopersicum and five from S. habrochaites. Searches of other databases and the S. lycopersicum genome resulted in the discovery of two additional sesquiterpene synthases expressed in trichomes. The sesquiterpene synthases from S. lycopersicum and S. habrochaites have high levels of protein identity. Several of them appeared to encode for non-functional proteins. Functional recombinant proteins produced germacrenes, β-caryophyllene/α-humulene, viridiflorene and valencene from (E,E)-farnesyl diphosphate. However, the activities of these enzymes do not completely explain the differences in sesquiterpene production between the two tomato plants. RT-qPCR confirmed high levels of expression of most of the S. lycopersicum sesquiterpene synthases in stem trichomes. In addition, one sesquiterpene synthase was induced by jasmonic acid, while another appeared to be slightly repressed by the treatment. Our data provide a foundation to study the evolution of terpene synthases in cultivated and wild tomato.
Project description:Semi-polar metabolites such as flavonoids, phenolic acids, and alkaloids are very important health-related compounds in tomato. As a first step to identify genes responsible for the synthesis of semi-polar metabolites, quantitative trait loci (QTLs) that influence the semi-polar metabolite content in red-ripe tomato fruit were identified, by characterizing fruits of a population of introgression lines (ILs) derived from a cross between the cultivated tomato Solanum lycopersicum and the wild species Solanum chmielewskii. By analyzing fruits of plants grown at two different locations, we were able to identify robust metabolite QTLs for changes in phenylpropanoid glycoconjugation on chromosome 9, for accumulation of flavonol glycosides on chromosome 5, and for alkaloids on chromosome 7. To further characterize the QTLs we used a combination of genome sequencing, transcriptomics and targeted metabolomics to identify candidate key genes underlying the observed metabolic variation.
Project description:Solanum lycopersicum RNA degradome sequencing Overall design: Isolated polyadenylated RNA from total RNA extracts of Solanum lycopersicum, were ligated to 5'-adapter that includes an MmeI recognition site. The ligated products were purified again, reverse transcribed and cleaved with MmeI. The 5' fragments were purified from gel and ligated to a 3'- dsDNA adapter and PCR amplified. After PCR amplification the sample was subjected to Solexa/Illumina high throughput pyrosequencing. Please see www.illumina.com for details of the sequencing technology.
Project description:Solanum lycopersicum and Solanum habrochaites are closely related plant species; however, their cold tolerance capacities are different. The wild species S. habrochaites is more cold tolerant than the cultivated species S. lycopersicum.The transcriptomes of S. lycopersicum and S. habrochaites leaf tissues under cold stress were studied using Illumina high-throughput RNA sequencing. The results showed that more than 200 million reads could be mapped to identify genes, microRNAs (miRNAs), and alternative splicing (AS) events to confirm the transcript abundance under cold stress. The results indicated that 21% and 23% of genes were differentially expressed in the cultivated and wild tomato species, respectively, and a series of changes in S. lycopersicum and S. habrochaites transcriptomes occur when plants are moved from warm to cold conditions. Moreover, the gene expression patterns for S. lycopersicum and S. habrochaites were dissimilar; however, there were some overlapping genes that were regulated by low temperature in both tomato species. An AS analysis identified 75,885 novel splice junctions among 172,910 total splice junctions, which suggested that the relative abundance of alternative intron isoforms in S. lycopersicum and S. habrochaites shifted significantly under cold stress. In addition, we identified 89 miRNA sequences that may regulate relevant target genes. Our data indicated that some miRNAs (e.g., miR159, miR319, and miR6022) play roles in the response to cold stress.Differences in gene expression, AS events, and miRNAs under cold stress may contribute to the observed differences in cold tolerance of these two tomato species.
Project description:Polymerase chain reaction and different barcoding methods commonly used for plant identification from metagenomics samples are based on the amplification of a limited number of pre-selected barcoding regions. These methods are often inapplicable due to DNA degradation, low amplification success or low species discriminative power of selected genomic regions. Here we introduce a method for the rapid identification of plant taxon-specific k-mers, that is applicable for the fast detection of plant taxa directly from raw sequencing reads without aligning, mapping or assembling the reads. We identified more than 800 Solanum lycopersicum specific k-mers (32 nucleotides in length) from 42 different chloroplast genome regions using the developed method. We demonstrated that identified k-mers are also detectable in whole genome sequencing raw reads from S. lycopersicum. Also, we demonstrated the usability of taxon-specific k-mers in artificial mixtures of sequences from closely related species. Developed method offers a novel strategy for fast identification of taxon-specific genome regions and offers new perspectives for detection of plant taxa directly from sequencing raw reads.
Project description:Genome-wide association mapping is an efficient way to identify quantitative trait loci controlling the variation of phenotypes, but the approach suffers severe limitations when one is studying inbred crops like cultivated tomato (Solanum lycopersicum). Such crops exhibit low rates of molecular polymorphism and high linkage disequilibrium, which reduces mapping resolution. The cherry type tomato (S. lycopersicum var. cerasiforme) genome has been described as an admixture between the cultivated tomato and its wild ancestor, S. pimpinellifolium. We have thus taken advantage of the properties of this admixture to improve the resolution of association mapping in tomato. As a proof of concept, we sequenced 81 DNA fragments distributed on chromosome 2 at different distances in a core collection of 90 tomato accessions, including mostly cherry type tomato accessions. The 81 Sequence Tag Sites revealed 352 SNPs and indels. Molecular diversity was greatest for S. pimpinellifolium accessions, intermediate for S. l. cerasiforme accessions, and lowest for the cultivated group. We assessed the structure of molecular polymorphism and the extent of linkage disequilibrium over genetic and physical distances. Linkage disequilibrium decreased under r(2) = 0.3 within 1 cM, and minimal estimated value (r(2) = 0.13) was reached within 20 kb over the physical regions studied. Associations between polymorphisms and fruit weight, locule number, and soluble solid content were detected. Several candidate genes and quantitative trait loci previously identified were validated and new associations detected. This study shows the advantages of using a collection of S. l. cerasiforme accessions to overcome the low resolution of association mapping in tomato.
Project description:Marker-trait association studies in tomato have progressed rapidly due to the availability of several populations developed between wild species and domesticated tomato. However, in the absence of whole genome sequences for each wild species, molecular marker methods for whole genome comparisons and fine mapping are required. We describe the development and validation of a diversity arrays technology (DArT) platform for tomato using an introgression line (IL) population consisting of wild Solanum pennellii introgressed into Solanum lycopersicum (cv. M82). A tomato diversity array consisting of 6,912 clones from domesticated tomato and twelve wild tomato/Solanaceous species was constructed. We successfully bin-mapped 990 polymorphic DArT markers together with 108 RFLP markers across the IL population, increasing the number of markers available for each S. pennellii introgression by tenfold on average. A subset of DArT markers from ILs previously associated with increased levels of lycopene and carotene were sequenced, and 44% matched protein coding genes. The bin-map position and order of sequenced DArT markers correlated well with their physical position on scaffolds of the draft tomato genome sequence (SL2.40). The utility of sequenced DArT markers was illustrated by converting several markers in both the S. pennellii and S. lycopersicum phases to cleaved amplified polymorphic sequence (CAPS) markers. Genotype scores from the CAPS markers confirmed the genotype scores from the DArT hybridizations used to construct the bin map. The tomato diversity array provides additional "sequence-characterized" markers for fine mapping of QTLs in S. pennellii ILs and wild tomato species.
Project description:Solanum lycopersicum RNA degradome sequencing Isolated polyadenylated RNA from total RNA extracts of Solanum lycopersicum, were ligated to 5'-adapter that includes an MmeI recognition site. The ligated products were purified again, reverse transcribed and cleaved with MmeI. The 5' fragments were purified from gel and ligated to a 3'- dsDNA adapter and PCR amplified. After PCR amplification the sample was subjected to Solexa/Illumina high throughput pyrosequencing. Please see www.illumina.com for details of the sequencing technology.