Development of simple sequence repeat (SSR) markers of sesame (Sesamum indicum) from a genome survey.
ABSTRACT: Sesame (Sesamum indicum), an important oil crop, is widely grown in tropical and subtropical regions. It provides part of the daily edible oil allowance for almost half of the world's population. A limited number of co-dominant markers has been developed and applied in sesame genetic diversity and germplasm identity studies. Here we report for the first time a whole genome survey used to develop simple sequence repeat (SSR) markers and to detect the genetic diversity of sesame germplasm. From the initial assembled sesame genome, 23,438 SSRs (?5 repeats) were identified. The most common repeat motif was dinucleotide with a frequency of 84.24%, followed by 13.53% trinucleotide, 1.65% tetranucleotide, 0.3% pentanucleotide and 0.28% hexanucleotide motifs. From 1500 designed and synthesised primer pairs, 218 polymorphic SSRs were developed and used to screen 31 sesame accessions that from 12 countries. STRUCTURE and phylogenetic analyses indicated that all sesame accessions could be divided into two groups: one mainly from China and another from other countries. Cluster analysis classified Chinese major sesame varieties into three groups. These novel SSR markers are a useful tool for genetic linkage map construction, genetic diversity detection, and marker-assisted selective sesame breeding.
Project description:BACKGROUND: Sesame is an important and ancient oil crop in tropical and subtropical areas. China is one of the most important sesame producing countries with many germplasm accessions and excellent cultivars. Domestication and modern plant breeding have presumably narrowed the genetic basis of cultivated sesame. Several modern sesame cultivars were bred with a limited number of landrace cultivars in their pedigree. The genetic variation was subsequently reduced by genetic drift and selection. Characterization of genetic diversity of these cultivars by molecular markers is of great value to assist parental line selection and breeding strategy design. RESULTS: Three hundred and forty nine simple sequence repeat (SSR) and 79 insertion-deletion (InDel) markers were developed from cDNA library and reduced-representation sequencing of a sesame cultivar Zhongzhi 14, respectively. Combined with previously published SSR markers, 88 polymorphic markers were used to assess the genetic diversity, phylogenetic relationships, population structure, and allele distribution among 130 Chinese sesame accessions including 82 cultivars, 44 landraces and 4 wild germplasm accessions. A total of 325 alleles were detected, with the average gene diversity of 0.432. Model-based structure analysis revealed the presence of five subgroups belonging to two main groups, which were consistent with the results from principal coordinate analysis (PCA), phylogenetic clustering and analysis of molecular variance (AMOVA). Several missing or unique alleles were identified from particular types, subgroups or families, even though they share one or both parental/progenitor lines. CONCLUSIONS: This report presented a by far most comprehensive characterization of the molecular and genetic diversity of sesame cultivars in China. InDels are more polymorphic than SSRs, but their ability for deciphering genetic diversity compared to the later. Improved sesame cultivars have narrower genetic basis than landraces, reflecting the effect of genetic drift or selection during breeding processes. Comparative analysis of allele distribution revealed genetic divergence between improved cultivars and landraces, as well as between cultivars released in different years. These results will be useful for assessing cultivars and for marker-assisted breeding in sesame.
Project description:BACKGROUND: Sesame (Sesamum indicum L.) is one of the most important oil crops; however, a lack of useful molecular markers hinders current genetic research. We performed transcriptome sequencing of samples from different sesame growth and developmental stages, and mining of genic-SSR markers to identify valuable markers for sesame molecular genetics research. RESULTS: In this study, 75 bp and 100 bp paired-end RNA-seq was used to sequence 24 cDNA libraries, and 42,566 uni-transcripts were assembled from more than 260 million filtered reads. The total length of uni-transcript sequences was 47.99 Mb, and 7,324 SSRs (SSRs ≥15 bp) and 4,440 SSRs (SSRs ≥18 bp) were identified. On average, there was one genic-SSR per 6.55 kb (SSRs ≥15 bp) or 10.81 kb (SSRs ≥18 bp). Among perfect SSRs (≥18 bp), di-nucleotide motifs (48.01%) were the most abundant, followed by tri- (20.96%), hexa- (25.37%), penta- (2.97%), tetra- (2.12%), and mono-nucleotides (0.57%). The top four motif repeats were (AG/CT)n [1,268 (34.51%)], (CA/TG)n [281 (7.65%)], (AT/AT)n [215 (5.85%)], and (GAA/TTC)n [131 (3.57%)]. A total of 2,164 SSR primer pairs were identified in the 4,440 SSR-containing sequences (≥18 bp), and 300 SSR primer pairs were randomly chosen for validation. These SSR markers were amplified and validated in 25 sesame accessions (24 cultivated accessions, one wild species). 276 (92.0%) primer pairs yielded PCR amplification products in 24 cultivars. Thirty two primer pairs (11.59%) exhibited polymorphisms. Moreover, 203 primer pairs (67.67%) yielded PCR amplicons in the wild accession and 167 (60.51%) were polymorphic between species. A UPGMA dendrogram based on genetic similarity coefficients showed that the correlation between genotype and geographical source was low and that the genetic basis of sesame in China is narrow, as previously reported. The 32 polymorphic primer pairs were validated using an F2 mapping population; 18 primer pairs exhibited polymorphisms between the parents, and 14 genic-SSRs could be integrated into 9 main linkage groups. CONCLUSIONS: 2,164 genic-SSR markers have been developed in sesame using transcriptome sequencing. 276 of 300 validated primer pairs successfully yielded PCR amplicons in 24 cultivated sesame accessions. These markers increase current SSR marker resources and will greatly benefit genetic diversity, qualitative and quantitative trait mapping and marker-assisted selection studies in sesame.
Project description:Sesame is an important oil crop widely cultivated in Africa and Asia. Understanding the genetic diversity of accessions from these continents is critical to designing breeding methods and for additional collection of sesame germplasm. To determine the genetic diversity in relation to geographical regions, 96 sesame accessions collected from 22 countries distributed over six geographic regions in Africa and Asia were genotyped using 33 polymorphic SSR markers. Large genetic variability was found within the germplasm collection. The total number of alleles was 137, averaging 4.15 alleles per locus. The accessions from Asia displayed more diversity than those from Africa. Accessions from Southern Asia (SAs), Eastern Asia (EAs), and Western Africa (WAf) were highly diversified, while those from Western Asia (WAs), Northern Africa (NAf), and Southeastern Africa (SAf) had the lowest diversity. The analysis of molecular variance revealed that more than 44% of the genetic variance was due to diversity among geographic regions. Five subpopulations, including three in Asia and two in Africa, were cross-identified through phylogenetic, PCA, and STRUCTURE analyses. Most accessions clustered in the same population based on their geographical origins. Our results provide technical guidance for efficient management of sesame genetic resources in breeding programs and further collection of sesame germplasm from these different regions.
Project description:Sesame (Sesamum indicum L.) is an ancient oilseed crop known for its nutty seeds and high-quality edible oil. It is an unexplored crop with a great economic potential. The present study deals with assessment of genetic diversity in the crop. Twenty two RAPD and 18 SSR primers were used for analysis of the 47 different sesame accessions grown in different agroclimatic zones of India. A total of 256 bands were obtained with RAPD primers, of which 191 were polymorphic. SSR primers gave 64 DNA bands, of which all of were polymorphic. The Jaccard's similarity coefficient of RAPD, SSR, and pooled RAPD and SSR data ranged from 0.510 to 0.885, 0.167 to 0.867, and 0.505 to 0.853, respectively. Maximum polymorphic information content was reported with SSRs (0.194) compared to RAPDs (0.186). Higher marker index was observed with RAPDs (1.426) than with SSRs (0.621). Similarly, maximum resolving power was found with RAPD (4.012) primers than with SSRs (0.884). The RAPD primer RPI-B11 and SSR primer S16 were the most informative in terms of describing genetic variability among the varieties under study. At a molecular level, the seed coat colour was distinguishable by the presence and absence of a group of marker amplicon/s. White and brown seeded varieties clustered close to each other, while black seeded varieties remained distanced from the cluster. In the present study, we found higher variability in Sesamum indicum L. using RAPD and SSR markers and these could assist in DNA finger printing, conservation of germplasm, and crop improvement.
Project description:BACKGROUND: Jatropha curcas L. has attracted a great deal of attention worldwide, regarding its potential as a new biodiesel crop. However, the understanding of this crop remains very limited and little genomic research has been done. We used simple sequence repeat (SSR) markers that could be transferred from Manihot esculenta (cassava) to analyze the genetic relationships among 45 accessions of J. curcas from our germplasm collection. RESULTS: In total, 187 out of 419 expressed sequence tag (EST)-SSR and 54 out of 182 genomic (G)-SSR markers from cassava were polymorphic among the J. curcas accessions. The EST-SSR markers comprised 26.20% dinucleotide repeats, 57.75% trinucleotide repeats, 7.49% tetranucleotide repeats, and 8.56% pentanucleotide repeats, whereas the majority of the G-SSR markers were dinucleotide repeats (62.96%). The 187 EST-SSRs resided in genes that are involved mainly in biological and metabolic processes. Thirty-six EST-SSRs and 20 G-SSRs were chosen to analyze the genetic diversity among 45 J. curcas accessions. A total of 183 polymorphic alleles were detected. On the basis of the distribution of these polymorphic alleles, the 45 accessions were classified into six groups, in which the genotype showed a correlation with geographic origin. The estimated mean genetic diversity index was 0.5572, which suggests that our J. curcas germplasm collection has a high level of genetic diversity. This should facilitate subsequent studies on genetic mapping and molecular breeding. CONCLUSION: We identified 241 novel EST-SSR and G-SSR markers in J. curcas, which should be useful for genetic mapping and quantitative trait loci analysis of important agronomic traits. By using these markers, we found that the intergroup gene diversity of J. curcas was greater than the intragroup diversity, and that the domestication of the species probably occurred partly in America and partly in Hainan, China.
Project description:Background:Olive (Olea europaea L.) is an important oil and fruit crop worldwide, owning a rich germplasm with a large number of cultivars. Simple sequence repeats (SSRs) are excellent markers and have been used for the identification of olive cultivars. However, the limited number of SSR markers and the occurrence of confusion on the names of cultivars, as well as the possible appearance of clonal variation make it difficult to identify cultivars and interpret relationships among olive cultivars. Method:SSR markers were designed based on trinucleotide repeat sequences by screening the whole genome of olive, and the polymorphic SSR markers were developed that were applied to the identification of 53 olive accessions. The genetic characteristics and relationships of these olive accessions were evaluated based on the developed SSR markers. Results:Twenty-one highly polymorphic genomic-SSR markers were developed, covering most chromosomes of olive. These SSR markers could well distinguish all 53 olive accessions, confirming their effectiveness. DNA fingerprints of the 53 olive accessions were constructed based on the 21 SSR markers. The dendrogram clearly divided the tested accessions into two main groups, which was also supported by the results of principal coordinate analysis. A total of 31 private alleles were detected in 15 olive accessions, which reflected the genetic diversity within 53 olive accessions to some extent. Six homonymy cases were also clarified by genetic analysis. These results suggest that the newly developed olive SSR markers are informative for the exploitation, preservation and breeding of olive.
Project description:We estimated the genetic diversity of 50 Jatropha curcas samples from the Costa Rican germplasm bank using 18 EST-SSR, one G-SSR and nrDNA-ITS markers. We also evaluated the phylogenetic relationships among samples using nuclear ribosomal ITS markers. Non-toxicity was evaluated using G-SSRs and SCARs markers. A Neighbor-Joining (NJ) tree and a Maximum Likelihood (ML) tree were constructed using SSR markers and ITS sequences, respectively. Heterozygosity was moderate (He = 0.346), but considerable compared to worldwide values for J. curcas. The PIC (PIC = 0.274) and inbreeding coefficient (f = - 0.102) were both low. Clustering was not related to the geographical origin of accessions. International accessions clustered independently of collection sites, suggesting a lack of genetic structure, probably due to the wide distribution of this crop and ample gene flow. Molecular markers identified only one non-toxic accession (JCCR-24) from Mexico. This work is part of a countrywide effort to characterize the genetic diversity of the Jatropha curcas germplasm bank in Costa Rica.
Project description:Molecular characterization and genetic diversity among 82 soybean accessions was carried out by using 44 simple sequence repeat (SSR) markers. Of the 44 SSR markers used, 40 markers were found polymorphic among 82 soybean accessions. These 40 polymorphic markers produced a total of 119 alleles, of which five were unique alleles and four alleles were rare. The allele number for each SSR locus varied between two to four with an average of 2.97 alleles per marker. Polymorphic information content values of SSRs ranged from 0.101 to 0.742 with an average of 0.477. Jaccard's similarity coefficient was employed to study the molecular diversity of 82 soybean accessions. The pairwise genetic similarity among 82 soybean accessions varied from 0.28 to 0.90. The dendrogram constructed based on genetic similarities among 82 soybean accessions identified three major clusters. The majority of genotypes including four improved cultivars were grouped in a single subcluster IIIa of cluster III, indicating high genetic resemblance among soybean germplasm collection in India.
Project description:Microsatellite or simple sequence repeat (SSR) is one of the most widely distributed molecular markers that have been widely utilized to assess genetic diversity and genetic mapping for important traits in plants. However, the understanding of microsatellite characteristics in Arachis species and the currently available amount of high-quality SSR markers remain limited. In this study, we identified 16,435 genome survey sequences SSRs (GSS-SSRs) and 40,199 expressed sequence tag SSRs (EST-SSRs) in Arachis hypogaea and its wild relative species using the publicly available sequence data. The GSS-SSRs had a density of 159.9-239.8 SSRs/Mb for wild Arachis and 1,015.8 SSR/Mb for cultivated Arachis, whereas the EST-SSRs had the density of 173.5-384.4 SSR/Mb and 250.9 SSRs/Mb for wild and cultivated Arachis, respectively. The trinucleotide SSRs were predominant across Arachis species, except that the dinucleotide accounted for most in A. hypogaea GSSs. From Arachis GSS-SSR and EST-SSR sequences, we developed 2,589 novel SSR markers that showed a high polymorphism in six diverse A. hypogaea accessions. A genetic linkage map that contained 540 novel SSR loci and 105 anchor SSR loci was constructed by case of a recombinant inbred lines F6 population. A subset of 82 randomly selected SSR markers were used to screen 39 wild and 22 cultivated Arachis accessions, which revealed a high transferability of the novel SSRs across Arachis species. Our results provided informative clues to investigate microsatellite patterns across A. hypogaea and its wild relative species and potentially facilitate the germplasm evaluation and gene mapping in Arachis species.
Project description:The sequencing of the full nuclear genome of sesame (<i>Sesamum indicum</i> L.) provides the platform for functional analyses of genome components and their application in breeding programs. Although the importance of microsatellites markers or simple sequence repeats (SSR) in crop genotyping, genetics, and breeding applications is well established, only a little information exist concerning SSRs at the whole genome level in sesame. In addition, SSRs represent a suitable marker type for sesame molecular breeding in developing countries where it is mainly grown. In this study, we identified 138,194 genome-wide SSRs of which 76.5% were physically mapped onto the 13 pseudo-chromosomes. Among these SSRs, up to three primers pairs were supplied for 101,930 SSRs and used to <i>in silico</i> amplify the reference genome together with two newly sequenced sesame accessions. A total of 79,957 SSRs (78%) were polymorphic between the three genomes thereby suggesting their promising use in different genomics-assisted breeding applications. From these polymorphic SSRs, 23 were selected and validated to have high polymorphic potential in 48 sesame accessions from different growing areas of Africa. Furthermore, we have developed an online user-friendly database, SisatBase (http://www.sesame-bioinfo.org/SisatBase/), which provides free access to SSRs data as well as an integrated platform for functional analyses. Altogether, the reference SSR and SisatBase would serve as useful resources for genetic assessment, genomic studies, and breeding advancement in sesame, especially in developing countries.