Characterization and Transferable Utility of Microsatellite Markers in the Wild and Cultivated Arachis Species.
ABSTRACT: Microsatellite or simple sequence repeat (SSR) is one of the most widely distributed molecular markers that have been widely utilized to assess genetic diversity and genetic mapping for important traits in plants. However, the understanding of microsatellite characteristics in Arachis species and the currently available amount of high-quality SSR markers remain limited. In this study, we identified 16,435 genome survey sequences SSRs (GSS-SSRs) and 40,199 expressed sequence tag SSRs (EST-SSRs) in Arachis hypogaea and its wild relative species using the publicly available sequence data. The GSS-SSRs had a density of 159.9-239.8 SSRs/Mb for wild Arachis and 1,015.8 SSR/Mb for cultivated Arachis, whereas the EST-SSRs had the density of 173.5-384.4 SSR/Mb and 250.9 SSRs/Mb for wild and cultivated Arachis, respectively. The trinucleotide SSRs were predominant across Arachis species, except that the dinucleotide accounted for most in A. hypogaea GSSs. From Arachis GSS-SSR and EST-SSR sequences, we developed 2,589 novel SSR markers that showed a high polymorphism in six diverse A. hypogaea accessions. A genetic linkage map that contained 540 novel SSR loci and 105 anchor SSR loci was constructed by case of a recombinant inbred lines F6 population. A subset of 82 randomly selected SSR markers were used to screen 39 wild and 22 cultivated Arachis accessions, which revealed a high transferability of the novel SSRs across Arachis species. Our results provided informative clues to investigate microsatellite patterns across A. hypogaea and its wild relative species and potentially facilitate the germplasm evaluation and gene mapping in Arachis species.
Project description:BACKGROUND: Lack of sufficient molecular markers hinders current genetic research in peanuts (Arachis hypogaea L.). It is necessary to develop more molecular markers for potential use in peanut genetic research. With the development of peanut EST projects, a vast amount of available EST sequence data has been generated. These data offered an opportunity to identify SSR in ESTs by data mining. RESULTS: In this study, we investigated 24,238 ESTs for the identification and development of SSR markers. In total, 881 SSRs were identified from 780 SSR-containing unique ESTs. On an average, one SSR was found per 7.3 kb of EST sequence with tri-nucleotide motifs (63.9%) being the most abundant followed by di- (32.7%), tetra- (1.7%), hexa- (1.0%) and penta-nucleotide (0.7%) repeat types. The top six motifs included AG/TC (27.7%), AAG/TTC (17.4%), AAT/TTA (11.9%), ACC/TGG (7.72%), ACT/TGA (7.26%) and AT/TA (6.3%). Based on the 780 SSR-containing ESTs, a total of 290 primer pairs were successfully designed and used for validation of the amplification and assessment of the polymorphism among 22 genotypes of cultivated peanuts and 16 accessions of wild species. The results showed that 251 primer pairs yielded amplification products, of which 26 and 221 primer pairs exhibited polymorphism among the cultivated and wild species examined, respectively. Two to four alleles were found in cultivated peanuts, while 3-8 alleles presented in wild species. The apparent broad polymorphism was further confirmed by cloning and sequencing of amplified alleles. Sequence analysis of selected amplified alleles revealed that allelic diversity could be attributed mainly to differences in repeat type and length in the microsatellite regions. In addition, a few single base mutations were observed in the microsatellite flanking regions. CONCLUSION: This study gives an insight into the frequency, type and distribution of peanut EST-SSRs and demonstrates successful development of EST-SSR markers in cultivated peanut. These EST-SSR markers could enrich the current resource of molecular markers for the peanut community and would be useful for qualitative and quantitative trait mapping, marker-assisted selection, and genetic diversity studies in cultivated peanut as well as related Arachis species. All of the 251 working primer pairs with names, motifs, repeat types, primer sequences, and alleles tested in cultivated and wild species are listed in Additional File 1.
Project description:Large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed in peanut (Arachis hypogaea L.) to obtain more informative genetic markers. A total of 10,102 potential non-redundant EST sequences, including 3,445 contigs and 6,657 singletons, were generated from cDNA libraries of the gynophore, roots, leaves and seedlings. A total of 3,187 primer pairs were designed on flanking regions of SSRs, some of which allowed one and two base mismatches. Among the 3,187 markers generated, 2,540 (80%) were trinucleotide repeats, 302 (9%) were dinucleotide repeats, and 345 (11%) were tetranucleotide repeats. Pre-polymorphic analyses of 24 Arachis accessions were performed using 10% polyacrylamide gels. A total of 1,571 EST-SSR markers showing clear polymorphisms were selected for further polymorphic analysis with a Fluoro-fragment Analyzer. The 16 Arachis accessions examined included cultivated peanut varieties as well as diploid species with the A or B genome. Altogether 1,281 (81.5%) of the 1,571 markers were polymorphic among the 16 accessions, and 366 (23.3%) were polymorphic among the 12 cultivated varieties. Diversity analysis was performed and the genotypes of all 16 Arachis accessions showed similarity coefficients ranging from 0.37 to 0.97. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11032-011-9604-8) contains supplementary material, which is available to authorized users.
Project description:The peanut (Arachis hypogaea) is an important oil crop. Breeding for high oil content is becoming increasingly important. Wild Arachis species have been reported to harbor genes for many valuable traits that may enable the improvement of cultivated Arachis hypogaea, such as resistance to pests and disease. However, only limited information is available on variation in oil content. In the present study, a collection of 72 wild Arachis accessions representing 19 species and 3 cultivated peanut accessions were genotyped using 136 genome-wide SSR markers and phenotyped for oil content over three growing seasons. The wild Arachis accessions showed abundant diversity across the 19 species. A. duranensis exhibited the highest diversity, with a Shannon-Weaver diversity index of 0.35. A total of 129 unique alleles were detected in the species studied. A. rigonii exhibited the largest number of unique alleles (75), indicating that this species is highly differentiated. AMOVA and genetic distance analyses confirmed the genetic differentiation between the wild Arachis species. The majority of SSR alleles were detected exclusively in the wild species and not in A. hypogaea, indicating that directional selection or the hitchhiking effect has played an important role in the domestication of the cultivated peanut. The 75 accessions were grouped into three clusters based on population structure and phylogenic analysis, consistent with their taxonomic sections, species and genome types. A. villosa and A. batizocoi were grouped with A. hypogaea, suggesting the close relationship between these two diploid wild species and the cultivated peanut. Considerable phenotypic variation in oil content was observed among different sections and species. Nine alleles were identified as associated with oil content based on association analysis, of these, three alleles were associated with higher oil content but were absent in the cultivated peanut. The results demonstrated that there is great potential to increase the oil content in A. hypogaea by using the wild Arachis germplasm.
Project description:<h4>Background</h4>The genus Arachis, originated in South America, is divided into nine taxonomical sections comprising of 80 species. Most of the Arachis species are diploids (2n = 2x = 20) and the tetraploid species (2n = 2x = 40) are found in sections Arachis, Extranervosae and Rhizomatosae. Diploid species have great potential to be used as resistance sources for agronomic traits like pests and diseases, drought related traits and different life cycle spans. Understanding of genetic relationships among wild species and between wild and cultivated species will be useful for enhanced utilization of wild species in improving cultivated germplasm. The present study was undertaken to evaluate genetic relationships among species (96 accessions) belonging to seven sections of Arachis by using simple sequence repeat (SSR) markers developed from Arachis hypogaea genomic library and gene sequences from related genera of Arachis.<h4>Results</h4>The average transferability rate of 101 SSR markers tested to section Arachis and six other sections was 81% and 59% respectively. Five markers (IPAHM 164, IPAHM 165, IPAHM 407a, IPAHM 409, and IPAHM 659) showed 100% transferability. Cluster analysis of allelic data from a subset of 32 SSR markers on 85 wild and 11 cultivated accessions grouped accessions according to their genome composition, sections and species to which they belong. A total of 109 species specific alleles were detected in different wild species, Arachis pusilla exhibited largest number of species specific alleles (15). Based on genetic distance analysis, the A-genome accession ICG 8200 (A. duranensis) and the B-genome accession ICG 8206 (A. ipaënsis) were found most closely related to A. hypogaea.<h4>Conclusion</h4>A set of cross species and cross section transferable SSR markers has been identified that will be useful for genetic studies of wild species of Arachis, including comparative genome mapping, germplasm analysis, population genetic structure and phylogenetic inferences among species. The present study provides strong support based on both genomic and genic markers, probably for the first time, on relationships of A. monticola and A. hypogaea as well as on the most probable donor of A and B-genomes of cultivated groundnut.
Project description:<h4>Background</h4>Cultivated peanut or groundnut (Arachis hypogaea L.) is the fourth most important oilseed crop in the world, grown mainly in tropical, subtropical and warm temperate climates. Due to its origin through a single and recent polyploidization event, followed by successive selection during breeding efforts, cultivated groundnut has a limited genetic background. In such species, microsatellite or simple sequence repeat (SSR) markers are very informative and useful for breeding applications. The low level of polymorphism in cultivated germplasm, however, warrants a need of larger number of polymorphic microsatellite markers for cultivated groundnut.<h4>Results</h4>A microsatellite-enriched library was constructed from the genotype TMV2. Sequencing of 720 putative SSR-positive clones from a total of 3,072 provided 490 SSRs. 71.2% of these SSRs were perfect type, 13.1% were imperfect and 15.7% were compound. Among these SSRs, the GT/CA repeat motifs were the most common (37.6%) followed by GA/CT repeat motifs (25.9%). The primer pairs could be designed for a total of 170 SSRs and were optimized initially on two genotypes. 104 (61.2%) primer pairs yielded scorable amplicon and 46 (44.2%) primers showed polymorphism among 32 cultivated groundnut genotypes. The polymorphic SSR markers detected 2 to 5 alleles with an average of 2.44 per locus. The polymorphic information content (PIC) value for these markers varied from 0.12 to 0.75 with an average of 0.46. Based on 112 alleles obtained by 46 markers, a phenogram was constructed to understand the relationships among the 32 genotypes. Majority of the genotypes representing subspecies hypogaea were grouped together in one cluster, while the genotypes belonging to subspecies fastigiata were grouped mainly under two clusters.<h4>Conclusion</h4>Newly developed set of 104 markers extends the repertoire of SSR markers for cultivated groundnut. These markers showed a good level of PIC value in cultivated germplasm and therefore would be very useful for germplasm analysis, linkage mapping, diversity studies and phylogenetic relationships in cultivated groundnut as well as related Arachis species.
Project description:<h4>Background</h4>The genus Arachis is native to a region that includes Central Brazil and neighboring countries. Little is known about the genetic variability of the Brazilian cultivated peanut (Arachis hypogaea, genome AABB) germplasm collection at the DNA level. The understanding of the genetic diversity of cultivated and wild species of peanut (Arachis spp.) is essential to develop strategies of collection, conservation and use of the germplasm in variety development. The identity of the ancestor progenitor species of cultivated peanut has also been of great interest. Several species have been suggested as putative AA and BB genome donors to allotetraploid A. hypogaea. Microsatellite or SSR (Simple Sequence Repeat) markers are co-dominant, multiallelic, and highly polymorphic genetic markers, appropriate for genetic diversity studies. Microsatellite markers may also, to some extent, support phylogenetic inferences. Here we report the use of a set of microsatellite markers, including newly developed ones, for phylogenetic inferences and the analysis of genetic variation of accessions of A. hypogea and its wild relatives.<h4>Results</h4>A total of 67 new microsatellite markers (mainly TTG motif) were developed for Arachis. Only three of these markers, however, were polymorphic in cultivated peanut. These three new markers plus five other markers characterized previously were evaluated for number of alleles per locus and gene diversity using 60 accessions of A. hypogaea. Genetic relationships among these 60 accessions and a sample of 36 wild accessions representative of section Arachis were estimated using allelic variation observed in a selected set of 12 SSR markers. Results showed that the Brazilian peanut germplasm collection has considerable levels of genetic diversity detected by SSR markers. Similarity groups for A. hypogaea accessions were established, which is a useful criteria for selecting parental plants for crop improvement. Microsatellite marker transferability was up to 76% for species of the section Arachis, but only 45% for species from the other eight Arachis sections tested. A new marker (Ah-041) presented a 100% transferability and could be used to classify the peanut accessions in AA and non-AA genome carriers.<h4>Conclusion</h4>The level of polymorphism observed among accessions of A. hypogaea analyzed with newly developed microsatellite markers was low, corroborating the accumulated data which show that cultivated peanut presents a relatively reduced variation at the DNA level. A selected panel of SSR markers allowed the classification of A. hypogaea accessions into two major groups. The identification of similarity groups will be useful for the selection of parental plants to be used in breeding programs. Marker transferability is relatively high between accessions of section Arachis. The possibility of using microsatellite markers developed for one species in genetic evaluation of other species greatly reduces the cost of the analysis, since the development of microsatellite markers is still expensive and time consuming. The SSR markers developed in this study could be very useful for genetic analysis of wild species of Arachis, including comparative genome mapping, population genetic structure and phylogenetic inferences among species.
Project description:BACKGROUND: Cultivated peanut (Arachis hypogaea) is an allotetraploid species whose ancestral genomes are most likely derived from the A-genome species, A. duranensis, and the B-genome species, A. ipaensis. The very recent (several millennia) evolutionary origin of A. hypogaea has imposed a bottleneck for allelic and phenotypic diversity within the cultigen. However, wild diploid relatives are a rich source of alleles that could be used for crop improvement and their simpler genomes can be more easily analyzed while providing insight into the structure of the allotetraploid peanut genome. The objective of this research was to establish a high-density genetic map of the diploid species A. duranensis based on de novo generated EST databases. Arachis duranensis was chosen for mapping because it is the A-genome progenitor of cultivated peanut and also in order to circumvent the confounding effects of gene duplication associated with allopolyploidy in A. hypogaea. RESULTS: More than one million expressed sequence tag (EST) sequences generated from normalized cDNA libraries of A. duranensis were assembled into 81,116 unique transcripts. Mining this dataset, 1236 EST-SNP markers were developed between two A. duranensis accessions, PI 475887 and Grif 15036. An additional 300 SNP markers also were developed from genomic sequences representing conserved legume orthologs. Of the 1536 SNP markers, 1054 were placed on a genetic map. In addition, 598 EST-SSR markers identified in A. hypogaea assemblies were included in the map along with 37 disease resistance gene candidate (RGC) and 35 other previously published markers. In total, 1724 markers spanning 1081.3 cM over 10 linkage groups were mapped. Gene sequences that provided mapped markers were annotated using similarity searches in three different databases, and gene ontology descriptions were determined using the Medicago Gene Atlas and TAIR databases. Synteny analysis between A. duranensis, Medicago and Glycine revealed significant stretches of conserved gene clusters spread across the peanut genome. A higher level of colinearity was detected between A. duranensis and Glycine than with Medicago. CONCLUSIONS: The first high-density, gene-based linkage map for A. duranensis was generated that can serve as a reference map for both wild and cultivated Arachis species. The markers developed here are valuable resources for the peanut, and more broadly, to the legume research community. The A-genome map will have utility for fine mapping in other peanut species and has already had application for mapping a nematode resistance gene that was introgressed into A. hypogaea from A. cardenasii.
Project description:With the aim to increase the number of functional markers in resource poor crop like cultivated peanut (Arachis hypogaea), large numbers of available expressed sequence tags (ESTs) in the public databases, were employed for the development of novel EST derived simple sequence repeat (SSR) markers. From 16424 unigenes, 2784 (16.95%) SSRs containing unigenes having 3373 SSR motifs were identified. Of these, 2027 (72.81%) sequences were annotated and 4124 gene ontology terms were assigned. Among different SSR motif-classes, tri-nucleotide repeats (33.86%) were the most abundant followed by di-nucleotide repeats (27.51%) while AG/CT (20.7%) and AAG/CTT (13.25%) were the most abundant repeat-motifs. A total of 2456 EST-SSR novel primer pairs were designed, of which 366 unigenes having relevance to various stresses and other functions, were PCR validated using a set of 11 diverse peanut genotypes. Of these, 340 (92.62%) primer pairs yielded clear and scorable PCR products and 39 (10.66%) primer pairs exhibited polymorphisms. Overall, the number of alleles per marker ranged from 1-12 with an average of 3.77 and the PIC ranged from 0.028 to 0.375 with an average of 0.325. The identified EST-SSRs not only enriched the existing molecular markers kitty, but would also facilitate the targeted research in marker-trait association for various stresses, inter-specific studies and genetic diversity analysis in peanut.
Project description:Despite several efforts in the last decade toward development of simple sequence repeat (SSR) markers in peanut, there is still a need for more markers for conducting different genetic and breeding studies. With the effort of the International Peanut Genome Initiative, the availability of reference genome for both the diploid progenitors of cultivated peanut allowed us to identify 135,529 and 199,957 SSRs from the A (Arachis duranensis) and B genomes (Arachis ipaensis), respectively. Genome sequence analysis showed uneven distribution of the SSR motifs across genomes with variation in parameters such as SSR type, repeat number, and SSR length. Using the flanking sequences of identified SSRs, primers were designed for 51,354 and 60,893 SSRs with densities of 49 and 45 SSRs per Mb in A. duranensis and A. ipaensis, respectively. In silico PCR analysis of these SSR markers showed high transferability between wild and cultivated Arachis species. Two physical maps were developed for the A genome and the B genome using these SSR markers, and two reported disease resistance quantitative trait loci (QTLs), qF2TSWV5 for tomato spotted wilt virus (TSWV) and qF2LS6 for leaf spot (LS), were mapped in the 8.135 Mb region of chromosome A04 of A. duranensis. From this genomic region, 719 novel SSR markers were developed, which provide the possibility for fine mapping of these QTLs. In addition, this region also harbors 652 genes and 49 of these are defense related genes, including two NB-ARC genes, three LRR receptor-like genes and three WRKY transcription factors. These disease resistance related genes could contribute to resistance to viral (such as TSWV) and fungal (such as LS) diseases in peanut. In summary, this study not only provides a large number of molecular markers for potential use in peanut genetic map development and QTL mapping but also for map-based gene cloning and molecular breeding.
Project description:BACKGROUND: Cultivated peanut or groundnut (Arachis hypogaea L.) is an important oilseed crop with an allotetraploid genome (AABB, 2n = 4x = 40). Both the low level of genetic variation within the cultivated gene pool and its polyploid nature limit the utilization of molecular markers to explore genome structure and facilitate genetic improvement. Nevertheless, a wealth of genetic diversity exists in diploid Arachis species (2n = 2x = 20), which represent a valuable gene pool for cultivated peanut improvement. Interspecific populations have been used widely for genetic mapping in diploid species of Arachis. However, an intraspecific mapping strategy was essential to detect chromosomal rearrangements among species that could be obscured by mapping in interspecific populations. To develop intraspecific reference linkage maps and gain insights into karyotypic evolution within the genus, we comparatively mapped the A- and B-genome diploid species using intraspecific F2 populations. Exploring genome organization among diploid peanut species by comparative mapping will enhance our understanding of the cultivated tetraploid peanut genome. Moreover, new sources of molecular markers that are highly transferable between species and developed from expressed genes will be required to construct saturated genetic maps for peanut. RESULTS: A total of 2,138 EST-SSR (expressed sequence tag-simple sequence repeat) markers were developed by mining a tetraploid peanut EST assembly including 101,132 unigenes (37,916 contigs and 63,216 singletons) derived from 70,771 long-read (Sanger) and 270,957 short-read (454) sequences. A set of 97 SSR markers were also developed by mining 9,517 genomic survey sequences of Arachis. An SSR-based intraspecific linkage map was constructed using an F2 population derived from a cross between K 9484 (PI 298639) and GKBSPSc 30081 (PI 468327) in the B-genome species A. batizocoi. A high degree of macrosynteny was observed when comparing the homoeologous linkage groups between A (A. duranensis) and B (A. batizocoi) genomes. Comparison of the A- and B-genome genetic linkage maps also showed a total of five inversions and one major reciprocal translocation between two pairs of chromosomes under our current mapping resolution. CONCLUSIONS: Our findings will contribute to understanding tetraploid peanut genome origin and evolution and eventually promote its genetic improvement. The newly developed EST-SSR markers will enrich current molecular marker resources in peanut.