Development of genic and genomic SSR markers of robusta coffee (Coffea canephora Pierre Ex A. Froehner).
ABSTRACT: Coffee breeding and improvement efforts can be greatly facilitated by availability of a large repository of simple sequence repeats (SSRs) based microsatellite markers, which provides efficiency and high-resolution in genetic analyses. This study was aimed to improve SSR availability in coffee by developing new genic-/genomic-SSR markers using in-silico bioinformatics and streptavidin-biotin based enrichment approach, respectively. The expressed sequence tag (EST) based genic microsatellite markers (EST-SSRs) were developed using the publicly available dataset of 13,175 unigene ESTs, which showed a distribution of 1 SSR/3.4 kb of coffee transcriptome. Genomic SSRs, on the other hand, were developed from an SSR-enriched small-insert partial genomic library of robusta coffee. In total, 69 new SSRs (44 EST-SSRs and 25 genomic SSRs) were developed and validated as suitable genetic markers. Diversity analysis of selected coffee genotypes revealed these to be highly informative in terms of allelic diversity and PIC values, and eighteen of these markers (? 27%) could be mapped on a robusta linkage map. Notably, the markers described here also revealed a very high cross-species transferability. In addition to the validated markers, we have also designed primer pairs for 270 putative EST-SSRs, which are expected to provide another ca. 200 useful genetic markers considering the high success rate (88%) of marker conversion of similar pairs tested/validated in this study.
Project description:Genic microsatellite markers, also known as functional markers, are preferred over anonymous markers as they reveal the variation in transcribed genes among individuals. In this study, we developed a total of 707 expressed sequence tag-derived simple sequence repeat markers (EST-SSRs) and used for development of a high-density integrated map using four individual mapping populations of B. rapa. This map contains a total of 1426 markers, consisting of 306 EST-SSRs, 153 intron polymorphic markers, 395 bacterial artificial chromosome-derived SSRs (BAC-SSRs), and 572 public SSRs and other markers covering a total distance of 1245.9 cM of the B. rapa genome. Analysis of allelic diversity in 24 B. rapa germplasm using 234 mapped EST-SSR markers showed amplification of 2 alleles by majority of EST-SSRs, although amplification of alleles ranging from 2 to 8 was found. Transferability analysis of 167 EST-SSRs in 35 species belonging to cultivated and wild brassica relatives showed 42.51% (Sysimprium leteum) to 100% (B. carinata, B. juncea, and B. napus) amplification. Our newly developed EST-SSRs and high-density linkage map based on highly transferable genic markers would facilitate the molecular mapping of quantitative trait loci and the positional cloning of specific genes, in addition to marker-assisted selection and comparative genomic studies of B. rapa with other related species.
Project description:The availability of large expressed sequence tag (EST) and whole genome databases of oil palm enabled the development of a data base of microsatellite markers. For this purpose, an EST database consisting of 40,979 EST sequences spanning 27?Mb and a chromosome-wise whole genome databases were downloaded. A total of 3,950 primer pairs were identified and developed from EST sequences. The tri and tetra nucleotide repeat motifs were most prevalent (each 24.75%) followed by di-nucleotide repeat motifs. Whole genome-wide analysis found a total of 245,654 SSR repeats across the 16 chromosomes of oil palm, of which 38,717 were compound microsatellite repeats. A web application, OpSatdb, the first microsatellite database of oil palm, was developed using the PHP and MySQL database ( https://ssr.icar.gov.in/index.php ). It is a simple and systematic web-based search engine for searching SSRs based on repeat motif type, repeat type, and primer details. High synteny was observed between oil palm and rice genomes. The mapping of ESTs having SSRs by Blast2GO resulted in the identification of 19.2% sequences with gene ontology (GO) annotations. Randomly, a set of ten genic SSRs and five genomic SSRs were used for validation and genetic diversity on 100 genotypes belonging to the world oil palm genetic resources. The grouping pattern was observed to be broadly in accordance with the geographical origin of the genotypes. The identified genic and genome-wide SSRs can be effectively useful for various genomic applications of oil palm, such as genetic diversity, linkage map construction, mapping of QTLs, marker-assisted selection, and comparative population studies.
Project description:BACKGROUND: Species-specific microsatellite markers are desirable for genetic studies and to harness the potential of MAS-based breeding for genetic improvement. Limited availability of such markers for coffee, one of the most important beverage tree crops, warrants newer efforts to develop additional microsatellite markers that can be effectively deployed in genetic analysis and coffee improvement programs. The present study aimed to develop new coffee-specific SSR markers and validate their utility in analysis of genetic diversity, individualization, linkage mapping, and transferability for use in other related taxa. RESULTS: A small-insert partial genomic library of Coffea canephora, was probed for various SSR motifs following conventional approach of Southern hybridisation. Characterization of repeat positive clones revealed a very high abundance of DNRs (1/15 Kb) over TNRs (1/406 kb). The relative frequencies of different DNRs were found as AT >> AG > AC, whereas among TNRs, AGC was the most abundant repeat. The SSR positive sequences were used to design 58 primer pairs of which 44 pairs could be validated as single locus markers using a panel of arabica and robusta genotypes. The analysis revealed an average of 3.3 and 3.78 alleles and 0.49 and 0.62 PIC per marker for the tested arabicas and robustas, respectively. It also revealed a high cumulative PI over all the markers using both sib-based (10-6 and 10-12 for arabicas and robustas respectively) and unbiased corrected estimates (10-20 and 10-43 for arabicas and robustas respectively). The markers were tested for Hardy-Weinberg equilibrium, linkage dis-equilibrium, and were successfully used to ascertain generic diversity/affinities in the tested germplasm (cultivated as well as species). Nine markers could be mapped on robusta linkage map. Importantly, the markers showed ~92% transferability across related species/genera of coffee. CONCLUSION: The conventional approach of genomic library was successfully employed although with low efficiency to develop a set of 44 new genomic microsatellite markers of coffee. The characterization/validation of new markers demonstrated them to be highly informative, and useful for genetic studies namely, genetic diversity in coffee germplasm, individualization/bar-coding for germplasm protection, linkage mapping, taxonomic studies, and use as conserved orthologous sets across secondary genepool of coffee. Further, the relative frequency and distribution of different SSR motifs in coffee genome indicated coffee genome to be relatively poor in microsatellites compared to other plant species.
Project description:Expressed sequence tags (EST) are potential source for the development of genic microsatellite markers, gene discovery, comparative genomics, and other genomic studies. In the present study, 7630 ESTs were examined from NCBI for SSR identification and characterization. A total of 263 SSRs were identified with an average density of one SSR/4.2?kb (3.4% frequency). Analysis revealed that trinucleotide repeats (47.52%) were most abundant followed by tetranucleotide (19.77%), dinucleotide (19.01%), pentanucleotide (9.12%), and hexanucleotide repeats (4.56%). Functional annotation was done through homology search and gene ontology, and 35 EST-SSRs were selected. Primer pairs were designed for evaluation of cross transferability and polymorphism among 11 plants belonging to five different families. Total 402 alleles were generated at 155 loci with an average of 2.6 alleles/locus and the polymorphic information content (PIC) ranged from 0.15 to 0.92 with an average of 0.75. The cross transferability ranged from 34.84% to 98.06% in different plants, with an average of 67.86%. Thus, the validation study of annotated 35 EST-SSR markers which correspond to particular metabolic activity revealed polymorphism and evolutionary nature in different families of Angiospermic plants.
Project description:Microsatellite or simple sequence repeat (SSR) is one of the most widely distributed molecular markers that have been widely utilized to assess genetic diversity and genetic mapping for important traits in plants. However, the understanding of microsatellite characteristics in Arachis species and the currently available amount of high-quality SSR markers remain limited. In this study, we identified 16,435 genome survey sequences SSRs (GSS-SSRs) and 40,199 expressed sequence tag SSRs (EST-SSRs) in Arachis hypogaea and its wild relative species using the publicly available sequence data. The GSS-SSRs had a density of 159.9-239.8 SSRs/Mb for wild Arachis and 1,015.8 SSR/Mb for cultivated Arachis, whereas the EST-SSRs had the density of 173.5-384.4 SSR/Mb and 250.9 SSRs/Mb for wild and cultivated Arachis, respectively. The trinucleotide SSRs were predominant across Arachis species, except that the dinucleotide accounted for most in A. hypogaea GSSs. From Arachis GSS-SSR and EST-SSR sequences, we developed 2,589 novel SSR markers that showed a high polymorphism in six diverse A. hypogaea accessions. A genetic linkage map that contained 540 novel SSR loci and 105 anchor SSR loci was constructed by case of a recombinant inbred lines F6 population. A subset of 82 randomly selected SSR markers were used to screen 39 wild and 22 cultivated Arachis accessions, which revealed a high transferability of the novel SSRs across Arachis species. Our results provided informative clues to investigate microsatellite patterns across A. hypogaea and its wild relative species and potentially facilitate the germplasm evaluation and gene mapping in Arachis species.
Project description:BACKGROUND: Over recent years, a growing effort has been made to develop microsatellite markers for the genomic analysis of the common bean (Phaseolus vulgaris) to broaden the knowledge of the molecular genetic basis of this species. The availability of large sets of expressed sequence tags (ESTs) in public databases has given rise to an expedient approach for the identification of SSRs (Simple Sequence Repeats), specifically EST-derived SSRs. In the present work, a battery of new microsatellite markers was obtained from a search of the Phaseolus vulgaris EST database. The diversity, degree of transferability and polymorphism of these markers were tested. RESULTS: From 9,583 valid ESTs, 4,764 had microsatellite motifs, from which 377 were used to design primers, and 302 (80.11%) showed good amplification quality. To analyze transferability, a group of 167 SSRs were tested, and the results showed that they were 82% transferable across at least one species. The highest amplification rates were observed between the species from the Phaseolus (63.7%), Vigna (25.9%), Glycine (19.8%), Medicago (10.2%), Dipterix (6%) and Arachis (1.8%) genera. The average PIC (Polymorphism Information Content) varied from 0.53 for genomic SSRs to 0.47 for EST-SSRs, and the average number of alleles per locus was 4 and 3, respectively. Among the 315 newly tested SSRs in the BJ (BAT93 X Jalo EEP558) population, 24% (76) were polymorphic. The integration of these segregant loci into a framework map composed of 123 previously obtained SSR markers yielded a total of 199 segregant loci, of which 182 (91.5%) were mapped to 14 linkage groups, resulting in a map length of 1,157 cM. CONCLUSIONS: A total of 302 newly developed EST-SSR markers, showing good amplification quality, are available for the genetic analysis of Phaseolus vulgaris. These markers showed satisfactory rates of transferability, especially between species that have great economic and genomic values. Their diversity was comparable to genomic SSRs, and they were incorporated in the common bean reference genetic map, which constitutes an important contribution to and advance in Phaseolus vulgaris genomic research.
Project description:BACKGROUND: During the last decade, numerous microsatellite markers were developed for genotyping and to identify closely related plant genotypes. In citrus, previously developed microsatellite markers were arisen from genomic libraries and more often located in non coding DNA sequences. To optimize the use of these EST-SSRs as genetic markers in genome mapping programs and citrus systematic analysis, we have investigated their polymorphism related to the type (di or trinucleotide) or their position in the coding sequences. RESULTS: Among 11000 unigenes from a Clementine EST library, we have found at least one microsatellite sequence (repeated units size ranged from 2 to 6 nucleotides) in 1500 unigenes (13.6%). More than 95% of these SSRs were di or trinucleotides. If trinucleotide microsatellites were encountered trough all part of EST sequences, dinucleotide microsatellites were preferentially (50%) concentrated in the 5' 100th nucleotides. We assessed the polymorphism of 41 EST-SSR, by PCR amplification droved with flanking primers among ten Citrus species plus 3 from other genera. More than 90% of EST-SSR markers were polymorphic. Furthermore, dinucleotide microsatellite markers were more polymorphic than trinucleotide ones, probably related to their distribution that was more often located in the 5' UnTranslated Region (UTR). We obtained a good agreement of diversity relationships between the citrus species and relatives assessed with EST-SSR markers with the established taxonomy and phylogeny. To end, the heterozygosity of each genotype and all dual combinations were studied to evaluate the percentage of mappable markers. Higher values (> 45%) were observed for putative Citrus inter-specific hybrids (lime lemon, or sour orange) than for Citrus basic true species (mandarin, pummelo and citron) (<30%). Most favorable combinations for genome mapping were observed in those involving interspecific hybrid genotypes. Those gave higher levels of mappable markers (>70%) with a significant proportion suitable for synteny analysis. CONCLUSION: Fourty one new EST-SSR markers were produced and were available for citrus genetic studies. Whatever the position of the SSR in the ESTs the EST-SSR markers we developed are powerful to investigate genetic diversity and genome mapping in citrus.
Project description:The recently acquired genome sequence of globe artichoke (Cynara cardunculus var. scolymus) has been used to catalog the genome's content of simple sequence repeat (SSR) markers. More than 177,000 perfect SSRs were revealed, equivalent to an overall density across the genome of 244.5 SSRs/Mbp, but some 224,000 imperfect SSRs were also identified. About 21% of these SSRs were complex (two stretches of repeats separated by <100 nt). Some 73% of the SSRs were composed of dinucleotide motifs. The SSRs were categorized for the numbers of repeats present, their overall length and were allocated to their linkage group. A total of 4,761 perfect and 6,583 imperfect SSRs were present in 3,781 genes (14.11% of the total), corresponding to an overall density across the gene space of 32,5 and 44,9 SSRs/Mbp for perfect and imperfect motifs, respectively. A putative function has been assigned, using the gene ontology approach, to the set of genes harboring at least one SSR. The same search parameters were applied to reveal the SSR content of 14 other plant species for which genome sequence is available. Certain species-specific SSR motifs were identified, along with a hexa-nucleotide motif shared only with the other two Compositae species (sunflower (Helianthus annuus) and horseweed (Conyza canadensis)) included in the study. Finally, a database, called "Cynara cardunculus MicroSatellite DataBase" (CyMSatDB) was developed to provide a searchable interface to the SSR data. CyMSatDB facilitates the retrieval of SSR markers, as well as suggested forward and reverse primers, on the basis of genomic location, genomic vs genic context, perfect vs imperfect repeat, motif type, motif sequence and repeat number. The SSR markers were validated via an in silico based PCR analysis adopting two available assembled transcriptomes, derived from contrasting globe artichoke accessions, as templates.
Project description:<h4>Background</h4>Expressed Sequence Tags (ESTs) are a source of simple sequence repeats (SSRs) that can be used to develop molecular markers for genetic studies. The availability of ESTs for Quercus robur and Quercus petraea provided a unique opportunity to develop microsatellite markers to accelerate research aimed at studying adaptation of these long-lived species to their environment. As a first step toward the construction of a SSR-based linkage map of oak for quantitative trait locus (QTL) mapping, we describe the mining and survey of EST-SSRs as well as a fast and cost-effective approach (bin mapping) to assign these markers to an approximate map position. We also compared the level of polymorphism between genomic and EST-derived SSRs and address the transferability of EST-SSRs in Castanea sativa (chestnut).<h4>Results</h4>A catalogue of 103,000 Sanger ESTs was assembled into 28,024 unigenes from which 18.6% presented one or more SSR motifs. More than 42% of these SSRs corresponded to trinucleotides. Primer pairs were designed for 748 putative unigenes. Overall 37.7% (283) were found to amplify a single polymorphic locus in a reference full-sib pedigree of Quercus robur. The usefulness of these loci for establishing a genetic map was assessed using a bin mapping approach. Bin maps were constructed for the male and female parental tree for which framework linkage maps based on AFLP markers were available. The bin set consisting of 14 highly informative offspring selected based on the number and position of crossover sites. The female and male maps comprised 44 and 37 bins, with an average bin length of 16.5 cM and 20.99 cM, respectively. A total of 256 EST-SSRs were assigned to bins and their map position was further validated by linkage mapping. EST-SSRs were found to be less polymorphic than genomic SSRs, but their transferability rate to chestnut, a phylogenetically related species to oak, was higher.<h4>Conclusion</h4>We have generated a bin map for oak comprising 256 EST-SSRs. This resource constitutes a first step toward the establishment of a gene-based map for this genus that will facilitate the dissection of QTLs affecting complex traits of ecological importance.
Project description:BACKGROUND: Eggplant (Solanum melongena L.) is a member of the Solanaceae family. In spite of its widespread cultivation and nutritional and economic importance, its genome has not as yet been extensively investigated. Few analyses have been carried out to determine the genetic diversity of eggplant at the DNA level, and linkage relationships have not been well characterised. As for the other Solanaceae crop species (potato, tomato and pepper), the level of intra-specific polymorphism appears to be rather limited, and so it is important that an effort is made to develop more informative DNA markers to make progress in understanding the genetics of eggplant and to advance its breeding. The aim of the present work was to develop a set of functional microsatellite (SSR) markers, via an in silico analysis of publicly available DNA sequence. RESULTS: From >3,300 genic DNA sequences, 50 SSR-containing candidates suitable for primer design were recovered. Of these, 39 were functional, and were then applied to a panel of 44 accessions, of which 38 were cultivated eggplant varieties, and six were from related Solanum species. The usefulness of the SSR assays for diversity analysis and taxonomic discrimination was demonstrated by constructing a phylogeny based on SSR polymorphisms, and by the demonstration that most were also functional when tested with template from tomato, pepper and potato. As a results of BLASTN analyses, several eggplant SSRs were found to have homologous counterparts in the phylogenetically related species, which carry microsatellite motifs in the same position. CONCLUSION: The set of eggplant EST-SSR markers was informative for phylogenetic analysis and genetic mapping. Since EST-SSRs lie within expressed sequence, they have the potential to serve as perfect markers for genes determining variation in phenotype. Their high level of transferability to other Solanaceae species can be used to provide anchoring points for the integration of genetic maps across species.