Project description:Sweetpotato (Ipomoea batatas) is the sixth most important food crop and plays a critical role in maintaining food security worldwide. Support for sweetpotato improvement research in breeding and genetics programs, and maintenance of sweetpotato germplasm collections is essential for preserving food security for future generations. Germplasm collections seek to preserve phenotypic and genotypic diversity through accession characterization. However, due to its genetic complexity, high heterogeneity, polyploid genome, phenotypic plasticity, and high flower production variability, sweetpotato genetic characterization is challenging. Here, we characterize the genetic diversity and population structure of 604 accessions from the sweetpotato germplasm collection maintained by the United States Department of Agriculture (USDA), Agricultural Research Service (ARS), Plant Genetic Resources Conservation Unit (PGRCU) in Griffin, Georgia, United States. Using the genotyping-by-sequencing platform (GBSpoly) and bioinformatic pipelines (ngsComposer and GBSapp), a total of 102,870 polymorphic SNPs with hexaploid dosage calls were identified from the 604 accessions. Discriminant analysis of principal components (DAPC) and Bayesian clustering identified six unique genetic groupings across seven broad geographic regions. Genetic diversity analyses using the hexaploid data set revealed ample genetic diversity among the analyzed collection in concordance with previous analyses. Following population structure and diversity analyses, breeder germplasm subsets of 24, 48, 96, and 384 accessions were established using K-means clustering with manual selection to maintain phenotypic and genotypic diversity. The genetic characterization of the PGRCU sweetpotato germplasm collection and breeder germplasm subsets developed in this study provide the foundation for future association studies and serve as precursors toward phenotyping studies aimed at linking genotype with phenotype.

Project description:Phenotypic evaluation and efficient utilization of germplasm collections can be time-intensive, laborious, and expensive. However, with the plummeting costs of next-generation sequencing and the addition of genomic selection to the plant breeder's toolbox, we now can more efficiently tap the genetic diversity within large germplasm collections. In this study, we applied and evaluated genomic prediction's potential to a set of 482 pea (Pisum sativum L.) accessions-genotyped with 30,600 single nucleotide polymorphic (SNP) markers and phenotyped for seed yield and yield-related components-for enhancing selection of accessions from the USDA Pea Germplasm Collection. Genomic prediction models and several factors affecting predictive ability were evaluated in a series of cross-validation schemes across complex traits. Different genomic prediction models gave similar results, with predictive ability across traits ranging from 0.23 to 0.60, with no model working best across all traits. Increasing the training population size improved the predictive ability of most traits, including seed yield. Predictive abilities increased and reached a plateau with increasing number of markers presumably due to extensive linkage disequilibrium in the pea genome. Accounting for population structure effects did not significantly boost predictive ability, but we observed a slight improvement in seed yield. By applying the best genomic prediction model (e.g., RR-BLUP), we then examined the distribution of genotyped but nonphenotyped accessions and the reliability of genomic estimated breeding values (GEBV). The distribution of GEBV suggested that none of the nonphenotyped accessions were expected to perform outside the range of the phenotyped accessions. Desirable breeding values with higher reliability can be used to identify and screen favorable germplasm accessions. Expanding the training set and incorporating additional orthogonal information (e.g., transcriptomics, metabolomics, physiological traits, etc.) into the genomic prediction framework can enhance prediction accuracy.

Project description:BackgroundConservation of genetic diversity is an essential prerequisite for developing new cultivars with desirable agronomic traits. Although a large number of germplasm collections have been established worldwide, many of them face major difficulties due to large size and a lack of adequate information about population structure and genetic diversity. Core collection with a minimum number of accessions and maximum genetic diversity of pepper species and its wild relatives will facilitate easy access to genetic material as well as the use of hidden genetic diversity in Capsicum.ResultsTo explore genetic diversity and population structure, we investigated patterns of molecular diversity using a transcriptome-based 48 single nucleotide polymorphisms (SNPs) in a large germplasm collection comprising 3,821 accessions. Among the 11 species examined, Capsicum annuum showed the highest genetic diversity (HE = 0.44, I = 0.69), whereas the wild species C. galapagoense showed the lowest genetic diversity (HE = 0.06, I = 0.07). The Capsicum germplasm collection was divided into 10 clusters (cluster 1 to 10) based on population structure analysis, and five groups (group A to E) based on phylogenetic analysis. Capsicum accessions from the five distinct groups in an unrooted phylogenetic tree showed taxonomic distinctness and reflected their geographic origins. Most of the accessions from European countries are distributed in the A and B groups, whereas the accessions from Asian countries are mainly distributed in C and D groups. Five different sampling strategies with diverse genetic clustering methods were used to select the optimal method for constructing the core collection. Using a number of allelic variations based on 48 SNP markers and 32 different phenotypic/morphological traits, a core collection 'CC240' with a total of 240 accessions (5.2 %) was selected from within the entire Capsicum germplasm. Compared to the other core collections, CC240 displayed higher genetic diversity (I = 0.95) and genetic evenness (J' = 0.80), and represented a wider range of phenotypic variation (MD = 9.45 %, CR = 98.40 %).ConclusionsA total of 240 accessions were selected from 3,821 Capsicum accessions based on transcriptome-based 48 SNP markers with genome-wide distribution and 32 traits using a systematic approach. This core collection will be a primary resource for pepper breeders and researchers for further genetic association and functional analyses.

Project description:BackgroundThe economic importance of grapevine has driven significant efforts in genomics to accelerate the exploitation of Vitis resources for development of new cultivars. However, although a large number of clonally propagated accessions are maintained in grape germplasm collections worldwide, their use for crop improvement is limited by the scarcity of information on genetic diversity, population structure and proper phenotypic assessment. The identification of representative and manageable subset of accessions would facilitate access to the diversity available in large collections. A genome-wide germplasm characterization using molecular markers can offer reliable tools for adjusting the quality and representativeness of such core samples.ResultsWe investigated patterns of molecular diversity at 22 common microsatellite loci and 384 single nucleotide polymorphisms (SNPs) in 2273 accessions of domesticated grapevine V. vinifera ssp. sativa, its wild relative V. vinifera ssp. sylvestris, interspecific hybrid cultivars and rootstocks. Despite the large number of putative duplicates and extensive clonal relationships among the accessions, we observed high level of genetic variation. In the total germplasm collection the average genetic diversity, as quantified by the expected heterozygosity, was higher for SSR loci (0.81) than for SNPs (0.34). The analysis of the genetic structure in the grape germplasm collection revealed several levels of stratification. The primary division was between accessions of V. vinifera and non-vinifera, followed by the distinction between wild and domesticated grapevine. Intra-specific subgroups were detected within cultivated grapevine representing different eco-geographic groups. The comparison of a phenological core collection and genetic core collections showed that the latter retained more genetic diversity, while maintaining a similar phenotypic variability.ConclusionsThe comprehensive molecular characterization of our grape germplasm collection contributes to the knowledge about levels and distribution of genetic diversity in the existing resources of Vitis and provides insights into genetic subdivision within the European germplasm. Genotypic and phenotypic information compared in this study may efficiently guide further exploration of this diversity for facilitating its practical use.

Dataset Information

Genetic Diversity and Population Structure of a Large USDA Sesame Collection

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets