Natural selection among Eurasians at genomic regions associated with HIV-1 control.
ABSTRACT: BACKGROUND:HIV susceptibility and pathogenicity exhibit both interindividual and intergroup variability. The etiology of intergroup variability is still poorly understood, and could be partly linked to genetic differences among racial/ethnic groups. These genetic differences may be traceable to different regimes of natural selection in the 60,000 years since the human radiation out of Africa. Here, we examine population differentiation and haplotype patterns at several loci identified through genome-wide association studies on HIV-1 control, as determined by viral-load setpoint, in European and African-American populations. We use genome-wide data from the Human Genome Diversity Project, consisting of 53 world-wide populations, to compare measures of FST and relative extended haplotype homozygosity (REHH) at these candidate loci to the rest of the respective chromosome. RESULTS:We find that the Europe-Middle East and Europe-South Asia pairwise FST in the most strongly associated region are elevated compared to most pairwise comparisons with the sub-Saharan African group, which exhibit very low FST. We also find genetic signatures of recent positive selection (higher REHH) at these associated regions among all groups except for sub-Saharan Africans and Native Americans. This pattern is consistent with one in which genetic differentiation, possibly due to diversifying/positive selection, occurred at these loci among Eurasians. CONCLUSIONS:These findings are concordant with those from earlier studies suggesting recent evolutionary change at immunity-related genomic regions among Europeans, and shed light on the potential genetic and evolutionary origin of population differences in HIV-1 control.
Project description:BACKGROUND:Cytochrome P450 CYP2C19 metabolizes a wide range of pharmacologically active substances and a relatively small number of naturally occurring environmental toxins. Poor activity alleles of CYP2C19 are very frequent worldwide, particularly in Asia, raising the possibility that reduced metabolism could be advantageous in some circumstances. The evolutionary selective forces acting on this gene have not previously been investigated.We analyzed CYP2C19 genetic markers from 127 Gambians and on 120 chromosomes from Yoruba, Europeans and Asians (Japanese?+?Han Chinese) in the Hapmap database. Haplotype breakdown was explored using bifurcation plots and relative extended haplotype homozygosity (REHH). Allele frequency differentiation across populations was estimated using the fixation index (FST) and haplotype diversity with coalescent models. RESULTS:Bifurcation plots suggested conservation of alleles conferring slow metabolism (CYP2C19*2 and *3). REHH was high around CYP2C19*2 in Yoruba (REHH 8.3, at 133.3 kb from the core) and to a lesser extent in Europeans (3.5, at 37.7 kb) and Asians (2.8, at -29.7 kb). FST at the CYP2C19 locus was low overall (0.098). CYP2C19*3 was an FST outlier in Asians (0.293), CYP2C19 haplotype diversity?<?= 0.037, p <0.001. CONCLUSIONS:We found some evidence that the slow metabolizing allele CYP2C19*2 is subject to positive selective forces worldwide. Similar evidence was also found for CYP2C19*3 which is frequent only in Asia. FST is low at the CYP2C19 locus, suggesting balancing selection overall. The biological factors responsible for these selective pressures are currently unknown. One possible explanation is that early humans were exposed to a ubiquitous novel toxin activated by CYP2C19. The genetic adaptation took place within the last 10,000 years which coincides with the development of systematic agricultural practices.
Project description:BACKGROUND:Identification of selection signatures can provide a direct insight into the mechanism of artificial selection and allow further disclosure of the candidate genes related to the animals' phenotypic variation. Domestication and subsequent long-time selection have resulted in extensive phenotypic changes in domestic pigs, involving a number of traits, like behavior, body composition, disease resistance, reproduction and coat color. In this study, based on genotypes obtained from PorcineSNP60 Illumina assay we attempt to detect both diversifying and within-breed selection signatures in 530 pigs belonging to four breeds: Polish Landrace, Pu?awska, Z?otnicka White and Z?otnicka Spotted, of which the last three are a subject of conservative breeding and substantially represent the native populations. RESULTS:A two largely complementary statistical methods were used for signatures detection, including: pairwise FST and relative extended haplotype homozygosity (REHH) test. Breed-specific diversifying selection signals included several genes involved in processes connected with fertility, growth and metabolism which are potentially responsible for different phenotypes of the studied breeds. The diversifying selection signals also comprised PPARD gene that was previously found to have a large effect on the shape of the external ear in pigs or two genes encoding neuropeptide Y receptors (Y2 and Y5) involved in fat deposition and stress response which are important features differentiating the studied breeds. REHH statistics allowed detecting several within-breed selection signatures overlapping with genes connected with a range of functions including, among others: metabolic pathways, immune system response or implantation and development of the embryo. CONCLUSIONS:The study provides many potential candidate genes with implication for traits selected in the individual breeds and gives strong basis for further studies aiming at identification of sources of variation among the studied pig breeds.
Project description:We used a high-density single-nucleotide polymorphism array to genotype 75 Plasmodium falciparum isolates recently collected from Senegal and The Gambia to search for signals of selection in this malaria endemic region. We found little geographic or temporal stratification of the genetic diversity among the sampled parasites. Through application of the iHS and REHH haplotype-based tests for positive selection, we found evidence of recent selective sweeps at a known drug resistance locus, at several known antigenic loci, and at several genomic regions not previously identified as sites of recent selection. We discuss the value of deep population-specific genomic analyses for identifying selection signals within sampled endemic populations of parasites, which may correspond to local selection pressures such as distinctive therapeutic regimes or mosquito vectors.
Project description:The Pacific cod Gadus macrocephalus is a demersal, economically important fish in the family Gadidae. Population genetic differentiation of Pacific cod was examined across its northwestern Pacific range by screening variation of eight microsatellite loci in the present study. All four populations exhibited high genetic diversity. Pairwise fixation index (Fst) suggested a moderate to high level of genetic differentiation among populations. Population of the Yellow Sea (YS) showed higher genetic difference compared to the other three populations based on the results of pairwise Fst, three-dimensional factorial correspondence analysis (3D-FCA) and STRUCTURE, which implied restricted gene flow among them. Wilcoxon signed rank tests suggested no significant heterozygosity excess and no recent genetic bottleneck events were detected. Microsatellite DNA is an effective molecular marker for detecting the phylogeographic pattern of Pacific cod, and these Pacific cod populations should be three management units.
Project description:BACKGROUND: An increasing number of aquaculture species are subjected to artificial selection in systematic breeding programs. Rapid improvements of important commercial traits are reported, but little is known about the effects of the strong directional selection applied, on gene level variation. Large numbers of genetic markers are becoming available, making it feasible to detect and estimate these effects. Here a simulation tool was developed in order to explore the power by which single genetic loci subjected to uni-directional selection in parallel breeding populations may be detected. FINDINGS: Two simulation models were pursued: 1) screening for loci displaying higher genetic differentiation than expected (high-FST outliers), from neutral evolution between a pool of domesticated populations and a pool of wild populations; 2) screening for loci displaying lower genetic differentiation (low-FST outliers) between domesticated strains than expected from neutral evolution. The premise for both approaches was that the isolated domesticated strains are subjected to the same breeding goals. The power to detect outlier loci was calculated under the following parameter values: number of populations, effective population size per population, number of generations since onset of selection, initial FST, and the selection coefficient acting on the locus. Among the parameters investigated, selection coefficient, the number of generation since onset of selection, and number of populations, had the largest impact on power. The power to detect loci subjected to directional in breeding programmes was high when applying the between farmed and wild population approach, and low for the between farmed populations approach. CONCLUSIONS: A simulation tool was developed for estimating the power to detect artificial selection acting directly on single loci. The simulation tool should be applicable to most species subject to domestication, as long as a reasonable high accuracy in input parameters such as effective population size, number of generations since the initiation of selection, and initial differentiation (FST) can be obtained. Identification of genetic loci under artificial selection would be highly valuable, since such loci could be used to monitor maintenance of genetic variation in the breeding populations and monitoring possible genetic changes in wild populations from genetic interaction between escapees and their wild counterpart.
Project description:Variation in human skin pigmentation evolved in response to the selective pressure of ultra-violet radiation (UVR). Selection to maintain darker skin in high UVR environments is expected to constrain pigmentation phenotype and variation in pigmentation loci. Consistent with this hypothesis, the gene MC1R exhibits reduced diversity in African populations from high UVR regions compared to low-UVR non-African populations. However, MC1R diversity in non-African populations that have evolved under high-UVR conditions is not well characterized.In order to test the hypothesis that MC1R variation has been constrained in Melanesians the coding region of the MC1R gene was sequenced in 188 individuals from Northern Island Melanesia. The role of purifying selection was assessed using a modified McDonald Kreitman's test. Pairwise FST was calculated between Melanesian populations and populations from the 1000 Genomes Project. The SNP rs2228479 was genotyped in a larger sample (n = 635) of Melanesians and tested for associations with skin and hair pigmentation.We observe three nonsynonymous and two synonymous mutations. A modified McDonald Kreitman's test failed to detect a significant signal of purifying selection. Pairwise FST values calculated between the four islands sampled here indicate little regional substructure in MC1R. When compared to African, European, East and South Asian populations, Melanesians do not exhibit reduced population divergence (measured as FST) or a high proportion of haplotype sharing with Africans, as one might expect if ancestral haplotypes were conserved across high UVR populations in and out of Africa. The only common nonsynonymous polymorphism observed, rs2228479, is not significantly associated with skin or hair pigmentation in a larger sample of Melanesians.The pattern of sequence diversity here does not support a model of strong selective constraint on MC1R in Northern Island Melanesia This absence of strong constraint, as well as the recent population history of the region, may explain the observed frequencies of the derived rs2228479 allele. These results emphasize the complex genetic architecture of pigmentation phenotypes, which are controlled by multiple, possibly interacting loci. They also highlight the role that population history can play in influencing phenotypic diversity in the absence of strong natural selection.
Project description:Tsetse flies are confined to sub-Saharan Africa where they occupy discontinuous habitats. In anticipation of area-wide control programmes, estimates of gene flow among tsetse populations are necessary. Genetic diversities were partitioned at eight microsatellite loci and five mitochondrial loci in 21 Glossina pallidipes Austin populations. At microsatellite loci, Nei's unbiased gene diversity averaged over loci was 0.659 and the total number of alleles was 214, only four of which were shared among all populations. The mean number of alleles per locus was 26.8. Random mating was observed within but not among populations (fixation index FST=0.18) and 81% of the genetic variance was within populations. Thirty-nine mitochondrial variants were detected. Mitochondrial diversities in populations varied from 0 to 0.85 and averaged 0.42, and FST=0.51. High levels of genetic differentiation were characteristic, extending even to subpopulations separated by tens and hundreds of kilometres, and indicating low rates of gene flow.
Project description:Ancient DNA makes it possible to observe natural selection directly by analysing samples from populations before, during and after adaptation events. Here we report a genome-wide scan for selection using ancient DNA, capitalizing on the largest ancient DNA data set yet assembled: 230 West Eurasians who lived between 6500 and 300 bc, including 163 with newly reported data. The new samples include, to our knowledge, the first genome-wide ancient DNA from Anatolian Neolithic farmers, whose genetic material we obtained by extracting from petrous bones, and who we show were members of the population that was the source of Europe's first farmers. We also report a transect of the steppe region in Samara between 5600 and 300 bc, which allows us to identify admixture into the steppe from at least two external sources. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height.
Project description:A large single nucleotide polymorphism (SNP) dataset was used to analyze genome-wide diversity in a diverse collection of watermelon cultivars representing globally cultivated, watermelon genetic diversity. The marker density required for conducting successful association mapping depends on the extent of linkage disequilibrium (LD) within a population. Use of genotyping by sequencing reveals large numbers of SNPs that in turn generate opportunities in genome-wide association mapping and marker-assisted selection, even in crops such as watermelon for which few genomic resources are available. In this paper, we used genome-wide genetic diversity to study LD, selective sweeps, and pairwise FST distributions among worldwide cultivated watermelons to track signals of domestication.We examined 183 Citrullus lanatus var. lanatus accessions representing domesticated watermelon and generated a set of 11,485 SNP markers using genotyping by sequencing. With a diverse panel of worldwide cultivated watermelons, we identified a set of 5,254 SNPs with a minor allele frequency of ? 0.05, distributed across the genome. All ancestries were traced to Africa and an admixture of various ancestries constituted secondary gene pools across various continents. A sliding window analysis using pairwise FST values was used to resolve selective sweeps. We identified strong selection on chromosomes 3 and 9 that might have contributed to the domestication process. Pairwise analysis of adjacent SNPs within a chromosome as well as within a haplotype allowed us to estimate genome-wide LD decay. LD was also detected within individual genes on various chromosomes. Principal component and ancestry analyses were used to account for population structure in a genome-wide association study. We further mapped important genes for soluble solid content using a mixed linear model.Information concerning the SNP resources, population structure, and LD developed in this study will help in identifying agronomically important candidate genes from the genomic regions underlying selection and for mapping quantitative trait loci using a genome-wide association study in sweet watermelon.
Project description:Native species that persist in urban environments may benefit from local adaptation to novel selection factors. We used double-digest restriction-side associated DNA (RAD) sequencing to evaluate shifts in genome-wide genetic diversity and investigate the presence of parallel evolution associated with urban-specific selection factors in wood frogs (Lithobates sylvaticus). Our replicated paired study design involved 12 individuals from each of 4 rural and urban populations to improve our confidence that detected signals of selection are indeed associated with urbanization. Genetic diversity measures were less for urban populations; however, the effect size was small, suggesting little biological consequence. Using an FST outlier approach, we identified 37 of 8344 genotyped single nucleotide polymorphisms with consistent evidence of directional selection across replicates. A genome-wide association study analysis detected modest support for an association between environment type and 12 of the 37 FST outlier loci. Discriminant analysis of principal components using the 37 FST outlier loci produced correct reassignment for 87.5% of rural samples and 93.8% of urban samples. Eighteen of the 37 FST outlier loci mapped to the American bullfrog (Rana [Lithobates] catesbeiana) genome, although none were in coding regions. This evidence of parallel evolution to urban environments provides a powerful example of the ability of urban landscapes to direct evolutionary processes.