Ancestry, admixture, and natural selection: a genetic analysis of New World populations
Ontology highlight
ABSTRACT: A population and admixture analysis of Mesoamerican Totonacs and South American Bolivians. A panel of highly informative ancestry informative markers (AIMs) for New World populations is identified. Regions coinciding with AIMs are have moderate signatures of selection. Population structure and differentiation were assessed with a genome-wide panel of 815,377 autosomal markers, Y-chromosome STR and SNPs, and mtDNA sequence data.
Project description:The genetic structure of some native Bolivians has been substantially influenced by admixture from Europeans, which we estimate to have occurred approximately 360 – 384 years ago. Consistent with historical accounts of male admixture, Y-chromosome haplogroups typical of Europeans were found in 39% of our Bolivian samples. No evidence of African admixture was found in native Bolivians. The Mesoamerican Totonacs have little evidence of European or African admixture. Our analysis indicates that some admixed Bolivians have Native American mtDNA and Y-chromosomes but harbor up to 30% European autosomal ancestry, demonstrating the need for autosomal markers to assess ancestry in admixed populations. From a dense genome-wide panel of 815,377 markers, we developed a set of 324 AIMs, specific for Native American ancestry. As few a 40-50 of these markers successfully predict New World ancestry in the ascertainment panel of Bolivians and Totonacs. The markers easily distinguish New World from Old World ancestry, even for populations more closely related to the Americas such as central and eastern Asians, and were effective for New World vs. Old World comparisons in five other geographically and culturally distinct populations of the Americas. SNPs demonstrating very high divergence between the two Native American populations and major Old World populations are found on haplotypes that are shared and occur at similar frequencies in other indigenous low-admixture American populations examined here (i.e. Pima, Maya, Colombian, Karitiana, and Surui). After excluding the possibility of recent relatedness, our results indicate that native Bolivians and Totonacs share ancestry with other American populations through a substantial contribution from a common founding population, population bottlenecks, and possible natural selection on functional variation.
Project description:Here we present genome-wide high-coverage genotyping data on a panel of 85 human samples from Eurasia that are used in addition to public data in studing the genomic context of a 24 kya old DNA sample from Southern Siberia that was sequenced to the avwerage depth of 1X. 85 samples were analysed with the Illumina platforms Human610-Quad v1.0, HumanHap650Yv3 (HumanHap650Yv3_A) and Human660W-Quad v1.0 Genotyping BeadChips and are described herein.
Project description:Populations of the Americas were founded by early migrants from Asia, and some have experienced recent genetic admixture. To better characterize the native and non-native ancestry components in populations from the Americas, we analyzed 815,377 autosomal SNPs, mitochondrial hypervariable segments I and II, and 36 Y-chromosome STRs from 24 Mesoamerican Totonacs and 23 South American Bolivians.We analyzed common genomic regions from native Bolivian and Totonac populations to identify 324 highly predictive Native American ancestry informative markers (AIMs). As few as 40-50 of these AIMs perform nearly as well as large panels of random genome-wide SNPs for predicting and estimating Native American ancestry and admixture levels. These AIMs have greater New World vs. Old World specificity than previous AIMs sets. We identify highly-divergent New World SNPs that coincide with high-frequency haplotypes found at similar frequencies in all populations examined, including the HGDP Pima, Maya, Colombian, Karitiana, and Surui American populations. Some of these regions are potential candidates for positive selection. European admixture in the Bolivian sample is approximately 12%, though individual estimates range from 0-48%. We estimate that the admixture occurred ~360-384?years ago. Little evidence of European or African admixture was found in Totonac individuals. Bolivians with pre-Columbian mtDNA and Y-chromosome haplogroups had 5-30% autosomal European ancestry, demonstrating the limitations of Y-chromosome and mtDNA haplogroups and the need for autosomal ancestry informative markers for assessing ancestry in admixed populations.
Project description:The human dimension of the Columbian Exchange entailed substantial genetic admixture between ancestral source populations from Africa, the Americas and Europe, which had evolved separately for many thousands of years. We sought to address the implications of the creation of admixed American genomes, containing novel allelic combinations, for human health and fitness via analysis of an admixed Colombian population from Medellin. Colombian genomes from Medellin show a wide range of three-way admixture contributions from ancestral source populations. The primary ancestry component for the population is European (average = 74.6%, range = 45.0%-96.7%), followed by Native American (average = 18.1%, range = 2.1%-33.3%) and African (average = 7.3%, range = 0.2%-38.6%). Locus-specific patterns of ancestry were evaluated to search for genomic regions that are enriched across the population for particular ancestry contributions. Adaptive and innate immune system related genes and pathways are particularly over-represented among ancestry-enriched segments, including genes (HLA-B and MAPK10) that are involved in defense against endemic pathogens such as malaria. Genes that encode functions related to skin pigmentation (SCL4A5) and cutaneous glands (EDAR) are also found in regions with anomalous ancestry patterns. These results suggest the possibility that ancestry-specific loci were differentially retained in the modern admixed Colombian population based on their utility in the New World environment.
Project description:The Amerindian group known as the Charrúas inhabited Uruguay at the timing of European colonial contact. Even though they were extinguished as an ethnic group as a result of a genocide, Charrúan heritage is part of the Uruguayan identity both culturally and genetically. While mitochondrial DNA studies have shown evidence of Amerindian ancestry in living Uruguayans, here we undertake whole-genome sequencing of 10 Uruguayan individuals with self-declared Charruan heritage. We detect chromosomal segments of Amerindian ancestry supporting the presence of indigenous genetic ancestry in living descendants. Specific haplotypes were found to be enriched in "Charrúas" and rare in the rest of the Amerindian groups studied. Some of these we interpret as the result of positive selection, as we identified selection signatures and they were located mostly within genes related to the infectivity of specific viruses. Historical records describe contacts of the Charrúas with other Amerindians, such as Guaraní, and patterns of genomic similarity observed here concur with genomic similarity between these groups. Less expected, we found a high genomic similarity of the Charrúas to Diaguita from Argentinian and Chile, which could be explained by geographically proximity. Finally, by fitting admixture models of Amerindian and European ancestry for the Uruguayan population, we were able to estimate the timing of the first pulse of admixture between European and Uruguayan indigenous peoples in approximately 1658 and the second migration pulse in 1683. Both dates roughly concurring with the Franciscan missions in 1662 and the foundation of the city of Colonia in 1680 by the Spanish.
Project description:Ancestry-informative markers (AIMs) show high allele frequency divergence between different ancestral or geographically distant populations. These genetic markers are especially useful in inferring the likely ancestral origin of an individual or estimating the apportionment of ancestry components in admixed individuals or populations. The study of AIMs is of great interest in clinical genetics research, particularly to detect and correct for population substructure effects in case-control association studies, but also in population and forensic genetics studies. This work presents a set of 46 ancestry-informative insertion deletion polymorphisms selected to efficiently measure population admixture proportions of four different origins (African, European, East Asian and Native American). All markers are analyzed in short fragments (under 230 basepairs) through a single PCR followed by capillary electrophoresis (CE) allowing a very simple one tube PCR-to-CE approach. HGDP-CEPH diversity panel samples from the four groups, together with Oceanians, were genotyped to evaluate the efficiency of the assay in clustering populations from different continental origins and to establish reference databases. In addition, other populations from diverse geographic origins were tested using the HGDP-CEPH samples as reference data. The results revealed that the AIM-INDEL set developed is highly efficient at inferring the ancestry of individuals and provides good estimates of ancestry proportions at the population level. In conclusion, we have optimized the multiplexed genotyping of 46 AIM-INDELs in a simple and informative assay, enabling a more straightforward alternative to the commonly available AIM-SNP typing methods dependent on complex, multi-step protocols or implementation of large-scale genotyping technologies.
Project description:Population genetic analyses of local ancestry tracts routinely assume that the ancestral admixture process is identical for both parents of an individual, an assumption that may be invalid when considering recent admixture. Here, we present Parental Admixture Proportion Inference (PAPI), a Bayesian tool for inferring the admixture proportions and admixture times for each parent of a single admixed individual. PAPI analyzes unphased local ancestry tracts and has two components: a binomial model that leverages genome-wide ancestry fractions to infer parental admixture proportions and a hidden Markov model (HMM) that infers admixture times from tract lengths. Crucially, the HMM accounts for unobserved within-ancestry recombination by approximating the pedigree crossover dynamics, enabling inference of parental admixture times. In simulations, we find that PAPI's admixture proportion estimates deviate from the truth by 0.047 on average, outperforming ANCESTOR and PedMix by 46.0% and 57.6%, respectively. Moreover, PAPI's admixture time estimates were strongly correlated with the truth (R=0.76) but have an average downward bias of 1.01 generations that is partly attributable to inaccuracies in local ancestry inference. As an illustration of its utility, we ran PAPI on African American genotypes from the PAGE study (N = 5,786) and found strong evidence of assortative mating by ancestry proportion: couples' ancestry proportions are highly correlated (R = 0.87) and are closer to each other than expected under random mating (p < 10-6). We anticipate that PAPI will be useful in studying the population dynamics of admixture and will also be of interest to individuals seeking to learn about their personal genealogies.
Project description:To carry out population genetics analyses of the Arctic gregion we carried out Illumina Bead-Array-based enotyping on 18 samples from Greenland. 19 samples were analysed with the Illumina platform Human660W-Quad v1.0 Genotyping BeadChip and are described herein.
Project description:The earlier version of AncestryPainter is a Perl program that displays the ancestry composition of numerous individuals using a rounded graph. Motivated by the requests of users in practical applications, we updated AncestryPainter to version 2.0 by coding in an R package and improving the layout, providing more options and compatible statistical functions for graphing. Apart from improving visualization functions per se in this update, we added an extra graphing module to visualize genetic distance through radial bars of varying lengths surrounding a core. Notably, AncestryPainter 2.0 allows for multiple pie charts at the center of the graph to display the ancestry composition of more than one target population and implements a method admixture history graph to infer the admixture sequence of multiple ancestry populations. We validated the six admixture history graph metrics using both simulated and real data and implemented a Pearson coefficient-based metric with the best performance in AncestryPainter 2.0. Furthermore, a statistical module was implemented to merge ancestry proportion matrices. AncestryPainter 2.0 is freely available at https://github.com/Shuhua-Group/AncestryPainterV2 and https://pog.fudan.edu.cn/#/Software.