Project description:The genetic structure of the indigenous hunter-gatherer peoples of Southern Africa, the oldest known lineage of modern man, holds an important key to understanding humanity's early history. Previously sequenced human genomes have been limited to recently diverged populations. Here we present the first complete genome sequences of an indigenous hunter-gatherer from the Kalahari Desert and of a Bantu from Southern Africa, as well as protein-coding regions from an additional three hunter-gatherers from disparate regions of the Kalahari. We characterize the extent of whole-genome and exome diversity among the five men, reporting 1.3 million novel DNA differences genome-wide, and 13,146 novel amino-acid variants. These data allow genetic relationships among Southern African foragers and neighboring agriculturalists to be traced more accurately than was previously possible. Adding the described variants to current databases will facilitate inclusion of Southern Africans in medical research efforts.
Project description:The genetic structure of the indigenous hunter-gatherer peoples of Southern Africa, the oldest known lineage of modern man, holds an important key to understanding humanity's early history. Previously sequenced human genomes have been limited to recently diverged populations. Here we present the first complete genome sequences of an indigenous hunter-gatherer from the Kalahari Desert and of a Bantu from Southern Africa, as well as protein-coding regions from an additional three hunter-gatherers from disparate regions of the Kalahari. We characterize the extent of whole-genome and exome diversity among the five men, reporting 1.3 million novel DNA differences genome-wide, and 13,146 novel amino-acid variants. These data allow genetic relationships among Southern African foragers and neighboring agriculturalists to be traced more accurately than was previously possible. Adding the described variants to current databases will facilitate inclusion of Southern Africans in medical research efforts. Copy number differences between NA18507 and KB1 were predicted from the depth of whole-genome shotgun sequence reads. These predictions were then validated using array-CGH using a a genome-wide design as well as a custom design targeted at specific regions of copy number difference
Project description:The history of click-speaking Khoe-San, and African populations in general, remains poorly understood. We genotyped ~2.3 million SNPs in 220 southern Africans and found that the Khoe-San diverged from other populations at least 100,000 years ago, but structure within the Khoe-San dated back to about 35,000 years ago. Genetic variation in various sub-Saharan populations did not localize the origin of modern humans to a single geographic region within Africa, instead, it indicated a history of admixture and stratification. We found evidence of adaptation targeting muscle function and immune response, potential adaptive introgression of UV-light protection, and selection predating modern human diversification involving skeletal and neurological development. These new findings illustrate the importance of African genomic diversity in understanding human evolutionary history .220 samples were analysed with the Illumina HumanOmni2.5-Quad BeadChip and are described herein.
Project description:RNA-seq reads from the selfing species Arabidopsis thaliana were produced from flowers to study the consequences of the transition from the ancestral state (outcrossing) to the derived state (selfing). This was done in the context of examining another species in the Arabidopsis genus (A. lyrata) and another species pair (Capsella rubella versus Capsella grandiflora, which are selfing and outcrossing, respectively). These samples were generated to complement part of this larger study. Briefly, the shift from outcrossing to selfing is common in flowering plants, but neither the genomic consequences nor the speed with which they appear are well understood. An excellent model for understanding the evolution of self fertilization is provided by Capsella rubella, which became self-compatible <200,000 years ago. We present a reference genome for the species, and compare RNA expression and polymorphism patterns between C. rubella and its outcrossing progenitor C. grandiflora. There is a clear shift in the expression of genes associated with flowering phenotypes; a similar shift is seen in the related genus Arabidopsis, where self-fertilization evolved about 1 million years ago. DNA sequence polymorphisms distinguishing the two Capsella species reveal rapid genome-wide relaxation of purifying selection in C. rubella but without a concomitant change in transposable element abundance. Overall, we document that the transition to selfing may be typified by shifts in expression for genes that function in pollen and flower development, along with a measurable reduction of purifying selection.
Project description:RNA-seq reads from the selfing species Arabidopsis thaliana were produced from flowers to study the consequences of the transition from the ancestral state (outcrossing) to the derived state (selfing). This was done in the context of examining another species in the Arabidopsis genus (A. lyrata) and another species pair (Capsella rubella versus Capsella grandiflora, which are selfing and outcrossing, respectively). These samples were generated to complement part of this larger study. Briefly, the shift from outcrossing to selfing is common in flowering plants, but neither the genomic consequences nor the speed with which they appear are well understood. An excellent model for understanding the evolution of self fertilization is provided by Capsella rubella, which became self-compatible <200,000 years ago. We present a reference genome for the species, and compare RNA expression and polymorphism patterns between C. rubella and its outcrossing progenitor C. grandiflora. There is a clear shift in the expression of genes associated with flowering phenotypes; a similar shift is seen in the related genus Arabidopsis, where self-fertilization evolved about 1 million years ago. DNA sequence polymorphisms distinguishing the two Capsella species reveal rapid genome-wide relaxation of purifying selection in C. rubella but without a concomitant change in transposable element abundance. Overall, we document that the transition to selfing may be typified by shifts in expression for genes that function in pollen and flower development, along with a measurable reduction of purifying selection. As part of a cross-species comparison of gene expression, RNA-seq data was generated in biological replication (2 replicates) from Arabidopsis thaliana at the floral stage. In total, two samples (biological replicates) were used. The reference strain was used for the experments (strain Col-0). Resulting data about gene expression was used as part of a larger study. The Capsella rubella and Capsella grandiflora data are included in GEO Series GSE45518.
Project description:In the United States, African-American (AA) women are more likely to develop early-onset breast cancer and have historically poorer outcomes due to this disease compared to European-American (EA) women. Here, we analyzed genomic profiles of breast tumors from young women (<50 years old), matched by tumor subtype, histological grade, and ethnicity (African-American, AA, compared to European-American, EA). DNA copy number alterations (CNAs) were analyzed using a 32K BAC tiling path array. The study provides insight into the genetic component of ethnicity-related breast cancer health disparities. Breast tumor samples from young women (< 50 years old) were matched as follows: a matched pair consists of one AA and one EA sample, matched for tumor grade and tumor subtype (based on immunohistochemical analysis of ER, PR, and HER2 status). 44 experiments; each experiment is tumor DNA versus reference control DNA (AF) isolated from the blood of a 25-year-old African-American female with no familial or personal history of breast cancer. Additional control experiments included the AF reference versus the well-characterized F1 reference, and 3 self-self hybridization controls (AF versus AF).