Dataset Information


Principal component analysis reveals the 1000 Genomes Project does not sufficiently cover the human genetic diversity in Asia.

ABSTRACT: The 1000 Genomes Project (1KG) aims to provide a comprehensive resource on human genetic variations. With an effort of sequencing 2,500 individuals, 1KG is expected to cover the majority of the human genetic diversities worldwide. In this study, using analysis of population structure based on genome-wide single nucleotide polymorphisms (SNPs) data, we examined and evaluated the coverage of genetic diversity of 1KG samples with the available genome-wide SNP data of 3,831 individuals representing 140 population samples worldwide. We developed a method to quantitatively measure and evaluate the genetic diversity revealed by population structure analysis. Our results showed that the 1KG does not have sufficient coverage of the human genetic diversity in Asia, especially in Southeast Asia. We suggested a good coverage of Southeast Asian populations be considered in 1KG or a regional effort be initialized to provide a more comprehensive characterization of the human genetic diversity in Asia, which is important for both evolutionary and medical studies in the future.


PROVIDER: S-EPMC3701331 | BioStudies | 2013-01-01


REPOSITORIES: biostudies

Similar Datasets

1000-01-01 | S-EPMC4112609 | BioStudies
2018-01-01 | S-EPMC5861096 | BioStudies
2019-12-31 | GSE74100 | GEO
2014-01-01 | S-EPMC3998037 | BioStudies
2020-01-01 | S-EPMC7412351 | BioStudies
2015-01-01 | S-EPMC4350547 | BioStudies
2016-01-01 | S-EPMC4870696 | BioStudies
2016-01-01 | S-EPMC4937191 | BioStudies
2020-01-01 | S-EPMC7163074 | BioStudies
2016-02-04 | GSE77508 | GEO