Unknown

Dataset Information

0

Using the UK Biobank as a global reference of worldwide populations: application to measuring ancestry diversity from GWAS summary statistics.


ABSTRACT:

Motivation

Measuring genetic diversity is an important problem because increasing genetic diversity is a key to making new genetic discoveries, while also being a major source of confounding to be aware of in genetics studies.

Results

Using the UK Biobank data, a prospective cohort study with deep genetic and phenotypic data collected on almost 500 000 individuals from across the UK, we carefully define 21 distinct ancestry groups from all four corners of the world. These ancestry groups can serve as a global reference of worldwide populations, with a handful of applications. Here, we develop a method that uses allele frequencies and principal components derived from these ancestry groups to effectively measure ancestry proportions from allele frequencies of any genetic dataset.

Availability and implementation

This method is implemented in function snp_ancestry_summary of R package bigsnpr.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Prive F 

PROVIDER: S-EPMC9237724 | biostudies-literature | 2022 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using the UK Biobank as a global reference of worldwide populations: application to measuring ancestry diversity from GWAS summary statistics.

Privé Florian F  

Bioinformatics (Oxford, England) 20220601 13


<h4>Motivation</h4>Measuring genetic diversity is an important problem because increasing genetic diversity is a key to making new genetic discoveries, while also being a major source of confounding to be aware of in genetics studies.<h4>Results</h4>Using the UK Biobank data, a prospective cohort study with deep genetic and phenotypic data collected on almost 500 000 individuals from across the UK, we carefully define 21 distinct ancestry groups from all four corners of the world. These ancestry  ...[more]

Similar Datasets

| S-EPMC10387204 | biostudies-literature
| S-EPMC9929290 | biostudies-literature
| S-EPMC5972416 | biostudies-literature
| S-EPMC9884206 | biostudies-literature
| S-EPMC11534268 | biostudies-literature
| S-EPMC6612820 | biostudies-literature
| S-EPMC6417431 | biostudies-literature
| S-EPMC11196113 | biostudies-literature
| S-EPMC8419981 | biostudies-literature
| S-EPMC9114146 | biostudies-literature