Unknown

Dataset Information

0

The (in)famous GWAS P-value threshold revisited and updated for low-frequency variants.


ABSTRACT: Genome-wide association studies (GWAS) have long relied on proposed statistical significance thresholds to be able to differentiate true positives from false positives. Although the genome-wide significance P-value threshold of 5 × 10(-8) has become a standard for common-variant GWAS, it has not been updated to cope with the lower allele frequency spectrum used in many recent array-based GWAS studies and sequencing studies. Using a whole-genome- and -exome-sequencing data set of 2875 individuals of European ancestry from the Genetics of Type 2 Diabetes (GoT2D) project and a whole-exome-sequencing data set of 13?000 individuals from five ancestries from the GoT2D and T2D-GENES (Type 2 Diabetes Genetic Exploration by Next-generation sequencing in multi-Ethnic Samples) projects, we describe guidelines for genome- and exome-wide association P-value thresholds needed to correct for multiple testing, explaining the impact of linkage disequilibrium thresholds for distinguishing independent variants, minor allele frequency and ancestry characteristics. We emphasize the advantage of studying recent genetic isolate populations when performing rare and low-frequency genetic association analyses, as the multiple testing burden is diminished due to higher genetic homogeneity.

SUBMITTER: Fadista J 

PROVIDER: S-EPMC4970684 | biostudies-literature | 2016 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

The (in)famous GWAS P-value threshold revisited and updated for low-frequency variants.

Fadista João J   Manning Alisa K AK   Florez Jose C JC   Groop Leif L  

European journal of human genetics : EJHG 20160106 8


Genome-wide association studies (GWAS) have long relied on proposed statistical significance thresholds to be able to differentiate true positives from false positives. Although the genome-wide significance P-value threshold of 5 × 10(-8) has become a standard for common-variant GWAS, it has not been updated to cope with the lower allele frequency spectrum used in many recent array-based GWAS studies and sequencing studies. Using a whole-genome- and -exome-sequencing data set of 2875 individuals  ...[more]

Similar Datasets

| S-EPMC6418148 | biostudies-literature
2014-10-10 | GSE53567 | GEO
2014-10-10 | E-GEOD-53567 | biostudies-arrayexpress
| S-EPMC5510701 | biostudies-literature
| S-EPMC8065719 | biostudies-literature
| S-EPMC7804299 | biostudies-literature
| S-EPMC5302847 | biostudies-literature
| S-EPMC8022962 | biostudies-literature
| S-EPMC4757735 | biostudies-literature
| S-EPMC8498594 | biostudies-literature