Unknown

Dataset Information

0

Incorporating family disease history and controlling case-control imbalance for population-based genetic association studies.


ABSTRACT:

Motivation

In the genome-wide association analysis of population-based biobanks, most diseases have low prevalence, which results in low detection power. One approach to tackle the problem is using family disease history, yet existing methods are unable to address type I error inflation induced by increased correlation of phenotypes among closely related samples, as well as unbalanced phenotypic distribution.

Results

We propose a new method for genetic association test with family disease history, mixed-model-based Test with Adjusted Phenotype and Empirical saddlepoint approximation, which controls for increased phenotype correlation by adopting a two-variance-component mixed model, accounts for case-control imbalance by using empirical saddlepoint approximation, and is flexible to incorporate any existing adjusted phenotypes, such as phenotypes from the LT-FH method. We show through simulation studies and analysis of UK Biobank data of white British samples and the Korean Genome and Epidemiology Study of Korean samples that the proposed method is robust and yields better calibration compared to existing methods while gaining power for detection of variant-phenotype associations.

Availability and implementation

The summary statistics and code generated in this study are available at https://github.com/styvon/TAPE.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Zhuang Y 

PROVIDER: S-EPMC9477535 | biostudies-literature | 2022 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Incorporating family disease history and controlling case-control imbalance for population-based genetic association studies.

Zhuang Yongwen Y   Wolford Brooke N BN   Nam Kisung K   Bi Wenjian W   Zhou Wei W   Willer Cristen J CJ   Mukherjee Bhramar B   Lee Seunggeun S  

Bioinformatics (Oxford, England) 20220901 18


<h4>Motivation</h4>In the genome-wide association analysis of population-based biobanks, most diseases have low prevalence, which results in low detection power. One approach to tackle the problem is using family disease history, yet existing methods are unable to address type I error inflation induced by increased correlation of phenotypes among closely related samples, as well as unbalanced phenotypic distribution.<h4>Results</h4>We propose a new method for genetic association test with family  ...[more]

Similar Datasets

| S-EPMC6314034 | biostudies-literature
| S-EPMC6119127 | biostudies-literature
| S-EPMC10923938 | biostudies-literature
| S-EPMC6805595 | biostudies-literature
| S-EPMC3935236 | biostudies-literature
| S-EPMC1866832 | biostudies-literature
| S-EPMC4296133 | biostudies-literature
| S-EPMC4899219 | biostudies-literature
| S-EPMC1829402 | biostudies-literature
| S-EPMC8620394 | biostudies-literature