Unknown

Dataset Information

0

TwinEQTL: ultrafast and powerful association analysis for eQTL and GWAS in twin studies.


ABSTRACT: We develop a computationally efficient alternative, TwinEQTL, to a linear mixed-effects model for twin genome-wide association study data. Instead of analyzing all twin samples together with linear mixed-effects model, TwinEQTL first splits twin samples into 2 independent groups on which multiple linear regression analysis can be validly performed separately, followed by an appropriate meta-analysis-like approach to combine the 2 nonindependent test results. Through mathematical derivations, we prove the validity of TwinEQTL algorithm and show that the correlation between 2 dependent test statistics at each single-nucleotide polymorphism is independent of its minor allele frequency. Thus, the correlation is constant across all single-nucleotide polymorphisms. Through simulations, we show empirically that TwinEQTL has well controlled type I error with negligible power loss compared with the gold-standard linear mixed-effects models. To accommodate expression quantitative loci analysis with twin subjects, we further implement TwinEQTL into an R package with much improved computational efficiency. Our approaches provide a significant leap in terms of computing speed for genome-wide association study and expression quantitative loci analysis with twin samples.

SUBMITTER: Xia K 

PROVIDER: S-EPMC9339336 | biostudies-literature | 2022 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

TwinEQTL: ultrafast and powerful association analysis for eQTL and GWAS in twin studies.

Xia Kai K   Shabalin Andrey A AA   Yin Zhaoyu Z   Chung Wonil W   Sullivan Patrick F PF   Wright Fred A FA   Styner Martin M   Gilmore John H JH   Santelli Rebecca C RC   Zou Fei F  

Genetics 20220701 4


We develop a computationally efficient alternative, TwinEQTL, to a linear mixed-effects model for twin genome-wide association study data. Instead of analyzing all twin samples together with linear mixed-effects model, TwinEQTL first splits twin samples into 2 independent groups on which multiple linear regression analysis can be validly performed separately, followed by an appropriate meta-analysis-like approach to combine the 2 nonindependent test results. Through mathematical derivations, we  ...[more]

Similar Datasets

| S-EPMC4469571 | biostudies-literature
| S-EPMC5676241 | biostudies-literature
| S-EPMC6788777 | biostudies-literature
| S-EPMC6428872 | biostudies-literature
| S-EPMC8894384 | biostudies-literature
| S-EPMC9161451 | biostudies-literature
| S-EPMC8488198 | biostudies-literature
| S-EPMC3032061 | biostudies-literature
| S-EPMC9062329 | biostudies-literature
| S-EPMC8939372 | biostudies-literature