Dataset Information


An evolutionary framework for association testing in resequencing studies.

ABSTRACT: Sequencing technologies are becoming cheap enough to apply to large numbers of study participants and promise to provide new insights into human phenotypes by bringing to light rare and previously unknown genetic variants. We develop a new framework for the analysis of sequence data that incorporates all of the major features of previously proposed approaches, including those focused on allele counts and allele burden, but is both more general and more powerful. We harness population genetic theory to provide prior information on effect sizes and to create a pooling strategy for information from rare variants. Our method, EMMPAT (Evolutionary Mixed Model for Pooled Association Testing), generates a single test per gene (substantially reducing multiple testing concerns), facilitates graphical summaries, and improves the interpretation of results by allowing calculation of attributable variance. Simulations show that, relative to previously used approaches, our method increases the power to detect genes that affect phenotype when natural selection has kept alleles with large effect sizes rare. We demonstrate our approach on a population-based re-sequencing study of association between serum triglycerides and variation in ANGPTL4.


PROVIDER: S-EPMC2978703 | BioStudies | 2010-01-01

REPOSITORIES: biostudies

Similar Datasets

2007-01-01 | S-EPMC2762948 | BioStudies
2010-01-01 | S-EPMC3032073 | BioStudies
2016-01-01 | S-EPMC4850838 | BioStudies
1000-01-01 | S-EPMC3276137 | BioStudies
2012-01-01 | S-EPMC3276672 | BioStudies
2019-01-01 | S-EPMC6377669 | BioStudies
2010-01-01 | S-EPMC3055324 | BioStudies
2019-01-01 | S-EPMC6635791 | BioStudies
1000-01-01 | S-EPMC5477370 | BioStudies
1000-01-01 | S-EPMC3375640 | BioStudies