Unknown

Dataset Information

0

Identification of putative causal loci in whole-genome sequencing data via knockoff statistics.


ABSTRACT: The analysis of whole-genome sequencing studies is challenging due to the large number of rare variants in noncoding regions and the lack of natural units for testing. We propose a statistical method to detect and localize rare and common risk variants in whole-genome sequencing studies based on a recently developed knockoff framework. It can (1) prioritize causal variants over associations due to linkage disequilibrium thereby improving interpretability; (2) help distinguish the signal due to rare variants from shadow effects of significant common variants nearby; (3) integrate multiple knockoffs for improved power, stability, and reproducibility; and (4) flexibly incorporate state-of-the-art and future association tests to achieve the benefits proposed here. In applications to whole-genome sequencing data from the Alzheimer's Disease Sequencing Project (ADSP) and COPDGene samples from NHLBI Trans-Omics for Precision Medicine (TOPMed) Program we show that our method compared with conventional association tests can lead to substantially more discoveries.

SUBMITTER: He Z 

PROVIDER: S-EPMC8149672 | biostudies-literature | 2021 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identification of putative causal loci in whole-genome sequencing data via knockoff statistics.

He Zihuai Z   Liu Linxi L   Wang Chen C   Le Guen Yann Y   Lee Justin J   Gogarten Stephanie S   Lu Fred F   Montgomery Stephen S   Tang Hua H   Silverman Edwin K EK   Cho Michael H MH   Greicius Michael M   Ionita-Laza Iuliana I  

Nature communications 20210525 1


The analysis of whole-genome sequencing studies is challenging due to the large number of rare variants in noncoding regions and the lack of natural units for testing. We propose a statistical method to detect and localize rare and common risk variants in whole-genome sequencing studies based on a recently developed knockoff framework. It can (1) prioritize causal variants over associations due to linkage disequilibrium thereby improving interpretability; (2) help distinguish the signal due to r  ...[more]

Similar Datasets

| S-EPMC9606389 | biostudies-literature
| S-EPMC7590468 | biostudies-literature
| S-EPMC5762310 | biostudies-literature
| S-EPMC8947317 | biostudies-literature
| S-EPMC11361808 | biostudies-literature
| S-EPMC11312651 | biostudies-literature
| S-EPMC5359410 | biostudies-literature
| S-EPMC10104122 | biostudies-literature
| S-EPMC9926792 | biostudies-literature
| S-EPMC10248669 | biostudies-literature