Unknown

Dataset Information

0

Using Summary Statistics to Model Multiplicative Combinations of Initially Analyzed Phenotypes With a Flexible Choice of Covariates.


ABSTRACT: While the promise of electronic medical record and biobank data is large, major questions remain about patient privacy, computational hurdles, and data access. One promising area of recent development is pre-computing non-individually identifiable summary statistics to be made publicly available for exploration and downstream analysis. In this manuscript we demonstrate how to utilize pre-computed linear association statistics between individual genetic variants and phenotypes to infer genetic relationships between products of phenotypes (e.g., ratios; logical combinations of binary phenotypes using "and" and "or") with customized covariate choices. We propose a method to approximate covariate adjusted linear models for products and logical combinations of phenotypes using only pre-computed summary statistics. We evaluate our method's accuracy through several simulation studies and an application modeling ratios of fatty acids using data from the Framingham Heart Study. These studies show consistent ability to recapitulate analysis results performed on individual level data including maintenance of the Type I error rate, power, and effect size estimates. An implementation of this proposed method is available in the publicly available R package pcsstools.

SUBMITTER: Wolf JM 

PROVIDER: S-EPMC8546319 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using Summary Statistics to Model Multiplicative Combinations of Initially Analyzed Phenotypes With a Flexible Choice of Covariates.

Wolf Jack M JM   Westra Jason J   Tintle Nathan N  

Frontiers in genetics 20211012


While the promise of electronic medical record and biobank data is large, major questions remain about patient privacy, computational hurdles, and data access. One promising area of recent development is pre-computing non-individually identifiable summary statistics to be made publicly available for exploration and downstream analysis. In this manuscript we demonstrate how to utilize pre-computed linear association statistics between individual genetic variants and phenotypes to infer genetic re  ...[more]

Similar Datasets

| S-EPMC4514926 | biostudies-literature
| S-EPMC6239891 | biostudies-literature
| S-EPMC10690198 | biostudies-literature
| S-EPMC4715495 | biostudies-literature
| S-EPMC10928990 | biostudies-literature
| S-EPMC6417828 | biostudies-literature
| S-EPMC9197066 | biostudies-literature
| S-EPMC6343668 | biostudies-literature
| S-EPMC10409646 | biostudies-literature
| S-EPMC4143328 | biostudies-literature