Unknown

Dataset Information

0

Application of clinical text data for phenome-wide association studies (PheWASs).


ABSTRACT: MOTIVATION:Genome-wide association studies (GWASs) are effective for describing genetic complexities of common diseases. Phenome-wide association studies (PheWASs) offer an alternative and complementary approach to GWAS using data embedded in the electronic health record (EHR) to define the phenome. International Classification of Disease version 9 (ICD9) codes are used frequently to define the phenome, but using ICD9 codes alone misses other clinically relevant information from the EHR that can be used for PheWAS analyses and discovery. RESULTS:As an alternative to ICD9 coding, a text-based phenome was defined by 23?384 clinically relevant terms extracted from Marshfield Clinic's EHR. Five single nucleotide polymorphisms (SNPs) with known phenotypic associations were genotyped in 4235 individuals and associated across the text-based phenome. All five SNPs genotyped were associated with expected terms (P<0.02), most at or near the top of their respective PheWAS ranking. Raw association results indicate that text data performed equivalently to ICD9 coding and demonstrate the utility of information beyond ICD9 coding for application in PheWAS.

SUBMITTER: Hebbring SJ 

PROVIDER: S-EPMC4481696 | BioStudies | 2015-01-01

REPOSITORIES: biostudies

Similar Datasets

2018-01-01 | S-EPMC5985339 | BioStudies
2013-01-01 | S-EPMC3637423 | BioStudies
2015-01-01 | S-EPMC4666492 | BioStudies
2016-01-01 | S-EPMC5480096 | BioStudies
2010-01-01 | S-EPMC2859132 | BioStudies
2019-01-01 | S-EPMC6323551 | BioStudies
2017-01-01 | S-EPMC5501393 | BioStudies
2015-01-01 | S-EPMC4765620 | BioStudies
2019-01-01 | S-EPMC6564422 | BioStudies
2018-01-01 | S-EPMC6109620 | BioStudies