Unknown

Dataset Information

0

Cox regression is robust to inaccurate EHR-extracted event time: an application to EHR-based GWAS.


ABSTRACT:

Motivation

Logistic regression models are used in genomic studies to analyze the genetic data linked to electronic health records (EHRs), and do not take full usage of the time-to-event information available in EHRs. Previous work has shown that Cox regression, which can account for left truncation and right censoring in EHRs, increased the power to detect genotype-phenotype associations compared to logistic regression. We extend this to evaluate the relative performance of Cox regression and various logistic regression models in the presence of positive errors in event time (delayed event time), relating to recorded event time accuracy.

Results

One Cox model and three logistic regression models were considered under different scenarios of delayed event time. Extensive simulations and a genomic study application were used to evaluate the impact of delayed event time. While logistic regression does not model the time-to-event directly, various logistic regression models used in the literature were more sensitive to delayed event time than Cox regression. Results highlighted the importance to identify and exclude the patients diagnosed before entry time. Cox regression had similar or modest improvement in statistical power over various logistic regression models at controlled type I error. This was supported by the empirical data, where the Cox models steadily had the highest sensitivity to detect known genotype-phenotype associations under all scenarios of delayed event time.

Availability and implementation

Access to individual-level EHR and genotype data is restricted by the IRB. Simulation code and R script for data process are at: https://github.com/QingxiaCindyChen/CoxRobustEHR.git.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Irlmeier R 

PROVIDER: S-EPMC10060718 | biostudies-literature | 2022 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Cox regression is robust to inaccurate EHR-extracted event time: an application to EHR-based GWAS.

Irlmeier Rebecca R   Hughey Jacob J JJ   Bastarache Lisa L   Denny Joshua C JC   Chen Qingxia Q  

Bioinformatics (Oxford, England) 20220401 8


<h4>Motivation</h4>Logistic regression models are used in genomic studies to analyze the genetic data linked to electronic health records (EHRs), and do not take full usage of the time-to-event information available in EHRs. Previous work has shown that Cox regression, which can account for left truncation and right censoring in EHRs, increased the power to detect genotype-phenotype associations compared to logistic regression. We extend this to evaluate the relative performance of Cox regressio  ...[more]

Similar Datasets

| S-EPMC10492844 | biostudies-literature
| S-EPMC10524851 | biostudies-literature
| S-EPMC7145010 | biostudies-literature
| S-EPMC5793916 | biostudies-literature
| S-EPMC5048533 | biostudies-literature
| S-EPMC11080814 | biostudies-literature
| S-EPMC6451633 | biostudies-literature
| S-EPMC7039372 | biostudies-literature
| S-EPMC11661057 | biostudies-literature
| S-EPMC3294270 | biostudies-literature