Unknown

Dataset Information

0

Personalized survival probabilities for SARS-CoV-2 positive patients by explainable machine learning.


ABSTRACT: Interpretable risk assessment of SARS-CoV-2 positive patients can aid clinicians to implement precision medicine. Here we trained a machine learning model to predict mortality within 12 weeks of a first positive SARS-CoV-2 test. By leveraging data on 33,938 confirmed SARS-CoV-2 cases in eastern Denmark, we considered 2723 variables extracted from electronic health records (EHR) including demographics, diagnoses, medications, laboratory test results and vital parameters. A discrete-time framework for survival modelling enabled us to predict personalized survival curves and explain individual risk factors. Performance on the test set was measured with a weighted concordance index of 0.95 and an area under the curve for precision-recall of 0.71. Age, sex, number of medications, previous hospitalizations and lymphocyte counts were identified as top mortality risk factors. Our explainable survival model developed on EHR data also revealed temporal dynamics of the 22 selected risk factors. Upon further validation, this model may allow direct reporting of personalized survival probabilities in routine care.

SUBMITTER: Zucco AG 

PROVIDER: S-EPMC9380679 | biostudies-literature | 2022 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Personalized survival probabilities for SARS-CoV-2 positive patients by explainable machine learning.

Zucco Adrian G AG   Agius Rudi R   Svanberg Rebecka R   Moestrup Kasper S KS   Marandi Ramtin Z RZ   MacPherson Cameron Ross CR   Lundgren Jens J   Ostrowski Sisse R SR   Niemann Carsten U CU  

Scientific reports 20220816 1


Interpretable risk assessment of SARS-CoV-2 positive patients can aid clinicians to implement precision medicine. Here we trained a machine learning model to predict mortality within 12 weeks of a first positive SARS-CoV-2 test. By leveraging data on 33,938 confirmed SARS-CoV-2 cases in eastern Denmark, we considered 2723 variables extracted from electronic health records (EHR) including demographics, diagnoses, medications, laboratory test results and vital parameters. A discrete-time framework  ...[more]

Similar Datasets

| S-EPMC10410015 | biostudies-literature
| S-EPMC9708009 | biostudies-literature
| S-EPMC11817524 | biostudies-literature
| S-EPMC10394879 | biostudies-literature
| S-EPMC10672177 | biostudies-literature
| S-EPMC11338658 | biostudies-literature
| S-EPMC8192966 | biostudies-literature
| S-EPMC8294595 | biostudies-literature
| S-EPMC11534633 | biostudies-literature
| S-EPMC8574161 | biostudies-literature