Unknown

Dataset Information

0

Homogeneous ensemble models for predicting infection levels and mortality of COVID-19 patients: Evidence from China.


ABSTRACT:

Background

Persistence of long-term COVID-19 pandemic is putting high pressure on healthcare services worldwide for several years. This article aims to establish models to predict infection levels and mortality of COVID-19 patients in China.

Methods

Machine learning models and deep learning models have been built based on the clinical features of COVID-19 patients. The best models are selected by area under the receiver operating characteristic curve (AUC) scores to construct two homogeneous ensemble models for predicting infection levels and mortality, respectively. The first-hand clinical data of 760 patients are collected from Zhongnan Hospital of Wuhan University between 3 January and 8 March 2020. We preprocess data with cleaning, imputation, and normalization.

Results

Our models obtain AUC = 0.7059 and Recall (Weighted avg) = 0.7248 in predicting infection level, while AUC=0.8436 and Recall (Weighted avg) = 0.8486 in predicting mortality ratio. This study also identifies two sets of essential clinical features. One is C-reactive protein (CRP) or high sensitivity C-reactive protein (hs-CRP) and the other is chest tightness, age, and pleural effusion.

Conclusions

Two homogeneous ensemble models are proposed to predict infection levels and mortality of COVID-19 patients in China. New findings of clinical features for benefiting the machine learning models are reported. The evaluation of an actual dataset collected from January 3 to March 8, 2020 demonstrates the effectiveness of the models by comparing them with state-of-the-art models in prediction.

SUBMITTER: Wang J 

PROVIDER: S-EPMC9630904 | biostudies-literature | 2022 Jan-Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Homogeneous ensemble models for predicting infection levels and mortality of COVID-19 patients: Evidence from China.

Wang Jiafeng J   Zhou Xianlong X   Hou Zhitian Z   Xu Xiaoya X   Zhao Yueyue Y   Chen Shanshan S   Zhang Jun J   Shao Lina L   Yan Rong R   Wang Mingshan M   Ge Minghua M   Hao Tianyong T   Tu Yuexing Y   Huang Haijun H  

Digital health 20220101


<h4>Background</h4>Persistence of long-term COVID-19 pandemic is putting high pressure on healthcare services worldwide for several years. This article aims to establish models to predict infection levels and mortality of COVID-19 patients in China.<h4>Methods</h4>Machine learning models and deep learning models have been built based on the clinical features of COVID-19 patients. The best models are selected by area under the receiver operating characteristic curve (AUC) scores to construct two  ...[more]

Similar Datasets

| S-EPMC9795949 | biostudies-literature
| S-EPMC6823902 | biostudies-literature
| S-EPMC10866533 | biostudies-literature
| S-EPMC8350610 | biostudies-literature
| S-EPMC7963518 | biostudies-literature
| S-EPMC3147114 | biostudies-literature
| S-EPMC8282037 | biostudies-literature
| S-EPMC9685565 | biostudies-literature
| S-EPMC9975879 | biostudies-literature
| S-EPMC8927593 | biostudies-literature