Unknown

Dataset Information

0

Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study.


ABSTRACT:

Background

A large number of web-based COVID-19 symptom checkers and chatbots have been developed; however, anecdotal evidence suggests that their conclusions are highly variable. To our knowledge, no study has evaluated the accuracy of COVID-19 symptom checkers in a statistically rigorous manner.

Objective

The aim of this study is to evaluate and compare the diagnostic accuracies of web-based COVID-19 symptom checkers.

Methods

We identified 10 web-based COVID-19 symptom checkers, all of which were included in the study. We evaluated the COVID-19 symptom checkers by assessing 50 COVID-19 case reports alongside 410 non-COVID-19 control cases. A bootstrapping method was used to counter the unbalanced sample sizes and obtain confidence intervals (CIs). Results are reported as sensitivity, specificity, F1 score, and Matthews correlation coefficient (MCC).

Results

The classification task between COVID-19-positive and COVID-19-negative for "high risk" cases among the 460 test cases yielded (sorted by F1 score): Symptoma (F1=0.92, MCC=0.85), Infermedica (F1=0.80, MCC=0.61), US Centers for Disease Control and Prevention (CDC) (F1=0.71, MCC=0.30), Babylon (F1=0.70, MCC=0.29), Cleveland Clinic (F1=0.40, MCC=0.07), Providence (F1=0.40, MCC=0.05), Apple (F1=0.29, MCC=-0.10), Docyet (F1=0.27, MCC=0.29), Ada (F1=0.24, MCC=0.27) and Your.MD (F1=0.24, MCC=0.27). For "high risk" and "medium risk" combined the performance was: Symptoma (F1=0.91, MCC=0.83) Infermedica (F1=0.80, MCC=0.61), Cleveland Clinic (F1=0.76, MCC=0.47), Providence (F1=0.75, MCC=0.45), Your.MD (F1=0.72, MCC=0.33), CDC (F1=0.71, MCC=0.30), Babylon (F1=0.70, MCC=0.29), Apple (F1=0.70, MCC=0.25), Ada (F1=0.42, MCC=0.03), and Docyet (F1=0.27, MCC=0.29).

Conclusions

We found that the number of correctly assessed COVID-19 and control cases varies considerably between symptom checkers, with different symptom checkers showing different strengths with respect to sensitivity and specificity. A good balance between sensitivity and specificity was only achieved by two symptom checkers.

SUBMITTER: Munsch N 

PROVIDER: S-EPMC7541039 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study.

Munsch Nicolas N   Martin Alistair A   Gruarin Stefanie S   Nateqi Jama J   Abdarahmane Isselmou I   Weingartner-Ortner Rafael R   Knapp Bernhard B  

Journal of medical Internet research 20201006 10


<h4>Background</h4>A large number of web-based COVID-19 symptom checkers and chatbots have been developed; however, anecdotal evidence suggests that their conclusions are highly variable. To our knowledge, no study has evaluated the accuracy of COVID-19 symptom checkers in a statistically rigorous manner.<h4>Objective</h4>The aim of this study is to evaluate and compare the diagnostic accuracies of web-based COVID-19 symptom checkers.<h4>Methods</h4>We identified 10 web-based COVID-19 symptom ch  ...[more]

Similar Datasets

| S-EPMC10276326 | biostudies-literature
| S-EPMC11483353 | biostudies-literature
| S-EPMC11091811 | biostudies-literature
| S-EPMC9121216 | biostudies-literature
| S-EPMC9548469 | biostudies-literature
| S-EPMC10582809 | biostudies-literature
| S-EPMC6330198 | biostudies-literature
| S-EPMC8282353 | biostudies-literature
| S-EPMC9595333 | biostudies-literature
| S-EPMC11303907 | biostudies-literature