Unknown

Dataset Information

0

Evidence of questionable research practices in clinical prediction models.


ABSTRACT:

Background

Clinical prediction models are widely used in health and medical research. The area under the receiver operating characteristic curve (AUC) is a frequently used estimate to describe the discriminatory ability of a clinical prediction model. The AUC is often interpreted relative to thresholds, with "good" or "excellent" models defined at 0.7, 0.8 or 0.9. These thresholds may create targets that result in "hacking", where researchers are motivated to re-analyse their data until they achieve a "good" result.

Methods

We extracted AUC values from PubMed abstracts to look for evidence of hacking. We used histograms of the AUC values in bins of size 0.01 and compared the observed distribution to a smooth distribution from a spline.

Results

The distribution of 306,888 AUC values showed clear excesses above the thresholds of 0.7, 0.8 and 0.9 and shortfalls below the thresholds.

Conclusions

The AUCs for some models are over-inflated, which risks exposing patients to sub-optimal clinical decision-making. Greater modelling transparency is needed, including published protocols, and data and code sharing.

SUBMITTER: White N 

PROVIDER: S-EPMC10478406 | biostudies-literature | 2023 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Evidence of questionable research practices in clinical prediction models.

White Nicole N   Parsons Rex R   Collins Gary G   Barnett Adrian A  

BMC medicine 20230904 1


<h4>Background</h4>Clinical prediction models are widely used in health and medical research. The area under the receiver operating characteristic curve (AUC) is a frequently used estimate to describe the discriminatory ability of a clinical prediction model. The AUC is often interpreted relative to thresholds, with "good" or "excellent" models defined at 0.7, 0.8 or 0.9. These thresholds may create targets that result in "hacking", where researchers are motivated to re-analyse their data until  ...[more]

Similar Datasets

| S-EPMC5351839 | biostudies-literature
| S-EPMC6047784 | biostudies-literature
| S-EPMC10621923 | biostudies-literature
| S-EPMC7561355 | biostudies-literature
| S-EPMC11472529 | biostudies-literature
| S-EPMC6883712 | biostudies-literature
| S-EPMC6468038 | biostudies-literature
| S-EPMC6016937 | biostudies-literature
| S-EPMC4856278 | biostudies-literature
| S-EPMC11318862 | biostudies-literature