Unknown

Dataset Information

0

Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography.


ABSTRACT:

Objectives

To evaluate the ability of a commercially available comprehensive chest radiography deep convolutional neural network (DCNN) to detect simple and tension pneumothorax, as stratified by the following subgroups: the presence of an intercostal drain; rib, clavicular, scapular or humeral fractures or rib resections; subcutaneous emphysema and erect versus non-erect positioning. The hypothesis was that performance would not differ significantly in each of these subgroups when compared with the overall test dataset.

Design

A retrospective case-control study was undertaken.

Setting

Community radiology clinics and hospitals in Australia and the USA.

Participants

A test dataset of 2557 chest radiography studies was ground-truthed by three subspecialty thoracic radiologists for the presence of simple or tension pneumothorax as well as each subgroup other than positioning. Radiograph positioning was derived from radiographer annotations on the images.

Outcome measures

DCNN performance for detecting simple and tension pneumothorax was evaluated over the entire test set, as well as within each subgroup, using the area under the receiver operating characteristic curve (AUC). A difference in AUC of more than 0.05 was considered clinically significant.

Results

When compared with the overall test set, performance of the DCNN for detecting simple and tension pneumothorax was statistically non-inferior in all subgroups. The DCNN had an AUC of 0.981 (0.976-0.986) for detecting simple pneumothorax and 0.997 (0.995-0.999) for detecting tension pneumothorax.

Conclusions

Hidden stratification has significant implications for potential failures of deep learning when applied in clinical practice. This study demonstrated that a comprehensively trained DCNN can be resilient to hidden stratification in several clinically meaningful subgroups in detecting pneumothorax.

SUBMITTER: Seah J 

PROVIDER: S-EPMC8655590 | biostudies-literature | 2021 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography.

Seah Jarrel J   Tang Cyril C   Buchlak Quinlan D QD   Milne Michael Robert MR   Holt Xavier X   Ahmad Hassan H   Lambert John J   Esmaili Nazanin N   Oakden-Rayner Luke L   Brotchie Peter P   Jones Catherine M CM  

BMJ open 20211207 12


<h4>Objectives</h4>To evaluate the ability of a commercially available comprehensive chest radiography deep convolutional neural network (DCNN) to detect simple and tension pneumothorax, as stratified by the following subgroups: the presence of an intercostal drain; rib, clavicular, scapular or humeral fractures or rib resections; subcutaneous emphysema and erect versus non-erect positioning. The hypothesis was that performance would not differ significantly in each of these subgroups when compa  ...[more]

Similar Datasets

| S-EPMC6303023 | biostudies-literature
| S-EPMC10482646 | biostudies-literature
| S-EPMC11582483 | biostudies-literature
| S-EPMC10772898 | biostudies-literature
| S-EPMC1726321 | biostudies-literature
| S-EPMC4057340 | biostudies-literature
| S-EPMC10378683 | biostudies-literature
| S-EPMC6245672 | biostudies-literature
| S-EPMC7534757 | biostudies-literature
| S-EPMC7652331 | biostudies-literature