Unknown

Dataset Information

0

Deploying deep learning models on unseen medical imaging using adversarial domain adaptation.


ABSTRACT: The fundamental challenge in machine learning is ensuring that trained models generalize well to unseen data. We developed a general technique for ameliorating the effect of dataset shift using generative adversarial networks (GANs) on a dataset of 149,298 handwritten digits and dataset of 868,549 chest radiographs obtained from four academic medical centers. Efficacy was assessed by comparing area under the curve (AUC) pre- and post-adaptation. On the digit recognition task, the baseline CNN achieved an average internal test AUC of 99.87% (95% CI, 99.87-99.87%), which decreased to an average external test AUC of 91.85% (95% CI, 91.82-91.88%), with an average salvage of 35% from baseline upon adaptation. On the lung pathology classification task, the baseline CNN achieved an average internal test AUC of 78.07% (95% CI, 77.97-78.17%) and an average external test AUC of 71.43% (95% CI, 71.32-71.60%), with a salvage of 25% from baseline upon adaptation. Adversarial domain adaptation leads to improved model performance on radiographic data derived from multiple out-of-sample healthcare populations. This work can be applied to other medical imaging domains to help shape the deployment toolkit of machine learning in medicine.

SUBMITTER: Valliani AA 

PROVIDER: S-EPMC9565422 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

Deploying deep learning models on unseen medical imaging using adversarial domain adaptation.

Valliani Aly A AA   Gulamali Faris F FF   Kwon Young Joon YJ   Martini Michael L ML   Wang Chiatse C   Kondziolka Douglas D   Chen Viola J VJ   Wang Weichung W   Costa Anthony B AB   Oermann Eric K EK  

PloS one 20221014 10


The fundamental challenge in machine learning is ensuring that trained models generalize well to unseen data. We developed a general technique for ameliorating the effect of dataset shift using generative adversarial networks (GANs) on a dataset of 149,298 handwritten digits and dataset of 868,549 chest radiographs obtained from four academic medical centers. Efficacy was assessed by comparing area under the curve (AUC) pre- and post-adaptation. On the digit recognition task, the baseline CNN ac  ...[more]

Similar Datasets

| S-EPMC10000732 | biostudies-literature
| S-EPMC10035842 | biostudies-literature
| S-EPMC7393676 | biostudies-literature
| S-EPMC7613452 | biostudies-literature
| S-EPMC5869052 | biostudies-other
| S-EPMC10805953 | biostudies-literature
| S-EPMC9477526 | biostudies-literature
| S-EPMC10132130 | biostudies-literature
| S-EPMC9798510 | biostudies-literature
| S-EPMC10436147 | biostudies-literature