Unknown

Dataset Information

0

PediCXR: An open, large-scale chest radiograph dataset for interpretation of common thoracic diseases in children.


ABSTRACT: Computer-aided diagnosis systems in adult chest radiography (CXR) have recently achieved great success thanks to the availability of large-scale, annotated datasets and the advent of high-performance supervised learning algorithms. However, the development of diagnostic models for detecting and diagnosing pediatric diseases in CXR scans is undertaken due to the lack of high-quality physician-annotated datasets. To overcome this challenge, we introduce and release PediCXR, a new pediatric CXR dataset of 9,125 studies retrospectively collected from a major pediatric hospital in Vietnam between 2020 and 2021. Each scan was manually annotated by a pediatric radiologist with more than ten years of experience. The dataset was labeled for the presence of 36 critical findings and 15 diseases. In particular, each abnormal finding was identified via a rectangle bounding box on the image. To the best of our knowledge, this is the first and largest pediatric CXR dataset containing lesion-level annotations and image-level labels for the detection of multiple findings and diseases. For algorithm development, the dataset was divided into a training set of 7,728 and a test set of 1,397. To encourage new advances in pediatric CXR interpretation using data-driven approaches, we provide a detailed description of the PediCXR data sample and make the dataset publicly available on https://physionet.org/content/vindr-pcxr/1.0.0/ .

SUBMITTER: Pham HH 

PROVIDER: S-EPMC10133237 | biostudies-literature | 2023 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

PediCXR: An open, large-scale chest radiograph dataset for interpretation of common thoracic diseases in children.

Pham Hieu H HH   Nguyen Ngoc H NH   Tran Thanh T TT   Nguyen Tuan N M TNM   Nguyen Ha Q HQ  

Scientific data 20230427 1


Computer-aided diagnosis systems in adult chest radiography (CXR) have recently achieved great success thanks to the availability of large-scale, annotated datasets and the advent of high-performance supervised learning algorithms. However, the development of diagnostic models for detecting and diagnosing pediatric diseases in CXR scans is undertaken due to the lack of high-quality physician-annotated datasets. To overcome this challenge, we introduce and release PediCXR, a new pediatric CXR dat  ...[more]

Similar Datasets

| S-EPMC11413054 | biostudies-literature
| S-EPMC10556963 | biostudies-literature
| S-EPMC9434361 | biostudies-literature
| S-EPMC9033522 | biostudies-literature
| S-EPMC8637309 | biostudies-literature
| S-EPMC8671604 | biostudies-literature
| S-EPMC10317964 | biostudies-literature
| S-EPMC9620940 | biostudies-literature
| S-EPMC8052417 | biostudies-literature
| S-EPMC7966819 | biostudies-literature