Unknown

Dataset Information

0

A deep learning model to classify neoplastic state and tissue origin from transcriptomic data.


ABSTRACT: Application of deep learning methods to transcriptomic data has the potential to enhance the accuracy and efficiency of tissue classification and cell state identification. Herein, we developed a multitask deep learning model for tissue classification combining publicly available whole transcriptomic (RNA-seq) datasets of non-neoplastic, neoplastic and peri-neoplastic tissue to classify disease state, tissue origin and neoplastic subclass. RNA-seq data from a total of 10,116 patient samples processed through a common pipeline were used for model training and validation. The model achieved 99% accuracy for disease state classification (ROC-AUC of 0.98) and 97% accuracy for tissue origin (ROC-AUC of 0.99). Moreover, the model achieved an accuracy of 92% (ROC-AUC 0.95) for neoplastic subclassification. This is the first multitask deep learning algorithm developed for tissue classification employing a uniform pipeline analysis of transcriptomic data with multiple tissue classifiers. This model serves as a framework for incorporating large transcriptomic datasets across conditions to facilitate clinical diagnosis and cell-based treatment strategies.

SUBMITTER: Hong J 

PROVIDER: S-EPMC9188604 | biostudies-literature | 2022 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

A deep learning model to classify neoplastic state and tissue origin from transcriptomic data.

Hong James J   Hachem Laureen D LD   Fehlings Michael G MG  

Scientific reports 20220611 1


Application of deep learning methods to transcriptomic data has the potential to enhance the accuracy and efficiency of tissue classification and cell state identification. Herein, we developed a multitask deep learning model for tissue classification combining publicly available whole transcriptomic (RNA-seq) datasets of non-neoplastic, neoplastic and peri-neoplastic tissue to classify disease state, tissue origin and neoplastic subclass. RNA-seq data from a total of 10,116 patient samples proc  ...[more]

Similar Datasets

| S-EPMC6929667 | biostudies-literature
| S-EPMC10808169 | biostudies-literature
| S-EPMC8909043 | biostudies-literature
| S-EPMC9281153 | biostudies-literature
| S-EPMC8345047 | biostudies-literature
| S-EPMC7783755 | biostudies-literature
| S-EPMC11261876 | biostudies-literature
| S-EPMC6544615 | biostudies-literature
| S-EPMC10448299 | biostudies-literature
| S-EPMC7924492 | biostudies-literature