Transcriptomics

Dataset Information

0

WBT-DC Pipeline: Whole Blood Transcriptomics data-based Disease Classification


ABSTRACT: Machine learning together with cell/tissue transcriptomics data has been widely used for disease classification. However, obtaining transcriptomics data for human tissues require invasive procedures, making it challenging for widespread application in the clinic. In this study, we developed the WBT-DC (Whole Blood Transcriptomics (WBT) data based Disease Classification). We utilized gene rank-based methods for feature extraction to mitigate issues associated with batch effects and gene noise. We applied the ensemble machine learning model, Random Forest, and performed cross-validation and model tuning. We evaluated our methods on four different diseases including crohn's disease (CD), ulcerative colitis (UC) and amyotrophic lateral sclerosis (ALS) and rheumatoid arthritis (RA) datasets, using data from seven independent cohorts and 2,452 participants, across RNA-Sequencing and microarrays. Our machine learning based WBT-DC pipeline demonstrated a robust performance across various disease datasets and different transcriptomics platforms, establishing itself as a valuable non-invasive tool for future disease classification and prediction.

ORGANISM(S): Homo sapiens

PROVIDER: GSE282218 | GEO | 2026/05/20

REPOSITORIES: GEO

Dataset's files

Source:
Action DRS
Other
Items per page:
1 - 1 of 1

Similar Datasets

2019-07-18 | GSE134056 | GEO
2019-07-18 | GSE134052 | GEO
2022-09-13 | PXD018996 | Pride
2022-11-12 | GSE211692 | GEO
2024-12-29 | GSE222979 | GEO
2023-11-06 | MSV000093325 | MassIVE
2020-06-05 | GSE142245 | GEO
2025-07-17 | PXD065892 | Pride
2024-07-23 | MODEL2407230001 | BioModels
2024-05-13 | MODEL2405130001 | BioModels