Dataset Information

Using machine learning methods to study the tumour microenvironment and its biomarkers in osteosarcoma metastasis

ABSTRACT:

SUBMITTER: Liu G

PROVIDER: S-EPMC11016722 | biostudies-literature | 2024 Apr

REPOSITORIES: biostudies-literature

ACCESS DATA

Similar Datasets

Project description:Background: Microglia plays complex and crucial roles in multiple sclerosis (MS). This study aimed to explore the biological significance of microglia-associated genes in experimental autoimmune encephalomyelitis (EAE) . Methods: Differentially expressed genes (DEGs) were screened with six machine learning (ML) methods, which were also utilized to validate the microglia-associated DEGs in three public databases. ceRNA and Protein–protein interaction (PPI) network analyses were utilized to identify the interaction of the 6 novel biomarkers with other molecules. Then, CIBERSORT and single-sample gene set enrichment analysis (ssGSEA) were employed to quantify the relative abundance of each immune cell infiltration, respectively. qRT-PCR was performed to test the expression of key DEGs in murine models. Results: A total of 247 DEmRNA, 499 DElncRNAs and 269 DEcircRNAs were identified. With screening strategy of five ML algorithms, 6 DEmRNAs were obtained including NGP, HIST1H2BJ, PBLD1, MBLN3, CD180 and F10. Then the 6 DEmRNAs were used as a multigene signature to construct models to differentiate EAE from normal microglia, and AUC value for each model was greater than 0.8. The diagnostic value of these 6 DEmRNAs were identified and further verified by qRT-PCR. Then, differential expression for five out of these 6 DEmRNAs, namely NGP, HIST1H2BJ, PBLD1, MBLN3, and F10 were confirmed. Using PPI analysis, DEmRNAs frequently interacting with transcription factors (TFs), potential drugs and RBPs were identified. With immune cell infiltration analyses, we found EAE microglia presented high levels of immune infiltration, especially Nature Killer (NK) cells and CD8+ T cells. We also reported circRNA (circRNA_00638) was predicted to bind to 76 RBPs. Conclusions: We identified and validated 6 novel microglia related genes and developed a multigene signature with ML methods to confirm their ability to accurately diagnose and characterize biological alterations in EAE microglia. The six key DEmRNAs might also be latent targets for immunoregulatory therapy.

Project description:Abstract Background Early diagnosis of liver metastasis is of great importance for enhancing the survival of colorectal adenocarcinoma (CAD) patients, and the combined use of a single biomarker in a classier model has shown great improvement in predicting the metastasis of several types of cancers. However, it is little reported for CAD. This study therefore aimed to screen an optimal classier model of CAD with liver metastasis and explore the metastatic mechanisms of genes when applying this classier model. Methods The differentially expressed genes between primary CAD samples and CAD with metastasis samples were screened from the Moffitt Cancer Center (MCC) dataset GSE131418. The classification performances of six selected algorithms, namely, LR, RF, SVM, GBDT, NN, and CatBoost, for classification of CAD with liver metastasis samples were compared using the MCC dataset GSE131418 by detecting their classification test accuracy. In addition, the consortium datasets of GSE131418 and GSE81558 were used as internal and external validation sets to screen the optimal method. Subsequently, functional analyses and a drug‐targeted network construction of the feature genes when applying the optimal method were conducted. Results The optimal CatBoost model with the highest accuracy of 99%, and an area under the curve of 1, was screened, which consisted of 33 feature genes. A functional analysis showed that the feature genes were closely associated with a “steroid metabolic process” and “lipoprotein particle receptor binding” (eg APOB and APOC3). In addition, the feature genes were significantly enriched in the “complement and coagulation cascade” pathways (eg FGA, F2, and F9). In a drug‐target interaction network, F2 and F9 were predicted as targets of menadione. Conclusion The CatBoost model constructed using 33 feature genes showed the optimal classification performance for identifying CAD with liver metastasis. APOB, APOC3, FGA, F2, F9, and NKX2‐3 were potential biomarkers for classification of CAD with liver metastasis. Menadione might be a promising anti‐metastatic drug of CAD cells through functioning its role at sites of F2 and F9. CatBoost model constructed by 33 feature genes showed the optimal classification performance for identifying CAD liver metastasis.

Dataset Information

Using machine learning methods to study the tumour microenvironment and its biomarkers in osteosarcoma metastasis

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets