Unknown

Dataset Information

0

Evaluating Ovarian Cancer Chemotherapy Response Using Gene Expression Data and Machine Learning.


ABSTRACT:

Background

Ovarian cancer (OC) is the most lethal gynecological cancer in the United States. Among the different types of OC, serous ovarian cancer (SOC) stands out as the most prevalent. Transcriptomics techniques generate extensive gene expression data, yet only a few of these genes are relevant to clinical diagnosis.

Methods

Methods for feature selection (FS) address the challenges of high dimensionality in extensive datasets. This study proposes a computational framework that applies FS techniques to identify genes highly associated with platinum-based chemotherapy response on SOC patients. Using SOC datasets from the Gene Expression Omnibus (GEO) database, LASSO and varSelRF FS methods were employed. Machine learning classification algorithms such as random forest (RF) and support vector machine (SVM) were also used to evaluate the performance of the models.

Results

The proposed framework has identified biomarkers panels with 9 and 10 genes that are highly correlated with platinum-paclitaxel and platinum-only response in SOC patients, respectively. The predictive models have been trained using the identified gene signatures and accuracy of above 90% was achieved.

Conclusions

In this study, we propose that applying multiple feature selection methods not only effectively reduces the number of identified biomarkers, enhancing their biological relevance, but also corroborates the efficacy of drug response prediction models in cancer treatment.

SUBMITTER: Amniouel S 

PROVIDER: S-EPMC11326537 | biostudies-literature | 2024 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Evaluating Ovarian Cancer Chemotherapy Response Using Gene Expression Data and Machine Learning.

Amniouel Soukaina S   Yalamanchili Keertana K   Sankararaman Sreenidhi S   Jafri Mohsin Saleet MS  

BioMedInformatics 20240522 2


<h4>Background</h4>Ovarian cancer (OC) is the most lethal gynecological cancer in the United States. Among the different types of OC, serous ovarian cancer (SOC) stands out as the most prevalent. Transcriptomics techniques generate extensive gene expression data, yet only a few of these genes are relevant to clinical diagnosis.<h4>Methods</h4>Methods for feature selection (FS) address the challenges of high dimensionality in extensive datasets. This study proposes a computational framework that  ...[more]

Similar Datasets

| S-EPMC10830836 | biostudies-literature
| S-EPMC9230120 | biostudies-literature
| S-EPMC9121583 | biostudies-literature
| S-EPMC9071249 | biostudies-literature
| S-EPMC8128240 | biostudies-literature
| S-EPMC10233311 | biostudies-literature
| S-EPMC3720898 | biostudies-literature
| S-EPMC2973809 | biostudies-literature
| S-EPMC9879537 | biostudies-literature
| S-EPMC10919320 | biostudies-literature