Browse
Submit Data
Databases
API
Help

Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

114 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

Prediction of Breast Cancer Estrogen Receptor Status using Machine Learning

ABSTRACT: Gene expression profiles were generated from 199 primary breast cancer patients. Samples 1-176 were used in another study, GEO Series GSE22820, and form the training data set in this study. Sample numbers 200-222 form a validation set. This data is used to model a machine learning classifier for Estrogen Receptor Status. RNA was isolated from 199 primary breast cancer patients. A machine learning classifier was built to predict ER status using only three gene features.

ORGANISM(S): Homo sapiens

SUBMITTER: Kathryn Graham

PROVIDER: E-GEOD-29210 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

ACCESS DATA

Json Xml

Similar Datasets

Prediction of Breast Cancer Estrogen Receptor Status using Machine Learning

Project description:Gene expression profiles were generated from 199 primary breast cancer patients. Samples 1-176 were used in another study, GEO Series GSE22820, and form the training data set in this study. Sample numbers 200-222 form a validation set. This data is used to model a machine learning classifier for Estrogen Receptor Status.

2013-01-01 | GSE29210 | GEO

Accurate Identification of Sites of Origin in Carcinoma of Unknown Primary (CUP) Tumors Using DNA Methylation II

Project description:53 tumors used in the clinical training and validation of a CUP machine learning classifier using methylation profiling

2025-01-15 | GSE249711 | GEO

Accurate Identification of Sites of Origin in Carcinoma of Unknown Primary (CUP) Tumors Using DNA Methylation I

Project description:53 tumors used in the clinical training and validation of a CUP machine learning classifier using methylation profiling

2025-01-15 | GSE249688 | GEO

A Machine Learning Classifier for Assigning Individual Patients with Systemic Sclerosis to Intrinsic Molecular Subsets

Project description:A Machine Learning Classifier for Assigning Individual Patients with Systemic Sclerosis to Intrinsic Molecular Subsets

| PRJNA515920 | ENA

DNA-Methylome based Tumor Hypoxia Classifier Identifies HPV-negative Head and Neck Cancer Patients at Risk for Locoregional Recurrence After Primary Radiochemotherapy

Project description:We used machine-learning algorithms to identify a hypoxia-associated methylation signature in patients with HPV negative HNSCC in the TCGA-HNSCC cohort. This current submission forms the basis of the independent validation cohort used to test the Hypoxia-M classifier in our study.

2023-04-05 | E-MTAB-12431 | biostudies-arrayexpress

Machine Learning Classifiers for Endometriosis Using Transcriptomics and Methylomics Data [Transcriptomics]

Project description:We experimented how well various supervised machine learning methods such as decision tree, partial least squares discriminant analysis (PLSDA), support vector machine and random forest perform in classifying endometriosis from the control samples trained on both transcriptomics and methylomics data. The assessment was done from two different perspectives for improving classification performances: (a) implication of three different normalization techniques, and (b) implication of differential analysis using the generalized linear model (GLM). We concluded that an appropriate machine learning diagnostic pipeline for endometriosis should use TMM normalization for transcriptomics data, and quantile or voom normalization for methylomics data, GLM for feature space reduction and classification performance maximization.

2019-07-18 | GSE134056 | GEO

Machine Learning Classifiers for Endometriosis Using Transcriptomics and Methylomics Data [Methylomics]

2019-07-18 | GSE134052 | GEO

Machine learning-guided evolution of pyrrolysyl-tRNA synthetase for improved incorporation efficiency of diverse noncanonical amino acids

Project description:The pyrrolysyl-tRNA synthetase (PylRS) is widely used to incorporate noncanonical amino acids (ncAAs) into proteins. However, most of ncAA-containing protein yields remain low due to the limited activity of PylRS variants. Here, we apply machine learning (ML) to engineer the tRNA-binding domain of PylRS. The FFT-PLSR model is first applied to explore pairwise combinations of 12 single mutations, generating a variant Com1-IFRS with an 11-fold increase in stop codon suppression efficiency. Deep learning models ESM-1v, Mutcompute, and ProRefiner then identify new mutation sites. Applying FFT-PLSR on these sites yields a variant Com2-IFRS showing a 30.8-fold increase in stop codon suppression efficiency. Transplanting these mutations into 7 other PylRS-derived synthetases improved ncAA-containing protein yield by up to 1149.7-fold. Molecular dynamics simulations are used to explore the molecular change caused by the mutations. This paper presents improved PylRS variants and a machine learning framework for optimizing the enzyme activity.

2025-07-25 | PXD065336 | Pride

An unbiased machine learning exploration reveals gene sets predictive of allograft tolerance after kidney transplantation

Project description:Efforts at finding potential biomarkers of tolerance after kidney transplantation have been hindered by limited sample size, as well as the complicated mechanisms underlying tolerance and the potential risk of rejection after immunosuppressant withdrawal. In this work, three different publicly available genome-wide expression data sets of peripheral blood lymphocyte (PBL) from 63 tolerant patients were used to compare 14 different machine learning models for their ability to predict spontaneous kidney graft tolerance. We found that the Best Subset Selection (BSS) regression approach was the most powerful with a sensitivity of 91.7% and a specificity of 93.8% in the test group, and a specificity of 86.1% and a sensitivity of 80% in the validation group. A feature set with five genes (HLA-DOA, TCL1A, EBF1, CD79B, and PNOC) was identified using the BSS model. EBF1 downregulation was also an independent factor predictive of graft rejection and graft loss. An AUC value of 84.4% was achieved using the two-gene signature (EBF1 and HLA-DOA) as an input to our classifier. Overall, our systematic machine learning exploration suggests novel biological targets that might affect tolerance to renal allografts, and provides clinical insights that can potentially guide patient selection for immunosuppressant withdrawal.

2021-06-04 | GSE166865 | GEO

Machine learning for discovery: deciphering RNA splicing logic

Project description:Machine learning for discovery: deciphering RNA splicing logic

| PRJNA822943 | ENA