Unknown

Dataset Information

0

Prediction of BRAF V600E variant from cancer gene expression data.


ABSTRACT:

Background

BRAF inhibitors have been approved for the treatment of melanoma, non-small cell lung cancer, and colon cancer. Real-time polymerase chain reaction or next-generation sequencing were clinically used for BRAF variant detection to select who responds to BRAF inhibitors. The prediction of BRAF variants using gene expression data might be an alternative test when the direct variant sequencing test is not feasible. In this study, we built a prediction model to detect BRAF V600 variants with mRNA gene expression data in various cancer types.

Methods

We adopted a penalized logistic regression for the BRAF V600E variants prediction model. Ten times bootstrap resampling was done with a combined target variable and cancer type stratification. Data preprocessing included knnimputation for missing value imputation, YeoJohnson transformation for skewness correction, center, and scale for standardization, synthetic minority over-sampling technique for class imbalance. Hyperparameter optimization with a grid search was undertaken for model selection in terms of area under the precision-recall.

Results

The area under the curve of the receiver operating characteristic curve on the test set was 0.98 in thyroid carcinoma, 0.90 in colon adenocarcinoma, and 0.85 in cutaneous melanoma. The area under the precision-recall of the test set was 0.98 in thyroid carcinoma, 0.71 in colon adenocarcinoma, and 0.65 in cutaneous melanoma.

Conclusions

Our penalized logistic regression model can predict BRAF V600E variants with good performance in thyroid carcinoma, cutaneous melanoma, and colon adenocarcinoma.

SUBMITTER: Kang J 

PROVIDER: S-EPMC9745377 | biostudies-literature | 2022 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Prediction of <i>BRAF</i> V600E variant from cancer gene expression data.

Kang Jun J   Lee Jieun J   Lee Ahwon A   Lee Youn Soo YS  

Translational cancer research 20221101 11


<h4>Background</h4>BRAF inhibitors have been approved for the treatment of melanoma, non-small cell lung cancer, and colon cancer. Real-time polymerase chain reaction or next-generation sequencing were clinically used for <i>BRAF</i> variant detection to select who responds to BRAF inhibitors. The prediction of <i>BRAF</i> variants using gene expression data might be an alternative test when the direct variant sequencing test is not feasible. In this study, we built a prediction model to detect  ...[more]

Similar Datasets

| S-EPMC3590666 | biostudies-other
2016-11-28 | GSE75317 | GEO
| S-EPMC7141227 | biostudies-literature
| S-EPMC2615043 | biostudies-literature
| S-EPMC3372405 | biostudies-literature
| PRJNA303175 | ENA
| S-EPMC2651569 | biostudies-literature
| S-EPMC7212105 | biostudies-literature
| S-EPMC5542387 | biostudies-literature
| S-EPMC7226510 | biostudies-literature