Unknown

Dataset Information

0

Machine learning-aided risk stratification in Philadelphia chromosome-positive acute lymphoblastic leukemia.


ABSTRACT: We used the eXtreme Gradient Boosting algorithm, an optimized gradient boosting machine learning library, and established a model to predict events in Philadelphia chromosome-positive acute lymphoblastic leukemia using a machine learning-aided method. A model was constructed using a training set (80%) and prediction was tested using a test set (20%). According to the feature importance score, BCR-ABL lineage, polymerase chain reaction value, age, and white blood cell count were identified as important features. These features were also confirmed by the permutation feature importance for the prediction using the test set. Both event-free survival and overall survival were clearly stratified according to risk groups categorized using these features: 80 and 100% in low risk (two or less factors), 42 and 47% in intermediate risk (three factors), and 0 and 10% in high risk (four factors) at 4?years. Machine learning-aided analysis was able to identify clinically useful prognostic factors using data from a relatively small number of patients.

SUBMITTER: Nishiwaki S 

PROVIDER: S-EPMC7890949 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Machine learning-aided risk stratification in Philadelphia chromosome-positive acute lymphoblastic leukemia.

Nishiwaki Satoshi S   Sugiura Isamu I   Koyama Daisuke D   Ozawa Yukiyasu Y   Osaki Masahide M   Ishikawa Yuichi Y   Kiyoi Hitoshi H  

Biomarker research 20210218 1


We used the eXtreme Gradient Boosting algorithm, an optimized gradient boosting machine learning library, and established a model to predict events in Philadelphia chromosome-positive acute lymphoblastic leukemia using a machine learning-aided method. A model was constructed using a training set (80%) and prediction was tested using a test set (20%). According to the feature importance score, BCR-ABL lineage, polymerase chain reaction value, age, and white blood cell count were identified as imp  ...[more]

Similar Datasets

| S-EPMC7042663 | biostudies-literature
| S-EPMC3971203 | biostudies-literature
| S-EPMC5680607 | biostudies-literature
| S-EPMC5638138 | biostudies-literature
| S-EPMC6171979 | biostudies-literature
2021-12-10 | GSE190397 | GEO
2010-09-10 | E-GEOD-23743 | biostudies-arrayexpress
2010-09-10 | GSE23743 | GEO