Machine learning-aided risk stratification in Philadelphia chromosome-positive acute lymphoblastic leukemia.
Satoshi NishiwakiIsamu SugiuraDaisuke KoyamaYukiyasu OzawaMasahide OsakiYuichi IshikawaHitoshi KiyoiPublished in: Biomarker research (2021)
We used the eXtreme Gradient Boosting algorithm, an optimized gradient boosting machine learning library, and established a model to predict events in Philadelphia chromosome-positive acute lymphoblastic leukemia using a machine learning-aided method. A model was constructed using a training set (80%) and prediction was tested using a test set (20%). According to the feature importance score, BCR-ABL lineage, polymerase chain reaction value, age, and white blood cell count were identified as important features. These features were also confirmed by the permutation feature importance for the prediction using the test set. Both event-free survival and overall survival were clearly stratified according to risk groups categorized using these features: 80 and 100% in low risk (two or less factors), 42 and 47% in intermediate risk (three factors), and 0 and 10% in high risk (four factors) at 4 years. Machine learning-aided analysis was able to identify clinically useful prognostic factors using data from a relatively small number of patients.
Keyphrases
- machine learning
- acute lymphoblastic leukemia
- prognostic factors
- free survival
- big data
- artificial intelligence
- allogeneic hematopoietic stem cell transplantation
- deep learning
- end stage renal disease
- single cell
- ejection fraction
- newly diagnosed
- chronic myeloid leukemia
- climate change
- chronic kidney disease
- dna methylation
- stem cells
- wastewater treatment
- gene expression
- peritoneal dialysis
- patient reported outcomes
- peripheral blood
- bone marrow
- neural network
- cell fate