Developing interpretable machine learning-Shapley additive explanations model for unconfined compressive strength of cohesive soils stabilized with geopolymer.
Anh Quan NgoLinh Quy NguyenVan Quan TranPublished in: PloS one (2023)
This paper seeks to develop an interpretable Machine Learning (ML) model for predicting the unconfined compressive strength (UCS) of cohesive soils stabilized with geopolymer at 28 days. Four models including Random Forest (RF), Artificial Neuron Network (ANN), Extreme Gradient Boosting (XGB), and Gradient Boosting (GB) are built. The database consists of 282 samples collected from the literature with three different types of cohesive soil stabilized with three geopolymer categories including Slag-based geopolymer cement, alkali-activated fly ash geopolymer and slag/fly ash-based geopolymer cement. The optimal model is selected by comparing their performances with each other. The values of hyperparameters are tuned by Particle Swarm Optimization (PSO) algorithm and K-Fold Cross Validation. Statistical indicators show the superior performance of the ANN model with three metrics performance such as coefficient of determination R2 = 0.9808, Root Mean Square Error RMSE = 0.8808 MPa and Mean Absolute Error MAE = 0.6344 MPa. In addition, a sensitivity analysis was performed to determine the influence of different input parameters on the UCS of cohesive soils stabilized with geopolymer. The order of feature effect can be ordered in descending order using the Shapley additive explanations (SHAP) value as follows: Ground granulated blast slag content (GGBFS) > Liquid limit (LL) > Alkali/Binder ratio (A/B) > Molarity (M) > Fly ash content (FA) > Na/Al > Si/Al. The ANN model can obtain the best accuracy using these seven inputs. LL has a negative correlation with the growth of unconfined compressive strength, whereas GGBFS has a positive correlation.