Login / Signup

Developing interpretable machine learning-Shapley additive explanations model for unconfined compressive strength of cohesive soils stabilized with geopolymer.

Anh Quan NgoLinh Quy NguyenVan Quan Tran
Published in: PloS one (2023)
This paper seeks to develop an interpretable Machine Learning (ML) model for predicting the unconfined compressive strength (UCS) of cohesive soils stabilized with geopolymer at 28 days. Four models including Random Forest (RF), Artificial Neuron Network (ANN), Extreme Gradient Boosting (XGB), and Gradient Boosting (GB) are built. The database consists of 282 samples collected from the literature with three different types of cohesive soil stabilized with three geopolymer categories including Slag-based geopolymer cement, alkali-activated fly ash geopolymer and slag/fly ash-based geopolymer cement. The optimal model is selected by comparing their performances with each other. The values of hyperparameters are tuned by Particle Swarm Optimization (PSO) algorithm and K-Fold Cross Validation. Statistical indicators show the superior performance of the ANN model with three metrics performance such as coefficient of determination R2 = 0.9808, Root Mean Square Error RMSE = 0.8808 MPa and Mean Absolute Error MAE = 0.6344 MPa. In addition, a sensitivity analysis was performed to determine the influence of different input parameters on the UCS of cohesive soils stabilized with geopolymer. The order of feature effect can be ordered in descending order using the Shapley additive explanations (SHAP) value as follows: Ground granulated blast slag content (GGBFS) > Liquid limit (LL) > Alkali/Binder ratio (A/B) > Molarity (M) > Fly ash content (FA) > Na/Al > Si/Al. The ANN model can obtain the best accuracy using these seven inputs. LL has a negative correlation with the growth of unconfined compressive strength, whereas GGBFS has a positive correlation.
Keyphrases
  • machine learning
  • heavy metals
  • systematic review
  • climate change
  • emergency department
  • neural network
  • deep learning
  • big data
  • human health
  • computed tomography
  • organic matter
  • ionic liquid
  • drug induced