Predicting Heavy Metal Adsorption on Soil with Machine Learning and Mapping Global Distribution of Soil Adsorption Capacities.
Hongrui YangKuan Z HuangKai ZhangQin WengHuichun ZhangFeier WangPublished in: Environmental science & technology (2021)
Studying heavy metal adsorption on soil is important for understanding the fate of heavy metals and properly assessing the related environmental risks. Existing experimental methods and traditional models for quantifying adsorption, however, are time-consuming and ineffective. In this study, we developed machine learning models for the soil adsorption of six heavy metals (Cd(II), Cr(VI), Cu(II), Pb(II), Ni(II), and Zn(II)) using 4420 data points (1105 soils) extracted from 150 journal articles. After a comprehensive comparison, our results showed that the gradient boosting decision tree had the best performance for a combined model based on all the data. The Shapley additive explanation method was used to identify the feature importance and the effects of these features on the adsorption, based on which six independent models were developed for the six metals to achieve better model performance than the combined model. Using these independent models, the global distribution of heavy metal adsorption capacities on soils was predicted with known soil properties. Reversed models, including one combined model for all the six metals and six independent models, were also built using the same data sets to predict the heavy metal concentration in water when the adsorbed amount is known for a soil/sediment.