Login / Signup

Machine Learning Models of Groundwater Arsenic Spatial Distribution in Bangladesh: Influence of Holocene Sediment Depositional History.

Zhen TanQiang YangYan Zheng
Published in: Environmental science & technology (2020)
Recent advances in machine learning methods offer the opportunity to improve risk assessment and to decipher factors influencing the spatial variability of groundwater arsenic ([As]gw). A systematic comparison reveals that boosted regression trees (BRT) and random forest (RF) outperform logistic regression. The probability of [As]gw exceeding 5 μg/L (approximate median value of Bangladesh [As]gw), 10 μg/L (WHO provisional guideline value), and 50 μg/L (Bangladesh drinking water standard) is modeled by BRT and RF methods for Bangladesh and its four subregions demarcated by major rivers. Of the 109 geo-environmental and hydrochemical predictor variables, phosphorus and iron emerge as the most important across spatial scales, consistent with known As mobilization mechanisms. Well depth is significant only when hydrochemical parameters are not considered, consistent with prior studies. A peak of probability of [As]gw exceedance at ∼30 m depth is evident in the partial dependence plots (PDPs) for spatial-parameter-only models but not in the equivalent all-parameter models, suggesting that sediment depositional history explains interdependent spatial patterns of groundwater As-P-Fe in Holocene aquifers. The South region exhibits a decrease of probability of [As]gw exceedance below 150 m depth in PDPs for spatial-parameter-only and all-parameter models, supporting that the deeper Pleistocene aquifer is a low-As water resource.
Keyphrases
  • drinking water
  • heavy metals
  • health risk assessment
  • health risk
  • machine learning
  • risk assessment
  • human health
  • optical coherence tomography
  • artificial intelligence
  • big data
  • neural network