Login / Signup

Optimizing the Quality of Machine Learning for Identifying the Share of Biogenic and Fossil Carbon in Solid Waste.

Dong-Ying LanPin-Jing HeYa-Ping QiTing-Wei WuHao-Yang XianRui-Heng WangFan LüHua Zhang
Published in: Analytical chemistry (2023)
Insights into carbon sources (biogenic and fossil carbon) and contents in solid waste are vital for estimating the carbon emissions from incineration plants. However, the traditional methods are time-, labor-, and cost-intensive. Herein, high-quality data sets were established after analyzing the carbon contents and infrared spectra of substantial samples using elemental analysis and attenuated total reflectance-Fourier transform infrared spectroscopy (ATR-FTIR), respectively. Then, five classification and eight regression machine learning (ML) models were evaluated to recognize the proportion of biogenic and fossil carbon in solid waste. Using the optimized data preprocessing approach, the random forest (RF) classifier with hyperparameter tuning ranked first in classifying the carbon group with a test accuracy of 0.969, and the carbon contents were successfully predicted by the RF regressor with R 2 = 0.926 considering performance-interpretability-computation time competition. The above proposed algorithms were further validated with real environmental samples, which exhibited robust performance with an accuracy of 0.898 for carbon group classification and an R 2 value of 0.851 for carbon content prediction. The reliable results indicate that ATR-FTIR coupled with ML algorithms is feasible for rapidly identifying both carbon groups and content, facilitating the calculation and assessment of carbon emissions from solid waste incineration.
Keyphrases
  • machine learning
  • deep learning
  • municipal solid waste
  • heavy metals
  • big data
  • sewage sludge
  • artificial intelligence
  • oxidative stress
  • electronic health record
  • drinking water
  • clinical evaluation