COMPARISON OF RANDOM FOREST AND MULTIPLE LINEAR REGRESSION TO MODEL THE MASS BALANCE OF BIOSOLIDS FROM A COMPLEX BIOSOLIDS MANAGEMENT AREA.
Thaís Bremm PluthDominic A BrosePublished in: Water environment research : a research publication of the Water Environment Federation (2021)
The use of biosolids as a soil amendment provides an important alternative to disposal and can improve soil health; however, distribution for water resource recovery facilities (WRRFs) in the U.S. can be challenging due to decreasing cropland, increased precipitation, variable plant operations, and financial constraints. Although statistical modeling is commonly used in the water sector, machine learning is still an emerging tool and can provide insights to optimize operations. Random forest (RF), a machine learning model, and multiple linear regression (MLR) were used in this study to model the mass balance of biosolids from a complex biosolids management area. The RF model outperformed (R2 =0.89) the MLR model (R2 =0.49) and showed that rainfall was a major factor impacting distribution. Storage for dried biosolids would help decouple drying operations from wet weather and increase distribution. This study demonstrated how machine learning can assist in decision-making processes for long-term planning at WRRFs.