Comparative analysis of supervised learning models for effluent quality prediction in wastewater treatment plants

  • 1State Key Laboratory of Regional and Urban Ecology, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen, Fujian, China.
  • 2Rural Revitalization College, Fujian Agriculture and Forestry University, Fuzhou, China.

|

Abstract

Effluent quality prediction is critical for optimizing Wastewater Treatment Plant (WWTP) operations, ensuring regulatory compliance, and promoting environmental sustainability. This study evaluates the performance of five supervised learning models-AdaBoost, Backpropagation Neural Networks (BP-NN), Support Vector Machine (SVR), XGBoost, and Gradient Boosting (GB)-using data from a WWTP in Zhuhai, China. The Effluent Quality Index (EQI), integrating multiple pollutant concentrations and environmental impacts, was used as the target variable. The models were trained and tested on 84 monthly datasets, with their performances compared using R2, Mean Absolute Percentage Error (MAPE), and Mean Bias Error (MBE). XGBoost achieved the best balance between accuracy and robustness, with the lowest MAPE(6.11%) and a high R2(0.813), while SVR excelled in fitting accuracy(R2 = 0.826) but showed limitations in error control. Although we employed GridSearchCV with cross-validation to optimize hyperparameters and ensure a fair model comparison, the study is limited by the reliance on data from a single WWTP and the relatively small dataset size (84 records). Nevertheless, the findings provide valuable insights into selecting effective machine learning models for effluent quality prediction, supporting data-driven decision-making in wastewater management and advancing intelligent process optimization in WWTP.

Related Concept Videos

Typical Model Studies 01:30

344

Fluid mechanics model studies often utilize scaled-down systems to predict fluid behavior in full-scale environments, such as river flows, dam spillways, and structures interacting with open surfaces. Maintaining Froude number similarity in river models is crucial, as it replicates surface flow features like wave patterns and velocities.

Practical constraints, however, can introduce geometric distortions, impacting how accurately these models represent the real system. Adjustments, such as...

Design Example: Creating a Hydraulic Model of a Dam Spillway 01:21

137

Scaled hydraulic models of dam spillways provide a practical way to replicate and study the intricate flow dynamics of these structures. Often built to a 1:15 ratio, these models allow for observing critical water behavior, such as velocity distribution, flow patterns, and energy dissipation.

Through this controlled setup, researchers can closely examine how water flows over the spillway, supporting a detailed analysis of its performance. A fundamental principle behind these models is using...

Modeling and Similitude 01:12

249

Scaled modeling is a fundamental technique in engineering, enabling the study of large and complex systems by creating smaller, manageable replicas that recreate critical characteristics of the original. In hydrology and civil infrastructure, for example, scaled models of dams help analyze water flow, turbulence, and pressure. This method allows for accurate predictions of real-world behavior within a controlled environment, significantly reducing the cost and time involved in full-scale...