Toward the development of an ML-driven decision support system for wastewater treatment: A bacterial inactivation prediction approach in solar photochemical processes

Abstract

The design of efficient bacterial inactivation treatment in wastewater is challenging due to its numerous parameters and the complex composition of wastewater. Although solar photochemical processes (PCPs) provide energy-saving benefits, a balance must be maintained between bacterial inactivation efficiency and experimental costs. Predictive decision tools for bacterial inactivation under various conditions would significantly contribute to optimizing PCP design resources. This study evaluated four machine learning algorithms (ML) (i.e., Artificial Neural Network (ANN), Random Forest (RF), Support Vector Machine (SVM), and Extreme Gradient Boost (XGBoost)) for predicting bacterial inactivation behavior, using Escherichia coli, Enterococcus spp., and Salmonella spp. Several oxidant types, bacterial concentrations, and aqueous matrices were evaluated in two scenarios simulating real-world conditions. Results demonstrated that decision tree-based models (RF and XGBoost) outperformed SVM and ANN in accuracy. In Scenario I (prediction of intermediate experimental values over time) the XGBoost model was most effective, achieving a Root Mean Square Error (RMSE) of 0.81, 0.76 and 0.55 and an R2 of 0.84, 0.79, and 0.87 for the three bacteria, respectively. In Scenario II (prediction of full experimental values over time), the RF model excelled for Escherichia coli and Salmonella spp. with an RMSE of 0.88 for both and an R2 of 0.80 and 0.71, respectively. The XGBoost model showed moderate effectiveness for Enterococcus sp. with an RMSE of 1.31 and R2 of 0.50. Overall, the decision tree-based models demonstrated their potential for prediction in tests of a wide range of PCP parameters without requiring additional trials.


The publication is part of Projects TED2021-129969B-C31 and TED2021-129969B-C33 funded by MCIN/AEI/10.13039/501100011033 and by “European Union NextGenerationEU/PRTR”. This work also received funding from the Spanish Ministry of Economy and Competitiveness through the “Severo Ochoa Programme for Centres of Excellence in R&D” (CEX2018-000797-S) and the Generalitat de Catalonia through the CERCA Program.


Peer Reviewed


Postprint (published version)

Document Type

Article

Language

English

Publisher

Elsevier

Related items

https://www.sciencedirect.com/science/article/pii/S0301479724035230?via%3Dihub

TED2021-129969B-C31

TED2021-129969B-C33

CEX2018-000797-S

Recommended citation

This citation was generated automatically.

Rights

http://creativecommons.org/licenses/by/4.0/

Open Access

Attribution 4.0 International

This item appears in the following Collection(s)

E-prints [72987]