TY - JOUR
T1 - Machine learning-based modeling and analysis of PFOS removal from contaminated water by nanofiltration process
AU - Hosseinzadeh, Ahmad
AU - Zhou, John L.
AU - Zyaie, Javad
AU - AlZainati, Nahawand
AU - Ibrar, Ibrar
AU - Altaee, Ali
N1 - Publisher Copyright:
© 2022 Elsevier B.V.
PY - 2022/5/15
Y1 - 2022/5/15
N2 - Per- and polyfluoroalkyl substances (PFAS) are hazardous chemicals that have been widely used in different industries and released into the environment through contaminated effluents. Nanofiltration (NF) is a promising process for removing PFAS from the effluents. This study aimed to model and analyze the performance of the NF membrane process in perfluorooctanesulfonic acid (PFOS) removal from contaminated effluents using machine learning (ML) algorithms. The modeling output of seven ML algorithms was evaluated using statistical indexes of determination coefficient (R2) and mean squared error (MSE) for robustness. The results demonstrated that random forest (RF), gradient boosting machine (GBM), and AdaBoost models were the most robust for the NF process. Accordingly, the optimization of these procedures was accomplished using a grid search. The optimized models were deeply analyzed using permutation variable importance (PVI) to quantify the relative importance of operating variables. The three ML procedures (RF, GBM, AdaBoost) presented high prediction strength for PFOS removal from contaminated effluents with low MSE values (4.726, 2.450, 2.879) and high R2 values (0.930, 0.975, 0.968). In addition, PVI-RF showed decreasing importance of pressure, initial PFOS concentration, membrane type, trivalent cation, pH, divalent cation and monovalent cation consecutively.
AB - Per- and polyfluoroalkyl substances (PFAS) are hazardous chemicals that have been widely used in different industries and released into the environment through contaminated effluents. Nanofiltration (NF) is a promising process for removing PFAS from the effluents. This study aimed to model and analyze the performance of the NF membrane process in perfluorooctanesulfonic acid (PFOS) removal from contaminated effluents using machine learning (ML) algorithms. The modeling output of seven ML algorithms was evaluated using statistical indexes of determination coefficient (R2) and mean squared error (MSE) for robustness. The results demonstrated that random forest (RF), gradient boosting machine (GBM), and AdaBoost models were the most robust for the NF process. Accordingly, the optimization of these procedures was accomplished using a grid search. The optimized models were deeply analyzed using permutation variable importance (PVI) to quantify the relative importance of operating variables. The three ML procedures (RF, GBM, AdaBoost) presented high prediction strength for PFOS removal from contaminated effluents with low MSE values (4.726, 2.450, 2.879) and high R2 values (0.930, 0.975, 0.968). In addition, PVI-RF showed decreasing importance of pressure, initial PFOS concentration, membrane type, trivalent cation, pH, divalent cation and monovalent cation consecutively.
KW - Machine learning
KW - Membrane performance
KW - Nanofiltration process
KW - PFOS
KW - Process modeling
UR - https://www.scopus.com/pages/publications/85125726086
U2 - 10.1016/j.seppur.2022.120775
DO - 10.1016/j.seppur.2022.120775
M3 - Article
AN - SCOPUS:85125726086
SN - 1383-5866
VL - 289
JO - Separation and Purification Technology
JF - Separation and Purification Technology
M1 - 120775
ER -