Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal

In the pursuit of understanding surface water quality for sustainable urban management, we created a machine learning modeling framework that utilized Random Forest (RF), Cubist, Extreme Gradient Boosting (XGB), Multivariate Adaptive Regression Splines (MARS), Gradient Boosting Machine (GBM), Suppor...

Full description

Bibliographic Details
Main Authors: Singha, Chiranjit, Bhattacharjee, Ishita, Sahoo, Satiprasad, Abdelrahman, Kamal, Uddin, Md Galal, Fnais, Mohammed S., Govind, Ajit, Abioui, Mohamed
Format: Journal Article
Language:Inglés
Published: Elsevier 2024
Subjects:
Online Access:https://hdl.handle.net/10568/173356
_version_ 1855532868090986496
author Singha, Chiranjit
Bhattacharjee, Ishita
Sahoo, Satiprasad
Abdelrahman, Kamal
Uddin, Md Galal
Fnais, Mohammed S.
Govind, Ajit
Abioui, Mohamed
author_browse Abdelrahman, Kamal
Abioui, Mohamed
Bhattacharjee, Ishita
Fnais, Mohammed S.
Govind, Ajit
Sahoo, Satiprasad
Singha, Chiranjit
Uddin, Md Galal
author_facet Singha, Chiranjit
Bhattacharjee, Ishita
Sahoo, Satiprasad
Abdelrahman, Kamal
Uddin, Md Galal
Fnais, Mohammed S.
Govind, Ajit
Abioui, Mohamed
author_sort Singha, Chiranjit
collection Repository of Agricultural Research Outputs (CGSpace)
description In the pursuit of understanding surface water quality for sustainable urban management, we created a machine learning modeling framework that utilized Random Forest (RF), Cubist, Extreme Gradient Boosting (XGB), Multivariate Adaptive Regression Splines (MARS), Gradient Boosting Machine (GBM), Support Vector Machine (SVM), and their hybrid stacking ensemble RF (SE-RF), as well as stacking Cubist (SE-Cubist), to predict the distribution of water quality in the Howrah Municipal Corporation (HMC) area in West Bengal, India. Additionally, we employed the ReliefF and Shapley Additive exPlanations (SHAP) methods to elucidate the underlying factors driving water quality. We first estimated the water quality index (WQI) to model seven water quality parameters: total hardness (TH), pH, total dissolved solids (TDS), dissolved oxygen (DO), biochemical oxygen demand (BOD), calcium (Ca), magnesium (Mg). Then six independent factors were utilized (i.e. Precipitation (Pr), Maximum Temperature (Tmax), Minimum Temperature (Tmin), Normalized Difference Turbidity Index (NDTI), Normalized Difference Chlorophyll Index (NDCI), and Total Dissolved Solids (TDS)) for predicting the WQI mapping through the different ML models. This study demonstrated that the SE-Cubist model outperforms other ML models. During the testing phase, it achieved the best modeling results with an R2 = 0.975, RMSE = 0.351, and MAE = 0.197. The ReliefF and SHAP analyses identified Pr and Tmax as the most significant factors influencing WQI within the study area.
format Journal Article
id CGSpace173356
institution CGIAR Consortium
language Inglés
publishDate 2024
publishDateRange 2024
publishDateSort 2024
publisher Elsevier
publisherStr Elsevier
record_format dspace
spelling CGSpace1733562025-10-26T12:55:58Z Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal Singha, Chiranjit Bhattacharjee, Ishita Sahoo, Satiprasad Abdelrahman, Kamal Uddin, Md Galal Fnais, Mohammed S. Govind, Ajit Abioui, Mohamed urban areas water management surface water In the pursuit of understanding surface water quality for sustainable urban management, we created a machine learning modeling framework that utilized Random Forest (RF), Cubist, Extreme Gradient Boosting (XGB), Multivariate Adaptive Regression Splines (MARS), Gradient Boosting Machine (GBM), Support Vector Machine (SVM), and their hybrid stacking ensemble RF (SE-RF), as well as stacking Cubist (SE-Cubist), to predict the distribution of water quality in the Howrah Municipal Corporation (HMC) area in West Bengal, India. Additionally, we employed the ReliefF and Shapley Additive exPlanations (SHAP) methods to elucidate the underlying factors driving water quality. We first estimated the water quality index (WQI) to model seven water quality parameters: total hardness (TH), pH, total dissolved solids (TDS), dissolved oxygen (DO), biochemical oxygen demand (BOD), calcium (Ca), magnesium (Mg). Then six independent factors were utilized (i.e. Precipitation (Pr), Maximum Temperature (Tmax), Minimum Temperature (Tmin), Normalized Difference Turbidity Index (NDTI), Normalized Difference Chlorophyll Index (NDCI), and Total Dissolved Solids (TDS)) for predicting the WQI mapping through the different ML models. This study demonstrated that the SE-Cubist model outperforms other ML models. During the testing phase, it achieved the best modeling results with an R2 = 0.975, RMSE = 0.351, and MAE = 0.197. The ReliefF and SHAP analyses identified Pr and Tmax as the most significant factors influencing WQI within the study area. 2024-11 2025-02-22T01:42:31Z 2025-02-22T01:42:31Z Journal Article https://hdl.handle.net/10568/173356 en Open Access Elsevier Singha, Chiranjit.; Bhattacharjee, Ishita.; Sahoo, Satiprasad.; Abdelrahman, Kamal.; Uddin, Md Galal.; Fnais, Mohammed S.; Govind, Ajit.; Abioui, Mohamed. Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal
spellingShingle urban areas
water management
surface water
Singha, Chiranjit
Bhattacharjee, Ishita
Sahoo, Satiprasad
Abdelrahman, Kamal
Uddin, Md Galal
Fnais, Mohammed S.
Govind, Ajit
Abioui, Mohamed
Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal
title Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal
title_full Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal
title_fullStr Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal
title_full_unstemmed Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal
title_short Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal
title_sort prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in howrah municipal corporation west bengal
topic urban areas
water management
surface water
url https://hdl.handle.net/10568/173356
work_keys_str_mv AT singhachiranjit predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal
AT bhattacharjeeishita predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal
AT sahoosatiprasad predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal
AT abdelrahmankamal predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal
AT uddinmdgalal predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal
AT fnaismohammeds predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal
AT govindajit predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal
AT abiouimohamed predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal