Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal
In the pursuit of understanding surface water quality for sustainable urban management, we created a machine learning modeling framework that utilized Random Forest (RF), Cubist, Extreme Gradient Boosting (XGB), Multivariate Adaptive Regression Splines (MARS), Gradient Boosting Machine (GBM), Suppor...
| Main Authors: | , , , , , , , |
|---|---|
| Format: | Journal Article |
| Language: | Inglés |
| Published: |
Elsevier
2024
|
| Subjects: | |
| Online Access: | https://hdl.handle.net/10568/173356 |
| _version_ | 1855532868090986496 |
|---|---|
| author | Singha, Chiranjit Bhattacharjee, Ishita Sahoo, Satiprasad Abdelrahman, Kamal Uddin, Md Galal Fnais, Mohammed S. Govind, Ajit Abioui, Mohamed |
| author_browse | Abdelrahman, Kamal Abioui, Mohamed Bhattacharjee, Ishita Fnais, Mohammed S. Govind, Ajit Sahoo, Satiprasad Singha, Chiranjit Uddin, Md Galal |
| author_facet | Singha, Chiranjit Bhattacharjee, Ishita Sahoo, Satiprasad Abdelrahman, Kamal Uddin, Md Galal Fnais, Mohammed S. Govind, Ajit Abioui, Mohamed |
| author_sort | Singha, Chiranjit |
| collection | Repository of Agricultural Research Outputs (CGSpace) |
| description | In the pursuit of understanding surface water quality for sustainable urban management, we created a machine learning modeling framework that utilized Random Forest (RF), Cubist, Extreme Gradient Boosting (XGB), Multivariate Adaptive Regression Splines (MARS), Gradient Boosting Machine (GBM), Support Vector Machine (SVM), and their hybrid stacking ensemble RF (SE-RF), as well as stacking Cubist (SE-Cubist), to predict the distribution of water quality in the Howrah Municipal Corporation (HMC) area in West Bengal, India. Additionally, we employed the ReliefF and Shapley Additive exPlanations (SHAP) methods to elucidate the underlying factors driving water quality. We first estimated the water quality index (WQI) to model seven water quality parameters: total hardness (TH), pH, total dissolved solids (TDS), dissolved oxygen (DO), biochemical oxygen demand (BOD), calcium (Ca), magnesium (Mg). Then six independent factors were utilized (i.e. Precipitation (Pr), Maximum Temperature (Tmax), Minimum Temperature (Tmin), Normalized Difference Turbidity Index (NDTI), Normalized Difference Chlorophyll Index (NDCI), and Total Dissolved Solids (TDS)) for predicting the WQI mapping through the different ML models. This study demonstrated that the SE-Cubist model outperforms other ML models. During the testing phase, it achieved the best modeling results with an R2 = 0.975, RMSE = 0.351, and MAE = 0.197. The ReliefF and SHAP analyses identified Pr and Tmax as the most significant factors influencing WQI within the study area. |
| format | Journal Article |
| id | CGSpace173356 |
| institution | CGIAR Consortium |
| language | Inglés |
| publishDate | 2024 |
| publishDateRange | 2024 |
| publishDateSort | 2024 |
| publisher | Elsevier |
| publisherStr | Elsevier |
| record_format | dspace |
| spelling | CGSpace1733562025-10-26T12:55:58Z Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal Singha, Chiranjit Bhattacharjee, Ishita Sahoo, Satiprasad Abdelrahman, Kamal Uddin, Md Galal Fnais, Mohammed S. Govind, Ajit Abioui, Mohamed urban areas water management surface water In the pursuit of understanding surface water quality for sustainable urban management, we created a machine learning modeling framework that utilized Random Forest (RF), Cubist, Extreme Gradient Boosting (XGB), Multivariate Adaptive Regression Splines (MARS), Gradient Boosting Machine (GBM), Support Vector Machine (SVM), and their hybrid stacking ensemble RF (SE-RF), as well as stacking Cubist (SE-Cubist), to predict the distribution of water quality in the Howrah Municipal Corporation (HMC) area in West Bengal, India. Additionally, we employed the ReliefF and Shapley Additive exPlanations (SHAP) methods to elucidate the underlying factors driving water quality. We first estimated the water quality index (WQI) to model seven water quality parameters: total hardness (TH), pH, total dissolved solids (TDS), dissolved oxygen (DO), biochemical oxygen demand (BOD), calcium (Ca), magnesium (Mg). Then six independent factors were utilized (i.e. Precipitation (Pr), Maximum Temperature (Tmax), Minimum Temperature (Tmin), Normalized Difference Turbidity Index (NDTI), Normalized Difference Chlorophyll Index (NDCI), and Total Dissolved Solids (TDS)) for predicting the WQI mapping through the different ML models. This study demonstrated that the SE-Cubist model outperforms other ML models. During the testing phase, it achieved the best modeling results with an R2 = 0.975, RMSE = 0.351, and MAE = 0.197. The ReliefF and SHAP analyses identified Pr and Tmax as the most significant factors influencing WQI within the study area. 2024-11 2025-02-22T01:42:31Z 2025-02-22T01:42:31Z Journal Article https://hdl.handle.net/10568/173356 en Open Access Elsevier Singha, Chiranjit.; Bhattacharjee, Ishita.; Sahoo, Satiprasad.; Abdelrahman, Kamal.; Uddin, Md Galal.; Fnais, Mohammed S.; Govind, Ajit.; Abioui, Mohamed. Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal |
| spellingShingle | urban areas water management surface water Singha, Chiranjit Bhattacharjee, Ishita Sahoo, Satiprasad Abdelrahman, Kamal Uddin, Md Galal Fnais, Mohammed S. Govind, Ajit Abioui, Mohamed Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal |
| title | Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal |
| title_full | Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal |
| title_fullStr | Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal |
| title_full_unstemmed | Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal |
| title_short | Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal |
| title_sort | prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in howrah municipal corporation west bengal |
| topic | urban areas water management surface water |
| url | https://hdl.handle.net/10568/173356 |
| work_keys_str_mv | AT singhachiranjit predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal AT bhattacharjeeishita predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal AT sahoosatiprasad predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal AT abdelrahmankamal predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal AT uddinmdgalal predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal AT fnaismohammeds predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal AT govindajit predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal AT abiouimohamed predictionofurbansurfacewaterqualityscenariosusinghybridstackingensemblesmachinelearningmodelinhowrahmunicipalcorporationwestbengal |