By Rajaul Karim | 06 Mar, 2025
Type: Journal Paper.
Journal Name: Scientific Data (Impact Factor: 9.8 | Cite Score: 11.2 | Q1 Journal).
Publisher: Nature Publication (Springer Nature).
Date: 06 March, 2025
🔗 Access Links:
📄 Article: Read Here
🗃️ Dataset and Code: Download Here
Abstract: Assessment and monitoring of surface water quality are essential for food security, public health, and ecosystem protection. Although water quality monitoring is a known phenomenon, little effort has been made to offer a comprehensive and harmonized dataset for surface water at the global scale. This study presents a comprehensive surface water quality dataset that preserves spatio-temporal variability, integrity, consistency, and depth of the data to facilitate empirical and data-driven evaluation, prediction, and forecasting. The dataset is assembled from a range of sources, including regional and global water quality databases, water management organizations, and individual research projects from five prominent countries in the world, e.g., the USA, Canada, Ireland, England, and China. The resulting dataset consists of 2.82 million measurements of eight water quality parameters that span 1940 - 2023. This dataset can support meta-analysis of water quality models and can facilitate Machine Learning (ML) based data and model-driven investigation of the spatial and temporal drivers and patterns of surface water quality at a cross-regional to global scale.
Step-by-Step process for Data Acquisition, Harmonization, Validation, and Dataset Preparation:
Data Collection Summary with Geographic Locations:
Performance Assessment Curves for the Machine Learning Models: