Recent advances in groundwater pollution research using machine learning from 2000 to 2023: a bibliometric analysis

Environ Res. 2024 Dec 20:120683. doi: 10.1016/j.envres.2024.120683. Online ahead of print.

Abstract

Groundwater pollution has become a global challenge, posing significant threats to human health and ecological environments. Machine learning, with its superior ability to capture non-linear relationships in data, has shown significant potential in addressing the groundwater pollution issues. This review presents a comprehensive bibliometric analysis of 1,462 articles published between 2000 and 2023, offering an overview of the current state of research, analyzing development trends, and suggesting future directions. The analysis reveals a growing trend in publications over the 24-year period, with a sharp expansion since 2020. China, the USA, India, and Iran are identified as the leading contributors to publications and citations, with prominent institutions such as Jilin University, the United States Geological Survey, and the University of Tabriz. Moreover, keyword frequency analysis indicates that principal component analysis (PCA) is the most commonly used method, followed by artificial neural network (ANN) and hierarchical clustering analysis (HCA). The most studied groundwater pollutants include nitrate, arsenic, heavy metals, and fluoride. As machine learning has rapidly advanced, research focuses have evolved from basic tasks like hydrochemical evolution analysis, water quality index evaluation, and groundwater vulnerability assessments to more complex issues, such as pollutant concentration prediction, pollution risk assessment, and pollution source identification. Despite these advances, challenges related to data quality, data scarcity, model generalization, and interpretability remain. Future research should prioritize data sharing, improving model interpretability, broadening research horizons and advancing theory-guided machine learning. These will enhance our understanding of groundwater pollution mechanisms, and ultimately facilitate more effective pollution control and remediation strategies. In summary, this review provides valuable insights and suggestions for researchers and policymakers working in this critical field.

Keywords: Groundwater pollution; Groundwater quality; Hydrochemical evolution; Machine learning; Pollution risk; Pollution source identification.

Publication types

  • Review