The application of machine learning to air pollution research: A bibliometric analysis
Machine learning (ML) is an advanced computer algorithm that simulates the human learning process to solve problems. With an explosion of monitoring data and the increasing demand for fast and accurate prediction, ML models have been rapidly developed and applied in air pollution research. In order to explore the status of ML applications in air pollution research, a bibliometric analysis was made based on 2962 articles published from 1990 to 2021. The number of publications increased sharply after 2017, comprising approximately 75% of the total. Institutions in China and United States contributed half of all publications with most research being conducted by individual groups rather than global collaborations. Cluster analysis revealed four main research topics for the application of ML: chemical characterization of pollutants, short-term forecasting, detection improvement and optimizing emission control. The rapid development of ML algorithms has increased the capability to explore the chemical characteristics of multiple pollutants, analyze chemical reactions and their driving factors, and simulate scenarios. Combined with multi-field data, ML models are a powerful tool for analyzing atmospheric chemical processes and evaluating the management of air quality and deserve greater attention in future.
| Item Type | Article |
|---|---|
| Open Access | Gold |
| Additional information | This work was supported by the Key Program of the National Natural Science Foundation of China (No. 41730855), the major project of Inner Mongolian Natural Science Foundation (No. 2019ZD02), and the High-level Innovative Talent Project of China Agricultural University. |
| Keywords | Air pollution, Machine learning, Bibliometric analysis, Research focuses, Development |
| Project | S2N - Soil to Nutrition [ISPG] |
| Date Deposited | 05 Dec 2025 10:36 |
| Last Modified | 19 Dec 2025 14:56 |


