A Physically Constrained Deep-Learning Fusion Method for Estimating Surface NO2 Concentration from Satellite and Ground Monitors

Jia Xing; Bok H Baek; Siwei Li; Chi-Tsan Wang; Ge Song; Siqi Ma; Shuxin Zheng; Chang Liu; Daniel Tong; Jung-Hun Woo; Tie-Yan Liu; Joshua S Fu

doi:10.1021/acs.est.4c07341

A Physically Constrained Deep-Learning Fusion Method for Estimating Surface NO₂ Concentration from Satellite and Ground Monitors

Environ Sci Technol. 2024 Dec 3;58(48):21218-21228. doi: 10.1021/acs.est.4c07341. Epub 2024 Nov 20.

Authors

Jia Xing^{1

2}, Bok H Baek¹, Siwei Li³, Chi-Tsan Wang¹, Ge Song³, Siqi Ma¹, Shuxin Zheng⁴, Chang Liu⁴, Daniel Tong¹, Jung-Hun Woo⁵, Tie-Yan Liu⁴, Joshua S Fu²

Affiliations

¹ Center for Spatial Information Science and Systems, George Mason University, Fairfax, Virginia 22030, United States.
² Department of Civil and Environmental Engineering, The University of Tennessee, Knoxville, Tennessee 37996, United States.
³ Hubei Key Laboratory of Quantitative Remote Sensing of Land and Atmosphere, School of Remote Sensing and Information Engineering, Wuhan University, Wuhan, Hubei 430000, China.
⁴ Microsoft Research AI for Science, Beijing 100080, China.
⁵ Graduate School of Environmental Studies, Seoul National University, Seoul 08826, Korea.

Abstract

Accurate estimation of atmospheric chemical concentrations from multiple observations is crucial for assessing the health effects of air pollution. However, existing methods are limited by imbalanced samples from observations. Here, we introduce a novel deep-learning model-measurement fusion method (DeepMMF) constrained by physical laws inferred from a chemical transport model (CTM) to estimate NO₂ concentrations over the Continental United States (CONUS). By pretraining with spatiotemporally complete CTM simulations, fine-tuning with satellite and ground measurements, and employing a novel optimization strategy for selecting proper prior emission, DeepMMF delivers improved NO₂ estimates, showing greater consistency and daily variation alignment with observations (with NMB reduced from -0.3 to -0.1 compared to original CTM simulations). More importantly, DeepMMF effectively addressed the sample imbalance issue that causes overestimation (by over 100%) of downwind or rural concentrations in other methods. It achieves a higher R² of 0.98 and a lower RMSE of 1.45 ppb compared to surface NO₂ observations, overperforming other approaches, which show R² values of 0.4-0.7 and RMSEs of 3-6 ppb. The method also offers a synergistic advantage by adjusting corresponding emissions, in agreement with changes (-10% to -20%) reported in the NEI between 2019 and 2020. Our results demonstrate the great potential of DeepMMF in data fusion to better support air pollution exposure estimation and forecasting.

Keywords: NO2; TROPOMI satellite; deep learning; model-measurement fusion; physically constrained.

MeSH terms

Air Pollutants* / analysis
Air Pollution*
Deep Learning*
Environmental Monitoring* / methods
Nitrogen Dioxide* / analysis

Substances

Air Pollutants
Nitrogen Dioxide