Login / Signup

Combining Machine Learning and Numerical Simulation for High-Resolution PM 2.5 Concentration Forecast.

Jianzhao BiK Emma KnowlandChristoph A KellerYang Liu
Published in: Environmental science & technology (2022)
Forecasting ambient PM 2.5 concentrations with spatiotemporal coverage is key to alerting decision makers of pollution episodes and preventing detrimental public exposure, especially in regions with limited ground air monitoring stations. The existing methods rely on either chemical transport models (CTMs) to forecast spatial distribution of PM 2.5 with nontrivial uncertainty or statistical algorithms to forecast PM 2.5 concentration time series at air monitoring locations without continuous spatial coverage. In this study, we developed a PM 2.5 forecast framework by combining the robust Random Forest algorithm with a publicly accessible global CTM forecast product, NASA's Goddard Earth Observing System "Composition Forecasting" (GEOS-CF), providing spatiotemporally continuous PM 2.5 concentration forecasts for the next 5 days at a 1 km spatial resolution. Our forecast experiment was conducted for a region in Central China including the populous and polluted Fenwei Plain. The forecast for the next 2 days had an overall validation R 2 of 0.76 and 0.64, respectively; the R 2 was around 0.5 for the following 3 forecast days. Spatial cross-validation showed similar validation metrics. Our forecast model, with a validation normalized mean bias close to 0, substantially reduced the large biases in GEOS-CF. The proposed framework requires minimal computational resources compared to running CTMs at urban scales, enabling near-real-time PM 2.5 forecast in resource-restricted environments.
Keyphrases