Abstract
Data-driven statistical air quality prediction methods usually build models fast with moderate accuracy and have been studied a lot in recent years. However, due to the complexity of air quality prediction which usually involves multiple factors, such as meteorological, spatial, and temporal properties, it is still a challenge to propose a model with required accuracy. In this paper, we propose a hybrid ensemble model CERL to exploit the merits of both forward neural networks and recurrent neural networks that are designed for handling time serial data to predict air quality hourly. Measured air pollutant factors including Air Quality Index (AQI), PM2.5, PM10, CO, SO2, NO2, and O3 are used as input to predict air quality from 1 to 8 h ahead. Based on the air quality prediction evaluation in Lanzhou and Xi’an, which are two important provincial capitals in Northwest China, CERL provides better performance over other baseline models. Moreover, as the step length increases, CERL has more obvious improvement. For example, the improvements of CERL in the 1-step, 3-step, 5-step, and 8-step prediction for PM2.5 in Lanzhou are 1.82%, 8.01%, 9.98%, and 20.03%, respectively. The superiority of CERL is also proved by a hypothesis Diebold Mariano test with level of significance 5%.
Similar content being viewed by others
References
AirNow (2019) Air quality index (aqi) basics. https://airnow.gov/index.cfm?action=aqibasics.aqi. Accessed 29 Aug 2019
AirVisual (2019) Airvisual–air quality monitor and information you can trust. https://www.airvisual.com/. Accessed 26 Aug 2019
AirVisual I (2018) 2018 world air quality report-region & city pm2.5 ranking, Tech. Rep.
Atabay D (2019) pyrenn: a recurrent neural network toolbox for python and matlab–pyrenn 0.1 documentation. https://pyrenn.readthedocs.io/en/latest/. Accessed 02 Jul 2019
Athira V, Geetha P, Vinayakumar R, Soman KP (2018) Deepairnet: applying recurrent networks for air quality prediction. Procedia Computer Science 132:1394–1403. Retrieved from http://www.sciencedirect.com/science/article/pii/S1877050918308007 (International Conference on Computational Intelligence and Data Science)
Biancofiore F, Busilacchio M, Verdecchia M, Tomassetti B, Aruffo E, Bianco S, et al. (2017) Recursive neural network model for analysis and forecast of pm10 and pm2.5. Atmos Pollut Res 8(4):652–659. Retrieved from http://www.sciencedirect.com/science/article/pii/S1309104216304056
Cabaneros SM, Calautit JK, Hughes BR (2019) A review of artificial neural network models for ambient air pollution prediction. Environmental Modelling & Software 119:285–304. Retrieved from http://www.sciencedirect.com/science/article/pii/S1364815218306352
CNEMC (2019) China national environmental monitoring centre. http://www.cnemc.cn/. Accessed 08 Aug 2019
Feng X, Li Q, Zhu Y, Hou J, Jin L, Wang J (2015) Artificial neural networks forecasting of pm2.5 pollution using air mass trajectory based geographic model and wavelet transformation. Atmos Environ 107:118–128. Retrieved from http://www.sciencedirect.com/science/article/pii/S1352231015001491
Garcia JM, Teodoro F, Cerdeira R, Coelho LMR, Kumar P, Carvalho MG (2016) Developing a methodology to predict pm10 concentrations in urban areas using generalized linear models. Environ Technol 37 (18):2316–2325. Retrieved from https://doi.org/10.1080/09593330.2016.1149228 (PMID: 26839052)
Genc DD, Yesilyurt C, Tuncel G (2010) Air pollution forecasting in Ankara, Turkey using air pollution index and its relation to assimilative capacity of the atmosphere. Environ Monit Assess 166(1):11–27. Retrieved from https://doi.org/10.1007/s10661-009-0981-y
He H, Li M, Wang W, Wang Z, Xue Y (2018) Prediction of pm2.5 concentration based on the similarity in air quality monitoring network. Build and Environ 137:11–17. Retrieved from http://www.sciencedirect.com/science/article/pii/S0360132318301938
He Z, Chen Y, Shang Z, Li C, Li L, Xu M (2019) A novel wind speed forecasting model based on moving window and multi-objective particle swarm optimization algorithm. Appl Math Model 76:717–740. Retrieved from http://www.sciencedirect.com/science/article/pii/S0307904X19304020
HEI, IHME (2018) State of global air/2018 a special report on global exposure to air pollution and its disease burden (Tech. Rep.)
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780. Retrieved from https://doi.org/10.1162/neco.1997.9.8.1735
Jaeger H (2001, 01) The”echo state” approach to analysing and training recurrent neural networks-with an erratum note’. Bonn, Germany: German National Research Center for Information Technology GMD Technical Report, 148
Jaeger H, Lukoševičius M, Popovici D, Siewert U (2007) Optimization and applications of echo state networks with leaky-integrator neurons. Neural Networks 20(3):335–352. Retrieved from http://www.sciencedirect.com/science/article/pii/S089360800700041X (Echo State Networks and Liquid State Machines)
Jiang F, He J, Tian T (2019) A clustering-based ensemble approach with improved pigeon-inspired optimization and extreme learning machine for air quality prediction. Appl Soft Comput 85:105827. Retrieved from http://www.sciencedirect.com/science/article/pii/S1568494619306088
Kwok L, Lam Y, Tam CY (2017) Developing a statistical based approach for predicting local air quality in complex terrain area. Atmos Pollut Res 8(1):114–126. Retrieved from http://www.sciencedirect.com/science/article/pii/S130910421630174X
Lang S, Bravo-Marquez F, Beckham C, Hall M, Frank E (2019) Wekadeeplearning4j: a deep learning package for weka based on deeplearning4j. Knowledge-Based Systems 178:48–50. Retrieved from http://www.sciencedirect.com/science/article/pii/S0950705119301789
Leksmono N, Longhurst J, Ling K, Chatterton T, Fisher B, Irwin J (2006) Assessment of the relationship between industrial and traffic sources contributing to air quality objective exceedences: a theoretical modelling exercise. Environmental Modelling & Software 21(4):494–500. Retrieved from http://www.sciencedirect.com/science/article/pii/S1364815204003123 (Urban Air Quality Modelling)
Li C, Hsu NC, Tsay SC (2011) A study on the potential applications of satellite data in air quality monitoring and forecasting. Atmos Environ 45(22):3663–3675. Retrieved from http://www.sciencedirect.com/science/article/pii/S1352231011004109
Lin YC, Lee SJ, Ouyang CS, Wu CH (2020) Air quality prediction by neuro-fuzzy modeling approach. Appl Soft Comput 86:105898. Retrieved from http://www.sciencedirect.com/science/article/pii/S1568494619306799
Lukoševičius M, Jaeger H (2009) Reservoir computing approaches to recurrent neural network training. Comput Sci Rev 3(3):127–149. Retrieved from http://www.sciencedirect.com/science/article/pii/S1574013709000173
Ma J, Cheng JC, Lin C, Tan Y, Zhang J (2019) Improving air quality prediction accuracy at larger temporal resolutions using deep learning and transfer learning techniques. Atmos Environ 214:116885. Retrieved from http://www.sciencedirect.com/science/article/pii/S1352231019305151
Ma J, Li Z, Cheng JC, Ding Y, Lin C, Xu Z (2019) Air quality prediction at new stations using spatially transferred bi-directional long short-term memory network. Science of The Total Environment, pp. 135771. Retrieved from http://www.sciencedirect.com/science/article/pii/S0048969719357663
Shen L, Chen W, Wang J, Wei J, Yu Z (2016) Functional echo state network for time series classification. Inf Sci 373:1–20. Retrieved from http://www.sciencedirect.com/science/article/pii/S0020025516306661
Maciag PS, Kasabov N, Kryszkiewicz M, Bembenik R (2019) Air pollution prediction with clustering-based ensemble of evolving spiking neural networks and a case study for London area. Environmental Modelling & Software 118:262–280. Retrieved from http://www.sciencedirect.com/science/article/pii/S1364815218307448 http://www.sciencedirect.com/science/article/pii/S1364815218307448
Mallet V, Sportisse B (2008) Air quality modeling: from deterministic to stochastic approaches. Comput Math Appl 55(10):2329–2337. Retrieved from http://www.sciencedirect.com/science/article/pii/S089812210700733X (Advanced Numerical Algorithms for Large-Scale Computations)
Osowski S, Garanty K (2007) Forecasting of the daily meteorological pollution using wavelets and support vector machine. Eng Appl Artif Intel 20(6):745–755. Retrieved from http://www.sciencedirect.com/science/article/pii/S0952197606001904
Perez P, Reyes J (2006) An integrated neural network model for pm10 forecasting. Atmos Environ 40 (16):2845–2851. Retrieved from http://www.sciencedirect.com/science/article/pii/S1352231006000495
Pineda F (1987) Generalization of back-propagation to recurrent neural networks. Phys Rev Lett 59 (19):2229–2232
Shang Z, Deng T, He J, Duan X (2019) A novel model for hourly pm2.5 concentration prediction based on cart and eelm. Sci Total Environ 651:3043–3052. Retrieved from http://www.sciencedirect.com/science/article/pii/S0048969718340841
Singh KP, Gupta S, Kumar A, Shukla SP (2012) Linear and nonlinear modeling approaches for urban air quality prediction. Sci Total Environ 426:244–255. Retrieved from http://www.sciencedirect.com/science/article/pii/S0048969712004809
Stadlober E, Hörmann S, Pfeiler B (2008) Quality and performance of a pm10 daily forecasting model. Atmos Environ 42(6):1098–1109. Retrieved from http://www.sciencedirect.com/science/article/pii/S1352231007009909
Tengeleng S, Armand N (2014) Performance of using cascade forward back propagation neural networks for estimating rain parameters with rain drop size distribution. Atmosphere 5(2):454–472. Retrieved from https://www.mdpi.com/2073-4433/5/2/454 https://www.mdpi.com/2073-4433/5/2/454
Wang J (2019) China air quality online monitoring and analysis platform. https://www.aqistudy.cn/. Accessed 14 Aug 2019
Wang J, Song G (2018) A deep spatial-temporal ensemble model for air quality prediction. Neurocomputing 314:198–206. Retrieved from http://www.sciencedirect.com/science/article/pii/S0925231218307859
Wang X (2019) Historical data of air quality in China. http://beijingair.sinaapp.com/. Accessed 04 Aug 2019
Warsito B, Santoso R, Suparti S, Yasin H (2018, 05) Cascade forward neural network for time series prediction. J Phys Conf Ser 1025:012097
Wu Q, Lin H (2019) A novel optimal-hybrid model for daily air quality index prediction considering air pollutant factors. Sci Total Environ 683:808–821. Retrieved from http://www.sciencedirect.com/science/article/pii/S0048969719323290
Deng M, Xu F, Wang H (2018) Prediction of hourly pm2.5 using a space-time support vector regression model. Atmos Environ 181:12–19. Retrieved from http://www.sciencedirect.com/science/article/pii/S1352231018301535
Yoon H, Jun SC, Hyun Y, Bae GO, Lee KK (2011) A comparative study of artificial neural networks and support vector machines for predicting groundwater levels in a coastal aquifer. J Hydrol 396(1):128–138. Retrieved from http://www.sciencedirect.com/science/article/pii/S0022169410006761
Zhang Y, Bocquet M, Mallet V, Seigneur C, Baklanov A (2012) Real-time air quality forecasting, part i: history, techniques, and current status. Atmos Environ 60:632–655. Retrieved from http://www.sciencedirect.com/science/article/pii/S1352231012005900
Zhao S, Yuan X, Xiao D, Zhang J, Li Z (2018) Airnet: a machine learning dataset for air quality forecasting. Retrieved from https://openreview.net/forum?id=SkymMAxAb
Funding
This work is supported by the National Natural Science Foundation of China (Project No. 61702240).
Author information
Authors and Affiliations
Corresponding author
Additional information
Responsible Editor: Constantini Samara
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhao, Z., Qin, J., He, Z. et al. Combining forward with recurrent neural networks for hourly air quality prediction in Northwest of China. Environ Sci Pollut Res 27, 28931–28948 (2020). https://doi.org/10.1007/s11356-020-08948-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11356-020-08948-1