skip to main content
research-article
Open Access

Scalable Belief Updating for Urban Air Quality Modeling and Prediction

Published:03 January 2021Publication History
Skip Abstract Section

Abstract

Air pollution is one of the major concerns in global urbanization. Data science can help to understand the dynamics of air pollution and build reliable statistical models to forecast air pollution levels. To achieve these goals, one needs to learn the statistical models which can capture the dynamics from the historical data and predict air pollution in the future. Furthermore, the large size and heterogeneity of today’s big urban data pose significant challenges on the scalability and flexibility of the statistical models. In this work, we present a scalable belief updating framework that is able to produce reliable predictions, using over millions of historical hourly air pollutant and meteorology records. We also present a non-parametric approach to learn the statistical model which reveals interesting periodical dynamics and correlations of the dataset. Based on the scalable belief update framework and the non-parametric model learning approach, we propose an iterative update algorithm to accelerate Gaussian process, which is notorious for its prohibitive computation with large input data. Finally, we demonstrate how to integrate information from heterogeneous data by regarding the beliefs produced by other models as the informative prior. Numerical examples and experimental results are presented to validate the proposed method.

References

  1. U. S. Environmental Protection Agency. 1999. Nitrogen Oxides (NOx), Why and How They Are Controlled.Google ScholarGoogle Scholar
  2. V. Athira, P. Geetha, R. Vinayakumar, and K. P. Soman. 2018. Deepairnet: Applying recurrent networks for air quality prediction. Procedia Computer Science 132 (2018), 1394--1403.Google ScholarGoogle Scholar
  3. Matthias Bauer, Mark van der Wilk, and Carl Edward Rasmussen. 2016. Understanding probabilistic sparse Gaussian process approximations. In Advances in Neural Information Processing Systems. 1533--1541.Google ScholarGoogle Scholar
  4. Christopher M. Bishop. 2006. Pattern Recognition and Machine Learning. Springer.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Pier Giovanni Bissiri, Chris C. Holmes, and Stephen G. Walker. 2016. A general framework for updating belief distributions. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 78, 5 (2016), 1103--1130.Google ScholarGoogle ScholarCross RefCross Ref
  6. George E. P. Box. 1979. Robustness in the strategy of scientific model building. In Robustness in Statistics. Elsevier, 201--236.Google ScholarGoogle Scholar
  7. George E. P. Box and David R. Cox. 1964. An analysis of transformations. Journal of the Royal Statistical Society: Series B (Methodological) 26, 2 (1964), 211--243.Google ScholarGoogle ScholarCross RefCross Ref
  8. Tien-Cuong Bui, Van-Duc Le, and Sang-Kyun Cha. 2018. A deep learning approach for forecasting air pollution in south korea using LSTM. Arxiv Preprint Arxiv:1804.07891 (2018).Google ScholarGoogle Scholar
  9. Jerome T. Connor, R. Douglas Martin, and Les E. Atlas. 1994. Recurrent neural networks and robust time series prediction. IEEE Transactions on Neural Networks 5, 2 (1994), 240--254.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Paulo S. R. Diniz, Eduardo A. B, Da Silva, and Sergio L. Netto. 2010. Digital Signal Processing: System Analysis and Design. Cambridge University Press.Google ScholarGoogle Scholar
  11. Yanjie Duan, Yisheng Lv, and Fei-Yue Wang. 2016. Travel time prediction with LSTM neural network. In 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC). IEEE, 1053--1058.Google ScholarGoogle Scholar
  12. Dongbing Gu and Huosheng Hu. 2012. Spatial gaussian process regression with mobile sensor networks. IEEE Transactions on Neural Networks and Learning Systems 23, 8 (2012), 1279--1290.Google ScholarGoogle ScholarCross RefCross Ref
  13. Min Han, Jianhui Xi, Shiguo Xu, and Fu-Liang Yin. 2004. Prediction of chaotic time series based on the recurrent predictor neural network. IEEE Transactions on Signal Processing 52, 12 (2004), 3409--3416.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Ching-Hui Huang, Heng-Cheng Lin, Chen-Dao Tsai, Hung-Kai Huang, Ie-Bin Lian, and Chia-Chu Chang. 2017. The interaction effects of meteorological factors and air pollution on the development of acute coronary syndrome. Scientific Reports 7 (2017), 44004.Google ScholarGoogle Scholar
  15. Rishee K. Jain, Jose M. F. Moura, and Constantine E. Kontokosta. 2014. Big data+ big cities: Graph signals of urban air pollution [exploratory sp]. IEEE Signal Processing Magazine 31, 5 (2014), 130--136.Google ScholarGoogle ScholarCross RefCross Ref
  16. Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. Arxiv Preprint Arxiv:1412.6980 (2014).Google ScholarGoogle Scholar
  17. Andreas Krause, Ajit Singh, and Carlos Guestrin. 2008. Near-optimal sensor placements in gaussian processes: Theory, efficient algorithms and empirical studies. Journal of Machine Learning Research 9, Feb. (2008), 235--284.Google ScholarGoogle Scholar
  18. Yuxuan Liang, Songyu Ke, Junbo Zhang, Xiuwen Yi, and Yu Zheng. 2018. Geoman: Multi-level attention networks for geo-sensory time series prediction. In Proceedings of IJCAI. 3428--3434.Google ScholarGoogle Scholar
  19. Haitao Liu, Yew-Soon Ong, Xiaobo Shen, and Jianfei Cai. 2018. When Gaussian process meets big data: A review of scalable GPs. Arxiv Preprint Arxiv:1807.01065 (2018).Google ScholarGoogle Scholar
  20. M. Lototzis, G. K. Papadopoulos, F. Droulia, A. Tseliou, and I. X. Tsiros. 2018. A note on the correlation between circular and linear variables with an application to wind direction and air temperature data in a mediterranean climate. Meteorology and Atmospheric Physics 130, 2 (2018), 259--264.Google ScholarGoogle Scholar
  21. World Health Organization. 2015. Economic cost of the health impact of air pollution in Europe: Clean air, health and wealth.Google ScholarGoogle Scholar
  22. C. Arden Pope III and Douglas W. Dockery. 1992. Acute health effects of PM10 pollution on symptomatic and asymptomatic children. American Review of Respiratory Disease 145, 5 (1992), 1123--1128.Google ScholarGoogle Scholar
  23. Zhongang Qi, Tianchun Wang, Guojie Song, Weisong Hu, Xi Li, and Zhongfei Zhang. 2018. Deep air learning: Interpolation, prediction, and feature analysis of fine-grained air quality. IEEE Transactions on Knowledge and Data Engineering 30, 12 (2018), 2285--2297.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Joaquin Quiñonero-Candela and Carl Edward Rasmussen. 2005. A unifying view of sparse approximate Gaussian process regression. Journal of Machine Learning Research 6, Dec (2005), 1939--1959.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Carl Edward Rasmussen and Christopher Williams. 2006. Gaussian Processes for Machine Learning. The MIT Press.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Matthias Seeger, Christopher Williams, and Neil Lawrence. 2003. Fast Forward Selection to Speed up Sparse Gaussian Process Regression. Technical Report.Google ScholarGoogle Scholar
  27. Khaled Bashir Shaban, Abdullah Kadri, and Eman Rezk. 2016. Urban air pollution monitoring system with forecasting models. IEEE Sensors Journal 16, 8 (2016), 2598--2606.Google ScholarGoogle ScholarCross RefCross Ref
  28. Robert H. Shumway and David Stoffer. 2017. Time Series Analysis and Its Applications: with R Examples. Springer.Google ScholarGoogle Scholar
  29. Edward Snelson and Zoubin Ghahramani. 2006. Sparse Gaussian processes using pseudo-inputs. In Advances in Neural Information Processing Systems. 1257--1264.Google ScholarGoogle Scholar
  30. In-Kwon Yeo and Richard A. Johnson. 2000. A new family of power transformations to improve normality or symmetry. Biometrika 87, 4 (2000), 954--959.Google ScholarGoogle ScholarCross RefCross Ref
  31. Xiuwen Yi, Junbo Zhang, Zhaoyuan Wang, Tianrui Li, and Yu Zheng. 2018. Deep distributed fusion network for air quality prediction. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery 8 Data Mining. 965--973.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Yu Zheng, Licia Capra, Ouri Wolfson, and Hai Yang. 2014. Urban computing: Concepts, methodologies, and applications. ACM Transactions on Intelligent Systems and Technology (TIST) 5, 3 (2014), 38.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Yu Zheng, Furui Liu, and Hsun-Ping Hsieh. 2013. U-air: When urban air quality inference meets big data. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1436--1444.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Yu Zheng, Xiuwen Yi, Ming Li, Ruiyuan Li, Zhangqing Shan, Eric Chang, and Tianrui Li. 2015. Forecasting fine-grained air quality based on big data. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2267--2276.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Scalable Belief Updating for Urban Air Quality Modeling and Prediction

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Article Metrics

          • Downloads (Last 12 months)58
          • Downloads (Last 6 weeks)5

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format
        About Cookies On This Site

        We use cookies to ensure that we give you the best experience on our website.

        Learn more

        Got it!