skip to main content
research-article

Enabling Cost-Effective Population Health Monitoring By Exploiting Spatiotemporal Correlation: An Empirical Study

Published:04 January 2021Publication History
Skip Abstract Section

Abstract

Because of its important role in health policy-shaping, population health monitoring (PHM) is considered a fundamental block for public health services. However, traditional public health data collection approaches, such as clinic-visit-based data integration or health surveys, could be very costly and time-consuming. To address this challenge, this article proposes a cost-effective approach called Compressive Population Health (CPH), where a subset of a given area is selected in terms of regions within the area for data collection in the traditional way, while leveraging inherent spatial correlations of neighboring regions to perform data inference for the rest of the area. By alternating selected regions longitudinally, this approach can validate and correct previously assessed spatial correlations. To verify whether the idea of CPH is feasible, we conduct an in-depth study based on spatiotemporal morbidity rates of chronic diseases in more than 500 regions around London for over 10 years. We introduce our CPH approach and present three extensive analytical studies. The first confirms that significant spatiotemporal correlations do exist. In the second study, by deploying multiple state-of-the-art data recovery algorithms, we verify that these spatiotemporal correlations can be leveraged to do data inference accurately using only a small number of samples. Finally, we compare different methods for region selection for traditional data collection and show how such methods can further reduce the overall cost while maintaining high PHM quality.

References

  1. Cichocki Andrzej, Zdunek Rafal, Phan Anh Huy, and Amari Shun Ichi. 2009. Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-Way Data Analysis and Blind Source Separation. Wiley Publishing, Chapter 2.Google ScholarGoogle Scholar
  2. Hasthanasombat Apinan and Cecilia Mascolo. 2019. Understanding the effects of the neighbourhood built environment on public health with open data. In Proceedings of the World Wide Web Conference (2019), 648--658.Google ScholarGoogle Scholar
  3. Karen Barnett, Mercer Stewart W., Michael Norbury, Graham Watt, Sally Wyke, and Bruce Guthrie. 2012. Epidemiology of multimorbidity and implications for health care, research, and medical education: A cross-sectional study. Lancet 380, 9836 (2012), 37--43.Google ScholarGoogle Scholar
  4. Mel Bartley. 2004. Health Inequality: An Introduction to Concepts, Theories and Methods. Cambridge: Polity Press.Google ScholarGoogle Scholar
  5. D. L. Blackwell, J. W. Lucas, and T. C. Clarke. 2014. Summary health statistics for U.S. adults: National health interview survey, 2012. Vital Health Stat 251, 223 (2014), 1--161.Google ScholarGoogle Scholar
  6. Guestrin Carlos, Andreas Krause, and Ajit Paul Singh. 2005. Near-optimal sensor placements in Gaussian processes. In Proceedings of the 22nd International Conference on Machine Learning. 265--272.Google ScholarGoogle Scholar
  7. Chao Chen, Yan Ding, Xuefeng Xie, Shu Zhang, Zhu Wang, and Liang Feng. 2020. TrajCompressor: An online map-matching-based trajectory compression framework leveraging vehicle heading direction and change. IEEE Transactions on Intelligent Transportation Systems 21, 5 (2020), 2012--2028.Google ScholarGoogle ScholarCross RefCross Ref
  8. Lusignan S. De, H. Liyanage, Iorio Ct Di, T. Chan, and S. T. Liaw. 2015. Using routinely collected health data for surveillance, quality improvement and research: Framework and key questions to assess ethics, privacy and data access. Journal of Innovation in Health Informatics 22, 4 (2015), 426.Google ScholarGoogle Scholar
  9. Baraniuk Richard G. 2007. Compressive sensing. IEEE Signal Processing Magazine 24, 4 (2007).Google ScholarGoogle Scholar
  10. Baraniuk Richard G., Volkan Cevher, Marco F. Duarte, and Chinmay Hegde. 2010. Model-based compressive sensing. IEEE Transactions on Information Theory 56, 4 (2010), 1982--2001.Google ScholarGoogle Scholar
  11. Quer Giorgio, Riccardo Masiero, Gianluigi Pillonetto, Michele Rossi, and Michele Zorzi. 2012. Sensing, compression, and recovery for WSNs: Sparse signal modeling and monitoring framework. IEEE Transactions on Wireless Communications 11, 10 (2012), 3447--3461.Google ScholarGoogle ScholarCross RefCross Ref
  12. S. Grover and G. S. Aujla. 2014. Prediction model for influenza epidemic based on Twitter data. International Journal of Advanced Research in Computer and Communication Engineering 3, 7 (2014), 7541--7545.Google ScholarGoogle Scholar
  13. Paul Michael J. and Mark Dredze. 2011. You are what you tweet: Analyzing Twitter for public health. In Proceedings of the 5th International AAAI Conference on Weblogs and Social Media.Google ScholarGoogle Scholar
  14. Eamonn Keogh and Chotirat Ann Ratanamahatana. 2005. Exact indexing of dynamic time warping. Knowledge 8 Information Systems 7, 3 (2005), 358--386.Google ScholarGoogle Scholar
  15. Tamara G. Kolda and Brett W. Bader. 2009. Tensor decompositions and applications. SIAM Rev. 51, 3 (2009), 455--500.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Linghe Kong, Mingyuan Xia, Xiao-Yang Liu, Guangshuo Chen, Yu Gu, Min-You Wu, and Xue Liu. 2013. Data loss and reconstruction in wireless sensor networks. IEEE Transactions on Parallel and Distributed Systems 25, 11 (2013), 2818--2828.Google ScholarGoogle ScholarCross RefCross Ref
  17. Kruse, Clemens Scott, Anna Stein, Heather Thomas, and Harmander Kaur. 2018. The use of electronic health records to support population health: A systematic review of the literature. Journal of Medical Systems 42, 11 (2018), 214.Google ScholarGoogle Scholar
  18. Andrew B. Lawson, Sidipto Banerjee, Rober P. Haining, and M. D. Ugarte. 2016. Handbook of Spatial Epidemiology. CRC Press.Google ScholarGoogle Scholar
  19. Daniel D. Lee and H. Sebastian Seung. 1999. Learning the parts of objects by nonnegative matrix factorization. Nature 401 (1999), 788--791.Google ScholarGoogle ScholarCross RefCross Ref
  20. Chih-Jen Lin. 2007. Projected gradient methods for non-negative matrix factorization. Neural Computation 19 (2007), 2756--2779.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. K. E. Mason, N. Pearce, and S. Cummins. 2018. Associations between fast food and physical activity environments and adiposity in mid-life: Cross-sectional, observational evidence from UK Biobank. Lancet Public Health 3, 1 (2018), 24--33.Google ScholarGoogle ScholarCross RefCross Ref
  22. Yelena Mejova, Hamed Haddadi, Anastasios Noulas, and Ingmar Weber. 2015. FoodPorn: Obesity patterns in culinary interactions. In Proceedings of the 5th International Conference on Digital Health. 51--58.Google ScholarGoogle Scholar
  23. Lauren Meyers. 2007. Contact network epidemiology: Bond percolation applied to infectious disease prediction and control. Bull. Amer. Math. Soc. 44, 1 (2007), 63--68.Google ScholarGoogle ScholarCross RefCross Ref
  24. Fang P., Dong S., Xiao J., Liu C., Feng X., and Wang Y. 2010. Regional inequality in health and its determinants: Evidence from China. Health Policy 94, 1 (2010), 14--25.Google ScholarGoogle Scholar
  25. Kind P., Dolan P., Gudex C., and Williams. 1998. Variations in population health status: Results from a United Kingdom national questionnaire survey. BMJ 316, 7133 (1998), 736--741.Google ScholarGoogle Scholar
  26. Perlman S. E., McVeigh K. H., Thorpe L. E., Jacobson L., Greene C. M., and Gwynn R. C. 2017. Innovations in population health surveillance: Using electronic health records for chronic disease surveillance. American Journal of Public Health 107, 6 (2017), 853--857.Google ScholarGoogle Scholar
  27. Mary C. Seiler and Fritz A. Seiler. 2010. Numerical recipes in C: The art of scientific computing. Risk Analysis 9, 3 (2010), 415--416.Google ScholarGoogle ScholarCross RefCross Ref
  28. Yiran Shen, Wen Hu, Mingrui Yang, Bo Wei, Simon Lucey, and Chun Tung Chou. 2014. Face recognition on smartphones via optimised sparse representation classification. In Proceedings of the 13th International Symposium on Information Processing in Sensor Networks. 237--248.Google ScholarGoogle Scholar
  29. Xiaoyuan Su and Taghi M. Khoshgoftaar. 2009. A survey of collaborative filtering techniques. Advances in Artificial Intelligence 2009, 4 (2009). https://doi.org/10.115/2009/421425.Google ScholarGoogle Scholar
  30. Hanna Tolonen, Päivikki Koponen, Ala’A Al-Kerwi, Nada Capkova, Simona Giampaoli, Jennifer Mindell, Laura Paalanen, Maria Ruiz-Castell, Antonia Trichopoulou, and Kari Kuulasmaa. 2018. European health examination surveys -- A tool for collecting objective information about the health of the population. Archives of Public Health 76, 1 (2018), 38.Google ScholarGoogle Scholar
  31. C. Trattner, D. Parra, and D. Elsweiler. 2017. Monitoring obesity prevalence in the United States through bookmarking activities in online food portals. PloS One 12, 6 (2017). DOI:https://doi.org/10.1371/journal.pone.0179144Google ScholarGoogle Scholar
  32. Shikha Verma, Younghee Park, and Mihui Kim. 2017. Predicting flu-rate using big data analytics based on social data and weather conditions. Advanced Science Letters 23, 12 (2017), 12775--12779.Google ScholarGoogle Scholar
  33. Verschuuren, Marieke, and eds Hans Van Oers. 2018. Population health monitoring: Climbing the information pyramid. Springer (2018).Google ScholarGoogle Scholar
  34. Leye Wang, Daqing Zhang, Animesh Pathak, Chao Chen, Haoyi Xiong, Dingqi Yang, and Yasha Wang. 2015. CCS-TA: Quality-guaranteed online task allocation in compressive crowdsensing. In Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing. 683--694.Google ScholarGoogle Scholar
  35. Yingzi Wang, Xiao Zhou, Anastasios Noulas, Cecilia Mascolo, Xing Xie, and Enhong Chen. 2018. Predicting the spatio-temporal evolution of chronic diseases in population with human mobility data. IJCAI (2018), 3578--3584.Google ScholarGoogle Scholar
  36. Xiuwen Yi, Yu Zheng, Junbo Zhang, and Tianrui Li. 2016. ST-MVL: Filling missing values in geo-sensory time series data. In Proceedings of the 25th International Joint Conference on Artificial Intelligence (2016), 2704--2710.Google ScholarGoogle Scholar
  37. Yin Zhang, Matthew Roughan, Walter Willinger, and Lili Qiu. 2009. Spatio-temporal compressive sensing and Internet traffic matrices. ACM SIGCOMM Computer Communication Review 39, 4 (2009), 267--278.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Yu Zheng, Tong Liu, Yilun Wang, Yanmin Zhu, Yanchi Liu, and Eric Chang. 2014. Diagnosing New York city’s noises with ubiquitous data. In Proceedings of the 16th ACM International Conference on Ubiquitous Computing.Google ScholarGoogle Scholar
  39. Yanmin Zhu, Zhi Li, Hongzi Zhu, Minglu Li, and Qian Zhang. 2012. A compressive sensing approach to urban traffic estimation with probe vehicles. IEEE Transactions on Mobile Computing 12, 11 (2012), 2289--2302.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Enabling Cost-Effective Population Health Monitoring By Exploiting Spatiotemporal Correlation: An Empirical Study

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in

              Full Access

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader

              HTML Format

              View this article in HTML Format .

              View HTML Format
              About Cookies On This Site

              We use cookies to ensure that we give you the best experience on our website.

              Learn more

              Got it!