Abstract
Because of its important role in health policy-shaping, population health monitoring (PHM) is considered a fundamental block for public health services. However, traditional public health data collection approaches, such as clinic-visit-based data integration or health surveys, could be very costly and time-consuming. To address this challenge, this article proposes a cost-effective approach called Compressive Population Health (CPH), where a subset of a given area is selected in terms of regions within the area for data collection in the traditional way, while leveraging inherent spatial correlations of neighboring regions to perform data inference for the rest of the area. By alternating selected regions longitudinally, this approach can validate and correct previously assessed spatial correlations. To verify whether the idea of CPH is feasible, we conduct an in-depth study based on spatiotemporal morbidity rates of chronic diseases in more than 500 regions around London for over 10 years. We introduce our CPH approach and present three extensive analytical studies. The first confirms that significant spatiotemporal correlations do exist. In the second study, by deploying multiple state-of-the-art data recovery algorithms, we verify that these spatiotemporal correlations can be leveraged to do data inference accurately using only a small number of samples. Finally, we compare different methods for region selection for traditional data collection and show how such methods can further reduce the overall cost while maintaining high PHM quality.
- Cichocki Andrzej, Zdunek Rafal, Phan Anh Huy, and Amari Shun Ichi. 2009. Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-Way Data Analysis and Blind Source Separation. Wiley Publishing, Chapter 2.Google Scholar
- Hasthanasombat Apinan and Cecilia Mascolo. 2019. Understanding the effects of the neighbourhood built environment on public health with open data. In Proceedings of the World Wide Web Conference (2019), 648--658.Google Scholar
- Karen Barnett, Mercer Stewart W., Michael Norbury, Graham Watt, Sally Wyke, and Bruce Guthrie. 2012. Epidemiology of multimorbidity and implications for health care, research, and medical education: A cross-sectional study. Lancet 380, 9836 (2012), 37--43.Google Scholar
- Mel Bartley. 2004. Health Inequality: An Introduction to Concepts, Theories and Methods. Cambridge: Polity Press.Google Scholar
- D. L. Blackwell, J. W. Lucas, and T. C. Clarke. 2014. Summary health statistics for U.S. adults: National health interview survey, 2012. Vital Health Stat 251, 223 (2014), 1--161.Google Scholar
- Guestrin Carlos, Andreas Krause, and Ajit Paul Singh. 2005. Near-optimal sensor placements in Gaussian processes. In Proceedings of the 22nd International Conference on Machine Learning. 265--272.Google Scholar
- Chao Chen, Yan Ding, Xuefeng Xie, Shu Zhang, Zhu Wang, and Liang Feng. 2020. TrajCompressor: An online map-matching-based trajectory compression framework leveraging vehicle heading direction and change. IEEE Transactions on Intelligent Transportation Systems 21, 5 (2020), 2012--2028.Google Scholar
Cross Ref
- Lusignan S. De, H. Liyanage, Iorio Ct Di, T. Chan, and S. T. Liaw. 2015. Using routinely collected health data for surveillance, quality improvement and research: Framework and key questions to assess ethics, privacy and data access. Journal of Innovation in Health Informatics 22, 4 (2015), 426.Google Scholar
- Baraniuk Richard G. 2007. Compressive sensing. IEEE Signal Processing Magazine 24, 4 (2007).Google Scholar
- Baraniuk Richard G., Volkan Cevher, Marco F. Duarte, and Chinmay Hegde. 2010. Model-based compressive sensing. IEEE Transactions on Information Theory 56, 4 (2010), 1982--2001.Google Scholar
- Quer Giorgio, Riccardo Masiero, Gianluigi Pillonetto, Michele Rossi, and Michele Zorzi. 2012. Sensing, compression, and recovery for WSNs: Sparse signal modeling and monitoring framework. IEEE Transactions on Wireless Communications 11, 10 (2012), 3447--3461.Google Scholar
Cross Ref
- S. Grover and G. S. Aujla. 2014. Prediction model for influenza epidemic based on Twitter data. International Journal of Advanced Research in Computer and Communication Engineering 3, 7 (2014), 7541--7545.Google Scholar
- Paul Michael J. and Mark Dredze. 2011. You are what you tweet: Analyzing Twitter for public health. In Proceedings of the 5th International AAAI Conference on Weblogs and Social Media.Google Scholar
- Eamonn Keogh and Chotirat Ann Ratanamahatana. 2005. Exact indexing of dynamic time warping. Knowledge 8 Information Systems 7, 3 (2005), 358--386.Google Scholar
- Tamara G. Kolda and Brett W. Bader. 2009. Tensor decompositions and applications. SIAM Rev. 51, 3 (2009), 455--500.Google Scholar
Digital Library
- Linghe Kong, Mingyuan Xia, Xiao-Yang Liu, Guangshuo Chen, Yu Gu, Min-You Wu, and Xue Liu. 2013. Data loss and reconstruction in wireless sensor networks. IEEE Transactions on Parallel and Distributed Systems 25, 11 (2013), 2818--2828.Google Scholar
Cross Ref
- Kruse, Clemens Scott, Anna Stein, Heather Thomas, and Harmander Kaur. 2018. The use of electronic health records to support population health: A systematic review of the literature. Journal of Medical Systems 42, 11 (2018), 214.Google Scholar
- Andrew B. Lawson, Sidipto Banerjee, Rober P. Haining, and M. D. Ugarte. 2016. Handbook of Spatial Epidemiology. CRC Press.Google Scholar
- Daniel D. Lee and H. Sebastian Seung. 1999. Learning the parts of objects by nonnegative matrix factorization. Nature 401 (1999), 788--791.Google Scholar
Cross Ref
- Chih-Jen Lin. 2007. Projected gradient methods for non-negative matrix factorization. Neural Computation 19 (2007), 2756--2779.Google Scholar
Digital Library
- K. E. Mason, N. Pearce, and S. Cummins. 2018. Associations between fast food and physical activity environments and adiposity in mid-life: Cross-sectional, observational evidence from UK Biobank. Lancet Public Health 3, 1 (2018), 24--33.Google Scholar
Cross Ref
- Yelena Mejova, Hamed Haddadi, Anastasios Noulas, and Ingmar Weber. 2015. FoodPorn: Obesity patterns in culinary interactions. In Proceedings of the 5th International Conference on Digital Health. 51--58.Google Scholar
- Lauren Meyers. 2007. Contact network epidemiology: Bond percolation applied to infectious disease prediction and control. Bull. Amer. Math. Soc. 44, 1 (2007), 63--68.Google Scholar
Cross Ref
- Fang P., Dong S., Xiao J., Liu C., Feng X., and Wang Y. 2010. Regional inequality in health and its determinants: Evidence from China. Health Policy 94, 1 (2010), 14--25.Google Scholar
- Kind P., Dolan P., Gudex C., and Williams. 1998. Variations in population health status: Results from a United Kingdom national questionnaire survey. BMJ 316, 7133 (1998), 736--741.Google Scholar
- Perlman S. E., McVeigh K. H., Thorpe L. E., Jacobson L., Greene C. M., and Gwynn R. C. 2017. Innovations in population health surveillance: Using electronic health records for chronic disease surveillance. American Journal of Public Health 107, 6 (2017), 853--857.Google Scholar
- Mary C. Seiler and Fritz A. Seiler. 2010. Numerical recipes in C: The art of scientific computing. Risk Analysis 9, 3 (2010), 415--416.Google Scholar
Cross Ref
- Yiran Shen, Wen Hu, Mingrui Yang, Bo Wei, Simon Lucey, and Chun Tung Chou. 2014. Face recognition on smartphones via optimised sparse representation classification. In Proceedings of the 13th International Symposium on Information Processing in Sensor Networks. 237--248.Google Scholar
- Xiaoyuan Su and Taghi M. Khoshgoftaar. 2009. A survey of collaborative filtering techniques. Advances in Artificial Intelligence 2009, 4 (2009). https://doi.org/10.115/2009/421425.Google Scholar
- Hanna Tolonen, Päivikki Koponen, Ala’A Al-Kerwi, Nada Capkova, Simona Giampaoli, Jennifer Mindell, Laura Paalanen, Maria Ruiz-Castell, Antonia Trichopoulou, and Kari Kuulasmaa. 2018. European health examination surveys -- A tool for collecting objective information about the health of the population. Archives of Public Health 76, 1 (2018), 38.Google Scholar
- C. Trattner, D. Parra, and D. Elsweiler. 2017. Monitoring obesity prevalence in the United States through bookmarking activities in online food portals. PloS One 12, 6 (2017). DOI:https://doi.org/10.1371/journal.pone.0179144Google Scholar
- Shikha Verma, Younghee Park, and Mihui Kim. 2017. Predicting flu-rate using big data analytics based on social data and weather conditions. Advanced Science Letters 23, 12 (2017), 12775--12779.Google Scholar
- Verschuuren, Marieke, and eds Hans Van Oers. 2018. Population health monitoring: Climbing the information pyramid. Springer (2018).Google Scholar
- Leye Wang, Daqing Zhang, Animesh Pathak, Chao Chen, Haoyi Xiong, Dingqi Yang, and Yasha Wang. 2015. CCS-TA: Quality-guaranteed online task allocation in compressive crowdsensing. In Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing. 683--694.Google Scholar
- Yingzi Wang, Xiao Zhou, Anastasios Noulas, Cecilia Mascolo, Xing Xie, and Enhong Chen. 2018. Predicting the spatio-temporal evolution of chronic diseases in population with human mobility data. IJCAI (2018), 3578--3584.Google Scholar
- Xiuwen Yi, Yu Zheng, Junbo Zhang, and Tianrui Li. 2016. ST-MVL: Filling missing values in geo-sensory time series data. In Proceedings of the 25th International Joint Conference on Artificial Intelligence (2016), 2704--2710.Google Scholar
- Yin Zhang, Matthew Roughan, Walter Willinger, and Lili Qiu. 2009. Spatio-temporal compressive sensing and Internet traffic matrices. ACM SIGCOMM Computer Communication Review 39, 4 (2009), 267--278.Google Scholar
Digital Library
- Yu Zheng, Tong Liu, Yilun Wang, Yanmin Zhu, Yanchi Liu, and Eric Chang. 2014. Diagnosing New York city’s noises with ubiquitous data. In Proceedings of the 16th ACM International Conference on Ubiquitous Computing.Google Scholar
- Yanmin Zhu, Zhi Li, Hongzi Zhu, Minglu Li, and Qian Zhang. 2012. A compressive sensing approach to urban traffic estimation with probe vehicles. IEEE Transactions on Mobile Computing 12, 11 (2012), 2289--2302.Google Scholar
Digital Library
Index Terms
Enabling Cost-Effective Population Health Monitoring By Exploiting Spatiotemporal Correlation: An Empirical Study
Recommendations
The use of Electronic Health Records to Support Population Health: A Systematic Review of the Literature
Electronic health records (EHRs) have emerged among health information technology as "meaningful use" to improve the quality and efficiency of healthcare, and health disparities in population health. In other instances, they have also shown lack of ...
Research challenges in measuring data for population health to enable predictive modeling for improving healthcare
At the core of the healthcare crisis is a fundamental lack of actionable data, needed to stratify individuals within populations, to predict which persons have which outcomes. A new health system with better health management will require better health ...
Nonlinear Relationship Between Health Factors and Health Outcomes
CSS 2017: Proceedings of the 2017 International Conference of The Computational Social Science Society of the AmericasThe relationship between Health Factors and Health Outcomes is a topic of great practical importance in the understanding of the genesis of and solution to the problem of health disparities. We have investigated the data compiled by the Population ...






Comments