Abstract
With the advancement of the computing technology and its wide range of applications, collecting large sets of multivariate time series in multiple geographical locations introduces a problem of identifying interesting spatio-temporal patterns. We consider a new spatial structure of the data in the pattern discovery process due to the dependent nature of the data. This article presents an information-theoretic approach to detect the temporal patterns from the multivariate time series in multiple locations. Based on their occurrences of discovered temporal patterns, we propose a method to identify interesting spatio-temporal patterns by a statistical significance test. Furthermore, the identified spatio-temporal patterns can be used for clustering and classification. For evaluating the performance, a simulated dataset is tested to validate the quality of the identified patterns and compare with other approaches. The result indicates the approach can effectively identify useful patterns to characterize the dataset for further analysis in achieving good clustering quality. Furthermore, experiments on real-world datasets and case studies have been conducted to illustrate the applicability and the practicability of the proposed approach.
- D. E. Zhuang, G. C. Li, and A. K. Wong. 2014. Discovery of temporal associations in multivariate time series. IEEE Trans. Knowl. Data Eng. 26, 12 (2014), 2969--2982.Google Scholar
Cross Ref
- P. Y. Zhou and K. C. Chan. 2015. A feature extraction method for multivariate time series classification using temporal patterns. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 409--421.Google Scholar
- A. M. MacEachren, M. Wachowicz, R. Edsall, D. Haug, and R. Masters. 1999. Constructing knowledge from multivariate spatiotemporal data: integrating geographical visualization with knowledge discovery in database methods. Int. J. Geogr. Inf. Sci. 13, 4 (1999), 311--334.Google Scholar
Cross Ref
- Z. Xing, J. Pei, and E. Keogh. 2010. A brief survey on sequence classification. ACM SIGKDD Explor. Newslett. 12, 1 (2010), 40--48.Google Scholar
Digital Library
- J. B. Kruskal and M. Liberman. 1983. The symmetric time warping problem: From continuous to discrete. In Time Warps, String Edits, and Macromolecules, 1983.Google Scholar
- T. Oates. 1999. Identifying distinctive subsequences in multivariate time series by clustering. In Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 322--326.Google Scholar
- C. Rosén and Z. Yuan. 2001. Supervisory control of wastewater treatment plants by combining principal component analysis and fuzzy c-means clustering. Water Sci. Technol. 43, 7 (2001), 147--156.Google Scholar
- K. Yang and C. Shahabi. 2004. A PCA-based similarity measure for multivariate time series. In Proceedings of the 2nd ACM International Workshop on Multimedia Databases. ACM, 65--74.Google Scholar
- A. Singhal and D. E. Seborg. 2005. Clustering multivariate time‐series data. J. Chemometr. 19, 8 (2005), 427--438.Google Scholar
Cross Ref
- H. Yoon, K. Yang, and C. Shahabi. 2005. Feature subset selection and feature ranking for multivariate time series. IEEE Trans. Knowl. Data Eng. 17, 9 (2005), 1186--1198.Google Scholar
Digital Library
- L. M. Owsley, L. E. Atlas, and G. D. Bernard. 1997. Automatic clustering of vector time-series for manufacturing machine monitoring. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’97) IEEE, 3393--3396.Google Scholar
- P. Y. Zhou and K. C. Chan. 2014. A model-based multivariate time series clustering algorithm. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 805--817.Google Scholar
- R. Coppi, P. D'Urso, and P. Giordani. 2010. A fuzzy clustering model for multivariate spatial time series. J. Classif. 27, 1 (2010), 54--88.Google Scholar
Digital Library
- R. H. Shumway. 2005. Discrimination and clustering for multivariate time series. Encyclopedia of Biostatistics, 4.Google Scholar
- R. D. Pascual-Marqui. 2007. Instantaneous and lagged measurements of linear and nonlinear dependence between groups of multivariate time series: frequency decomposition. arXiv preprint arXiv:0711.1455.Google Scholar
- P. Von Bünau, F. C. Meinecke, F. C. Király, and K.-R. Müller. 2009. Finding stationary subspaces in multivariate time series. Phys. Rev. Lett. 103, 21 (2009), 214101.Google Scholar
Cross Ref
- A. F. Zuur, R. Fryer, I. Jolliffe, R. Dekker, and J. Beukema. 2003. Estimating common trends in multivariate time series using dynamic factor analysis. Environmetrics 14, 7 (2003), 665--685.Google Scholar
Cross Ref
- M. Lavielle and G. Teyssiere. 2006. Detection of multiple change-points in multivariate time series. Lith. Math. J. 46, 3 (2006), 287--306.Google Scholar
Cross Ref
- R. S. Tsay, D. Peña, and A. E. Pankratz. 2000. Outliers in multivariate time series. Biometrika 87, 4 (2000), 789--804.Google Scholar
Cross Ref
- S. Frenzel and B. Pompe. 2007. Partial mutual information for coupling analysis of multivariate time series. Phys. Rev. Lett. 99, 20 (2007), 204101.Google Scholar
Cross Ref
- A. Amiri-Simkooei. 2009. Noise in multivariate GPS position time-series. Journal of Geodesy 83, 2 (2009), 175--187.Google Scholar
Cross Ref
- J. Lin, E. Keogh, L. Wei, and S. Lonardi. 2007. Experiencing SAX: A novel symbolic representation of time series. Data Min. Knowl. Discov. 15, 2, 107--144.Google Scholar
Digital Library
- S. J. Haberman. 1973. The analysis of residuals in cross-classified tables. Biometrics 29, 1, 205--220.Google Scholar
Cross Ref
- A. K. Wong, B. Wu, G. P. Wu, and K. C. Chan. 2010. Pattern discovery for large mixed-mode database. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management. ACM, 859--868.Google Scholar
- P. C. Ma, K. C. Chan, and D. K. Chiu. 2005. Clustering and re-clustering for pattern discovery in gene expression data. J. Bioinf. Comput. Biol. 3, 2 (2005), 281--301.Google Scholar
- Y. Wang and A. K. C. Wong. 2003. From association to classification: Inference using weight of evidence. IEEE Trans. Knowl. Data Eng. 15, 3 (2003), 764--767.Google Scholar
Digital Library
- G. Tatavarty, R. Bhatnagar, and B. Young. 2007. Discovery of temporal dependencies between frequent patterns in multivariate time series. In Proceedings of the IEEE Symposium on Computational Intelligence and Data Mining (CIDM’07). IEEE, 688--696.Google Scholar
- Y. Liu. 2015. Scalable multivariate time-series models for climate informatics. Computing in Science 8 Engineering 17, 6 (2015), 19--26.Google Scholar
- A. C. Lozano, H. Li, A. Niculescu-Mizil, Y. Liu, C. Perlich, J. Hosking, and N. Abe. 2009. Spatial-temporal causal modeling for climate change attribution. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 587--596.Google Scholar
- J. Flatley. Crime in England and Wales: Year Ending June 2017. (October 2017). Retrieved July 11, 2018 from https://www.ons.gov.uk/peoplepopulationandcommunity/crimeandjustice/bulletins/crimeinenglandandwales/june2017.Google Scholar
- M. L. Hetland and P. Sætrom. 2003. The role of discretization parameters in sequence rule evolution. In Proceedings of the International Conference on Knowledge-Based and Intelligent Information and Engineering Systems. Springer, Berlin, 518--525.Google Scholar
- J. Schmidhuber. 2015. Deep learning in neural networks: An overview. Neur. Netw. 61 (2015), 85--117.Google Scholar
Digital Library
- J. Gu, Z. Wang, J. Kuen, L. Ma, A. Shahroudy, B. Shuai, T. Liu, X. Wang, J. Cai, and T. Chen. 2018. Recent advances in convolutional neural networks. Pattern Recogn. 77 (2018), 354--377.Google Scholar
Digital Library
- K. Tian, S. Zhou, and J. Guan. 2017. Deepcluster: A general clustering framework based on deep learning. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, Cham, 809--825.Google Scholar
Index Terms
Discovery of Spatio-Temporal Patterns in Multivariate Spatial Time Series
Recommendations
Partial spatio-temporal co-occurrence pattern mining
Spatio-temporal co-occurrence patterns represent subsets of object-types that are often located together in space and time. The aim of the discovery of partial spatio-temporal co-occurrence patterns (PACOPs) is to find co-occurrences of the object-types ...
Spatio-temporal discretization for sequential pattern mining
ICUIMC '08: Proceedings of the 2nd international conference on Ubiquitous information management and communicationSpatio-temporal frequent patterns discovered from historical trajectories of moving objects can provide important knowledge for location-based services. To address the problem of finding sequential patterns from spatio-temporal datasets, continuous ...
Mining temporal co-orientation pattern from spatio-temporal databases
PAKDD'07: Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data miningA spatial co-orientation pattern refers to objects that frequently occur with the same spatial orientation, e.g. left, right, below, etc., among images. In this paper, we introduce temporal co-orientation pattern mining which is the problem of temporal ...






Comments