Abstract
Smart communities are composed of groups, organizations, and individuals who share information and make use of that shared information for better decision making. Shared information can come from many sources, particularly, but not exclusively, from sensors and social media. Social media has become an important source of near-instantaneous user-generated information that can be shared and analyzed to support better decision making. One domain where social media data can add value is transportation and traffic management. This article looks at the exploitation of Twitter data in the traffic reporting domain. A key challenge is how to identify relevant information from a huge amount of user-generated data and then analyze the relevant data for automatic geocoded incident detection. The article proposes an instant traffic alert and warning system based on a novel latent Dirichlet allocation (LDA) approach (“tweet-LDA”). The system is evaluated and shown to perform better than related approaches.
- Simon Beddus and Mike Fisher. 2011. Enabling smart communities. Journal of the Institute of Telecommunications Professionals 5, 4, 34--38.Google Scholar
- Illinois Department of Transportation. 2017. Illinois Traffic Alert System. Retrieved September 23, 2017, from http://www.iltrafficalert.com/.Google Scholar
- UK Traffic News. 2017. Ross Targett publishing. Retrieved July 8, 2017, from https://uk-traffic-news-twitraffic.soft112.com/.Google Scholar
- Yiming Gu, Zhen (Sean) Qian, and Feng Chen. 2016. From Twitter to detector: Real-time traffic incident detection using social media data. Transportation Research Part C 67, 321--342.Google Scholar
Cross Ref
- Eleonora D'Andrea, Pietro Ducange, Beatrice Lazzerini, and Francesco Marcelloni. 2015. Real-time detection of traffic from Twitter stream analysis. IEEE Transactions on Intelligent Transportation Systems 16, 4, 2269--2283. Google Scholar
Digital Library
- Jie Yin, Andrew Lampert, Bella Robinson, and Robert Power. 2012. Using social media to enhance emergency situation awareness. IEEE Intelligent Systems 27, 6, 52--59. Google Scholar
Digital Library
- Takumi Ichimura and Shin Kamada. 2012. A generation method of filtering rules of Twitter via smartphone based Participatory Sensing system for tourist by interactive GHSOM and C4.5. In Proceedings of the 2012 IEEE International Conference on Systems, Man, and Cybernetics. Google Scholar
Cross Ref
- Takeshi Sakaki, Yutaka Matsuo, Tadashi Yanagihara, Naiwala P. Chandrasiri, and Kazunari Nawa. 2012. Real-time event extraction for driving information from social sensors. In Proceedings of the 2012 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems. Google Scholar
Cross Ref
- M. A. H. Khan, D. Bollegala, G. Liu, and K. Sezaki. 2013. Multi-tweet summarization of real-time events. In Proceedings of the International Conference on Social Computing (SocialCom’13). 128--133. Google Scholar
Digital Library
- Takeshi Sakaki, Makoto Okazaki, and Yutaka Matsuo. 2013. Tweet analysis for real-time event detection and earthquake reporting system development. IEEE Transactions on Knowledge and Data Engineering 25, 4, 919--931. Google Scholar
Digital Library
- Kirill Kireyev, Leysia Palen, and Kenneth M. Anderson. 2009. Applications of topics models to analysis of disaster-related Twitter data. In Proceedings of the NIPS Workshop on Applications for Topic Models: Text and Beyond.Google Scholar
- Michael J. Paul and Mark Dredze. 2011. You are what you tweet: Analyzing Twitter for public health. Proceedings of the 5th International AAAI Conference on Weblogs and Social Media.Google Scholar
- Kyle W. Prier, Matthew S. Smith, Christophe Giraud-Carrier, and Carl L. Hanson. 2011. Identifying health-related topics on Twitter. Lecture Notes in Computer Science, Vol. 6589. Springer, 18--25. Google Scholar
Cross Ref
- Jiang Bian, Umit Topaloglu, and Fan Yu. 2012. Towards large-scale Twitter mining for drug-related adverse events. In Proceedings of the 2012 International Workshop on Smart Health and Wellbeing (SHB’12). ACM, New York, NY, 25--32. Google Scholar
Digital Library
- Brendan O'Connor, Michel Krieger, and David Ahn. 2010. Tweetmotif: Exploratory search and topic summarization for Twitter. In Proceedings of the 4th International AAAI Conference on Weblogs and Social Media.Google Scholar
- Guang Xiang, Bin Fan, Ling Wang, Jason I. Hong, and Carolyn P. Rose. 2012. Detecting offensive tweets via topical feature discovery over a large scale Twitter corpus. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM’12). 1980--1984. Google Scholar
Digital Library
- Aron Culotta. 2010. Towards detecting influenza epidemics by analyzing Twitter messages. In Proceedings of the 1st Workshop on Social Media Analytics (SOMA’10). 115--122. Google Scholar
Digital Library
- Bharath Sriram, David Fuhry, Engin Demir, Hakan Ferhatosmanoglu, and Murat Demirbas. 2010. Short text classification in Twitter to improve information filtering. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’10). 841--842. Google Scholar
Digital Library
- Alan Ritter, Sam Clark, Mausam, and Oren Etzioni. 2011. Named entity recognition in tweets: An experimental study. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. 1524--1534.Google Scholar
Digital Library
- Max Kaufmann. 2010. Syntactic normalization of Twitter messages. In Proceedings of the International Conference on Natural Language Processing.Google Scholar
- D. M. Blei, A. Y. Ng, and M. I. Jordan. 2003. Latent Dirichlet allocation. Journal of Machine Learning Research 3, 993--1022.Google Scholar
Digital Library
- X. Zhao Wayne, Jiang Jing, Wng Jianshu, He Jing, Lim Ee-Peng, Yan Hongfei, and Li Xiaoming, 2011. Comparing Twitter and traditional media using topic models. In Proceedings of the 33rd European Conference on Advances in Information Retrieval (ECIR’11). 338--349.Google Scholar
- M. Steyvers, P. Smyth, M. Rosen-Zvi, and T. Griffiths. 2004. Probabilistic author-topic models for information discovery. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’04). 206--315. Google Scholar
Digital Library
- Liangjie Hong and Brian D. Davison. 2010. Empirical study of topic modeling in Twitter. In Proceedings of the 1st Workshop on Social Media Analytics (SOMA’10). Google Scholar
Digital Library
- Jeff Huang, Katherine M. Thornton, Efthimis N. Efthimiadis. 2010. Conversational tagging in Twitter. In Proceedings of the 21st ACM Conference on Hypertext and Hypermedia (HT’10). 173--178. Google Scholar
Digital Library
- D. Blei and J. D. McAuliffe. 2007. Supervised topic models. In Proceedings of Advances in Neural Information Processing Systems (NIPS’07). 121--128.Google Scholar
- D. Wang, M. Thint, and A. Al-Rubaie. 2012. Semi-supervised latent Dirichlet allocation and its application for document classification. In Proceedings of the IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (WI-IAT’12). 306--310. Google Scholar
Digital Library
- Matthew D. Hoffman, David M. Blei, and Francis R. Bach. 2010. Online learning for latent Dirichlet allocation. In Proceedings of Advances in Neural Information Processing Systems (NIPS’10). 856--864.Google Scholar
- Kevin R. Canini, Lei Shi, and Thomas L. Griffiths. 2009. Online inference of topics with latent Dirichlet allocation. In Proceedings of the 12th International Conference on Artificial Intelligence and Statistics (AISTATS’09).Google Scholar
- Issei Sato, Kenichi Kurihara, and Hiroshi Nakagawa. 2010. Deterministic single-pass algorithm for LDA. In Proceedings of the 23rd International Conference on Neural Information Processing Systems (NIPS’10). 2074--2082.Google Scholar
- Zekai J. Gao, Yangqiu Song, Shixia Liu, Haixun Wang, Hao Weil, and Weiwei Cui. 2011. Tracking and connecting topics via incremental hierarchical Dirichlet processes. In Proceedings of the 2011 11th IEEE International Conference on Data Mining. Google Scholar
Digital Library
- B. Han and T. Baldwin. Lexical normalisation of short text messages: Makn sens a #twitter. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics. 368--378.Google Scholar
- Github. 2017. First20hours/google-10000-english. Retrieved September 23, 2017, from https://github.com/first20hours/google-10000-english/blob/master/google-10000-english.txt.Google Scholar
- Jalal Mahmud, Jeffrey Nichols, and Clemens Drews. 2014. Home location identification of Twitter users. ACM Transactions on Intelligent Systems and Technology 5, 3, Article No. 47. Google Scholar
Digital Library
- Clodoveu A. Davis Jr, Gisele L. Pappa, Diogo Rennó Rocha de Oliveira, and Filipe de L. Arcanjo. 2011. Inferring the location of Twitter messages based on user relationships. Transactions in GIS 15, 6, 735--751.Google Scholar
Cross Ref
- Ahmed Talal, Khaled Al Kaabi, Di Wang, Ahmad Al-Rubaie, Ahmed Al Dhanhani, John Davies, and Sandra Stincic Clarke. 2016. Event identification and assertion from social media using auto-extendable knowledge base. In Proceedings of the 2016 IEEE World Congress on Computational Intelligence (IEEE-WCCI’16).Google Scholar
- Wikipedia. 2017. Dartford Crossing. Retrieved September 23, 2017, from https://en.wikipedia.org/wiki/Dartford_Crossing.Google Scholar
Index Terms
Real-Time Traffic Event Detection From Social Media
Recommendations
Who are the spoilers in social media marketing? Incremental learning of latent semantics for social spam detection
With the rise of social web, there has also been a great concern about the quality of user-generated content on social media sites (SMSs). Deceptive comments harm users' trust in online social media and cause financial loss to firms. Previous studies ...






Comments