ABSTRACT
This paper focuses on the problem of identifying influential users of micro-blogging services. Twitter, one of the most notable micro-blogging services, employs a social-networking model called "following", in which each user can choose who she wants to "follow" to receive tweets from without requiring the latter to give permission first. In a dataset prepared for this study, it is observed that (1) 72.4% of the users in Twitter follow more than 80% of their followers, and (2) 80.5% of the users have 80% of users they are following follow them back. Our study reveals that the presence of "reciprocity" can be explained by phenomenon of homophily. Based on this finding, TwitterRank, an extension of PageRank algorithm, is proposed to measure the influence of users in Twitter. TwitterRank measures the influence taking both the topical similarity between users and the link structure into account. Experimental results show that TwitterRank outperforms the one Twitter currently uses and other related algorithms, including the original PageRank and Topic-sensitive PageRank.
- Micro-blogging. http://en.wikipedia.org/wiki/Micro-blogging.Google Scholar
- D.M. Blei, A.Y. Ng, and M.I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003. Google Scholar
Digital Library
- S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Computer Network and ISDN Systems, 30(1-7):107--117, 1998. Google Scholar
Digital Library
- A. Cheng and M. Evans. Inside Twitter: An in-depth look inside the Twitter world. http://www.sysomos.com/insidetwitter/, June 2009.Google Scholar
- D.M. Endres and J.E. Schindelin. A new metric for probability distributions. IEEE ransactions on Information Theory, 49(7):1858--1860, 2003. Google Scholar
Digital Library
- T.L. Griffiths and M. Steyvers. Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America, 101(Suppl 1):5228--5235, 2004.Google Scholar
Cross Ref
- T.H. Haveliwala. Topic-sensitive pagerank. In WWW '02: Proceedings of the 11th international conference on World Wide Web, pages 517--526, New York, NY, USA, 2002. ACM. Google Scholar
Digital Library
- A. Java, X. Song, T. Finin, and B. Tseng. Why we twitter: understanding microblogging usage and communities. In WebKDD/SNA-KDD '07: Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis, pages 56--65, New York, NY, USA, 2007. ACM. Google Scholar
Digital Library
- D. Kempe, J. Kleinberg, and E. Tardos. Maximizing the spread of influence through a social network. In KDD '03: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 137--146, New York, NY, USA, 2003. ACM. Google Scholar
Digital Library
- D. Kempe, J. Kleinberg, and É. Tardos. Influential nodes in a diffusion model for social networks. In ICALP 2005: Proceedings of the 32nd International Colloquium on Automata, Languages and Programming, pages 1127--1138, 2005. Google Scholar
Digital Library
- M. Kendall. A new measure of rank correlation. Biometrika, 30(1-2):81--93, 1938.Google Scholar
- J.M. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604--632, 1999. Google Scholar
Digital Library
- A. Leavitt, with Evan Burchard, D. Fisher, and S. Gilbert. The influentials: New approaches for analyzing influence on twitter. a publication of the Web Ecology project. http://www.webecologyproject.org/wpcontent/uploads/2009/09/influence-report-final.pdf, Sept 2009.Google Scholar
- M. McPherson, L. Smith-Lovin, and J.M. Cook. Birds of a feather: Homophily in social networks. Annual Review of Sociology, 27(1):415--444, 2001.Google Scholar
Cross Ref
- R.G. Miller. Beyond ANOVA, basics of applied statistics. Wiley Series in Probability and Mathematical Statistics. Wiley, 1986.Google Scholar
- S. Milstein, A. Chowdhury, G. Hochmuth, B. Lorica, and R. Magoulas. Twitter and the micro-messaging revolution: Communication, connections, and immediacy-140 characters at a time. O'Reilly Report, November 2008.Google Scholar
- I. Porteous, D. Newman, A. Ihler, A. Asuncion, P. Smyth, and M. Welling. Fast collapsed gibbs sampling for latent dirichlet allocation. In KDD '08: Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 569--577, New York, NY, USA, 2008. ACM. Google Scholar
Digital Library
- M. Steyvers and T. Griffiths. Probabilistic topic models. In T. Landauer, D. McNamara, S. Dennis, and W. Kintsch, editors, Latent Semantic Analysis: A Road to Meaning. Laurence Erlbaum, In Press.Google Scholar
Index Terms
TwitterRank: finding topic-sensitive influential twitterers
Recommendations
What is Twitter, a social network or a news media?
WWW '10: Proceedings of the 19th international conference on World wide webTwitter, a microblogging service less than three years old, commands more than 41 million users as of July 2009 and is growing fast. Twitter users tweet about any topic within the 140-character limit and follow others to receive their tweets. The goal ...
Finding news-topic oriented influential twitter users based on topic related hashtag community detection
Recently, more and more users would like to collect and provide information about news topics in Twitter, which is one of the most popular microblogging services. Virtual communities defined by hashtags in Twitter are created for exchanging information ...
Measuring Spatial Influence of Twitter Users by Interactions
LENS'17: Proceedings of the 1st ACM SIGSPATIAL Workshop on Analytics for Local Events and NewsThe three ways of interactions in Twitter--retweet, reply, and mention--comprise of a latent dynamic information flow network between users, which can be utilized to determine influential users. This paper focuses on determining which Twitter users have ...






Comments