Abstract
With the increasing use of Twitter for encouraging users to instigate violent behavior with hate and racial content, it becomes necessary to investigate the uniqueness in the dynamics of the spread of tweets made during violent communal incidents and the challenges they pose in early identification of potential viral content. In this article, we study the spread of the tweets made during several violent communal incidents along four major dimensions — the underlying follower network of the users, their structural and engagement characteristics, the cascades, and the cognitive aspects of the content, each of which plays a vital role in the spread of content. Using large public and collected data, we compare these features with tweets related to other subjects from several major domains, such as non-violent political events, celebrities, and technology, that contribute to a large fraction of the viral content over Twitter. We discover that while the spread of cascades and the users involved may provide strong early evidence of the viral content for several domains, the early phases of the spread of viral tweets related to violent communal incidents are characterized by cascades with protracted growth involving fringe or low-importance users, which would possibly make early prediction difficult. Our findings indicate that an interplay of certain network and cascade properties, together with the cognitive characteristics of tweets and the behavioral patterns of the engaging users, may provide stronger early indicators of the virality of this content.
- [1] . 2020. Big five personality traits: The OCEAN model explained. Positive Psychology. Retrieved from https://positivepsychologyprogram.com/big-five-personality-theory/.Google Scholar
- [2] . 2015. Human behaviour in different social medias: A case study of Twitter and Disqus. In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015, Paris, France, August 25-28, 2015. ACM, 270–273. Google Scholar
Digital Library
- [3] . 2017. Deep learning for hate speech detection in tweets. In Proceedings of the 26th International Conference on World Wide Web Companion, Perth, Australia, April 3-7, 2017, Rick Barrett, Rick Cummings, Eugene Agichtein, and Evgeniy Gabrilovich (Eds.). ACM, 759–760. Google Scholar
Digital Library
- [4] . 2011. Everyone’s an influencer: Quantifying influence on Twitter. In Proceedings of the Forth International Conference on Web Search and Web Data Mining, WSDM 2011, Hong Kong, China, February 9-12, 2011, Irwin King, Wolfgang Nejdl, and Hang Li (Eds.). ACM, 65–74. Google Scholar
Digital Library
- [5] . 2017. Multilingual emotion classification using supervised learning: Comparative experiments. Inf. Process. Manag. 53, 3 (2017), 684–704. Google Scholar
Cross Ref
- [6] . 2020. Angry tweets: A corpus-assisted study of anger in populist political discourse. Journal of Language Aggression and Conflict 8, 1 (2020), 118–145.Google Scholar
Cross Ref
- [7] . 2016. A comparison of retweet prediction approaches: The superiority of Random Forest learning method. Telkonika (Telecommun Comput Electron Control) 14, 3 (2016), 1052–1058.Google Scholar
Cross Ref
- [8] Abhijnan Chakraborty, Johnnatan Messias, Fabrício Benevenuto, Saptarshi Ghosh, Niloy Ganguly, and Krishna P. Gummadi. 2017. Who Makes Trends? Understanding Demographic Biases in Crowdsourced Recommendations. In Proceedings of the Eleventh International Conference on Web and Social Media, ICWSM 2017, Montréal, Québec, Canada, May 15-18, 2017. AAAI Press, 22–31. https://aaai.org/ocs/index.php/ICWSM/ICWSM17/paper/view/15680Google Scholar
- [9] . 2014. Can cascades be predicted? In 23rd International World Wide Web Conference (WWW’14), Seoul, Republic of Korea, April 7-11, 2014, Chin-Wan Chung, Andrei Z. Broder, Kyuseok Shim, and Torsten Suel (Eds.). ACM, 925–936. Google Scholar
Digital Library
- [10] . 1995. Lag order and critical values of the augmented Dickey–Fuller test. Journal of Business & Economic Statistics 13, 3 (1995), 277–280.Google Scholar
- [11] . 2017. Automated hate speech detection and the problem of offensive language. In Proceedings of the Eleventh International Conference on Web and Social Media, ICWSM 2017, Montréal, Québec, Canada, May 15-18, 2017. AAAI Press, 512–515. https://aaai.org/ocs/index.php/ICWSM/ICWSM17/paper/view/15665Google Scholar
- [12] . 1950. Testing for serial correlation in least squares regression: I. Biometrika 37, 3/4 (1950), 409–428.Google Scholar
Cross Ref
- [13] . 2015. STREAMCUBE: Hierarchical spatio-temporal hashtag clustering for event exploration over the Twitter stream. In 31st IEEE International Conference on Data Engineering, ICDE 2015, Seoul, South Korea, April 13-17, 2015, Johannes Gehrke, Wolfgang Lehner, Kyuseok Shim, Sang Kyun Cha, and Guy M. Lohman (Eds.). IEEE Computer Society, 1561–1572. Google Scholar
Cross Ref
- [14] . 2010. Outtweeting the twitterers-predicting information cascades in microblogs. In 3rd Workshop on Online Social Networks, WOSN 2010, Boston, MA, USA, June 22, 2010, Bruce M. Maggs and Andrew Tomkins (Eds.). USENIX Association.Google Scholar
- [15] . 2018. Quantifying controversy on social media. ACM Trans. Soc. Comput. 1, 1 (2018), 3:1–3:27. Google Scholar
Digital Library
- [16] . 1982. Measurement of linear dependence and feedback between multiple time series. Journal of the American Statistical Association 77, 378 (1982), 304–313.Google Scholar
Cross Ref
- [17] . 2016. The structural virality of online diffusion. Manag. Sci. 62, 1 (2016), 180–196. Google Scholar
Cross Ref
- [18] . 2011. Predicting personality from Twitter. In PASSAT/SocialCom 2011, Privacy, Security, Risk and Trust (PASSAT), 2011 IEEE Third International Conference on and 2011 IEEE Third International Conference on Social Computing (SocialCom), Boston, MA, USA, 9-11 Oct., 2011. IEEE Computer Society, 149–156. Google Scholar
Cross Ref
- [19] . 2018. Learning sequential features for cascade outbreak prediction. Knowl. Inf. Syst. 57, 3 (2018), 721–739. Google Scholar
Digital Library
- [20] . 1969. Investigating causal relations by econometric models and cross-spectral methods. Econometrica: Journal of the Econometric Society (1969), 424–438.Google Scholar
Cross Ref
- [21] . 2020. The psychology of online activism and social movements: Relations between online and offline collective action. Current Opinion in Psychology 35 (2020), 49–54.Google Scholar
Cross Ref
- [22] Florian Greinert. 2019. A methodology and initial exploration of personality traits of GitHub developers as expressed in GitHub issues. B.S. thesis.Google Scholar
- [23] Clayton J. Hutto and Eric Gilbert. 2014. VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text. In Proceedings of the Eighth International Conference on Weblogs and Social Media, ICWSM 2014, Ann Arbor, Michigan, USA, June 1-4, 2014, Eytan Adar, Paul Resnick, Munmun De Choudhury, Bernie Hogan, and Alice Oh (Eds.). The AAAI Press. http://www.aaai.org/ocs/index.php/ICWSM/ICWSM14/paper/view/8109Google Scholar
- [24] Maximilian Jenders, Gjergji Kasneci, and Felix Naumann. 2013. Analyzing and predicting viral tweets. In 22nd International World Wide Web Conference (WWW’13), Rio de Janeiro, Brazil, May 13-17, 2013, Companion Volume, Leslie Carr, Alberto H. F. Laender, Bernadette Farias Lóscio, Irwin King, Marcus Fontoura, Denny Vrandecic, Lora Aroyo, José Palazzo M. de Oliveira, Fernanda Lima, and Erik Wilde (Eds.). International World Wide Web Conferences Steering Committee/ACM, 657–664. Google Scholar
Digital Library
- [25] Bin Jiang. 2013. Head/tail breaks: A new classification scheme for data with a heavy-tailed distribution. The Professional Geographer 65, 3 (2013), 482–494.Google Scholar
- [26] Aditya Joshi, Ross Sparks, James McHugh, Sarvnaz Karimi, Cecile Paris, and C. Raina MacIntyre. 2020. Harnessing tweets for early detection of an acute disease event. Epidemiology (Cambridge, Mass.) 31, 1 (2020), 90.Google Scholar
- [27] . 2019. Social media-predicted personality traits and values can help match people to their ideal jobs. Proceedings of the National Academy of Sciences 116, 52 (2019), 26459–26464.Google Scholar
Cross Ref
- [28] Younghoon Kim and Jiwon Seo. 2020. Detection of Rapidly Spreading Hashtags via Social Networks. IEEE Access 8 (2020), 39847–39860. Google Scholar
Cross Ref
- [29] Jon M. Kleinberg. 1999. Authoritative Sources in a Hyperlinked Environment. J. ACM 46, 5 (1999), 604–632. Google Scholar
Digital Library
- [30] . 2019. Beyond the Coverage of Information Spreading: Analytical and Empirical Evidence of Re-exposure in Large-scale Online Social Networks.
arxiv:physics.soc-ph/1907.12389 .Google Scholar - [31] Zongyang Ma, Aixin Sun, and Gao Cong. 2013. On predicting the popularity of newly emerging hashtags in Twitter. J. Assoc. Inf. Sci. Technol. 64, 7 (2013), 1399–1410. Google Scholar
Cross Ref
- [32] Binny Mathew, Ritam Dutt, Pawan Goyal, and Animesh Mukherjee. 2019. Spread of Hate Speech in Online Social Media. In Proceedings of the 11th ACM Conference on Web Science, WebSci 2019, Boston, MA, USA, June 30 - July 03, 2019, Paolo Boldi, Brooke Foucault Welles, Katharina Kinder-Kurlanda, Christo Wilson, Isabella Peters, and Wagner Meira Jr. (Eds.). ACM, 173–182. Google Scholar
Digital Library
- [33] Binny Mathew, Punyajoy Saha, Hardik Tharad, Subham Rajgaria, Prajwal Singhania, Suman Kalyan Maity, Pawan Goyal, and Animesh Mukherjee. 2019. Thou Shalt Not Hate: Countering Online Hate Speech. In Proceedings of the Thirteenth International Conference on Web and Social Media, ICWSM 2019, Munich, Germany, June 11-14, 2019, Jürgen Pfeffer, Ceren Budak, Yu-Ru Lin, and Fred Morstatter (Eds.). AAAI Press, 369–380. https://ojs.aaai.org/index.php/ICWSM/article/view/3237Google Scholar
- [34] Rishabh Narang, Simran Sarin, Prajjwal Singh, and Rinkaj Goyal. 2018. Impact of Reciprocity in Information Spreading Using Epidemic Model Variants. Inf. 9, 6 (2018), 136. Google Scholar
Cross Ref
- [35] Mariam Nouh, Jason R. C. Nurse, and Michael Goldsmith. 2019. Understanding the Radical Mind: Identifying Signals to Detect Extremist Content on Twitter. In 2019 IEEE International Conference on Intelligence and Security Informatics, ISI 2019, Shenzhen, China, July 1-3, 2019. IEEE, 98–103. Google Scholar
Digital Library
- [36] Alexandra Olteanu, Carlos Castillo, Jeremy Boy, and Kush R. Varshney. 2018. The Effect of Extremist Violence on Hateful Speech Online. In Proceedings of the Twelfth International Conference on Web and Social Media, ICWSM 2018, Stanford, California, USA, June 25-28, 2018. AAAI Press, 221–230. https://aaai.org/ocs/index.php/ICWSM/ICWSM18/paper/view/17908.Google Scholar
- [37] Toni Pano and Rasha Kashef. 2020. A Complete VADER-Based Sentiment Analysis of Bitcoin (BTC) Tweets during the Era of COVID-19. Big Data Cogn. Comput. 4, 4 (2020), 33. Google Scholar
Cross Ref
- [38] . 2011. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12 (2011), 2825–2830.Google Scholar
Digital Library
- [39] . 2015. The Development and Psychometric Properties of LIWC2015.
Technical Report .Google Scholar - [40] . 2014. The impact of network characteristics on the diffusion of innovations. Physica A: Statistical Mechanics and Its Applications 402 (2014), 330–343.Google Scholar
Cross Ref
- [41] René Pfitzner, Antonios Garas, and Frank Schweitzer. 2012. Emotional Divergence Influences Information Spreading in Twitter. In Proceedings of the Sixth International Conference on Weblogs and Social Media, Dublin, Ireland, June 4-7, 2012, John G. Breslin, Nicole B. Ellison, James G. Shanahan, and Zeynep Tufekci (Eds.). The AAAI Press. http://www.aaai.org/ocs/index.php/ICWSM/ICWSM12/paper/view/4596.Google Scholar
- [42] . 2019. Survey on prediction of smartphone virality using Twitter analytics. In 2019 IEEE International Conference on System, Computation, Automation and Networking (ICSCAN’19). IEEE, 1–5.Google Scholar
Cross Ref
- [43] Shalini Priya, Saharsh Singh, Sourav Kumar Dandapat, Kripabandhu Ghosh, and Joydeep Chandra. 2019. Identifying infrastructure damage during earthquake using deep active learning. In International Conference on Advances in Social Networks Analysis and Mining (ASONAM’19), Vancouver, British Columbia, Canada, 27-30 August, 2019, Francesca Spezzano, Wei Chen, and Xiaokui Xiao (Eds.). ACM, 551–552. Google Scholar
Digital Library
- [44] Manoel Horta Ribeiro, Pedro H. Calais, Yuri A. Santos, Virgílio A. F. Almeida, and Wagner Meira Jr. 2018. Characterizing and Detecting Hateful Users on Twitter. In Proceedings of the Twelfth International Conference on Web and Social Media (ICWSM’18), Stanford, California, USA, June 25-28, 2018. AAAI Press, 676–679. https://aaai.org/ocs/index.php/ICWSM/ICWSM18/paper/view/17837.Google Scholar
- [45] Daniel M. Romero, Brendan Meeder, and Jon M. Kleinberg. 2011. Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter. In Proceedings of the 20th International Conference on World Wide Web (WWW’11), Hyderabad, India, March 28 - April 1, 2011, Sadagopan Srinivasan, Krithi Ramamritham, Arun Kumar, M. P. Ravindra, Elisa Bertino, and Ravi Kumar (Eds.). ACM, 695–704. Google Scholar
Digital Library
- [46] Saswata Roy, Brijendra Kumar Suman, Joydeep Chandra, and Sourav Kumar Dandapat. 2020. Forecasting the Future: Leveraging RNN based Feature Concatenation for Tweet Outbreak Prediction. In 7th ACM IKDD CoDS and 25th COMAD (CoDS-COMAD’20), Hyderabad India, January 5-7, 2020, Rishiraj Saha Roy (Ed.). ACM, 219–223. Google Scholar
Digital Library
- [47] Koustav Rudra, Ashish Sharma, Niloy Ganguly, and Saptarshi Ghosh. 2018. Characterizing and Countering Communal Microblogs During Disaster Events. IEEE Trans. Comput. Soc. Syst. 5, 2 (2018), 403–417. Google Scholar
Cross Ref
- [48] . 2020. A hybridized feature extraction approach to suicidal ideation detection from social media post. In 2020 IEEE Region 10 Symposium (TENSYMP’20). IEEE, 985–988.Google Scholar
- [49] Judy Hanwen Shen and Frank Rudzicz. 2017. Detecting Anxiety through Reddit. In Proceedings of the Fourth Workshop on Computational Linguistics and Clinical Psychology - From Linguistic Signal to Clinical Reality, [email protected] 2017, Vancouver, Canada, August 3, 2017, Kristy Hollingshead, Molly Ireland, and Kate Loveys (Eds.). Association for Computational Linguistics, 58–65. Google Scholar
Cross Ref
- [50] Benjamin Shulman, Amit Sharma, and Dan Cosley. 2016. Predictability of Popularity: Gaps between Prediction and Understanding. In Proceedings of the Tenth International Conference on Web and Social Media, Cologne, Germany, May 17-20, 2016. AAAI Press, 348–357. http://www.aaai.org/ocs/index.php/ICWSM/ICWSM16/paper/view/13129Google Scholar
- [51] . 2020. Demonstrating in the Internet age: A test of Castells’ theory. Social Movement Studies 19, 2 (2020), 201–221.Google Scholar
Cross Ref
- [52] . 2014. A Ferguson Twitter archive [Blog post]. Retrieved from Recuperado de https://inkdroid. org/2014/08/30/a-ferguson-twitter-archive.Google Scholar
- [53] . 2020. The language of well-being: Tracking fluctuations in emotion experience through everyday speech. Journal of Personality and Social Psychology 118, 2 (2020), 364.Google Scholar
Cross Ref
- [54] Sho Tsugawa and Hiroyuki Ohsaki. 2015. Negative Messages Spread Rapidly and Widely on Social Media. In Proceedings of the 2015 ACM on Conference on Online Social Networks (COSN’15), Palo Alto, California, USA, November 2-3, 2015, Aneesh Sharma, Rakesh Agrawal, and Matthias Grossglauser (Eds.). ACM, 151–160. Google Scholar
Digital Library
- [55] . 1973. Generalization of the Durbin-Watson statistic for higher order autoregressive processes. Communications in Statistics-Theory and Methods 2, 2 (1973), 115–144.Google Scholar
- [56] . 2012. Model selection and psychological theory: A discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC). Psychological Methods 17, 2 (2012), 228.Google Scholar
Cross Ref
- [57] . 1943. Tests of statistical hypotheses concerning several parameters when the number of observations is large. Transactions of the American Mathematical Society 54, 3 (1943), 426–482.Google Scholar
Cross Ref
- [58] Lei Wang, Jianwei Niu, and Shui Yu. 2020. SentiDiff: Combining Textual Information and Sentiment Diffusion Patterns for Twitter Sentiment Analysis. IEEE Trans. Knowl. Data Eng. 32, 10 (2020), 2026–2039. Google Scholar
Cross Ref
- [59] Helena Webb, Pete Burnap, Rob Procter, Omer F. Rana, Bernd Carsten Stahl, Matthew L. Williams, William Housley, Adam Edwards, and Marina Jirotka. 2016. Digital Wildfires: Propagation, Verification, Regulation, and Responsible Innovation. ACM Trans. Inf. Syst. 34, 3 (2016), 15:1–15:23. Google Scholar
Digital Library
- [60] . 2013. Virality prediction and community structure in social networks. Scientific Reports 3, 1 (2013), 1–6.Google Scholar
Cross Ref
- [61] Lilian Weng, Filippo Menczer, and Yong-Yeol Ahn. 2014. Predicting successful memes using network and community structure. In Proceedings of the Eighth International Conference on Weblogs and Social Media (ICWSM’14), Ann Arbor, Michigan, USA, June 1-4, 2014. The AAAI Press. http://www.aaai.org/ocs/index.php/ICWSM/ICWSM14/paper/view/8081.Google Scholar
- [62] Yiming Yang and Jan O. Pedersen. 1997. A Comparative Study on Feature Selection in Text Categorization. In Proceedings of the Fourteenth International Conference on Machine Learning (ICML’97), Nashville, Tennessee, USA, July 8-12, 1997, Douglas H. Fisher (Ed.). Morgan Kaufmann, 412–420.Google Scholar
- [63] . 2017. Users’ participation and social influence during information spreading on Twitter. PloS One 12, 9 (2017).Google Scholar
Cross Ref
- [64] Qingyuan Zhao, Murat A. Erdogdu, Hera Y. He, Anand Rajaraman, and Jure Leskovec. 2015. SEISMIC: A Self Exciting Point Process Model for Predicting Tweet Popularity. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Sydney, NSW, Australia, August 10-13, 2015, Longbing Cao, Chengqi Zhang, Thorsten Joachims, Geoffrey I. Webb, Dragos D. Margineantu, and Graham Williams (Eds.). ACM, 1513–1522. Google Scholar
Digital Library
- [65] Fan Zhou, Xovee Xu, Goce Trajcevski, and Kunpeng Zhang. 2021. A Survey of Information Cascade Analysis: Models, Predictions, and Recent Advances. ACM Comput. Surv. 54, 2 (2021), 27:1–27:36. Google Scholar
Digital Library
- [66] . 2020. Political effects of the Internet and social media. Annual Review of Economics 12, 1 (2020), 415–438.Google Scholar
Cross Ref
- [67] . 2006. Vector autoregressive models for multivariate time series. Modeling Financial Time Series with S-PLUS® (2006), 385–429.Google Scholar
Index Terms
Spotting Flares: The Vital Signs of the Viral Spread of Tweets Made During Communal Incidents
Recommendations
Analyzing and predicting viral tweets
WWW '13 Companion: Proceedings of the 22nd International Conference on World Wide WebTwitter and other microblogging services have become indispensable sources of information in today's web. Understanding the main factors that make certain pieces of information spread quickly in these platforms can be decisive for the analysis of ...
Measuring and Detecting Virality on Social Media: The Case of Twitter’s Viral Tweets Topic
WWW '23 Companion: Companion Proceedings of the ACM Web Conference 2023Social media posts may go viral and reach large numbers of people within a short period of time. Such posts may threaten the public dialogue if they contain misleading content, making their early detection highly crucial. Previous works proposed their ...
Make Tweets Great Again: Who Are Opinion Leaders, and What Did They Tweet About Donald Trump?
Twitter gained new levels of political prominence with Donald J. Trump’s use of the platform. Although previous work has been done studying the content of Trump’s tweets, there remains a dearth of research exploring who opinion leaders were in the early ...






Comments