Abstract
Users’ polarization and confirmation bias play a key role in misinformation spreading on online social media. Our aim is to use this information to determine in advance potential targets for hoaxes and fake news. In this article, we introduce a framework for promptly identifying polarizing content on social media and, thus, “predicting” future fake news topics. We validate the performances of the proposed methodology on a massive Italian Facebook dataset, showing that we are able to identify topics that are susceptible to misinformation with 77% accuracy. Moreover, such information may be embedded as a new feature in an additional classifier able to recognize fake news with 91% accuracy. The novelty of our approach consists in taking into account a series of characteristics related to users’ behavior on online social media such as Facebook, making a first, important step towards the mitigation of misinformation phenomena by supporting the identification of potential misinformation targets and thus the design of tailored counter-narratives.
- Statista. 2018. Number of monthly active Facebook users worldwide as of 3rd quarter 2017 (in millions). Retrieved from https://www.statista.com/statistics/264810/number-of-monthly-active-facebook-users-worldwide/.Google Scholar
- Oxford Dictionaries. 2017. Oxford dictionaries word of the year 2016 is...post-truth. Retrieved from https://www.oxforddictionaries.com/press/news/2016/12/11/WOTY-16.Google Scholar
- Robert Allen. 2017. What happens online in 60 seconds? Retrieved from https://www.smartinsights.com/internet-marketing-statistics/happens-online-60-seconds/.Google Scholar
- Nic Newman, Richard Fletcher, Antonis Kalogeropoulos, David AL Levy, and Rasmus Kleis Nielsen. 2017. Reuters digital news report.Google Scholar
- Michela Del Vicario, Alessandro Bessi, Fabiana Zollo, Fabio Petroni, Antonio Scala, Guido Caldarelli, H. Eugene Stanley, and Walter Quattrociocchi. 2016. The spreading of misinformation online. Proc. Natl. Acad. Sci. 113, 3 (2016), 554--559.Google Scholar
Cross Ref
- Ana LucÃa Schmidt, Fabiana Zollo, Michela Del Vicario, Alessandro Bessi, Antonio Scala, Guido Caldarelli, H. Eugene Stanley, and Walter Quattrociocchi. 2017. Anatomy of news consumption on Facebook. Proc. Natl. Acad. Sci. 114, 12 (2017).Google Scholar
- Michela Del Vicario, Fabiana Zollo, Guido Caldarelli, Antonio Scala, and Walter Quattrociocchi. 2017. Mapping social dynamics on Facebook: The Brexit debate. Soc. Netw. 50, Supplement C (2017), 6--16.Google Scholar
Cross Ref
- Ana Lucηa Schmidt, Fabiana Zollo, Antonio Scala, Cornelia Betsch, and Walter Quattrociocchi. 2018. Polarization of the vaccination debate on Facebook. Vaccine 36, 25 (2018), 3,606--3,612.Google Scholar
- Fabiana Zollo, Alessandro Bessi, Michela Del Vicario, Antonio Scala, Guido Caldarelli, Louis Shekhtman, Shlomo Havlin, and Walter Quattrociocchi. 2017. Debunking in a world of tribes. PLOS ONE 12, 7 (07 2017), 1--27.Google Scholar
- W. Lee Howell. 2013. Digital Wildfires in a Hyperconnected World. Technical Report Global Risks. World Economic Forum.Google Scholar
- Fabiana Zollo and Walter Quattrociocchi. 2018 (in press). Social dynamics in the age of credulity: The misinformation risk and its fallout. In Digital Dominance. The Power of Google, Amazon, Facebook, and Apple, Martin Moore and Damian Tambini (Eds.). Oxford University Press, Oxford.Google Scholar
- Fabiana Zollo and Walter Quattrociocchi. 2018. Misinformation spreading on Facebook. In Complex Spreading Phenomena in Social Systems, Sune Lehmann and Yong-Yeol Ahn (Eds.). Springer Nature.Google Scholar
- Sotirios Antoniadis, Iouliana Litou, and Vana Kalogeraki. 2015. A model for identifying misinformation in online social networks. In On the Move to Meaningful Internet Systems: OTM 2015 Conferences, Christophe Debruyne, Hervé Panetto, Robert Meersman, Tharam Dillon, Georg Weichhart, Yuan An, and Claudio Agostino Ardagna (Eds.). Springer International Publishing, Cham, 473--482. Google Scholar
Digital Library
- Meet Rajdev and Kyumin Lee. 2015. Fake and spam messages: Detecting misinformation during natural disasters on social media. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT’15), Vol. 1. IEEE, 17--20. Google Scholar
Digital Library
- Christina Boididou, Symeon Papadopoulos, Lazaros Apostolidis, and Yiannis Kompatsiaris. 2017. Learning to detect misleading content on Twitter. In Proceedings of the 2017 ACM International Conference on Multimedia Retrieval. ACM, 278--286. Google Scholar
Digital Library
- Christina Boididou, Stuart E Middleton, Zhiwei Jin, Symeon Papadopoulos, Duc-Tien Dang-Nguyen, Giulia Boato, and Yiannis Kompatsiaris. 2017. Verifying information with multimedia content on Twitter. Multimed. Tools Appl. 77, 12 (2017), 1--27. Google Scholar
Digital Library
- Ana-Maria Popescu and Marco Pennacchiotti. 2010. Detecting controversial events from Twitter. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management. ACM, 1873--1876. Google Scholar
Digital Library
- Aditi Gupta, Hemank Lamba, Ponnurangam Kumaraguru, and Anupam Joshi. 2013. Faking Sandy: Characterizing and identifying fake images on Twitter during Hurricane Sandy. In Proceedings of the 22nd International Conference on World Wide Web. ACM, 729--736. Google Scholar
Digital Library
- Carlos Castillo, Marcelo Mendoza, and Barbara Poblete. 2011. Information credibility on Twitter. In Proceedings of the 20th International Conference on World Wide Web. ACM, 675--684. Google Scholar
Digital Library
- Cody Buntain and Jennifer Golbeck. 2017. Automatically identifying fake news in popular Twitter threads. In Proceedings of the IEEE International Conference on Smart Cloud (SmartCloud). IEEE, 208--215.Google Scholar
Cross Ref
- Srijan Kumar, Robert West, and Jure Leskovec. 2016. Disinformation on the web: Impact, characteristics, and detection of Wikipedia hoaxes. In Proceedings of the 25th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 591--602. Google Scholar
Digital Library
- Stefan Siersdorfer, Sergiu Chelaru, Jose San Pedro, Ismail Sengor Altingovde, and Wolfgang Nejdl. 2014. Analyzing and mining comments and comment ratings on the social web. ACM Trans. Web 8, 3 (2014), 17. Google Scholar
Digital Library
- Sadia Afroz, Michael Brennan, and Rachel Greenstadt. 2012. Detecting hoaxes, frauds, and deception in writing style online. In Proceedings of the IEEE Symposium on Security and Privacy (SP’12). IEEE, 461--475. Google Scholar
Digital Library
- Yelena Mejova, Amy X. Zhang, Nicholas Diakopoulos, and Carlos Castillo. 2014. Controversy and sentiment in online news. arXiv preprint arXiv:1409.8152 (2014).Google Scholar
- Lada A. Adamic and Natalie Glance. 2005. The political blogosphere and the 2004 US election: Divided they blog. In Proceedings of the 3rd International Workshop on Link Discovery. ACM, 36--43. Google Scholar
Digital Library
- Kiran Garimella, Gianmarco De Francisci Morales, Aristides Gionis, and Michael Mathioudakis. 2016. Quantifying controversy in social media. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining. ACM, 33--42. Google Scholar
Digital Library
- Andrew Guess, Brendan Nyhan, and Jason Reifler. 2018. Selective exposure to misinformation: Evidence from the consumption of fake news during the 2016 US presidential campaign. European Research Council, 9.Google Scholar
- Johan Ugander, Lars Backstrom, Cameron Marlow, and Jon Kleinberg. 2012. Structural diversity in social contagion. Proc. Natl. Acad. Sci. 109, 16 (2012), 5,962--5,966.Google Scholar
Cross Ref
- Pedro Henrique Calais Guerra, Wagner Meira Jr, Claire Cardie, and Robert Kleinberg. 2013. A measure of polarization on social media networks based on community boundaries. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM’13).Google Scholar
- Ana Lucía Schmidt, Fabiana Zollo, Antonio Scala, and Walter Quattrociocchi. 2018. Polarization rank: A study on European news consumption on Facebook. arXiv preprint arXiv:1805.08030 (2018).Google Scholar
- Mauro Conti, Daniele Lain, Riccardo Lazzeretti, Giulio Lovisotto, and Walter Quattrociocchi. 2017. It’s always April fools’ day! On the difficulty of social network misinformation classification via propagation features. arXiv preprint arXiv:1701.04221 (2017).Google Scholar
- Kai Shu, Amy Sliva, Suhang Wang, Jiliang Tang, and Huan Liu. 2017. Fake news detection on social media: A data mining perspective. ACM SIGKDD Explor. Newslett. 19, 1 (2017), 22--36. Google Scholar
Digital Library
- Savvas Zannettou, Michael Sirivianos, Jeremy Blackburn, and Nicolas Kourtellis. 2018. The web of false information: Rumors, fake news, hoaxes, clickbait, and various other shenanigans. arXiv preprint arXiv:1804.03461 (2018).Google Scholar
- Srijan Kumar and Neil Shah. 2018. False information on web and social media: A survey. arXiv preprint arXiv:1804.08559 (2018).Google Scholar
- Srijan Kumar, Meng Jiang, Taeho Jung, Roger Jie Luo, and Jure Leskovec. 2018. MIS2: Misinformation and misbehavior mining on the web. In Proceedings of the 11th ACM International Conference on Web Search and Data Mining. ACM, 799--800. Google Scholar
Digital Library
- Jooyeon Kim, Behzad Tabibian, Alice Oh, Bernhard Schölkopf, and Manuel Gomez-Rodriguez. 2018. Leveraging the crowd to detect and reduce the spread of fake news and misinformation. In Proceedings of the 11th ACM International Conference on Web Search and Data Mining. ACM, 324--332. Google Scholar
Digital Library
- Nikhil Karamchandani and Massimo Franceschetti. 2013. Rumor source detection under probabilistic sampling. In Proceedings of the IEEE International Symposium on Information Theory (ISIT’13). IEEE, 2,184--2,188.Google Scholar
- Sejeong Kwon, Meeyoung Cha, and Kyomin Jung. 2017. Rumor detection over varying time windows. PloS One 12, 1 (2017), e0168344.Google Scholar
Cross Ref
- Zhaoxu Wang, Wenxiang Dong, Wenyi Zhang, and Chee Wei Tan. 2014. Rumor source detection with multiple observations: Fundamental limits and algorithms. In ACM SIGMETRICS Perf. Eval. Rev., Vol. 42. ACM, 1--13. Google Scholar
Digital Library
- Sam Spencer and R. Srikant. 2016. Maximum likelihood rumor source detection in a star network. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’16). IEEE, 2,199--2,203.Google Scholar
- Yang Liu and Songhua Xu. 2016. Detecting rumors through modeling information propagation networks in a social media environment. IEEE Trans. Comput. Soc. Systems 3, 2 (2016), 46--62.Google Scholar
Cross Ref
- Zhe Zhao, Paul Resnick, and Qiaozhu Mei. 2015. Enquiring minds: Early detection of rumors in social media from enquiry posts. In Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 1,395--1,405. Google Scholar
Digital Library
- Srijan Kumar, Francesca Spezzano, and V. S. Subrahmanian. 2015. Vews: A Wikipedia vandal early warning system. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’15). ACM, 607--616. Google Scholar
Digital Library
- Ming Yang, Melody Kiang, and Wei Shang. 2015. Filtering big data from social media—Building an early warning system for adverse drug reactions. J. Biomed. Inform. 54 (2015), 230--240. Google Scholar
Digital Library
- Michela Del Vicario, Sabrina Gaito, Walter Quattrociocchi, Matteo Zignani, and Fabiana Zollo. 2017. News consumption during the Italian Referendum: A cross-platform analysis on Facebook and Twitter. In Proceedings of the 4th IEEE International Conference on Data Science and Advanced Analytics. IEEE.Google Scholar
Cross Ref
- F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. 2011. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 12 (2011), 2,825--2,830. Google Scholar
Digital Library
- Issa Alsmadi and Gan Keng Hoon. 2018. Term weighting scheme for short-text classification: Twitter corpuses. Neural Comput. Appl. (2018), 1--13.Google Scholar
- Selma Ayşe Özel, Esra Saraç, Seyran Akdemir, and Hülya Aksu. 2017. Detection of cyberbullying on social media messages in Turkish. In Proceedings of the International Conference on Computer Science and Engineering (UBMK’17). IEEE, 366--370.Google Scholar
Cross Ref
- Apalak Khatua and Aparup Khatua. 2017. Cricket World Cup 2015: Predicting users’ orientation through mix tweets on Twitter platform. In Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. ACM, 948--951. Google Scholar
Digital Library
- Despoina Antonakaki, Iasonas Polakis, Elias Athanasopoulos, Sotiris Ioannidis, and Paraskevi Fragopoulou. 2016. Exploiting abused trending topics to identify spam campaigns in Twitter. Soc. Netw. Anal. Mining 6, 1 (2016), 48.Google Scholar
Cross Ref
- Jeff Hemsley, Sikana Tanupabrungsun, and Bryan Semaan. 2017. Call to retweet: Negotiated diffusion of strategic political messages. In Proceedings of the 8th International Conference on Social Media 8 Society. ACM, 9. Google Scholar
Digital Library
- Ward van Zoonen and Toni G. L. A. van der Meer. 2016. Social media research: The application of supervised machine learning in organizational communication research. Comput. Hum. Behav. 63 (2016), 132--141. Google Scholar
Digital Library
- Soroush Vosoughi and Deb Roy. 2016. Tweet acts: A speech act classifier for Twitter. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM’16). 711--715.Google Scholar
- Che-Chia Chang, Shu-I Chiu, and Kuo-Wei Hsu. 2017. Predicting political affiliation of posts on Facebook. In Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication. ACM, 57. Google Scholar
Digital Library
- Ryan M. Rifkin and Ross A. Lippert. 2007. Notes on regularized least squares. MIT CSAIL Technical Reports. http://hdl.handle.net/1721.1/37318.Google Scholar
- Mark Schmidt, Nicolas Le Roux, and Francis Bach. 2017. Minimizing finite sums with the stochastic average gradient. Math. Program. 162, 1--2 (2017), 83--112. Google Scholar
Digital Library
- Corinna Cortes and Vladimir Vapnik. 1995. Support-vector networks. Mach. Learn. 20, 3 (1995), 273--297. Google Scholar
Digital Library
- Jon Louis Bentley. 1975. Multidimensional binary search trees used for associative searching. Commun. ACM 18, 9 (1975), 509--517. Google Scholar
Digital Library
- David E. Rumelhart, Geoffrey E. Hinton, and Ronald J. Williams. 1988. Learning representations by back-propagating errors. Cog. Model. 5, 3 (1988), 1.Google Scholar
- Leo Breiman. 2017. Classification and Regression Trees. Routledge.Google Scholar
- Marina Sokolova and Guy Lapalme. 2009. A systematic analysis of performance measures for classification tasks. Inform. Proc. Manag. 45, 4 (2009), 427--437. Google Scholar
Digital Library
- Charles E. Metz. 1978. Basic principles of ROC analysis. In Sem. Nucl. Med., Vol. 8. Elsevier, 283--298.Google Scholar
- ADS. 2016. Elenchi Testate. Retrieved from http://www.adsnotizie.it/_testate.asp.Google Scholar
- Bufale.net. 2016. The Black List: La Lista Nera Del Web. (2016). Retrieved from http://www.adsnotizie.it/_testate.asp.Google Scholar
- BUTAC. 2016. The Black List. (2016). Retrieved from http://www.butac.it/the-black-list/.Google Scholar
- Facebook. 2013. Using the Graph API. (2013). Retrieved from https://developers.facebook.com/docs/graph-api/using-graph-api/.Google Scholar
- SpazioDati. 2017. Dandelion API. Retrieved from https://dandelion.eu/docs/.Google Scholar
- Raquel Fonseca Canales and Edgar Casasola Murillo. 2017. Evaluation of entity recognition algorithms in short texts. CLEI Electron. J. 20, 1 (2017).Google Scholar
- Xu-Ying Liu, Jianxin Wu, and Zhi-Hua Zhou. 2009. Exploratory undersampling for class-imbalance learning. IEEE Trans. Systems, Man, Cyber., Part B (Cyber.) 39, 2 (2009), 539--550. Google Scholar
Digital Library
- Chris Drummond and Robert C. Holte. 2003. C4.5, class imbalance, and cost sensitivity: Why under-sampling beats over-sampling. In Workshop on Learning from Imbalanced Datasets II, Vol. 11. Citeseer, 1--8.Google Scholar
- N. V. Chawla, N. Japkowicz, and A. Kotcz. 2004. Editorial: Special issue on learning from imbalanced data sets. SIGKDD Explor. Newslett. 6: 1--6. Google Scholar
Digital Library
- Kai Shu, Suhang Wang, and Huan Liu. 2017. Exploiting tri-relationship for fake news detection. arXiv preprint arXiv:1712.07709 (2017).Google Scholar
- Katya Demidova. 2016. Getting real about fake news. GitHub. Retrieved from https://github.com/demidovakatya/competitions/blob/master/fake-news/README.md.Google Scholar
Index Terms
Polarization and Fake News: Early Warning of Potential Misinformation Targets
Recommendations
Fake News Research: Theories, Detection Strategies, and Open Problems
KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data MiningFake news has become a global phenomenon due its explosive growth, particularly on social media. The goal of this tutorial is to (1) clearly introduce the concept and characteristics of fake news and how it can be formally differentiated from other ...
Fake News on Facebook and Twitter: Investigating How People (Don't) Investigate
CHI '20: Proceedings of the 2020 CHI Conference on Human Factors in Computing SystemsWith misinformation proliferating online and more people getting news from social media, it is crucial to understand how people assess and interact with low-credibility posts. This study explores how users react to fake news posts on their Facebook or ...
Consuming Fake News: A Matter of Age? The Perception of Political Fake News Stories in Facebook Ads
Human Aspects of IT for the Aged Population. Technology and SocietyAbstractSocial media are increasingly being used by young and old as a source of information. Fake news is also on the rise. The role played by age in the consumption of fake news on social media, however, is unclear. This paper explores the generational ...






Comments