Abstract
The proliferation of mobile networked devices has made it easier and faster than ever for people to obtain and share information. However, this occasionally results in the propagation of erroneous information, which may be difficult to distinguish from the truth. The widespread diffusion of such information can result in irrational and poor decision making on potentially important issues. In 2020, this coincided with the global outbreak of Coronavirus Disease (COVID-19), a highly contagious and deadly virus. The proliferation of misinformation about COVID-19 on social media has already been identified as an “infodemic” by the World Health Organization (WHO), posing significant challenges for global governments seeking to manage the pandemic. This has driven an urgent need for methods to automatically detect and identify such misinformation. The research uses multiple deep learning model frameworks to detect misinformation in Chinese and English, and compare them based on different text feature selections. The model learns the textual characteristics of each type of true and misinformation for subsequent true/false prediction. The long and short-term memory (LSTM) model, the gated recurrent unit (GRU) model, and the bidirectional long and short-term memory (BiLSTM) model were selected for fake news detection. BiLSTM produces the best detection result, with detection accuracy reaching 94% for short-sentence English texts, and 99% for long-sentence English texts, while the accuracy for Chinese texts was 82%.
- [1] . 2019. “Fake news” and the defection of 2012 Obama voters in the 2016 presidential election. Electoral Studies 61 (2019), [102030]. Google Scholar
Cross Ref
- [2] . 2018. The spread of true and false news online. Science 359, 6380 (2018), 1146–1151. Google Scholar
Cross Ref
- [3] WHO Coronavirus Disease (COVID-19) Dashboard. World Health Organization. Retrieved Sep. 31, 2021 from https://covid19.who.int/.Google Scholar
- [4] . 2020. The COVID-19 social media infodemic. Scientific Reports 10. Google Scholar
Cross Ref
- [5] Battling the coronavirus ‘infodemic’. (2020, May 29). Nature. Retrieved Nov. 19, 2021 from https://www.nature.com/articles/d41586-020-01136-8.Google Scholar
- [6] . 2021. Online hate network spreads malicious COVID-19 content outside the control of individual social media platforms. Scientific Reports 11. Google Scholar
Cross Ref
- [7] . 2020. Defining misinformation and understanding its bounded nature: Using expertise and evidence for describing misinformation. Political Communication 37, 1 (2020), 136–144. Google Scholar
Cross Ref
- [8] . 2020. Inoculating against fake news about COVID-19. Frontiers in Psychology 11. Google Scholar
Cross Ref
- [9] . 2020. Detecting misleading information on COVID-19. IEEE Access 8 (2020), 165201–165215. Google Scholar
Cross Ref
- [10] . 2019. Understanding online falsehood from the perspective of social problem. Advances in Media, Entertainment, and the Arts, 1–17. Google Scholar
Cross Ref
- [11] . 2019. A survey on fake news and rumour detection techniques. Information Sciences 497 (2019), 38–55. Google Scholar
Digital Library
- [12] . 2015. Deception detection for news: Three types of fakes. In Proceedings of the Association for Information Science and Technology 52, 1 (2015), 1–4. Google Scholar
Cross Ref
- [13] . 2015. Towards news verification: Deception detection methods for news discourse. In Proceedings of the Hawaii International Conference on System Sciences (HICSS48) Symposium on Rapid Screening Technologies, Deception Detection and Credibility Assessment Symposium. 5–8. Google Scholar
Cross Ref
- [14] . 2009. Automatic satire detection: Are you having a laugh? In Proceedings of the ACL-IJCNLP 2009 Conference Short Papers. Google Scholar
Cross Ref
- [15] . 2007. Rumor, gossip and urban legends. Diogenes 54, 1 (2007), 19–35. Google Scholar
Cross Ref
- [16] . 2015. Towards detecting rumours in social media. arXiv: 1504.04712. Retrieved from https://arxiv.org/abs/1504.04712.Google Scholar
- [17] . 2018. Detection and resolution of rumours in social media. ACM Computing Surveys (CSUR) 51, 2 (2018), 1–36. Google Scholar
Digital Library
- [18] . 2015. Misleading online content: Recognizing clickbait as “false news”. In Proceedings of the 2015 ACM on Workshop on Multimodal Deception Detection. Google Scholar
Digital Library
- [19] . 2015. Lies, damn lies and viral content. Columbia University. Google Scholar
Cross Ref
- [20] . 2020. FakeNewsNet: A data repository with news content, social context, and spatiotemporal information for studying fake news on social media. Big Data 8 3 (2020), 171–188. Google Scholar
Cross Ref
- [21] What is a content farm? (2019, September 19). EContent Magazine. Retrieved Sep. 17, 2020 from http://www.econtentmag.com/Articles/Resources/Defining-EContent/What-is-a-Content-Farm-78370.htm.Google Scholar
- [22] . 2020. Types, sources, and claims of COVID-19 misinformation. Reuters Institute for the Study of Journalism.Google Scholar
- [23] . 2018. The spread of medical fake news in social media – The pilot quantitative study. Health Policy and Technology 7, 2 (2018), 115–118. Google Scholar
Cross Ref
- [24] . 2012. An improving deception detection method in computer-mediated communication. Journal of Networks 7, 11 (2012), 1811–1816. Google Scholar
Cross Ref
- [25] . 2017. Detection of online fake news using n-gram analysis and machine learning techniques. Lecture Notes in Computer Science. 127–138. Google Scholar
Cross Ref
- [26] . 2016. Fake news or truth? Using satirical cues to detect potentially misleading news. In Proceedings of the Second Workshop on Computational Approaches to Deception Detection. Google Scholar
Cross Ref
- [27] . 2020. Detection of Bangla fake news using MNB and SVM classifier. 2020 International Conference on Computing, Electronics & Communications Engineering (ICCECE). 81–85. Google Scholar
Cross Ref
- [28] . 2020. Machine learning algorithm based model for classification of fake news on Twitter. 2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC). 1–4. .Google Scholar
Cross Ref
- [29] . 2020. Fake news classification using random forest and decision tree (J48). Al-Nahrain Journal of Science 23, 4 (2020), 49–55. Google Scholar
Cross Ref
- [30] . 2017. Creating a labeled dataset for medical misinformation in health forums. 2017 IEEE International Conference on Healthcare Informatics (ICHI). 456–461. .Google Scholar
Cross Ref
- [31] . 2017. Keynote speaker II. Procedia Computer Science 116 (2017), 3–9. Google Scholar
Digital Library
- [32] . 2019. Enforcing position-based confidentiality with machine learning paradigm through mobile edge computing in real-time industrial informatics. IEEE Transactions on Industrial Informatics 15, 7 (2019), 4189–4196.Google Scholar
Cross Ref
- [33] . 2020. Attention-based LSTM network for rumor veracity estimation of tweets. Information Systems Frontiers. 1–16. Google Scholar
Digital Library
- [34] . 2019. Defending against neural fake news. arXiv: 1905.12616. Retrieved from https://arxiv.org/abs/1905.12616.Google Scholar
- [35] . 2016. News verification by exploiting conflicting social viewpoints in microblogs. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI'16). AAAI Press, 2972–2978.Google Scholar
Digital Library
- [36] . 2017. Understand short texts by harvesting and analyzing semantic knowledge. IEEE Transactions on Knowledge and Data Engineering, 29, 499–512. Google Scholar
Digital Library
- [37] . 2020. Mapping the landscape of artificial intelligence applications against COVID-19. Journal of Artificial Intelligence Research 69 (2020), 807–845. Google Scholar
Cross Ref
- [38] . 2020. Assessing the risks of 'infodemics' in response to COVID-19 epidemics. Nature Human Behaviour 4 (2020), 1285–1293. Google Scholar
Cross Ref
- [39] . 2021. An optimized hybrid deep learning model to detect COVID-19 misleading information. Computational Intelligence and Neuroscience, 2021. Google Scholar
Digital Library
- [40] . 2021. Identifying COVID-19 fake news in social media. arXiv: 2101.11954. Retrieved from https://arxiv.org/abs/2101.11954.Google Scholar
- [41] . 2021. Transformer based automatic COVID-19 fake news detection system. arXiv:2101.00180. Retrieved from https://arxiv.org/abs/2101.00180.Google Scholar
- [42] . 2021. COVID-19 outbreak: An ensemble pre-trained deep learning model for detecting informative tweets. Applied Soft Computing 107 (2021), 107495. Google Scholar
Cross Ref
- [43] . 2020. BANANA at WNUT-2020 Task 2: Identifying COVID-19 information on Twitter by combining deep learning and transfer learning models. Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020). 366–370. https://10.18653/v1/2020.wnut-1.50.Google Scholar
Cross Ref
- [44] . 2020. No rumours please! A multi-Indic-lingual approach for COVID fake-tweet detection. arXiv: 2010.06906. Retrieved from https://arxiv.org/abs/2010.06906.Google Scholar
- [45] . 2020. MM-COVID: A multilingual and multimodal data repository for combating COVID-19 disinformation. arXiv: 2011.04088. Retrieved from https://arxiv.org/abs/2011.04088.Google Scholar
- [46] . 2021. Cross-lingual COVID-19 fake news detection. arXiv: 2110.06495. Retrieved from https://arxiv.org/abs/2110.06495.Google Scholar
- [47] . 2021. Combating the infodemic: A Chinese infodemic dataset for misinformation identification. Healthcare, 9. Google Scholar
Cross Ref
- [48] Fact check rating. Snopes Media Group Inc. Retrieved Sep. 10, 2020 from https://www.snopes.com/fact-check-ratings/.Google Scholar
- [49] . 1997. Long short-term memory. Neural Computation 9 (1997), 1735–1780. Google Scholar
Digital Library
- [50] . 1997. Bidirectional recurrent neural networks. Signal Processing, IEEE Transactions 45 (1997), 2673–2681. Google Scholar
Digital Library
- [51] . 2017. Stance Detection for Fake News Identification. Stanford University.Google Scholar
- [52] . 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv: 1412.3555. Retrieved from https://arxiv.org/abs/1412.3555.Google Scholar
- [53] . 2020. FakeCovid - A multilingual cross-domain fact check news dataset for COVID-19. arXiv: 2006.11343. Retrieved from https://arxiv.org/abs/2006.11343.Google Scholar
- [54] COVID-19-News-Corpus. 2020. GitHub. Retrieved Sep. 15, 2020 from https://github.com/KangGu96/COVID-19-News-Corpus/.Google Scholar
- [55] 【Cofacts 真的假的】Open Datasets. 2020. Github. Retrieved Sep. 15, 2020 from https://github.com/cofacts/opendata.Google Scholar
- [56] . 2019. Neural abstractive text summarization and fake news detection. arXiv: 1904.00788. Retrieved from https://arxiv.org/abs/1904.00788.Google Scholar
Index Terms
Using Deep Learning Models to Detect Fake News about COVID-19
Recommendations
Using Fuzzy Clustering with Deep Learning Models for Detection of COVID-19 Disinformation
Since the beginning of 2020, the COVID-19 pandemic has killed millions of people around the world, leading to a worldwide panic that has fueled the rapid and widespread dissemination of COVID-19-related disinformation on social media. The phenomenon, ...
CovidMis20: COVID-19 Misinformation Detection System on Twitter Tweets Using Deep Learning Models
Intelligent Human Computer InteractionAbstractOnline news and information sources are convenient and accessible ways to learn about current issues. For instance, more than 300 million people engage with posts on Twitter globally, which provides the possibility to disseminate misleading ...
Towards COVID-19 fake news detection using transformer-based models
AbstractThe COVID-19 pandemic has resulted in a surge of fake news, creating public health risks. However, developing an effective way to detect such news is challenging, especially when published news involves mixing true and false ...






Comments