Abstract
Increased connectivity has contributed greatly in facilitating rapid access to information and reliable communication. However, the uncontrolled information dissemination has also resulted in the spread of fake news. Fake news might be spread by a group of people or organizations to serve ulterior motives such as political or financial gains or to damage a country’s public image. Given the importance of timely detection of fake news, the research area has intrigued researchers from all over the world. Most of the work for detecting fake news focuses on the English language. However, automated detection of fake news is important irrespective of the language used for spreading false information. Recognizing the importance of boosting research on fake news detection for low resource languages, this work proposes a novel semantically enriched technique to effectively detect fake news in Urdu—a low resource language. A model based on deep contextual semantics learned from the convolutional neural network is proposed. The features learned from the convolutional neural network are combined with other n-gram-based features and are fed to a conventional majority voting ensemble classifier fitted with three base learners: Adaptive Boosting, Gradient Boosting, and Multi-Layer Perceptron. Experiments are performed with different models, and results show that enriching the traditional ensemble learner with deep contextual semantics along with other standard features shows the best results and outperforms the state-of-the-art Urdu fake news detection model.
- . 2020. Classification of fake news by fine-tuning deep bidirectional transformers based language model. EAI Endorsed Transactions on Scalable Information Systems 7, 17 (2020), 1–12.Google Scholar
- . 2009. Assas-band, an affix-exception-list based Urdu stemmer. In Proceedings of the 7th Workshop on Asian Language Resources. 40–46.Google Scholar
Digital Library
- . 2018. Fake news identification characteristics using named entity recognition and phrase detection. In Proceedings of the 2018 10th International Conference on Information Technology and Electrical Engineering (ICITEE’18). IEEE, Los Alamitos, CA, 12–17.Google Scholar
Cross Ref
- . 2020. Arabic fake news detection in social media using readers’ comments: Text mining techniques in action. International Journal of Computer Science and Network Security 20, 9 (2020), 29–35.Google Scholar
- . 2020. “Bend the truth”: Benchmark dataset for fake news detection in Urdu language and its evaluation. Journal of Intelligent & Fuzzy Systems.Preprint (2020), 1–13.Google Scholar
- . 2019. Fake news detection using bi-directional LSTM-recurrent neural network. Procedia Computer Science 165 (2019), 74–82.Google Scholar
Digital Library
- . 2018. Urdu word segmentation using conditional random fields (CRFs). In Proceedings of the 27th International Conference on Computational Linguistics.2562–2569. http://aclweb.org/anthology/C18-1217.Google Scholar
- . 2018. A new study suggests fake news might have won Donald Trump the 2016 election. Washington Post. Retrieved October 17, 2021 from https://www.washingtonpost.com/news/the-fix/wp/2018/04/03/a-new-study-suggests-fake-news-might-have-won-donald-trump-the-2016-election/.Google Scholar
- . 2018. Call attention to rumors: Deep attention based recurrent neural networks for early rumor detection. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. 40–52.Google Scholar
Cross Ref
- . 2017. Towards automatic identification of fake news: Headline-article stance detection with LSTM attention models. In Stanford CS224d Deep Learning for NLP Final Project.Google Scholar
- . 2020. Approaches to the profiling fake news spreaders on Twitter task in English and Spanish. In Proceedings of the 2020 Conference and Labs of the Evaluation Forum (CLEF’20).Google Scholar
- . 2016. Reading fake news, Pakistani minister directs nuclear threat at Israel. New York Times. Retrieved October 17, 2021 from https://www.nytimes.com/2016/12/24/world/asia/pakistan-israel-khawaja-asif-fake-news-nuclear.html.Google Scholar
- . 2019. Classifying Arabic Tweets based on credibility using content and user features. In Proceedings of the 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT’19). IEEE, Los Alamitos, CA, 596–601.Google Scholar
Cross Ref
- . 2020. Fake news detection for the Russian language. In Proceedings of the 3rd International Workshop on Rumours and Deception in Social Media (RDSM’20). 45–57.Google Scholar
- . 2017. Fake News Detection Through Multi-Perspective Speaker Profiles. Association for Computational Linguistics.Google Scholar
- . 2017. Detect Rumors in Microblog Posts Using Propagation Structure via Kernel Learning. Association for Computational Linguistics.Google Scholar
- . 2018. Rumor Detection on Twitter with Tree-Structured Recursive Neural Networks. Association for Computational Linguistics.Google Scholar
- . 2013. Efficient estimation of word representations in vector space. Arxiv Preprint arXiv:1301.3781 (2013).Google Scholar
- . 2019. Fake news detection on social media using geometric deep learning. Arxiv Preprint arXiv:1902.06673 (2019).Google Scholar
- . 2020. Machine generation and detection of Arabic manipulated and fake news. Arxiv Preprint arXiv:2011.03092 (2020).Google Scholar
- . 2011. Finding deceptive opinion spam by any stretch of the imagination. Arxiv Preprint arXiv:1107.4557 (2011).Google Scholar
- . 2019. Detection of fake news in a new corpus for the Spanish language. Journal of Intelligent & Fuzzy Systems 36, 5 (2019), 4869–4876.Google Scholar
Cross Ref
- . 2018. Neural user response generator: Fake news detection with collective user intelligence. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI’18), Vol. 18. 3834–3840.Google Scholar
Cross Ref
- . 2017. Truth of varying shades: Analyzing language in fake news and political fact-checking. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2931–2937.Google Scholar
Cross Ref
- . 2017. CSI: A hybrid deep model for fake news detection. In Proceedings of the 2017 ACM Conference on Information and Knowledge Management. 797–806.Google Scholar
Digital Library
- . 2019a. Studying fake news via network analysis: Detection and mitigation. In Emerging Research Challenges and Opportunities in Computational Social Network Analysis and Mining. Springer, 43–65.Google Scholar
Cross Ref
- . 2017. Fake news detection on social media: A data mining perspective. ACM SIGKDD Explorations Newsletter 19, 1 (2017), 22–36.Google Scholar
Digital Library
- . 2018. Understanding user profiles on social media for fake news detection. In Proceedings of the 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR’18). IEEE, Los Alamitos, CA, 430–435.Google Scholar
Cross Ref
- . 2019b. Beyond news contents: The role of social context for fake news detection. In Proceedings of the 12th ACM International Conference on Web Search and Data Mining. 312–320.Google Scholar
Digital Library
- . 2017. Some like it hoax: Automated fake news detection in social networks. Arxiv Preprint arXiv:1704.07506 (2017).Google Scholar
- . 2019. Fake news detection with the new German dataset “GermanFakeNC.” In Proceedings of the International Conference on Theory and Practice of Digital Libraries. 288–295.Google Scholar
Digital Library
- . 2017. Separating facts from fiction: Linguistic models to classify suspicious and trusted news posts on Twitter. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 647–653.Google Scholar
Cross Ref
- . 2018. The spread of true and false news online. Science 359, 6380 (2018), 1146–1151.Google Scholar
Cross Ref
- . 2017. “Liar, liar pants on fire”: A new benchmark dataset for fake news detection. Arxiv Preprint arXiv:1705.00648 (2017).Google Scholar
- . 2019. Unsupervised fake news detection on social media: A generative approach. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 5644–5651.Google Scholar
Digital Library
- . 2008. Six-year-old news story causes United Airlines stock to plummet—UPDATE Google placed wrong date on story. Wired. Retrieved October 17, 2021 from https://www.wired.com/2008/09/six-year-old-st/.Google Scholar
- . 2018. Fake news: A survey of research, detection methods, and opportunities. Arxiv Preprint arXiv:1812.00315 2 (2018).Google Scholar
Index Terms
Enriching Conventional Ensemble Learner with Deep Contextual Semantics to Detect Fake News in Urdu
Recommendations
Arabic Fake News Detection: A Fact Checking Based Deep Learning Approach
Fake news stories can polarize society, particularly during political events. They undermine confidence in the media in general. Current NLP systems are still lacking the ability to properly interpret and classify Arabic fake news. Given the high stakes ...
An ensemble learning framework for convolutional neural network based on multiple classifiers
AbstractTraditional machine learning methods have certain limitations in constructing high-precision estimation models and improving generalization ability, but ensemble learning that combines multiple different single models into one model is ...
Fake News Detection using Bi-directional LSTM-Recurrent Neural Network
AbstractMedia plays a vital role in the public dissemination of information about events. The rapid development of the Internet allows a quick spread of information through social networks or websites. Without the concern about the credibility of the ...






Comments