skip to main content
research-article

A Survey of Offensive Language Detection for the Arabic Language

Published:09 March 2021Publication History
Skip Abstract Section

Abstract

The use of offensive language in user-generated content is a serious problem that needs to be addressed with the latest technology. The field of Natural Language Processing (NLP) can support the automatic detection of offensive language. In this survey, we review previous NLP studies that cover Arabic offensive language detection. This survey investigates the state-of-the-art in offensive language detection for the Arabic language, providing a structured overview of previous approaches, including core techniques, tools, resources, methods, and main features used. This work also discusses the limitations and gaps of the previous studies. Findings from this survey emphasize the importance of investing further effort in detecting Arabic offensive language, including the development of benchmark resources and the invention of novel preprocessing and feature extraction techniques.

References

  1. Kheireddine Abainia, Siham Ouamour, and Halim Sayoud. 2017. A novel robust Arabic light stemmer. J. Exper. Theor. Artif. Intell. 29, 3 (2017), 557--573. arXiv:https://doi.org/10.1080/0952813X.2016.1212100.Google ScholarGoogle ScholarCross RefCross Ref
  2. Kareem E. Abdelfatah, Gabriel Terejanu, and Ayman A. Alhelbawy. 2017. Unsupervised detection of violent content in Arabic social media. Comput. Sci. Inf. Technol. 7, 4 (2017), 1--7. Retrieved from http://airccse.org/V7N66.html.Google ScholarGoogle Scholar
  3. Muhammad Abdul-Mageed, Chiyu Zhang, AbdelRahim Elmadany, Arun Rajendran, and Lyle Ungar. 2019. DiaNet: BERT and hierarchical attention multi-task learning of fine-grained dialect. ArXiv abs/1910.14243 (2019).Google ScholarGoogle Scholar
  4. Muhammad Abdul-Mageed, Chiyu Zhang, Azadeh Hashemi, and El Moatez Billah Nagoudi. 2020. AraNet: A deep learning toolkit for Arabic social media. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 16--23. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.3.Google ScholarGoogle Scholar
  5. Ehab Abozinadah, Alex Mbaziira, and James H. Jones. 2015. Detection of abusive accounts with Arabic tweets. Int. J. Knowl. Eng. 1 (2015), 113--119. DOI:https://doi.org/10.7763/IJKE.2015.V1.19Google ScholarGoogle ScholarCross RefCross Ref
  6. Ehab A. Abozinadah. 2017. Detecting Abusive Arabic Language Twitter Accounts Using a Multidimensional Analysis Model. Ph.D. Dissertation. George Mason University, Fairfax, VA.Google ScholarGoogle Scholar
  7. Ehab A. Abozinadah and James H. Jones. 2017. A statistical learning approach to detect abusive Twitter accounts. In Proceedings of the International Conference on Compute and Data Analysis (ICCDA’17). Association for Computing Machinery, New York, NY, 6--13. DOI:https://doi.org/10.1145/3093241.3093281Google ScholarGoogle Scholar
  8. Ibrahim Abu Farha and Walid Magdy. 2020. Multitask learning for Arabic offensive language and hate-speech detection. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 86--90. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.14.Google ScholarGoogle Scholar
  9. Areej Al-Hassan and Hmood Al-Dossari. 2019. Detection of hate speech in social networks: A survey on multilingual corpus. Comput. Sci. Inf. Technol. 9, 2 (2019), 83--100. https://doi.org/10.5121/csit.2019.90208.Google ScholarGoogle Scholar
  10. Azalden Alakrot, Liam Murray, and Nikola S. Nikolov. 2018b. Dataset construction for the detection of anti-social behaviour in online communication in Arabic. Procedia Comput. Sci. 142 (2018), 174--181. DOI:https://doi.org/10.1016/j.procs.2018.10.473Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Azalden Alakrot, Liam Murray, and Nikola S. Nikolov. 2018a. Towards accurate detection of offensive language in online communication in Arabic. Procedia Comput. Sci. 142 (2018), 315--320. DOI:https://doi.org/10.1016/j.procs.2018.10.491Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Nuha Albadi, Maram Kurdi, and Shivakant Mishra. 2018. Are they our brothers? Analysis and detection of religious hate speech in the Arabic Twittersphere. In Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM’18). 69--76. DOI:https://doi.org/10.1109/ASONAM.2018.8508247Google ScholarGoogle ScholarCross RefCross Ref
  13. Nuha Albadi, Maram Kurdi, and Shivakant Mishra. 2019a. Hateful people or hateful bots? Detection and characterization of bots spreading religious hatred in Arabic social media. Proc. ACM Hum.-comput. Interact. 3, 61 (Nov. 2019), 25. DOI:https://doi.org/10.1145/3359163Google ScholarGoogle Scholar
  14. Nuha Albadi, Maram Kurdi, and Shivakant Mishra. 2019b. Investigating the effect of combining GRU neural networks with handcrafted features for religious hatred detection on Arabic Twitter space. Soc. Netw. Anal. Mining 9, 41 (Aug. 2019), 1--19. DOI:https://doi.org/10.1007/s13278-019-0587-5Google ScholarGoogle ScholarCross RefCross Ref
  15. Abdullah I. Alharbi and Mark Lee. 2020. Combining character and word embeddings for the detection of offensive language in Arabic. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 91--96. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.15.Google ScholarGoogle Scholar
  16. Sarah Alhumoud, Mawaheb Altuwaijri, Tarfa Albuhairi, and Wejdan Alohaideb. 2015. Survey on Arabic sentiment analysis in Twitter. World Acad. Sci., Eng. Technol. Int. J. Soc., Behav., Educ., Econ., Bus. Industr. Eng. 9, 1, 364--378.Google ScholarGoogle Scholar
  17. Ibrahim Aljarah, Maria Habib, Neveen Hijazi, Hossam Faris, Raneem Qaddoura, Bassam Hammo, Mohammad Abushariah, and Mohammad Alfawareh. 2020. Intelligent detection of hate speech in Arabic social network: A machine learning approach. J. Inf. Sci. (May 2020), 0165551520917651. DOI:https://doi.org/10.1177/0165551520917651Google ScholarGoogle ScholarCross RefCross Ref
  18. Ali Alshehri, El Moatez Billah Nagoudi, Hassan Alhuzali, and Muhammad Abdul-Mageed. 2018. Think before your click: Data and models for adult content in Arabic Twitter. In Proceedings of the 2nd Workshop on Text Analytics for Cybersecurity and Online Safety (TA-COS’18). European Language Resources Association (ELRA).Google ScholarGoogle Scholar
  19. Wissam Antoun, Fady Baly, and Hazem Hajj. 2020. AraBERT: Transformer-based Model for Arabic Language Understanding. arxiv:cs.CL/2003.00104(2020).Google ScholarGoogle Scholar
  20. Naaima Boudad, Rdouan Faizi, Rachid Oulad Haj Thami, and Raddouane Chiheb. 2018. Sentiment analysis in Arabic: A review of the literature. Ain Shams Eng. J. 9, 4 (2018), 2479--2490. DOI:https://doi.org/10.1016/j.asej.2017.04.007Google ScholarGoogle ScholarCross RefCross Ref
  21. Arijit Ghosh Chowdhury, Aniket Didolkar, Ramit Sawhney, and Rajiv Ratn Shah. 2019. ARHNet—Leveraging community interaction for detection of religious hate speech in Arabic. In Proceedings of the 57th Meeting of the Association for Computational Linguistics: Student Research Workshop. Association for Computational Linguistics, 273--280. DOI:https://doi.org/10.18653/v1/P19-2038Google ScholarGoogle ScholarCross RefCross Ref
  22. Shammur Absar Chowdhury, Hamdy Mubarak, Ahmed Abdelali, Soon-gyo Jung, Bernard J. Jansen, and Joni Salminen. 2020. A multi-platform Arabic news comment dataset for offensive language detection. In Proceedings of the 12th Language Resources and Evaluation Conference. European Language Resources Association, 6203--6212. Retrieved from https://www.aclweb.org/anthology/2020.lrec-1.761.Google ScholarGoogle Scholar
  23. Maral Dadvar, Dolf Trieschnigg, Roeland Ordelman, and Franciska de Jong. 2013. Improving cyberbullying detection with user context. In Advances in Information Retrieval, Pavel Serdyukov, Pavel Braslavski, Sergei O.Kuznetsov, Jaap Kamps, Stefan Ruger, Eugene Agichtein, Ilya Segalovich, and Emine Yilmaz (Eds.). Springer Berlin, 693--696.Google ScholarGoogle Scholar
  24. Kareem Darwish. 2014. Arabizi detection and conversion to Arabic. In Proceedings of the EMNLP Workshop on Arabic Natural Language Processing (ANLP’14). Association for Computational Linguistics, 217--224. DOI:https://doi.org/10.3115/v1/W14-3629Google ScholarGoogle ScholarCross RefCross Ref
  25. Thomas Davidson, Dana Warmsley, Michael Macy, and Ingmar Weber. 2017. Automated hate speech detection and the problem of offensive language. In Proceedings of the 11th International AAAI Conference on Web and Social Media (ICWSM’17). 512--515. Retrieved from https://aaai.org/ocs/index.php/ICWSM/ICWSM17/paper/view/15665/14843.Google ScholarGoogle Scholar
  26. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arxiv:cs.CL/1810.04805 (2018).Google ScholarGoogle Scholar
  27. Marc Djandji, Fady Baly, Wissam Antoun, and Hazem M. Hajj. 2020. Multi-task learning using AraBert for offensive language detection. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection.Google ScholarGoogle Scholar
  28. Obeida ElJundi, Wissam Antoun, Nour El Droubi, Hazem Hajj, Wassim El-Hajj, and Khaled Shaban. 2019. hULMonA: The universal language model in Arabic. In Proceedings of the 4th Arabic Natural Language Processing Workshop. Association for Computational Linguistics, 68--77. DOI:https://doi.org/10.18653/v1/W19-4608Google ScholarGoogle ScholarCross RefCross Ref
  29. AbdelRahim Elmadany, Chiyu Zhang, Muhammad Abdul-Mageed, and Azadeh Hashemi. 2020. Leveraging Affective Bidirectional Transformers for Offensive Language Detection. arxiv:cs.CL/2006.01266 (2020).Google ScholarGoogle Scholar
  30. Paula Fortuna and Sergio Nunes. 2018. A survey on automatic detection of hate speech in text. ACM Comput. Surv. 51, 4 (2018). DOI:https://doi.org/10.1145/3232676Google ScholarGoogle Scholar
  31. Antigoni-Maria Founta, Despoina Chatzakou, Nicolas Kourtellis, Jeremy Blackburn, Athena Vakali, and Ilias Leontiadis. 2019. A unified deep learning architecture for abuse detection. In Proceedings of the 10th ACM Conference on Web Science.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Neamat El Gayar and Ching Suen. 2018. Series on Language Processing, Pattern Recognition, and Intelligent Systems, Vol. 4. World Scientific. https://doi.org/10.1142/10693.Google ScholarGoogle Scholar
  33. Nizar Y. Habash. 2010. Synthesis Lectures on Human Language Technologies, Vol. 3. Morgan & Claypool Publishers. https://doi.org/10.2200/S00277ED1V01Y201008HLT010.Google ScholarGoogle Scholar
  34. Bushr Haddad, Zoher Orabe, Anas Al-Abood, and Nada Ghneim. 2020. Arabic offensive language detection with attention-based deep neural networks. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 76--81. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.12.Google ScholarGoogle Scholar
  35. Hatem Haddad, Hala Mulki, and Asma Oueslati. 2019. T-HSAB: A Tunisian hate speech and abusive dataset. In Arabic Language Processing: From Theory to Practice, Kamel Smaïli (Ed.). Springer International Publishing, Cham, 251--263.Google ScholarGoogle Scholar
  36. Batoul Haidar, Maroun Chamoun, and Ahmed Serhrouchni. 2017. Multilingual cyberbullying detection system: Detecting cyberbullying in Arabic content. In Proceedings of the 1st Cyber Security in Networking Conference (CSNet’17). 1--8. DOI:https://doi.org/10.1109/CSNET.2017.8242005Google ScholarGoogle ScholarCross RefCross Ref
  37. Batoul Haidar, Maroun Chamoun, and Ahmed Serhrouchni. 2018. Arabic cyberbullying detection: Using deep learning. In Proceedings of the 7th International Conference on Computer and Communication Engineering (ICCCE’18). 284--289. DOI:https://doi.org/10.1109/ICCCE.2018.8539303Google ScholarGoogle ScholarCross RefCross Ref
  38. Batoul Haidar, Maroun Chamoun, and Ahmed Serhrouchni. 2019. Arabic cyberbullying detection: Enhancing performance by using ensemble machine learning. In Proceedings of the International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData). 323--327. DOI:https://doi.org/10.1109/iThings/GreenCom/CPSCom/SmartData.2019.00074Google ScholarGoogle Scholar
  39. Sabit Hassan, Younes Samih, Hamdy Mubarak, Ahmed Abdelali, Ammar Rashed, and Shammur Absar Chowdhury. 2020. ALT submission for OSACT shared task on offensive language detection. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 61--65. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.9.Google ScholarGoogle Scholar
  40. Cynthia Van Hee, Els Lefever, Ben Verhoeven, Julie Mennes, Bart Desmet, Guy De Pauw, Walter Daelemans, and Veronique Hoste. 2015. Detection and fine-grained classification of cyberbullying events. In Proceedings of the International Conference on Recent Advances in Natural Language Processing, Galia Angelova, Kalina Bontcheva, and Ruslan Mitkov (Eds.). 672--680.Google ScholarGoogle Scholar
  41. Mohammad Hijjawi and Yousef Elsheikh. 2015. Arabic language challenges in text based conversational agents compared to the English language. Int. J. Comput. Sci. Inf. Technol. 7 (June 2015), 13. DOI:https://doi.org/Doi.org/10.5121/ijcsit.2015.7301Google ScholarGoogle Scholar
  42. Fatemah Husain. 2020a. Arabic Offensive Language Detection Using Machine Learning and Ensemble Machine Learning Approaches. arxiv:cs.CL/2005.08946 (2020).Google ScholarGoogle Scholar
  43. Fatemah Husain. 2020b. OSACT4 shared task on offensive language detection: Intensive preprocessing based approach. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools (OSACT’20).Google ScholarGoogle Scholar
  44. Andrew Johnston and Gary Weiss. 2017. Identifying Sunni extremist propaganda with deep learning. In Proceedings of the IEEE Symposium Series on Computational Intelligence (SSCI’17). IEEE, 1--6. DOI:https://doi.org/10.1109/SSCI.2017.8280944Google ScholarGoogle ScholarCross RefCross Ref
  45. Lisa Kaati, Enghin Omer, Nico Prucha, and Amendra Shrestha. 2015. Detecting multipliers of jihadism on Twitter. In Proceedings of the IEEE International Conference on Data Mining Workshop (ICDMW’15). IEEE, 954--960. DOI:https://doi.org/10.1109/ICDMW.2015.9Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Raghav Kapoor, Yaman Kumar, Kshitij Rajput, Rajiv Ratn Shah, Ponnurangam Kumaraguru, and Roger Zimmermann. 2018. Mind Your Language: Abuse and Offense Detection for Code-Switched Languages. Retrieved from https://arxiv.org/pdf/1809.08652.pdf.Google ScholarGoogle Scholar
  47. Amr Keleg, Samhaa R. El-Beltagy, and Mahmoud Khalil. 2020. ASU_OPTO at OSACT4—Offensive language detection for Arabic text. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 66--70. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.10.Google ScholarGoogle Scholar
  48. Samantha Kent. 2018. German hate speech detection on Twitter. In Proceedings of the 14th Conference on Natural Language Processing (KONVENS’18). 120--124. Retrieved from https://epub.oeaw.ac.at/0xc1aa5576_0x003a10f4.pdf.Google ScholarGoogle Scholar
  49. Irene Kwok and Yuzhou Wang. 2013. Locate the hate: Detecting tweets against blacks. In Proceedings of the 27th AAAI Conference on Artificial Intelligence (AAAI’13). AAAI Press, 1621--1622.Google ScholarGoogle Scholar
  50. Zachary Laub. 2019. Hate Speech on Social Media: Global Comparison. Retrieved from https://www.cfr.org/backgrounder/hate-speech-social-media-global-comparisons.Google ScholarGoogle Scholar
  51. Walid Magdy, Kareem Darwish, and Ingmar Weber. 2016. Failed Revolutions: Using Twitter to study the antecedents of ISIS support. First Mond. 21, 2 (January 2016). DOI:https://doi.org/10.5210/fm.v21i2.6372Google ScholarGoogle Scholar
  52. Marcin Michał Mironczuk and Jarosław Protasiewicz. 2018. A recent overview of the state-of-the-art elements of text classification. Exp. Syst. Applic. 106 (2018), 36--54. DOI:https://doi.org/10.1016/j.eswa.2018.03.058Google ScholarGoogle ScholarCross RefCross Ref
  53. Hanane Mohaouchane, Asmaa Mourhir, and Nikola Nikolov. 2019. Detecting offensive language on Arabic social media using deep learning. In Proceedings of the 6th International Conference on Social Networks Analysis, Management and Security (SNAMS’10). 466--471. DOI:https://doi.org/10.1109/SNAMS.2019.8931839Google ScholarGoogle ScholarCross RefCross Ref
  54. Hamdy Mubarak. 2018. Build fast and accurate lemmatization for Arabic. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC’18). European Language Resources Association (ELRA). Retrieved from https://www.aclweb.org/anthology/L18-1181.Google ScholarGoogle Scholar
  55. Hamdy Mubarak, Kareem Darwish, and Walid Magdy. 2017. Abusive language detection on Arabic social media. In Proceedings of the 1st Workshop on Abusive Language Online. Association for Computational Linguistics, 52--56. DOI:https://doi.org/10.18653/v1/W17-3008Google ScholarGoogle ScholarCross RefCross Ref
  56. Hamdy Mubarak, Kareem Darwish, Walid Magdy, Tamer Elsayed, and Hend Al-Khalifa. 2020. Overview of OSACT4 Arabic offensive language detection shared task. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection.Google ScholarGoogle Scholar
  57. Hala Mulki, Hatem Haddad, Chedi Bechikh Ali, and Halima Alshabani. 2019. L-HSAB: A levantine Twitter dataset for hate speech and abusive language. In Proceedings of the 3rd Workshop on Abusive Language Online. Association for Computational Linguistics, 111--118. DOI:https://doi.org/10.18653/v1/W19-3512Google ScholarGoogle ScholarCross RefCross Ref
  58. Raza Mustafa, M. Saqib Nawaz, Javed Ferzund, M. Ikram Ullah Lali, Basit Shahzad, and Philippe Fournier-Viger. 2017. Early detection of controversial Urdu speeches from social media. Data Sci. Pattern Recog. 1 (2017), 26--42. Retrieved from http://www.ikelab.net/dspr-pdf/vol1-2/dspr-paper3.pdf.Google ScholarGoogle Scholar
  59. Ahmed Omar, Tarek M. Mahmoud, and Tarek Abd-El-Hafeez. 2020. Comparative performance of machine learning and deep learning algorithms for Arabic hate speech detection in OSNs. In Proceedings of the International Conference on Artificial Intelligence and Computer Vision (AICV’20), Aboul-Ella Hassanien, Ahmad Taher Azar, Tarek Gaber, Diego Oliva, and Fahmy M. Tolba (Eds.). Springer International Publishing, Cham, 247--257.Google ScholarGoogle ScholarCross RefCross Ref
  60. Selma Ayse Ozel, Esra Sarac, Seyran Akdemir, and Hulya Aksu. 2017. Detection of cyberbullying on social media messages in Turkish. In Proceedings of the International Conference on Computer Science and Engineering (UBMK’17). 366--370. DOI:https://doi.org/10.1109/UBMK.2017.8093411Google ScholarGoogle ScholarCross RefCross Ref
  61. Arfath Pasha, Mohamed Al-Badrashiny, Mona Diab, Ahmed El Kholy, Ramy Eskander, Nizar Habash, Manoj Pooleery, Owen Rambow, and Ryan Roth. 2014. MADAMIRA: A fast, comprehensive tool for morphological analysis and disambiguation of Arabic. In Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC’14). European Language Resources Association (ELRA), 1094--1101. Retrieved from http://www.lrec-conf.org/proceedings/lrec2014/pdf/593_Paper.pdf.Google ScholarGoogle Scholar
  62. Georgios Pitsilis, Heri Ramampiaro, and Helge Langseth. 2018. Effective hate-speech detection in Twitter data using recurrent neural networks. Appl. Intell. 48 (2018), 4730--4742. https://doi.org/10.1007/s10489-018-1242-y.Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. protranslate. 2016. Arabic Speaking Population in the World. Retrieved from https://www.https://www.protranslate.net/blog/en/arabic-speaking-population-in-the-world-2/#:∼:text=Arabic%20is%20widely%20spoken%20around,million%20with%2030%20different%20dialects.Google ScholarGoogle Scholar
  64. Bjorn Ross, Michael Rist, Guillermo Carbonell, Benjamin Cabrera, Nils Kurowsky, and Michael Wojatzki. 2016. Measuring the reliability of hate speech annotations: The case of the european refugee crisis. In Proceedings of the Workshop on Natural Language Processing for Computer-mediated Communication (NLP4CMC’16), Vol. 17. Bochumer Linguistische Arbeitsberichte, 6--9. DOI:https://doi.org/10.17185/duepublico/42132Google ScholarGoogle Scholar
  65. Hafiz Hassaan Saeed, Toon Calders, and Faisal Kamiran. 2020. OSACT4 shared tasks: Ensembled stacked classification for offensive and hate speech in Arabic tweets. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 71--75. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.11.Google ScholarGoogle Scholar
  66. Maarten Sap, Dallas Card, Saadia Gabriel, Yejin Choi, and Noah Smith. 2019. The risk of racial bias in hate speech detection. In Proceedings of the 57th Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 1668--1678. DOI:https://doi.org/10.18653/v1/P19-1163Google ScholarGoogle ScholarCross RefCross Ref
  67. Benjamin Elisha Sawe. 2018. Arabic Speaking Countries. Retrieved from https://www.worldatlas.com/articles/arabic-speaking-countries.html.Google ScholarGoogle Scholar
  68. Anna Schmidt and Michael Wiegand. 2017. Survey on hate speech detection using natural language processing. In Proceedings of the 5th International Workshop on Natural Language Processing for Social Media. Association for Computational Linguistics, 1--10. DOI:https://doi.org/10.18653/v1/W17-1101Google ScholarGoogle ScholarCross RefCross Ref
  69. Gudbjartur Ingi Sigurbergsson and Leon Derczynski. 2019. Offensive language and hate speech detection for Danish. ArXiv abs/1908.04531 (2019), 1--13.Google ScholarGoogle Scholar
  70. Abu Bakr Soliman, Kareem Eissa, and Samhaa R. El-Beltagy. 2017. AraVec: A set of Arabic word embedding models for use in Arabic NLP. Procedia Comput. Sci. 117 (2017), 256--265.Google ScholarGoogle ScholarCross RefCross Ref
  71. Hui-Po Su, Zhen-Jie Huang, Hao-Tsung Chang, and Chuan-Jie Lin. 2017. Rephrasing profanity in Chinese text. In Proceedings of the 1st Workshop on Abusive Language Online. Association for Computational Linguistics, 18--24. DOI:https://doi.org/10.18653/v1/W17-3003Google ScholarGoogle ScholarCross RefCross Ref
  72. Baptist Vandersmissen. 2012. Automated Detection of Offensive Language Behavior on Social Networking Sites. Master’s thesis. Ghent University.Google ScholarGoogle Scholar
  73. Gregor Wiedemann, Eugen Ruppert, Raghav Jindal, and Chris Biemann. 2018. Transfer learning from LDA to BiLSTM-CNN for offensive language detection in Twitter, In Proceedings of the 14th Conference on Natural Language Processing.ArXiv abs/1811.02906. (2018).Google ScholarGoogle Scholar
  74. Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Sara Rosenthal, Noura Farra, and Ritesh Kumar. 2019. SemEval-2019 Task 6: Identifying and categorizing offensive language in social media (OffensEval). In Proceedings of the 13th International Workshop on Semantic Evaluation. 75--86.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A Survey of Offensive Language Detection for the Arabic Language

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!