Abstract
The use of offensive language in user-generated content is a serious problem that needs to be addressed with the latest technology. The field of Natural Language Processing (NLP) can support the automatic detection of offensive language. In this survey, we review previous NLP studies that cover Arabic offensive language detection. This survey investigates the state-of-the-art in offensive language detection for the Arabic language, providing a structured overview of previous approaches, including core techniques, tools, resources, methods, and main features used. This work also discusses the limitations and gaps of the previous studies. Findings from this survey emphasize the importance of investing further effort in detecting Arabic offensive language, including the development of benchmark resources and the invention of novel preprocessing and feature extraction techniques.
- Kheireddine Abainia, Siham Ouamour, and Halim Sayoud. 2017. A novel robust Arabic light stemmer. J. Exper. Theor. Artif. Intell. 29, 3 (2017), 557--573. arXiv:https://doi.org/10.1080/0952813X.2016.1212100.Google Scholar
Cross Ref
- Kareem E. Abdelfatah, Gabriel Terejanu, and Ayman A. Alhelbawy. 2017. Unsupervised detection of violent content in Arabic social media. Comput. Sci. Inf. Technol. 7, 4 (2017), 1--7. Retrieved from http://airccse.org/V7N66.html.Google Scholar
- Muhammad Abdul-Mageed, Chiyu Zhang, AbdelRahim Elmadany, Arun Rajendran, and Lyle Ungar. 2019. DiaNet: BERT and hierarchical attention multi-task learning of fine-grained dialect. ArXiv abs/1910.14243 (2019).Google Scholar
- Muhammad Abdul-Mageed, Chiyu Zhang, Azadeh Hashemi, and El Moatez Billah Nagoudi. 2020. AraNet: A deep learning toolkit for Arabic social media. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 16--23. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.3.Google Scholar
- Ehab Abozinadah, Alex Mbaziira, and James H. Jones. 2015. Detection of abusive accounts with Arabic tweets. Int. J. Knowl. Eng. 1 (2015), 113--119. DOI:https://doi.org/10.7763/IJKE.2015.V1.19Google Scholar
Cross Ref
- Ehab A. Abozinadah. 2017. Detecting Abusive Arabic Language Twitter Accounts Using a Multidimensional Analysis Model. Ph.D. Dissertation. George Mason University, Fairfax, VA.Google Scholar
- Ehab A. Abozinadah and James H. Jones. 2017. A statistical learning approach to detect abusive Twitter accounts. In Proceedings of the International Conference on Compute and Data Analysis (ICCDA’17). Association for Computing Machinery, New York, NY, 6--13. DOI:https://doi.org/10.1145/3093241.3093281Google Scholar
- Ibrahim Abu Farha and Walid Magdy. 2020. Multitask learning for Arabic offensive language and hate-speech detection. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 86--90. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.14.Google Scholar
- Areej Al-Hassan and Hmood Al-Dossari. 2019. Detection of hate speech in social networks: A survey on multilingual corpus. Comput. Sci. Inf. Technol. 9, 2 (2019), 83--100. https://doi.org/10.5121/csit.2019.90208.Google Scholar
- Azalden Alakrot, Liam Murray, and Nikola S. Nikolov. 2018b. Dataset construction for the detection of anti-social behaviour in online communication in Arabic. Procedia Comput. Sci. 142 (2018), 174--181. DOI:https://doi.org/10.1016/j.procs.2018.10.473Google Scholar
Digital Library
- Azalden Alakrot, Liam Murray, and Nikola S. Nikolov. 2018a. Towards accurate detection of offensive language in online communication in Arabic. Procedia Comput. Sci. 142 (2018), 315--320. DOI:https://doi.org/10.1016/j.procs.2018.10.491Google Scholar
Digital Library
- Nuha Albadi, Maram Kurdi, and Shivakant Mishra. 2018. Are they our brothers? Analysis and detection of religious hate speech in the Arabic Twittersphere. In Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM’18). 69--76. DOI:https://doi.org/10.1109/ASONAM.2018.8508247Google Scholar
Cross Ref
- Nuha Albadi, Maram Kurdi, and Shivakant Mishra. 2019a. Hateful people or hateful bots? Detection and characterization of bots spreading religious hatred in Arabic social media. Proc. ACM Hum.-comput. Interact. 3, 61 (Nov. 2019), 25. DOI:https://doi.org/10.1145/3359163Google Scholar
- Nuha Albadi, Maram Kurdi, and Shivakant Mishra. 2019b. Investigating the effect of combining GRU neural networks with handcrafted features for religious hatred detection on Arabic Twitter space. Soc. Netw. Anal. Mining 9, 41 (Aug. 2019), 1--19. DOI:https://doi.org/10.1007/s13278-019-0587-5Google Scholar
Cross Ref
- Abdullah I. Alharbi and Mark Lee. 2020. Combining character and word embeddings for the detection of offensive language in Arabic. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 91--96. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.15.Google Scholar
- Sarah Alhumoud, Mawaheb Altuwaijri, Tarfa Albuhairi, and Wejdan Alohaideb. 2015. Survey on Arabic sentiment analysis in Twitter. World Acad. Sci., Eng. Technol. Int. J. Soc., Behav., Educ., Econ., Bus. Industr. Eng. 9, 1, 364--378.Google Scholar
- Ibrahim Aljarah, Maria Habib, Neveen Hijazi, Hossam Faris, Raneem Qaddoura, Bassam Hammo, Mohammad Abushariah, and Mohammad Alfawareh. 2020. Intelligent detection of hate speech in Arabic social network: A machine learning approach. J. Inf. Sci. (May 2020), 0165551520917651. DOI:https://doi.org/10.1177/0165551520917651Google Scholar
Cross Ref
- Ali Alshehri, El Moatez Billah Nagoudi, Hassan Alhuzali, and Muhammad Abdul-Mageed. 2018. Think before your click: Data and models for adult content in Arabic Twitter. In Proceedings of the 2nd Workshop on Text Analytics for Cybersecurity and Online Safety (TA-COS’18). European Language Resources Association (ELRA).Google Scholar
- Wissam Antoun, Fady Baly, and Hazem Hajj. 2020. AraBERT: Transformer-based Model for Arabic Language Understanding. arxiv:cs.CL/2003.00104(2020).Google Scholar
- Naaima Boudad, Rdouan Faizi, Rachid Oulad Haj Thami, and Raddouane Chiheb. 2018. Sentiment analysis in Arabic: A review of the literature. Ain Shams Eng. J. 9, 4 (2018), 2479--2490. DOI:https://doi.org/10.1016/j.asej.2017.04.007Google Scholar
Cross Ref
- Arijit Ghosh Chowdhury, Aniket Didolkar, Ramit Sawhney, and Rajiv Ratn Shah. 2019. ARHNet—Leveraging community interaction for detection of religious hate speech in Arabic. In Proceedings of the 57th Meeting of the Association for Computational Linguistics: Student Research Workshop. Association for Computational Linguistics, 273--280. DOI:https://doi.org/10.18653/v1/P19-2038Google Scholar
Cross Ref
- Shammur Absar Chowdhury, Hamdy Mubarak, Ahmed Abdelali, Soon-gyo Jung, Bernard J. Jansen, and Joni Salminen. 2020. A multi-platform Arabic news comment dataset for offensive language detection. In Proceedings of the 12th Language Resources and Evaluation Conference. European Language Resources Association, 6203--6212. Retrieved from https://www.aclweb.org/anthology/2020.lrec-1.761.Google Scholar
- Maral Dadvar, Dolf Trieschnigg, Roeland Ordelman, and Franciska de Jong. 2013. Improving cyberbullying detection with user context. In Advances in Information Retrieval, Pavel Serdyukov, Pavel Braslavski, Sergei O.Kuznetsov, Jaap Kamps, Stefan Ruger, Eugene Agichtein, Ilya Segalovich, and Emine Yilmaz (Eds.). Springer Berlin, 693--696.Google Scholar
- Kareem Darwish. 2014. Arabizi detection and conversion to Arabic. In Proceedings of the EMNLP Workshop on Arabic Natural Language Processing (ANLP’14). Association for Computational Linguistics, 217--224. DOI:https://doi.org/10.3115/v1/W14-3629Google Scholar
Cross Ref
- Thomas Davidson, Dana Warmsley, Michael Macy, and Ingmar Weber. 2017. Automated hate speech detection and the problem of offensive language. In Proceedings of the 11th International AAAI Conference on Web and Social Media (ICWSM’17). 512--515. Retrieved from https://aaai.org/ocs/index.php/ICWSM/ICWSM17/paper/view/15665/14843.Google Scholar
- Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arxiv:cs.CL/1810.04805 (2018).Google Scholar
- Marc Djandji, Fady Baly, Wissam Antoun, and Hazem M. Hajj. 2020. Multi-task learning using AraBert for offensive language detection. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection.Google Scholar
- Obeida ElJundi, Wissam Antoun, Nour El Droubi, Hazem Hajj, Wassim El-Hajj, and Khaled Shaban. 2019. hULMonA: The universal language model in Arabic. In Proceedings of the 4th Arabic Natural Language Processing Workshop. Association for Computational Linguistics, 68--77. DOI:https://doi.org/10.18653/v1/W19-4608Google Scholar
Cross Ref
- AbdelRahim Elmadany, Chiyu Zhang, Muhammad Abdul-Mageed, and Azadeh Hashemi. 2020. Leveraging Affective Bidirectional Transformers for Offensive Language Detection. arxiv:cs.CL/2006.01266 (2020).Google Scholar
- Paula Fortuna and Sergio Nunes. 2018. A survey on automatic detection of hate speech in text. ACM Comput. Surv. 51, 4 (2018). DOI:https://doi.org/10.1145/3232676Google Scholar
- Antigoni-Maria Founta, Despoina Chatzakou, Nicolas Kourtellis, Jeremy Blackburn, Athena Vakali, and Ilias Leontiadis. 2019. A unified deep learning architecture for abuse detection. In Proceedings of the 10th ACM Conference on Web Science.Google Scholar
Digital Library
- Neamat El Gayar and Ching Suen. 2018. Series on Language Processing, Pattern Recognition, and Intelligent Systems, Vol. 4. World Scientific. https://doi.org/10.1142/10693.Google Scholar
- Nizar Y. Habash. 2010. Synthesis Lectures on Human Language Technologies, Vol. 3. Morgan & Claypool Publishers. https://doi.org/10.2200/S00277ED1V01Y201008HLT010.Google Scholar
- Bushr Haddad, Zoher Orabe, Anas Al-Abood, and Nada Ghneim. 2020. Arabic offensive language detection with attention-based deep neural networks. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 76--81. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.12.Google Scholar
- Hatem Haddad, Hala Mulki, and Asma Oueslati. 2019. T-HSAB: A Tunisian hate speech and abusive dataset. In Arabic Language Processing: From Theory to Practice, Kamel Smaïli (Ed.). Springer International Publishing, Cham, 251--263.Google Scholar
- Batoul Haidar, Maroun Chamoun, and Ahmed Serhrouchni. 2017. Multilingual cyberbullying detection system: Detecting cyberbullying in Arabic content. In Proceedings of the 1st Cyber Security in Networking Conference (CSNet’17). 1--8. DOI:https://doi.org/10.1109/CSNET.2017.8242005Google Scholar
Cross Ref
- Batoul Haidar, Maroun Chamoun, and Ahmed Serhrouchni. 2018. Arabic cyberbullying detection: Using deep learning. In Proceedings of the 7th International Conference on Computer and Communication Engineering (ICCCE’18). 284--289. DOI:https://doi.org/10.1109/ICCCE.2018.8539303Google Scholar
Cross Ref
- Batoul Haidar, Maroun Chamoun, and Ahmed Serhrouchni. 2019. Arabic cyberbullying detection: Enhancing performance by using ensemble machine learning. In Proceedings of the International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData). 323--327. DOI:https://doi.org/10.1109/iThings/GreenCom/CPSCom/SmartData.2019.00074Google Scholar
- Sabit Hassan, Younes Samih, Hamdy Mubarak, Ahmed Abdelali, Ammar Rashed, and Shammur Absar Chowdhury. 2020. ALT submission for OSACT shared task on offensive language detection. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 61--65. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.9.Google Scholar
- Cynthia Van Hee, Els Lefever, Ben Verhoeven, Julie Mennes, Bart Desmet, Guy De Pauw, Walter Daelemans, and Veronique Hoste. 2015. Detection and fine-grained classification of cyberbullying events. In Proceedings of the International Conference on Recent Advances in Natural Language Processing, Galia Angelova, Kalina Bontcheva, and Ruslan Mitkov (Eds.). 672--680.Google Scholar
- Mohammad Hijjawi and Yousef Elsheikh. 2015. Arabic language challenges in text based conversational agents compared to the English language. Int. J. Comput. Sci. Inf. Technol. 7 (June 2015), 13. DOI:https://doi.org/Doi.org/10.5121/ijcsit.2015.7301Google Scholar
- Fatemah Husain. 2020a. Arabic Offensive Language Detection Using Machine Learning and Ensemble Machine Learning Approaches. arxiv:cs.CL/2005.08946 (2020).Google Scholar
- Fatemah Husain. 2020b. OSACT4 shared task on offensive language detection: Intensive preprocessing based approach. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools (OSACT’20).Google Scholar
- Andrew Johnston and Gary Weiss. 2017. Identifying Sunni extremist propaganda with deep learning. In Proceedings of the IEEE Symposium Series on Computational Intelligence (SSCI’17). IEEE, 1--6. DOI:https://doi.org/10.1109/SSCI.2017.8280944Google Scholar
Cross Ref
- Lisa Kaati, Enghin Omer, Nico Prucha, and Amendra Shrestha. 2015. Detecting multipliers of jihadism on Twitter. In Proceedings of the IEEE International Conference on Data Mining Workshop (ICDMW’15). IEEE, 954--960. DOI:https://doi.org/10.1109/ICDMW.2015.9Google Scholar
Digital Library
- Raghav Kapoor, Yaman Kumar, Kshitij Rajput, Rajiv Ratn Shah, Ponnurangam Kumaraguru, and Roger Zimmermann. 2018. Mind Your Language: Abuse and Offense Detection for Code-Switched Languages. Retrieved from https://arxiv.org/pdf/1809.08652.pdf.Google Scholar
- Amr Keleg, Samhaa R. El-Beltagy, and Mahmoud Khalil. 2020. ASU_OPTO at OSACT4—Offensive language detection for Arabic text. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 66--70. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.10.Google Scholar
- Samantha Kent. 2018. German hate speech detection on Twitter. In Proceedings of the 14th Conference on Natural Language Processing (KONVENS’18). 120--124. Retrieved from https://epub.oeaw.ac.at/0xc1aa5576_0x003a10f4.pdf.Google Scholar
- Irene Kwok and Yuzhou Wang. 2013. Locate the hate: Detecting tweets against blacks. In Proceedings of the 27th AAAI Conference on Artificial Intelligence (AAAI’13). AAAI Press, 1621--1622.Google Scholar
- Zachary Laub. 2019. Hate Speech on Social Media: Global Comparison. Retrieved from https://www.cfr.org/backgrounder/hate-speech-social-media-global-comparisons.Google Scholar
- Walid Magdy, Kareem Darwish, and Ingmar Weber. 2016. Failed Revolutions: Using Twitter to study the antecedents of ISIS support. First Mond. 21, 2 (January 2016). DOI:https://doi.org/10.5210/fm.v21i2.6372Google Scholar
- Marcin Michał Mironczuk and Jarosław Protasiewicz. 2018. A recent overview of the state-of-the-art elements of text classification. Exp. Syst. Applic. 106 (2018), 36--54. DOI:https://doi.org/10.1016/j.eswa.2018.03.058Google Scholar
Cross Ref
- Hanane Mohaouchane, Asmaa Mourhir, and Nikola Nikolov. 2019. Detecting offensive language on Arabic social media using deep learning. In Proceedings of the 6th International Conference on Social Networks Analysis, Management and Security (SNAMS’10). 466--471. DOI:https://doi.org/10.1109/SNAMS.2019.8931839Google Scholar
Cross Ref
- Hamdy Mubarak. 2018. Build fast and accurate lemmatization for Arabic. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC’18). European Language Resources Association (ELRA). Retrieved from https://www.aclweb.org/anthology/L18-1181.Google Scholar
- Hamdy Mubarak, Kareem Darwish, and Walid Magdy. 2017. Abusive language detection on Arabic social media. In Proceedings of the 1st Workshop on Abusive Language Online. Association for Computational Linguistics, 52--56. DOI:https://doi.org/10.18653/v1/W17-3008Google Scholar
Cross Ref
- Hamdy Mubarak, Kareem Darwish, Walid Magdy, Tamer Elsayed, and Hend Al-Khalifa. 2020. Overview of OSACT4 Arabic offensive language detection shared task. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection.Google Scholar
- Hala Mulki, Hatem Haddad, Chedi Bechikh Ali, and Halima Alshabani. 2019. L-HSAB: A levantine Twitter dataset for hate speech and abusive language. In Proceedings of the 3rd Workshop on Abusive Language Online. Association for Computational Linguistics, 111--118. DOI:https://doi.org/10.18653/v1/W19-3512Google Scholar
Cross Ref
- Raza Mustafa, M. Saqib Nawaz, Javed Ferzund, M. Ikram Ullah Lali, Basit Shahzad, and Philippe Fournier-Viger. 2017. Early detection of controversial Urdu speeches from social media. Data Sci. Pattern Recog. 1 (2017), 26--42. Retrieved from http://www.ikelab.net/dspr-pdf/vol1-2/dspr-paper3.pdf.Google Scholar
- Ahmed Omar, Tarek M. Mahmoud, and Tarek Abd-El-Hafeez. 2020. Comparative performance of machine learning and deep learning algorithms for Arabic hate speech detection in OSNs. In Proceedings of the International Conference on Artificial Intelligence and Computer Vision (AICV’20), Aboul-Ella Hassanien, Ahmad Taher Azar, Tarek Gaber, Diego Oliva, and Fahmy M. Tolba (Eds.). Springer International Publishing, Cham, 247--257.Google Scholar
Cross Ref
- Selma Ayse Ozel, Esra Sarac, Seyran Akdemir, and Hulya Aksu. 2017. Detection of cyberbullying on social media messages in Turkish. In Proceedings of the International Conference on Computer Science and Engineering (UBMK’17). 366--370. DOI:https://doi.org/10.1109/UBMK.2017.8093411Google Scholar
Cross Ref
- Arfath Pasha, Mohamed Al-Badrashiny, Mona Diab, Ahmed El Kholy, Ramy Eskander, Nizar Habash, Manoj Pooleery, Owen Rambow, and Ryan Roth. 2014. MADAMIRA: A fast, comprehensive tool for morphological analysis and disambiguation of Arabic. In Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC’14). European Language Resources Association (ELRA), 1094--1101. Retrieved from http://www.lrec-conf.org/proceedings/lrec2014/pdf/593_Paper.pdf.Google Scholar
- Georgios Pitsilis, Heri Ramampiaro, and Helge Langseth. 2018. Effective hate-speech detection in Twitter data using recurrent neural networks. Appl. Intell. 48 (2018), 4730--4742. https://doi.org/10.1007/s10489-018-1242-y.Google Scholar
Digital Library
- protranslate. 2016. Arabic Speaking Population in the World. Retrieved from https://www.https://www.protranslate.net/blog/en/arabic-speaking-population-in-the-world-2/#:∼:text=Arabic%20is%20widely%20spoken%20around,million%20with%2030%20different%20dialects.Google Scholar
- Bjorn Ross, Michael Rist, Guillermo Carbonell, Benjamin Cabrera, Nils Kurowsky, and Michael Wojatzki. 2016. Measuring the reliability of hate speech annotations: The case of the european refugee crisis. In Proceedings of the Workshop on Natural Language Processing for Computer-mediated Communication (NLP4CMC’16), Vol. 17. Bochumer Linguistische Arbeitsberichte, 6--9. DOI:https://doi.org/10.17185/duepublico/42132Google Scholar
- Hafiz Hassaan Saeed, Toon Calders, and Faisal Kamiran. 2020. OSACT4 shared tasks: Ensembled stacked classification for offensive and hate speech in Arabic tweets. In Proceedings of the 4th Workshop on Open-source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. European Language Resource Association, 71--75. Retrieved from https://www.aclweb.org/anthology/2020.osact-1.11.Google Scholar
- Maarten Sap, Dallas Card, Saadia Gabriel, Yejin Choi, and Noah Smith. 2019. The risk of racial bias in hate speech detection. In Proceedings of the 57th Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 1668--1678. DOI:https://doi.org/10.18653/v1/P19-1163Google Scholar
Cross Ref
- Benjamin Elisha Sawe. 2018. Arabic Speaking Countries. Retrieved from https://www.worldatlas.com/articles/arabic-speaking-countries.html.Google Scholar
- Anna Schmidt and Michael Wiegand. 2017. Survey on hate speech detection using natural language processing. In Proceedings of the 5th International Workshop on Natural Language Processing for Social Media. Association for Computational Linguistics, 1--10. DOI:https://doi.org/10.18653/v1/W17-1101Google Scholar
Cross Ref
- Gudbjartur Ingi Sigurbergsson and Leon Derczynski. 2019. Offensive language and hate speech detection for Danish. ArXiv abs/1908.04531 (2019), 1--13.Google Scholar
- Abu Bakr Soliman, Kareem Eissa, and Samhaa R. El-Beltagy. 2017. AraVec: A set of Arabic word embedding models for use in Arabic NLP. Procedia Comput. Sci. 117 (2017), 256--265.Google Scholar
Cross Ref
- Hui-Po Su, Zhen-Jie Huang, Hao-Tsung Chang, and Chuan-Jie Lin. 2017. Rephrasing profanity in Chinese text. In Proceedings of the 1st Workshop on Abusive Language Online. Association for Computational Linguistics, 18--24. DOI:https://doi.org/10.18653/v1/W17-3003Google Scholar
Cross Ref
- Baptist Vandersmissen. 2012. Automated Detection of Offensive Language Behavior on Social Networking Sites. Master’s thesis. Ghent University.Google Scholar
- Gregor Wiedemann, Eugen Ruppert, Raghav Jindal, and Chris Biemann. 2018. Transfer learning from LDA to BiLSTM-CNN for offensive language detection in Twitter, In Proceedings of the 14th Conference on Natural Language Processing.ArXiv abs/1811.02906. (2018).Google Scholar
- Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Sara Rosenthal, Noura Farra, and Ritesh Kumar. 2019. SemEval-2019 Task 6: Identifying and categorizing offensive language in social media (OffensEval). In Proceedings of the 13th International Workshop on Semantic Evaluation. 75--86.Google Scholar
Cross Ref
Index Terms
A Survey of Offensive Language Detection for the Arabic Language
Recommendations
A New Corpus and Lexicon for Offensive Tamazight Language Detection
Sideways '22: Proceedings of the 7th International Workshop on Social Media World SensorsIn this paper, we address the offensive language detection on Tamazight language, which is one of the under-resourced languages that are still in their infancy and lack of standard orthography. We are particularly interested in the Kabyle dialect, ...
Tamil Offensive Language Detection: Supervised versus Unsupervised Learning Approaches
Studies on natural language processing are mainly conducted in English, with very few exploring languages that are under-resourced, including the Dravidian languages. We present a novel work in detecting offensive language using a corpus collected from ...
Integrating implicit and explicit linguistic phenomena via multi-task learning for offensive language detection
AbstractThe analysis and detection of offensive content in textual information have become a great challenge for the Natural Language Processing community. Most of the research conducted so far on offensive language detection have addressed ...
Highlights- Addressing offensive language detection for Spanish texts.
- Studying implicit ...






Comments