skip to main content
research-article

Sentiment Analysis in Hindi—A Survey on the State-of-the-art Techniques

Published:29 November 2021Publication History
Skip Abstract Section

Abstract

Sentiment Analysis (SA) has been a core interest in the field of text mining research, dealing with computational processing of sentiments, views, and subjective nature of the text. Due to the availability of extensive web-based data in Indian languages such as Hindi, Marathi, Kannada, Tamil, and so on. It has become extremely significant to analyze this data and recover valuable and relevant information. Hindi being the first language of the majority of the population in India, SA in Hindi has turned out to be a critical task particularly for companies and government organizations. This research portrays a systematic review specifically in the field of Hindi SA. The major contribution of this article includes the categorization of numerous articles based on techniques that have attracted researchers in performing SA tasks in Hindi language. This survey classifies these state-of-the-art computational intelligence techniques into four major categories namely lexicon-based techniques, machine learning techniques, deep learning techniques, and hybrid techniques. It discusses the importance of these techniques based on different aspects such as their impact on the issues of SA, levels of analysis, and performance evaluation measures. The research puts forward a comprehensive overview of the majority of the work done in Hindi SA. This study will help researchers in finding out resources such as annotated datasets, linguistic resources, and lexical resources. This survey delivers some significant findings and presents overall future research directions in the field of Hindi SA.

REFERENCES

  1. [1] Pang B. and Lee L.. 2008. Opinion mining and sentiment analysis. Foundations and Trends R in Information Retrieval 2, 1–2 (2008), 1135.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Kumar Singh Vivek. 2015. A survey of opinion mining research. In Hindi Language, International Journal of Advanced Scientific Research & Development 02, 03 Ver. II, (2015), 135138.Google ScholarGoogle Scholar
  3. [3] D. Jain, A. Kumar, and G. Garg. 2020. Sarcasm detection in mash-up language using soft-attention based bi- directional LSTM and feature-rich CNN. Applied Soft Computing Journal 91, Article 106198 (2020).Google ScholarGoogle Scholar
  4. [4] S. R. Narang, M. K. Jindal, S. Ahuja, and M. Kumar. 2020. On the recognition of devanagari ancient handwritten characters using SIFT and Gabor features. Soft Computing 24, 22 (2020), 17279--17289. DOI: https://doi.org/10.1007/1229s00500-020-05018-zGoogle ScholarGoogle Scholar
  5. [5] S. Dargan, M. Kumar, A. Garg, and K. Thakur. 2020. Writer identification system for pre-segmented offline hand- written devanagari characters using k-NN and SVM. Soft Computing 24, 13 (2020), 10111--10122. DOI: https://doi.org/123210.1007/s00500-019-04525-yGoogle ScholarGoogle Scholar
  6. [6] Richa Sharma, Shweta Nigam, and Rekha Jain. 2014. Opinion mining in hindi language: A survey. International Journal in Foundations of Computer Science & Technology 4, 2 (2014), 41--47.Google ScholarGoogle Scholar
  7. [7] S. Rani and P. Kumar. 2019. A journey of Indian languages over sentiment analysis: A systematic review. Artificial Intelligence Review 52, 2 (2019), 1415--1462.Google ScholarGoogle Scholar
  8. [8] Pontiki M., Galanis D., Pavlopoulos J., Papageorgiou H., Androutsopoulos I., and Manandhar S.. 2014. Semeval-2014 task 4: Aspect based sentiment analysis. In Proceedings of the 8th International Workshop on Semantic Evaluation.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] S. R. Narang, M. K. Jindal, and M. Kumar. 2019. Drop flow method: an iterative algorithm for complete segmentation of Devanagari ancient manuscripts. Multimedia Tools and Applications 78, 16 (2019), 23255--23280. DOI: 10.1007/s11042-1241019-7620-6Google ScholarGoogle Scholar
  10. [10] Sneha Mulatkar. 2014. Sentiment classification in Hindi. International Journal of Scientific & Technology Research 3, 5 (2014), 204--206.Google ScholarGoogle Scholar
  11. [11] Bakliwal Akshat, Arora Piyush, and Varma Vasudeva. 2012. Hindi subjective lexicon: A lexical resource for Hindi polarity classification. In Proceedings of the 8th International Conference on Language Resources and Evaluation.Google ScholarGoogle Scholar
  12. [12] Das A. and Bandyopadhyay S.. 2010. SentiWordnet for Indian Languages. In Proceedings of the 8th Workshop on Asian Language Resources 2010, 5663.Google ScholarGoogle Scholar
  13. [13] Shelke Rita and Singh Thakore Devendra. 2020. A novel approach for named entity recognition on Hindi language using residual bilstm network. International Journal on Natural Language Computing (IJNLC) 9, 2 (2020), 18.Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Arora, P. 2013. Sentiment Analysis for Hindi Language. MS Thesis. International Institute of Information Technology, Hyderabad, India (2013).Google ScholarGoogle Scholar
  15. [15] Kaur Jasleen and Saini Jatinderkumar R.. 2014. A study and analysis of opinion mining research in indo-aryan, dravidian and tibeto-burman language families. International Journal of Data Mining and Emerging Technologies 4, 2 (2014), 5360.Google ScholarGoogle ScholarCross RefCross Ref
  16. [16] Mukesh Yadav and Varunakshi Bhojane. 2015. Sentiment analysis on Hindi content: A survey. International Journal of Innovations & Advancement in Computer Science 4, 12 (2015), 14--21.Google ScholarGoogle Scholar
  17. [17] Garg Komal and Preetpal Kaur Buttar. 2017. Survey on sentiment analysis in Hindi language. International Journal of Advanced Research in Computer Science 8, 5 (2017), 1360--1363.Google ScholarGoogle Scholar
  18. [18] Sheetal SharmaBharti S. K., and Kumar Goel Raj. 2018. Sentiment analysis of Indian language. International Research Journal of Engineering and Technology (IRJET) 5, 05 (2018), 42514253.Google ScholarGoogle Scholar
  19. [19] Ahmad Gazi Imtiyaz, Singla Jimmy, and Nikita. 2019. Review on sentiment analysis of indian languages with a special focus on code mixed Indian languages. In Proceedings of the International Conference on Automation, Computational and Technology Management. IEEE, 352356.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Taj Soonh, Shaikh Baby Bakhtawer, and Meghji Areej Fatemah. 2019. Sentiment analysis of news articles: A lexicon based approach. In Proceedings of the 2nd International Conference on Computing, Mathematics and Engineering Technologies. IEEE, 15.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Rajput Rahul and Kumar Solanki Arun. 2016. Review of sentimental analysis methods using lexicon based approach. International Journal of Computer Science and Mobile Computing 5, 2 (2016), 159166Google ScholarGoogle Scholar
  22. [22] Joshi A., Balamurali A. R., and Bhattacharyya P.. 2010. A fallback strategy for sentiment analysis in Hindi: A case study. In Proceedings of the 8th International Conference on Natural Language Processing.Google ScholarGoogle Scholar
  23. [23] Mittal N., Agarwal B., Chouhan G., Bania N., and Pareek P.. 2013. Sentiment analysis of Hindi review based on negation and discourse relation. In Proceedings of International Joint Conference on Natural Language Processing 2013, 4550.Google ScholarGoogle Scholar
  24. [24] Arora Piyush, Bakliwal Akshat, and Varma Vasudeva. 2012. Hindi subjective lexicon generation using wordnet graph traversal. International Journal of Computational Linguistics and Applications 3, 1 (2012), 2539.Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Raksha Sharma and Bhattacharyya Pushpak. 2014. A sentiment analyzer for Hindi using Hindi senti lexicon. In Proceedings of the 11th International Conference on Natural Language Processing. 150155.Google ScholarGoogle Scholar
  26. [26] Jha Vandana, Manjunath N., Deepa Shenoy P., and Venugopal K. R.. 2016. Sentiment analysis in a resource scarce language: Hindi. International Journal of Scientific & Engineering Research 7, 9 (2016), 968990.Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Mishra Deepali, Manju Venugopalan, and Deepa Gupta. 2016. Context specific Lexicon for Hindi reviews. Procedia Computer Science 93 (2016), 554--563.Google ScholarGoogle Scholar
  28. [28] Modi Deepa and Nain Neeta. 2016. Part-of-speech tagging of Hindi corpus using rule-based method. In Proceedings of the International Conference on Recent Cognizance in Wireless Communication & Image Processing. Springer, 241247.Google ScholarGoogle ScholarCross RefCross Ref
  29. [29] Jha Vandana, Savitha R., Deepa Shenoy P., Venugopal K. R., and Sangaiah Arun Kumar. 2018. A novel sentiment aware dictionary for multi-domain sentiment classification. Computers & Electrical Engineering 69, (2018), 585597.Google ScholarGoogle ScholarCross RefCross Ref
  30. [30] Firdous Hussaini, S. Padmaja, and S. Sameen Fatima. 2018. Score-based sentiment analysis of book reviews in Hindi language. International Journal on Natural Language Computing 7, 5 (2018), 115--127.Google ScholarGoogle Scholar
  31. [31] Sharma Yakshi, Mangat Veenu, and Kaur Mandeep. 2015. A practical approach to sentiment analysis of Hindi tweets. In Proceedings of the 1st International Conference on Next Generation Computing Technologies. 677680.Google ScholarGoogle ScholarCross RefCross Ref
  32. [32] Jha Vandana, Manjunath N., Deepa Shenoy P., and Venugopal K. R.. 2015. HSAS: Hindi subjectivity analysis system. In Proceedings of the 2015 Annual IEEE India Conference (INDICON). IEEE, 16.Google ScholarGoogle Scholar
  33. [33] Hale Shapiro Adam, Sudhof Moritz, and Wilson Daniel. 2020. Measuring news sentiment. Journal of Econometrics (2020). DOI: https://doi.org/10.1016/j.jeconom.2020.07.053Google ScholarGoogle Scholar
  34. [34] Garg Kanika. 2019. Sentiment analysis of Indian PM's “Mann Ki Baat”. International Journal of Information Technology 12, 1 (2019), 3748.Google ScholarGoogle ScholarCross RefCross Ref
  35. [35] Sharma Richa, Shweta Nigam, and Rekha Jain. 2014. Polarity detection movie reviews in Hindi language. International Journal on Computational Sciences & Applications (IJCSA) 4, 4 (2014), 49--57.Google ScholarGoogle Scholar
  36. [36] Gilliar Meng and Heba Saddeh. 2020. Applications of machine learning and soft computing techniques in real world. International Journal of Computer Applications & Information Technology 12, 1 (2020), 298--302.Google ScholarGoogle Scholar
  37. [37] Ansari Fatima Anees, Arsalaan Shaikh, Arbaz Shaikh, and Sufiyan Shaikh. 2020. Survey paper on sentiment analysis: Techniques and challenges. EasyChair 2516-2314 (2020), 1--4.Google ScholarGoogle Scholar
  38. [38] Monica C. and Nagarathna N.. 2020. Detection of fake tweets using sentiment analysis. SN Computer Science 1, 89 (2020), Springernature.Google ScholarGoogle Scholar
  39. [39] Mungra D., Agrawal A., and Thakkar A.. 2020. A voting-based sentiment classification model. In Intelligent Communication, Control and Devices. Choudhury S., Mishra R., Mishra R., Kumar A. (Eds.), Advances in Intelligent Systems and Computing, Vol. 989, Springer.Google ScholarGoogle ScholarCross RefCross Ref
  40. [40] S. Narang, M. K. Jindal, and M. Kumar. 2019. Devanagari ancient documents recognition using statistical feature extraction techniques. Sadhana 44, 6 (2019), 1--8. DOI: https://doi.org/10.1007/s12046-019-1126-9Google ScholarGoogle Scholar
  41. [41] Yadlapalli S. S., Reddy R. R., and Sasikala T.. 2020. Advanced Twitter sentiment analysis using supervised techniques and minimalistic features, ambient communications, and computer systems. In Advances in Intelligent Systems and Computing. Hu Y. C., Tiwari S., Trivedi M., and Mishra K. (Eds.). Springer, 91104.Google ScholarGoogle Scholar
  42. [42] Kabir Monika, Kabir Mir Md. Jahangir, Xu Shuxiang, and Badhon Bodrunnessa. 2019. An empirical research on sentiment analysis using machine learning approaches. International Journal of Computers and Applications 2019, 19. DOI: 10.1080/1206212X.2019.1643584Google ScholarGoogle Scholar
  43. [43] Sangeetha K. and Prabha D.. 2020. Sentiment analysis of student feedback using multihead attention fusion model of word and context embedding for LSTM. Journal of Ambient Intelligence and Humanized Computing 12, 6 (2020), 41174126.Google ScholarGoogle Scholar
  44. [44] Patra B. G., Das D., Das A., and Prasath R.. 2015. Shared task on sentiment analysis in Indian languages (sail) tweets an overview. In International Conference on Mining Intelligence and Knowledge Exploration. Prasath R., Vuppala A., and Kathirvalavakumar T. (Eds.), Lecture Notes in Computer Science, Vol. 9468, Springer, 650655.Google ScholarGoogle Scholar
  45. [45] Prasad Sudha Shanker, Kumar Jitendra, Kumar Prabhakar Dinesh, and Pal Sukomal. 2015. Sentiment classification: An approach for Indian language tweets using decision tree. In Proceedings of the 3rd International Conference on Mining Intelligence and Knowledge Exploration. Springer, 656663.Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. [46] Sarkar K. and Chakraborty S.. 2015. A sentiment analysis system for Indian language tweets. In International Conference On Mining Intelligence and Knowledge Exploration. Prasath R., Vuppala A., Kathirvalavakumar T. (Eds.), Lecture Notes in Computer Science, Vol. 9468, Springer, 694702.Google ScholarGoogle Scholar
  47. [47] Sachin Kumar S., Premjith B., Anand Kumar M., and Dr. Soman K. P.. 2015. [email protected]: Sentiment analysis in Indian language using regularized least square approach with randomized feature learning. In Proceedings of the International Conference on Mining Intelligence and Knowledge Exploration. Lecture Notes in Computer Science, Vol. 9468, 671683.Google ScholarGoogle Scholar
  48. [48] Ayush Kumar, Kohail S., Ekbal A., and Biemann C.. 2015. Iit-tuda: System for sentiment analysis in Indian language using lexical acquisition. In Proceedings of the 3rd International Conference on Mining Intelligence and Knowledge Exploration. Springer, 684693.Google ScholarGoogle Scholar
  49. [49] Se Shriya, Vinayakumar R., Kumar M. A., and Soman K.. 2015. Amrita-cen@ sail2015: Sentiment analysis in Indian languages. In Proceedings of the International Conference on Mining Intelligence and Knowledge Exploration. Springer, 703710.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. [50] Jha V., Manjunath N., Shenoy P. D., Venugopal K., and Patnaik L. M.. 2015. Homs: Hindi opinion mining system. In Proceedings of the 2nd International Conference on Recent Trends in Information Systems. IEEE, 366–37.Google ScholarGoogle ScholarCross RefCross Ref
  51. [51] Sharma P. and Moh T. S.. 2015. Prediction of Indian election using sentiment analysis on Hindi twitter. In Proceedings of the International Conference on Big Data. IEEE, 19661971.Google ScholarGoogle Scholar
  52. [52] Akhtar M. S., Ekbal A., and Bhattacharyya P.. 2016. Aspect based sentiment analysis in Hindi: Resource creation and evaluation. In Proceedings of the 10th International Conference on Language Resources and Evaluation, 17.Google ScholarGoogle Scholar
  53. [53] Akhtar M. S., Ekbal A., and Bhattacharyya P.. 2016. Aspect based sentiment analysis: Category detection and sentiment classification for Hindi. In Proceedings of the 17th International Conference on Intelligent Text Processing and Computational Linguistics, 112.Google ScholarGoogle Scholar
  54. [54] Prof. Nikita Desai and Anandkumar D. 2016. Sarcasm detection in Hindi sentences using support vector machine. International Journal of Advance Research in Computer Science and Management Studies 4, 7 (2016), 8--15.Google ScholarGoogle Scholar
  55. [55] Prafulla B. Bafna, Jatinderkumar R. Saini. 2020. On exhaustive evaluation of eager machine learning algorithms for classification of Hindi verses. International Journal of Advanced Computer Science and Applications 11, 2 (2020), 181--185.Google ScholarGoogle Scholar
  56. [56] Khandelwal Ankush, Swami Sahil, Akhtar Syed Sarfaraz, and Shrivastava Manish. 2018. Gender prediction in english-hindi code-mixed social media content: Corpus and baseline system. Computación y Sistemas 22, 4 (2018), 12411247.Google ScholarGoogle Scholar
  57. [57] Vijay Deepanshu, Bohra Aditya, Singh Vinay, Akhtar Syed S., and Shrivastava Manish. 2018. A dataset for detecting irony in hindi-english code-mixed social media text. In Proceedings of the 15th Extended Semantic Web Conference (ESWC-2018).Google ScholarGoogle Scholar
  58. [58] Ravi Kumar and Ravi Vadlamani. 2016. Sentiment classification of Hinglish text. In Proceedings of the 3rd International Conference on Recent Advances in Information Technology (RAIT), IEEE.Google ScholarGoogle ScholarCross RefCross Ref
  59. [59] Nanda Charu, Dua Mohit, and Nanda Garima. 2018. Sentiment analysis of movie reviews in Hindi language using machine learning. In Proceedings of the International Conference on Communication and Signal Processing 2018.Google ScholarGoogle ScholarCross RefCross Ref
  60. [60] Yadav Mukesh and Bhojane Varunakshi. 2019. Semi supervised mix Hindi sentiment analysis using neural network. In Proceedings of the 9th International Conference on Cloud Computing, Data Science & Engineering (Confluence).Google ScholarGoogle ScholarCross RefCross Ref
  61. [61] Kaushika Pal and Biraj V. Patel. 2020. Model for classification of poems in Hindi language based on ras. Smart Systems and IoT: Innovations in Computing 141 (2020), 655--661. DOI: https://doi.org/10.1007/978-981-13-8406-6Google ScholarGoogle Scholar
  62. [62] Venugopalan M. and Gupta D.. 2015. Sentiment classification for Hindi tweets in a constrained environment augmented using tweet specific features. In Proceedings of the International Conference On Mining Intelligence and Knowledge Exploration. Springer, 664670.Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. [63] Yaman Kumar, Debanjan Mahata, Sagar Aggarwal, Anmol Chugh, Rajat Maheshwari, and Rajiv Ratn Shah. 2019. BHAAV (_pg)---A Text Corpus for Emotion Analysis from Hindi Stories. CoRR abs/1910.04073 (2019). DOI: 10.5281/zenodo.3457467Google ScholarGoogle Scholar
  64. [64] Agarwal Basant and Mittal Namita. 2016. Prominent feature extraction for review analysis: An empirical study. Journal of Experimental & Theoretical Artificial Intelligence 28, 3 (2016), 485498.Google ScholarGoogle ScholarCross RefCross Ref
  65. [65] Vijay Deepanshu, Bohra Aditya, Singh Vinay, Akhtar Syed S., and Shrivastava Manish. 2018. Corpus creation and emotion prediction for hindi-english code-mixed social media text. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, 128135.Google ScholarGoogle ScholarCross RefCross Ref
  66. [66] Swami S., Khandelwal A., Singh V., Akhtar S. S., and Shrivastava M.. 2018. A corpus of english-hindi code-mixed tweets for sarcasm detection. In Proceedings of the 19th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing).Google ScholarGoogle Scholar
  67. [67] Ankush Khandelwal, Sahil Swami, Syed S. Akhtar, and Manish Shrivastava. 2018. Humor detection in english-hindi code-mixed social media content: Corpus and baseline system. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC), (2018).Google ScholarGoogle Scholar
  68. [68] Alatabi Hayder A. and Abbas Ayad R.. 2020. Sentiment analysis in social media using machine learning techniques. Iraqi Journal of Science 61, 1 (2020), 193201.Google ScholarGoogle ScholarCross RefCross Ref
  69. [69] Otter D. W., Medina J. R., and Kalita J. K.. 2020. A survey of the usages of deep learning for natural language processing. IEEE Transactions on Neural Networks and Learning Systems 32, 2 (2020), 604624.Google ScholarGoogle ScholarCross RefCross Ref
  70. [70] S. Dargan, M. Kumar, and M. R. Ayyagari. 2020. A survey of deep learning and its applications: A new paradigm to machine learning. Archives of Computational Methods in Engineering 27, 4 (2020), 1071--1092. DOI: https://doi.org/10.13871007/s11831-019-09344-wGoogle ScholarGoogle Scholar
  71. [71] Seshadri S., Madasamy A. K., and Padannayil S. K.. 2016. Analyzing sentiment in Indian languages micro text using recurrent neural network. Institute of Integrative Omics and Applied Biotechnology (IIOAB Journal) 7 (2016), 313318.Google ScholarGoogle Scholar
  72. [72] Li Wenling, Jin Bo, and Quan Yu. 2020. Review of research on text sentiment analysis based on deep learning. Open Access Library Journal 7, 3 (2020), 18.Google ScholarGoogle Scholar
  73. [73] Akhtar M. S., Sawant P., Sen S., Ekbal A., and Bhattacharyya P.. 2018. Solving data sparsity for aspect-based sentiment analysis using cross-linguality and multi-linguality. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Vol. 1, 572582.Google ScholarGoogle ScholarCross RefCross Ref
  74. [74] Rani S. and Kumar P.. 2018. Deep learning-based sentiment analysis using convolution neural network. Arabian Journal for Science and Engineering 44, 4 (2018), 33053314.Google ScholarGoogle ScholarCross RefCross Ref
  75. [75] Mathur Puneet, Shah Rajiv Ratn, Sawhney Ramit, and Mahata Debanjan. 2018. Detecting offensive tweets in hindi-english code-switched language. In Proceedings of the 6th International Workshop on Natural Language Processing for Social Media, 1826.Google ScholarGoogle ScholarCross RefCross Ref
  76. [76] Sane Sushmitha Reddy, Tripathi Suraj, Sane Koushik Reddy, and Mamidi Radhika. 2019. Deep learning techniques for humor detection in hindi-english code-mixed tweets. In Proceedings of the 10th Workshop on Computational Approaches to Subjectivity, Sentiment, and Social Media Analysis, 5761.Google ScholarGoogle ScholarCross RefCross Ref
  77. [77] Singh Pranaydeep and Lefever Els. 2020. Sentiment analysis for hinglish code-mixed tweets by means of cross-lingual word embeddings. In Proceedings of the LREC—4th Workshop on Computational Approaches to Code Switching, 4551.Google ScholarGoogle Scholar
  78. [78] Santosh T. Y. S. S. and Aravind K. V. S.. 2019. Hate speech detection in hindi-english code-mixed social media text. Proceedings of the ACM India Joint International Conference on Data Science and Management of Data, Cods-COMAD 2019, 310313.Google ScholarGoogle ScholarDigital LibraryDigital Library
  79. [79] Joshi Aditya, Prabhu Ameya, Shrivastava Manish, and Varma Vasudeva. 2016. Towards sub-word level compositions for sentiment analysis of hindi-english code-mixed text. In Proceedings of 26th International Conference on Computational Linguistics (COLING), 24822491.Google ScholarGoogle Scholar
  80. [80] Singhal P. and Bhattacharyya P.. 2016. Borrow a little from your rich cousin: Using embeddings and polarities of english words for multilingual sentiment classification. In Proceedings of the International Conference on Computational Linguistics (COLING).Google ScholarGoogle Scholar
  81. [81] Bhargava R., Arora S., and Sharma Y.. 2018. Neural network-based architecture for sentiment analysis in Indian languages. Journal of Intelligent Systems 28, 3 (2018), 361375.Google ScholarGoogle ScholarCross RefCross Ref
  82. [82] Akhtar Md Shad, Kumar Abhishek, Ekbal Asif, Biemann Chris, and Bhattacharyya Pushpak. 2019. Language-agnostic model for aspect-based sentiment analysis. In Proceedings of the 13th International Conference on Computational Semantics-Long Papers, 154164.Google ScholarGoogle ScholarCross RefCross Ref
  83. [83] Akhtar Md Shad, Garg Tarun, and Ekbal Asif. 2020. Multi-task learning for aspect term extraction and aspect sentiment classification. Neurocomputing 398 (2020), 247256.Google ScholarGoogle ScholarCross RefCross Ref
  84. [84] Mukherjee Siddhartha. 2020. Deep learning technique for sentiment analysis of hindi-english code-mixed text using late fusion of character and word features. In Proceedings of the IEEE 16th India Council International Conference (INDICON), 2020.Google ScholarGoogle Scholar
  85. [85] Sonali Rajesh Shah and Abhishek Kaushik. 2019. Sentiment analysis on Indian indigenous languages: A review on multilingual opinion mining. CoRR abs/1911.12848 (2019). DOI: 10.20944/preprints201911.0338.v1Google ScholarGoogle Scholar
  86. [86] Siddhartha Bhattacharyya Koyel. 2018. Relook into sentiment analysis performed on Indian languages using deep learning. In Proceedings of the 4th International Conference on Research in Computational Intelligence and Communication Networks.Google ScholarGoogle Scholar
  87. [87] Joshi Ramchandra, Goel Purvi, and Joshi Raviraj. 2019. Deep learning for Hindi text classification: A comparison. In Proceedings of the International Conference on Intelligent Human Computer Interaction.Google ScholarGoogle Scholar
  88. [88] Akhtar M. S., Kumar A., Ekbal A., and Bhattacharyya P.. 2016. A hybrid deep learning architecture for sentiment analysis. In Proceedings of the 26th International Conference on Computational Linguistics, 482493.Google ScholarGoogle Scholar
  89. [89] Madan Gopal Jhanwar and Arpita Das. 2018. An ensemble model for sentiment analysis of hindi-english code-mixed data. CoRR abs/1806.04450 (2018).Google ScholarGoogle Scholar
  90. [90] Garg Kanika and Lobiyal D. K.. 2018. Multi-class classification of sentiments in Hindi sentences based on intensities. Towards Extensible and Adaptable Methods in Computing. Chakraverty S., Goel A., and Misra S. (Eds.), Springer Nature, 2018.Google ScholarGoogle ScholarCross RefCross Ref
  91. [91] Tarwani Shrikant, Jethanandani Manan, and Kant Vibhor. 2019. Cyberbullying detection in hindi-english code-mixed language using sentiment classification. Advances in Computing and Data Sciences. Singh M., Gupta P., Tyagi V., Flusser J., Ören T., and Kashyap R. (Eds.), Springer.Google ScholarGoogle ScholarCross RefCross Ref
  92. [92] Surbhi Maheshwari, Pallavi Gupta, and Ritu Dhabhai. 2017. Localized sentiment analysis using random walk algorithm in hindi. Review of Business and Technology Research 14, 1 (2017), 70--76.Google ScholarGoogle Scholar
  93. [93] Pravalika A., Oza Vishvesh, Meghana N. P., and Sowmya Kamath S.. 2017. Domain-specific sentiment analysis approaches for code-mixed social network data. In Proceedings of the 8th International Conference on Computing Communication and Networking Technologies.Google ScholarGoogle ScholarCross RefCross Ref
  94. [94] Ubale Sumedha, Sarang Ankita, Wadye Kajol, and Patil Nita. 2018. Hindi sentiment analysis. International Journal on Future Revolution in Computer Science & Communication Engineering 4, 4 (2018), 536540.Google ScholarGoogle Scholar
  95. [95] Sitaram Dinkar, Murthy Savitha, Ray Debraj, Sharma Devansh, and Dhar Kashyap. 2015. Sentiment analysis of mixed language employing Hindi-English code switching. In Proceedings of the International Conference on Machine Learning and Cybernetics.Google ScholarGoogle ScholarCross RefCross Ref
  96. [96] Pundlik Sumitra, Kasbekar Prachi, Gaikwad Gajanan, Dasare Prasad, Gawade Akshay, and Pundlik Purushottam. 2016. Multiclass classification and class based sentiment analysis for Hindi language. In Proceedings of the International Conference on Advances in Computing, Communications and Informatics.Google ScholarGoogle ScholarCross RefCross Ref
  97. [97] Pandey P. and Govilkar S.. 2015. A framework for sentiment analysis in Hindi using HSWN. International Journal of Computer Applications 119, 19 (2015), 2326.Google ScholarGoogle ScholarCross RefCross Ref
  98. [98] Mittal Namita, Agarwal Basant, Chouhan Garvit, Pareek Prateek, and Bania Nitin. 2013. Discourse based sentiment analysis for Hindi reviews. In Proceedings of the International Conference on Pattern Recognition and Machine Intelligence, 720725.Google ScholarGoogle ScholarCross RefCross Ref
  99. [99] Rai Vartika, Vijay Sakshee, and Sharma Dipti Misra. 2017. Transfer of polarity score for sentiment classification in Hindi. In Proceedings of the 14th International Conference on Natural Language Processing, 373382.Google ScholarGoogle Scholar
  100. [100] Surbhi Maheshwari, Pallavi Gupta, and Ritu Dhabhai. 2017. Localized sentiment analysis using random walk algorithm in hindi. Review of Business and Technology Research 14, 1 (2017), 70--76.Google ScholarGoogle Scholar
  101. [101] Ansari Mohammed Arshad and Govilkar. Prof. Sharvari 2016. Sentiment analysis of transliterated hindi and marathi script. In Proceedings of the 6th International Conference on Computational Intelligence and Information Technology.Google ScholarGoogle Scholar
  102. [102] Komal Garg and Buttar Preetpal Kaur. 2017. Aspect based sentiment analysis of Hindi text review. International Journal of Advanced Research in Computer Science 8, 7 (2017), 831836.Google ScholarGoogle Scholar
  103. [103] C. Dalal, S. Tandon, and A. Mukerjee. 2014. Insult detection in Hindi. Technical Report on Artificial Intelligence 18, 1 (2014), 1--8.Google ScholarGoogle Scholar
  104. [104] Bharti Santosh Kumar, Babu Korra Sathya, and Jena Sanjay Kumar. 2017. Harnessing online news for sarcasm detection in Hindi tweets. In Proceedings of the 7th International Conference on Pattern Recognition and Machine Intelligence.Google ScholarGoogle ScholarCross RefCross Ref
  105. [105] Bharti Santosh Kumar, Babu Korra Sathya, and Raman Rahul. 2017. Context-based sarcasm detection in Hindi tweets. In Proceedings of the 9th International Conference on Advances in Pattern Recognition.Google ScholarGoogle ScholarCross RefCross Ref
  106. [106] Jain A., Yadav D., and Tayal D. K.. 2014. NER for Hindi language using association rules. In Proceedings of the International Conference on Data Mining and Intelligent Computing.Google ScholarGoogle ScholarCross RefCross Ref
  107. [107] Jhaa Vikas Kumar, Pa Hrudya, Na Vinu P., Vijayana Vishnu, and Pa Prabaharan. 2020. DHOT-Repository and classification of offensive tweets in the Hindi Language. In Proceedings of the 3rd International Conference on Computing and Network Communications.Google ScholarGoogle ScholarCross RefCross Ref
  108. [108] Kumari Archana and Lobiyal D. K.. 2020. Word2vec's distributed word representation for hindi word sense disambiguation. In Distributed Computing and Internet Technology. Hung D., D´Souza M. (Eds.), Lecture Notes in Networks and Systems, Vol. 11969, Springer.Google ScholarGoogle Scholar
  109. [109] Sharmis Balamurali A., Joshi A., and Bhattacharyya P.. 2012. Cross-lingual sentiment analysis for Indian languages using linkedWordnets. In Proceedings of 24th International Conference on Computational Linguistics, 7382.Google ScholarGoogle Scholar
  110. [110] Ananthakrishnan Ramanathan, and Rao Durgesh D.. 2003. A lightweight stemmer for Hindi. In Proceedings of the 10th conference on European Chapter of the Association for Computational Linguistics.Google ScholarGoogle Scholar
  111. [111] Ghosh Aanusha and Dutta Indranil. 2014. Real Time Sentiment Analysis of HindiTweets.In Proceedings of the 35th Conference of the Linguistic Society of Nepal.Google ScholarGoogle Scholar
  112. [112] Nhu Viet-Ha, Shirzadi Ataollah, Shahabi Himan, Singh Sushant K., Al-Ansari Nadhir, Clague John J., Jaafari Abolfazl, Chen Wei, Miraki Shaghayegh, Dou Jie, Luu Chinh, Górski Krzysztof, Pham Binh Thai, Nguyen Huu Duy, and Ahmad Baharin Bin. 2020. Shallow landslide susceptibility mapping: A comparison between logistic model tree, logistic regression, naïve bayes tree, artificial neural network, and support vector machine algorithms. International Journal of Environmental Research and Public Health 17 8, (2020), 2749.Google ScholarGoogle ScholarCross RefCross Ref
  113. [113] Saroj Anita, kumar Munodtiya Rajesh, and Pal Sukomal. 2018. Rule based event extraction system from newswires and social media text in Indian Languages (EventXtract-IL) for English and Hindi data. In Proceedings of the 10th Meeting of Forum for Information Retrieval Evaluation (FIRE) 2018.Google ScholarGoogle Scholar
  114. [114] S. R. Narang, M. K. Jindal, and M. Kumar. 2019. Devanagari ancient character recognition using DCT features with adaptive boosting and bootstrap aggregating. Soft Computing 23, 1 (2019), 13603--13614. DOI: https://doi.org/10.1007/s00500-019-03897-5Google ScholarGoogle Scholar
  115. [115] Tao Jie and Fang Xing. 2020. Toward multilabel sentiment analysis: a transfer learning-based approach. Journal of Big Data 7, 1, (2020), 126.Google ScholarGoogle ScholarCross RefCross Ref
  116. [116] Shyamasunda L. B. and Jhansi Rani P.. 2020. A multiple-layer machine learning architecture for improved accuracy in sentiment analysis. The Computer Journal 63, 3 (2020), 395409.Google ScholarGoogle ScholarCross RefCross Ref
  117. [117] Dang Nhan Cach, Moreno-García María N., and De la Prieta Fernando. 2020. Sentiment analysis based on deep learning: A comparative study. Electronics Journal 9, 3 (2020), 483.Google ScholarGoogle ScholarCross RefCross Ref
  118. [118] Nankani H., Dutta H., Shrivastava H., Krishna Rama P. V. N. S., Mahata D., and Shah R. R.. 2020. Multilingual sentiment analysis. Deep Learning-Based Approaches for Sentiment Analysis. Algorithms for Intelligent Systems, Basant Agarwal, Richi Nayak, Namita Mittal, and Srikanta Patnaik (Eds.). Springer.Google ScholarGoogle ScholarCross RefCross Ref
  119. [119] Sharma Sanur and Jain Anurag. 2020. Hybrid ensemble learning with feature selection for sentiment classification in social media. International Journal of Information Retrieval Research 10, 2 (2020), 4058.Google ScholarGoogle ScholarCross RefCross Ref
  120. [120] Phani S., IIEST S., Lahiri S., and Biswas A.. 2016. Sentiment analysis of tweets in three Indian languages. In Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing.Google ScholarGoogle Scholar
  121. [121] Hindi Polarity Labeled Corpora (Domain: Movie) (Domain Tourism) dataset. 2010. Last accessed:2020, Retrieved from http://www.cfilt.iitb.ac.in/Sentiment_Analysis_Resources.html.Google ScholarGoogle Scholar
  122. [122] SAIL 2015 dataset. Retrieved 2020 from http://amitavadas.com/SAIL/data.html.Google ScholarGoogle Scholar
  123. [123] Md Shad Akhtar, Asif Ekbal, Pushpak Bhattacharyya. 2016. Aspect Based Sentiment Analysis in Hindi: Resource Creation and Evaluation. Retrieved 2021 from https://www.iitp.ac.in/∼ai-nlp-ml/resources.html.Google ScholarGoogle Scholar
  124. [124] Deepanshu Vijay, Aditya Bohra, Vinay Singh, Syed S. Akthar, and Manish Shrivastava. 2018. Irony-Detection-Hindi-English-Code-Mixed Dataset. Retrieved 2018 from https://github.com/deepanshu1995/Irony-Detection-Hindi-English-Code-Mixed.Google ScholarGoogle Scholar
  125. [125] Yaman Kumar, Debanjan Mahata, Sagar Aggarwal, Anmol Chugh, Rajat Maheshwari, Rajiv Ratn Shah, BHAAV dataset. 2019. Retrieved 2019 from https://github.com/midas-research/bhaav.Google ScholarGoogle Scholar
  126. [126] Towards Sub-Word Level Compositions for Sentiment Analysis of Hi-En Code Mixed Text, COLING 2016. 2017. Retrieved 2018 from https://github.com/DrImpossible/Sub-word-LSTM.Google ScholarGoogle Scholar
  127. [127] Hindi news dataset. 2017. Retrieved 2021 from https://github.com/sbharti1984/Hindi-News.Google ScholarGoogle Scholar
  128. [128] Hindi tweets dataset. 2017. Retrieved 2021 from https://github.com/sbharti1984/Hindi-Tweets.Google ScholarGoogle Scholar
  129. [129] Sahil Swami, Ankush Khandelwal, Vinay Singh, Syed Sarfaraz Akhtar, and Manish Shrivastava. 2018. A Corpus of English-Hindi Code-Mixed Tweets for Sarcasm Detection. Retrieved 2018 from https://github.com/sahilswami96/SarcasmDetectionCodeMixed.Google ScholarGoogle Scholar
  130. [130] Ankush Khandelwal, Sahil Swami, Syed S. Akhtar, Manish Shrivastava. 2018. Humor detection corpus dataset. Retrieved 2021 from https://github.com/Ankh2295/humor-detection-corpus.Google ScholarGoogle Scholar
  131. [131] Vikas Kumar Jhaa, Hrudya Pa, Vinu P. Na, Vishnu Vijayana, Prabaharan Pa. 2019. * DHOT-Repository and Classification of Offensive Tweets in the Hindi Language, DHOT dataset. Retrieved 2021 from https://github.com/vikaskumarjha9/hindi_abusive_dataset.Google ScholarGoogle Scholar
  132. [132] Hindi shallow parser. 2018. Retrieved 2020 from http://ltrc.iiit.ac.in/showfile.php?filename=downloads/shallow_parser.php.Google ScholarGoogle Scholar
  133. [133] Shabdakosh. 2003. Retrieved 2021 from http://www.shabdkosh.com/.Google ScholarGoogle Scholar
  134. [134] Shabdanjali. 2000. Retrieved 2021 from https://ltrc.iiit.ac.in/Dictionaries/Shabdanjali/dict-README.html.Google ScholarGoogle Scholar
  135. [135] Hindi wordnet. 2010. Retrieved 2021 from http://www.cfilt.iitb.ac.in/wordnet/webhwn/.Google ScholarGoogle Scholar
  136. [136] API for accessing Hindi wordnet. 2010. Retrieved 2021 from http://www.cfilt.iitb.ac.in/wordnet/webhwn/API_downloaderInfo.php.Google ScholarGoogle Scholar
  137. [137] Hindi POS tagger by CDAC. 2014. Retrieved 2016 from http://nlp.cdacmumbai.in/tools.html.Google ScholarGoogle Scholar
  138. [138] Hindi dependency parser. 2017. Retrieved 2021 from http://sivareddy.in/downloads#hindi_tools.Google ScholarGoogle Scholar
  139. [139] Tokenizer. 2019. Retrieved 2019 from http://docs.cltk.org/en/latest/hindi.html.Google ScholarGoogle Scholar
  140. [140] Fasttext7 word embeddings. 2019. Retrieved 2021 from https://github.com/facebookresearch/.Google ScholarGoogle Scholar
  141. [141] Hindi--English Dictionary. 2014. Retrieved 2020 from http://www.cfilt.iitb.ac.in/hdict/webinterface_user/.Google ScholarGoogle Scholar
  142. [142] Lightweight stemmer for Hindi. 2015. Retrieved 2021 from research.variancia.com/hindi_stemmer.Google ScholarGoogle Scholar
  143. [143] Hindi POS tagger. 2014. Retrieved 2017 from http://nltr.org/snltr-software/.Google ScholarGoogle Scholar
  144. [144] Distributional Thesaurus (DT). 2017. Retrieved 2021 from http://ltmaggie.informatik.uni-hamburg.de/jobimtext/documentation/calculate-a-distributional-thesaurus-dt/.Google ScholarGoogle Scholar

Index Terms

  1. Sentiment Analysis in Hindi—A Survey on the State-of-the-art Techniques

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 21, Issue 1
      January 2022
      442 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3494068
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 29 November 2021
      • Accepted: 1 June 2021
      • Revised: 1 May 2021
      • Received: 1 August 2020
      Published in tallip Volume 21, Issue 1

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text

    HTML Format

    View this article in HTML Format .

    View HTML Format
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!