skip to main content
research-article

Rule Based Fuzzy Computing Approach on Self-Supervised Sentiment Polarity Classification with Word Sense Disambiguation in Machine Translation for Hindi Language

Authors Info & Claims
Published:09 May 2023Publication History
Skip Abstract Section

Abstract

With increasing globalization, communication among people of diverse cultural backgrounds is also taking place to a very large extent in the present era. Issues like language diversity in various parts of the world can lead to hindrance in communication. The usage of social media and user-generated material has grown at an exponential rate and existing supervised sentiment polarity classification techniques need labelling for the training dataset. In this study, two problems have been analyzed. First, sentiment analysis of the Twitter dataset and sense disambiguation of morphologically rich Hindi language. A rule-based fuzzy logics-based system for self-supervised sentiment classification was used to compute and analyze the self-supervised or completely unsupervised sentiment categorization of a social-media dataset using three types of lexicons.  The combination of fuzzy with three different types of lexicons gives sentiment analysis a new path. The unsupervised fuzzy rules integrate the fuzziness of both negative as well as positive scores, and fuzzy logic-based systems can cope with ambiguity and vagueness. The fuzzy-system uses an unsupervised/self-supervised fuzzy rule-based technique to identify text using natural language processing (NLP) and sense of word. We compared the results of fuzzy rule based self-supervised sentiment classification by using three types of lexicons on five different datasets, with unsupervised as well as supervised sentiment classification techniques. Second, using cross-lingual sense embedding rather than cross-lingual word embedding resolves the ambiguity issue. The word sense embeddings are produced for the source languages to learn multiple or various senses of the words. Different evaluation metrics depict an improved performance for English-Hindi language.

REFERENCES

  1. Abdi A., Shamsuddin S. M., Hasan S., and Piran J.. 2019. Deep learning-based sentiment classification of evaluative text based on multi-feature fusion. Information Processing & Management 56, 4 (2019), 12451259.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Agarwal B., Mittal N., Bansal P., and Garg S.. 2015. Sentiment analysis using common-sense and context information. Computational Intelligence and Neuroscience (2015).Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Akhtar S., Ghosal D., Ekbal A., Bhattacharyya P., and Kurohashi S.. 2019. All-in-one: Emotion, sentiment and intensity prediction using a multi-task ensemble framework. IEEE Transactions on Affective Computing (2019).Google ScholarGoogle Scholar
  4. Altrabsheh N., Cocea M., and Fallahkhair S.. 2014. Sentiment analysis: Towards a tool for analysing real-time students feedback. In 2014 IEEE 26th International Conference on Tools with Artificial Intelligence. IEEE. 419423.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Amazon Dataset: https://www.kaggle.com/c/sentiment-analysis-on-movie-reviews/data.Google ScholarGoogle Scholar
  6. Artetxe M., Labaka G., Agirre E., and Cho K.. 2017. Unsupervised neural machine translation. arXiv preprint arXiv:1710.11041.Google ScholarGoogle Scholar
  7. Artetxe M., Labaka G., and Agirre E.. 2018. A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings. arXiv preprint arXiv:1805.06297.Google ScholarGoogle Scholar
  8. Asghar Muhammad Z. et al. 2021. Senti-eSystem: A sentiment-based eSystem-using hybridized fuzzy and deep neural network for measuring customer satisfaction. Software: Practice and Experience 51, 3 (2021), 571594.Google ScholarGoogle ScholarCross RefCross Ref
  9. Baccianella S., Esuli A., and Sebastiani F.. 2010. SentiWordNet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In LREC 10, 2010, 22002204.Google ScholarGoogle Scholar
  10. Barbosa L. and Feng J.. 2010. Robust sentiment detection on Twitter from biased and noisy data. In Coling 2010: Posters. 3644.Google ScholarGoogle Scholar
  11. Banerjee S. and Lavie A.. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization. 6572.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Bengio Y. and LeCun Y.. 2007. Scaling learning algorithms towards AI. Large-Scale Kernel Machines 34, 5 (2007), 141.Google ScholarGoogle Scholar
  13. Baruah R., Mundotiya R. K., and Singh A. K.. 2021. Low resource neural machine translation: Assamese to/from other Indo-Aryan (Indic) languages. Transactions on Asian and Low-Resource Language Information Processing 21, 1 (2021), 132.Google ScholarGoogle Scholar
  14. Cambria E., Poria S., Hazarika D., and Kwok K.. 2018. SenticNet 5: Discovering conceptual primitives for sentiment analysis by means of context embeddings. In Proceedings of the AAAI Conference on Artificial Intelligence 32, 1 (2018).Google ScholarGoogle ScholarCross RefCross Ref
  15. Devlin J., Chang M. W., Lee K., and Toutanova K.. 2018. BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.Google ScholarGoogle Scholar
  16. Doddington G.. 2002. Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In Proceedings of the Second International Conference on Human Language Technology Research. 138145.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Domingos P. and Pazzani M.. 1997. On the optimality of the simple Bayesian classifier under zero-one loss. Machine Learning 29, 2 (1997), 103130.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Duţu L. C., Mauris G., and Bolon P.. 2017. A fast and accurate rule-base generation method for Mamdani fuzzy systems. IEEE Transactions on Fuzzy Systems 26, 2 (2017), 715733.Google ScholarGoogle ScholarCross RefCross Ref
  19. Go A., Bhayani R., and Huang L.. 2009. Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford 1, 12 (2009), 2009.Google ScholarGoogle Scholar
  20. Godara J., Aron R., and Shabaz M.. 2022. Sentiment analysis and sarcasm detection from social network to train health-care professionals. World Journal of Engineering 19 1 (2022), 124133.Google ScholarGoogle ScholarCross RefCross Ref
  21. Hutto C. and Gilbert E.. 2014. VADER: A parsimonious rule-based model for sentiment analysis of social media text. In Proceedings of the International AAAI Conference on Web and Social Media 8, 1 (2014).Google ScholarGoogle ScholarCross RefCross Ref
  22. IMDB dataset: https://www.kaggle.com/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews.Google ScholarGoogle Scholar
  23. Jang J. S. R., Sun C. T., and Mizutani E.. 1997. Neuro-fuzzy and soft computing-a computational approach to learning and machine intelligence [Book Review]. IEEE Transactions on Automatic Control 42, 10 (1997), 14821484.Google ScholarGoogle ScholarCross RefCross Ref
  24. Joulin A., Grave E., Bojanowski P., and Mikolov T.. 2016. Bag of tricks for efficient text classification. arXiv preprint arXiv:1607.01759.Google ScholarGoogle Scholar
  25. Jurek A., Mulvenna M. D., and Bi Y.. 2015. Improved lexicon-based sentiment analysis for social media analytics. Security Informatics 4, 1 (2015), 113.Google ScholarGoogle ScholarCross RefCross Ref
  26. Kim Y., Geng J., and Ney H.. 2019. Improving unsupervised word-by-word translation with language model and denoising autoencoder. arXiv preprint arXiv:1901.01590.Google ScholarGoogle Scholar
  27. Kingma D. P. and Ba J.. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.Google ScholarGoogle Scholar
  28. Lin C. Y.. 2004. ROUGE: A package for automatic evaluation of summaries. In Text Summarization Branches Out. 7481.Google ScholarGoogle Scholar
  29. Liu B.. 2012. Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies 5, 1 (2012), 1167.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. López V., Del Río S., Benítez J. M., and Herrera F.. 2015. Cost-sensitive linguistic fuzzy rule based classification systems under the MapReduce framework for imbalanced big data. Fuzzy Sets and Systems 258 (2015), 538.Google ScholarGoogle ScholarCross RefCross Ref
  31. Jain S. S., Gupta R., Tiwari C., and Kaur N.. 2019. Behaviour of players on IPL based on fuzzy C means. International Journal of Innovative Technology and Exploring Engineering 8, 9S (2019), 150--154.Google ScholarGoogle Scholar
  32. Mamdani E. H. and Assilian S.. 1975. An experiment in linguistic synthesis with a fuzzy logic controller. International Journal of Man-Machine Studies 7, 1 (1975), 113.Google ScholarGoogle ScholarCross RefCross Ref
  33. Montoro A., Olivas J. A., Peralta A., Romero F. P., and Serrano-Guerrero J.. 2018. An ANEW based fuzzy sentiment analysis model. In 2018 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). IEEE. 17.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Moraes R., Valiati João Francisco, and Gavião Neto Wilson P.. 2013. Document-level sentiment classification: An empirical comparison between SVM and ANN. Expert Syst. Appl. 40, 2 (2013), 621633.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Neviarouskaya A., Prendinger H., and Ishizuka M.. 2011. SentiFul: A lexicon for sentiment analysis. IEEE Transactions on Affective Computing 2, 1 (2011), 2236.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Nielsen F.. 2011. AFINN. Richard Petersens Plads, Building 321, DK-2800 Kgs. Lyngby.Google ScholarGoogle Scholar
  37. Ortigosa A., Martín J. M., and Carro R. M.. 2014. Sentiment analysis in Facebook and its application to e-learning. Computers in Human Behavior 31 (2014), 527541.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Oyebode O., Alqahtani F., and Orji R.. 2020. Using machine learning and thematic analysis methods to evaluate mental health apps based on user reviews. IEEE Access 8 (2020), 111141111158.Google ScholarGoogle ScholarCross RefCross Ref
  39. Papineni K., Roukos S., Ward T., and Zhu W. J.. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. 311318.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Park C. W. and Seo D. R.. 2018. Sentiment analysis of Twitter corpus related to artificial intelligence assistants. In 2018 5th International Conference on Industrial Engineering and Applications (ICIEA). IEEE. 495498.Google ScholarGoogle ScholarCross RefCross Ref
  41. Patwa P., Aguilar G., Kar S., Pandey S., PYKL S., Gambäck B.,… and Das A.. 2020. SemEval-2020 task 9: Overview of sentiment analysis of code-mixed tweets. arXiv e-prints, arXiv-2008.Google ScholarGoogle Scholar
  42. Pelevina M., Arefyev N., Biemann C., and Panchenko A.. 2017. Making sense of word embeddings. arXiv preprint arXiv:1708.03390.Google ScholarGoogle Scholar
  43. Perikos I. and Hatzilygeroudis I.. 2013. Recognizing emotion presence in natural language sentences. In International Conference on Engineering Applications of Neural Networks. Springer, Berlin. 3039.Google ScholarGoogle ScholarCross RefCross Ref
  44. Qin Y., Wang X., and Xu Z.. 2022. Ranking tourist attractions through online reviews: A novel method with intuitionistic and hesitant fuzzy information based on sentiment analysis. International Journal of Fuzzy Systems 24, 2 (2022), 755777.Google ScholarGoogle ScholarCross RefCross Ref
  45. Sahoo A. K., Sarangi P. K., and Yadav C. S.. 2022. Indian sign language recognition using ensemble based classifier combination. Macromolecular Symposia 401, 1 (2022), 2100286.Google ScholarGoogle ScholarCross RefCross Ref
  46. Saleena N.. 2018. An ensemble classification system for Twitter sentiment analysis. Procedia Computer Science 132 (2018), 937946.Google ScholarGoogle ScholarCross RefCross Ref
  47. Sanz J. A., Fernández A., Bustince H., and Herrera F.. 2013. IVTURS: A linguistic fuzzy rule-based classification system based on a new interval-valued fuzzy reasoning method with tuning and rule selection. IEEE Transactions on Fuzzy Systems 21, 3 (2013), 399411.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Serrano-Guerrero Jesus, Romero Francisco P., and Olivas Jose A.. 2021. Fuzzy logic applied to opinion mining: A review. Knowledge-Based Systems 222 (2021), 107018.Google ScholarGoogle ScholarCross RefCross Ref
  49. Shabaz M. and Garg U.. 2020. Clustering Yelp's sentiment data through various approaches and estimating the error rate. Materials Today: Proceedings (2020). .Google ScholarGoogle ScholarCross RefCross Ref
  50. Shelke N. M., Deshpande S., and Thakre V.. 2017. Exploiting expectation maximization algorithm for sentiment analysis of product reviews. In 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT). IEEE. 390396.Google ScholarGoogle ScholarCross RefCross Ref
  51. Shelke Y. and Chakraborty C.. 2021. An end-to-end shape-preserving point completion network. IEEE Computer Graphics and Applications 41, 3 (2021), 124138.Google ScholarGoogle ScholarCross RefCross Ref
  52. Shi S., Wu X., Su R., and Huang H.. 2022. Low-resource neural machine translation: Methods and trends. Transactions on Asian and Low-Resource Language Information Processing.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Simionescu C., Stoleru I., Lucaci D., Balan G., Bute I., and Iftene A.. 2019. UAIC at SemEval-2019 Task 3: Extracting much from little. In Proceedings of the 13th International Workshop on Semantic Evaluation. 355359.Google ScholarGoogle ScholarCross RefCross Ref
  54. Sivakumar M. and Srinivasulu R. U.. 2021. Aspect-based sentiment analysis of mobile phone reviews using LSTM and fuzzy logic. International Journal of Data Science and Analytics 12, 4 (2021), 355367.Google ScholarGoogle ScholarCross RefCross Ref
  55. Snover M., Madnani N., Dorr B., and Schwartz R.. 2009. Fluency, adequacy, or HTER? Exploring different human judgments with a tunable MT metric. In Proceedings of the Fourth Workshop on Statistical Machine Translation. 259268.Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Stock Market: https://www.kaggle.com/yash612/stockmarket-sentiment-dataset.Google ScholarGoogle Scholar
  57. Strapparava C. and Mihalcea R.. 2007. SemEval-2007 task 14: Affective text. In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007). 7074.Google ScholarGoogle ScholarCross RefCross Ref
  58. Sun H., Wang R., Utiyama M., Marie B., Chen K., Sumita E., and Zhao T.. 2021. Unsupervised neural machine translation for similar and distant language pairs: An empirical study. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 20, 1 (2021), 117.Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Tan S., Li Y., Sun H., Guan Z., Yan X., Bu J.,… and He X.. 2013. Interpreting the public sentiment variations on Twitter. IEEE Transactions on Knowledge and Data Engineering 26, 5 (2013), 11581170.Google ScholarGoogle Scholar
  60. Thelwall M., Buckley K., and Paltoglou G.. 2012. Sentiment strength detection for the social web. Journal of the American Society for Information Science and Technology 63, 1 (2012), 163173.Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Tripathy A., Agrawal A., and Rath S. K.. 2016. Classification of sentiment reviews using n-gram machine learning approach. Expert Systems with Applications 57 (2016), 117126.Google ScholarGoogle ScholarDigital LibraryDigital Library
  62. Adrià Torrens Urrutia, M. Dolores Jiménez-López, and Vilém Novák. 2021. Fuzzy natural logic for sentiment analysis: A proposal. International Symposium on Distributed Computing and Artificial Intelligence. Springer, Cham, 2021.Google ScholarGoogle Scholar
  63. Vashishtha S. and Susan S.. 2019. Fuzzy rule based unsupervised sentiment analysis from social media posts. Expert Systems with Applications 138 (2019), 112834.Google ScholarGoogle ScholarCross RefCross Ref
  64. Vashishtha S. and Susan S.. 2020. Fuzzy interpretation of word polarity scores for unsupervised sentiment analysis. In 2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT). IEEE. 16.Google ScholarGoogle ScholarCross RefCross Ref
  65. Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A. N., ... and Polosukhin I.. 2017. Attention is all you need. Advances in Neural Information Processing Systems (2017), 30.Google ScholarGoogle Scholar
  66. Windasari I. P., F. N. Uzzi , and Satoto K. I.. 2017. Sentiment analysis on Twitter posts: An analysis of positive or negative opinion on GoJek. In 2017 4th International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE). IEEE. 266269.Google ScholarGoogle ScholarCross RefCross Ref
  67. Wöllmer M., Weninger F., Knaup T., Schuller B., Sun C., Sagae K., and Morency L. P.. 2013. YouTube movie reviews: Sentiment analysis in an audio-visual context. IEEE Intelligent Systems 28, 3 (2013), 4653.Google ScholarGoogle ScholarDigital LibraryDigital Library
  68. Xu F., Pan Z., and Xia R.. 2020. E-commerce product review sentiment classification based on a naïve Bayes continuous learning framework. Information Processing & Management 57, 5 (2020), 102221.Google ScholarGoogle ScholarCross RefCross Ref
  69. Yadav A., Jha C. K., Sharan A., and Vaish V.. 2020. Sentiment analysis of financial news using unsupervised approach. Procedia Computer Science 167 (2020), 589598.Google ScholarGoogle ScholarCross RefCross Ref
  70. Yan Y., Yang H., and Wang H. M. 2017. Two simple and effective ensemble classifiers for Twitter sentiment analysis. In 2017 Computing Conference. IEEE. 13861393.Google ScholarGoogle ScholarCross RefCross Ref
  71. Yang L., Li Y., Wang J., and Sherratt R. S.. 2020. Sentiment analysis for E-commerce product reviews in Chinese based on sentiment lexicon and deep learning. IEEE Access 8 (2020), 2352223530.Google ScholarGoogle ScholarCross RefCross Ref
  72. Yoo S., Song J., and Jeong O.. 2018. Social media contents based sentiment analysis and prediction system. Expert Systems with Applications 105 (2018), 102111.Google ScholarGoogle ScholarCross RefCross Ref
  73. Zadeh L. A.. 2015. Fuzzy logic—a personal perspective. Fuzzy Sets and Systems, 281, 420.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Rule Based Fuzzy Computing Approach on Self-Supervised Sentiment Polarity Classification with Word Sense Disambiguation in Machine Translation for Hindi Language

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 22, Issue 5
      May 2023
      653 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3596451
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 9 May 2023
      • Online AM: 22 February 2023
      • Accepted: 6 November 2022
      • Revised: 7 October 2022
      • Received: 2 June 2022
      Published in tallip Volume 22, Issue 5

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)138
      • Downloads (Last 6 weeks)10

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!