skip to main content
research-article

Vietnamese Sentiment Analysis: An Overview and Comparative Study of Fine-tuning Pretrained Language Models

Authors Info & Claims
Published:16 June 2023Publication History
Skip Abstract Section

Abstract

Sentiment Analysis (SA) is one of the most active research areas in the Natural Language Processing (NLP) field due to its potential for business and society. With the development of language representation models, numerous methods have shown promising efficiency in fine-tuning pre-trained language models in NLP downstream tasks. For Vietnamese, many available pre-trained language models were also released, including the monolingual and multilingual language models. Unfortunately, all of these models were trained on different architectures, pre-trained data, and pre-processing steps; consequently, fine-tuning these models can be expected to yield different effectiveness. In addition, there is no study focusing on evaluating the performance of these models on the same datasets for the SA task up to now. This article presents a fine-tuning approach to investigate the performance of different pre-trained language models for the Vietnamese SA task. The experimental results show the superior performance of the monolingual PhoBERT model and ViT5 model in comparison with previous studies and provide new state-of-the-art performances on five benchmark Vietnamese SA datasets. To the best of our knowledge, our study is the first attempt to investigate the performance of fine-tuning Transformer-based models on five datasets with different domains and sizes for the Vietnamese SA task.

REFERENCES

  1. [1] Farha Ibrahim Abu and Magdy Walid. 2021. A comparative study of effective approaches for Arabic sentiment analysis. Inf. Process. Manage. 58, 2 (2021), 102438. Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Agüero-Torales Marvin M., Salas José I. Abreu, and López-Herrera Antonio G.. 2021. Deep learning and multilingual sentiment analysis on social media data: An overview. Appl. Soft Comput. 107 (2021), 107373. Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Al-Ayyoub Mahmoud, Khamaiseh Abed Allah, Jararweh Yaser, and Al-Kabi Mohammed N.. 2019. A comprehensive survey of arabic sentiment analysis. Inf. Process. Manage. 56, 2 (2019), 320342.Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Bach Ngo Xuan and Phuong Tu Minh. 2015. Leveraging user ratings for resource-poor sentiment classification. Proc. Comput. Sci. 60 (2015), 322331.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Bang Tran Sy, Haruechaiyasak Choochart, and Sornlertlamvanich Virach. 2015. Vietnamese sentiment analysis based on term feature selection approach. In Proceedings of the 10th International Conference on Knowledge Information and Creativity Support Systems. Springer, 196204.Google ScholarGoogle Scholar
  6. [6] Beltagy Iz, Peters Matthew E., and Cohan Arman. 2020. Longformer: The long-document transformer. CoRR abs/2004.05150 (2020). arXiv:2004.05150 https://arxiv.org/abs/2004.05150Google ScholarGoogle Scholar
  7. [7] Birjali Marouane, Kasri Mohammed, and Beni-Hssane Abderrahim. 2021. A comprehensive survey on sentiment analysis: Approaches, challenges and trends. Knowl.-Bas. Syst. 226 (2021), 107134. Google ScholarGoogle ScholarCross RefCross Ref
  8. [8] Brodersen Kay Henning, Ong Cheng Soon, Stephan Klaas Enno, and Buhmann Joachim M.. 2010. The balanced accuracy and its posterior distribution. In Proceedings of the 20th International Conference on Pattern Recognition. IEEE, 31213124. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. [9] Bui The Viet, Tran Thi Oanh, and Le-Hong Phuong. 2020. Improving sequence tagging for vietnamese text using transformer-based neural models. In Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation. Association for Computational Linguistics, Hanoi, Vietnam, 1320.Google ScholarGoogle Scholar
  10. [10] Clark Kevin, Luong Minh-Thang, Le Quoc, and Manning Christopher D.. 2020. Pre-training transformers as energy-based cloze models. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, 285294. Google ScholarGoogle ScholarCross RefCross Ref
  11. [11] Conneau Alexis, Khandelwal Kartikay, Goyal Naman, Chaudhary Vishrav, Wenzek Guillaume, Guzmán Francisco, Grave Edouard, Ott Myle, Zettlemoyer Luke, and Stoyanov Veselin. 2020. Unsupervised cross-lingual representation learning at scale. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 84408451. Google ScholarGoogle ScholarCross RefCross Ref
  12. [12] Dang Thin, Nguyen Vu, Kiet Nguyen, and Ngan Nguyen. 2019. A transformation method for aspect-based sentiment analysis. J. Comput. Sci. Cybernet. 34, 4 (2019), 323333.Google ScholarGoogle ScholarCross RefCross Ref
  13. [13] Devlin Jacob, Chang Ming-Wei, Lee Kenton, and Toutanova Kristina. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, 41714186. Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Ding Ming, Zhou Chang, Yang Hongxia, and Tang Jie. 2020. CogLTX: Applying BERT to long texts. In Advances in Neural Information Processing Systems, Larochelle H., Ranzato M., Hadsell R., Balcan M. F., and Lin H. (Eds.), Vol. 33. Curran Associates, Inc., 1279212804.Google ScholarGoogle Scholar
  15. [15] Dodge Jesse, Ilharco Gabriel, Schwartz Roy, Farhadi Ali, Hajishirzi Hannaneh, and Smith Noah A.. 2020. Fine-tuning pretrained language models: Weight initializations, data orders, and early stopping. arXiv:2002.06305. Retrieved from https://arxiv.org/abs/2002.06305.Google ScholarGoogle Scholar
  16. [16] Duong Huu-Thanh and Thi Tram-Anh Nguyen,. 2021. A review: Preprocessing techniques and data augmentation for sentiment analysis. Comput. Soc. Netw. 8, 1 (2021), 116.Google ScholarGoogle ScholarCross RefCross Ref
  17. [17] Duyen Nguyen Thi, Bach Ngo Xuan, and Phuong Tu Minh. 2014. An empirical study on sentiment analysis for vietnamese. In Proceedings of the International Conference on Advanced Technologies for Communications. IEEE, 309314. Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Feng Steven Y., Gangal Varun, Wei Jason, Chandar Sarath, Vosoughi Soroush, Mitamura Teruko, and Hovy Eduard. 2021. A survey of data augmentation approaches for NLP. In Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP’21). Association for Computational Linguistics, 968988. Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Gao Zhengjie, Feng Ao, Song Xinyu, and Wu Xi. 2019. Target-dependent sentiment classification with BERT. IEEE Access 7 (2019), 154290154299. Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Gonzalez Jose Angel, Hurtado Lluís-F., and Pla Ferran. 2021. TWilBert: Pre-trained deep bidirectional transformers for spanish twitter. Neurocomputing 426 (2021), 5869.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Grandini Margherita, Bagli Enrico, and Visani Giorgio. 2020. Metrics for multi-class classification: An overview. arxiv:2008.05756. Retrieved from https://arxiv.org/abs/2008.05756.Google ScholarGoogle Scholar
  22. [22] Guo Demi, Rush Alexander, and Kim Yoon. 2021. Parameter-efficient transfer learning with diff pruning. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics, 48844896. Google ScholarGoogle ScholarCross RefCross Ref
  23. [23] Ha Quang-Thuy, Vu Tien-Thanh, Pham Huyen-Trang, and Luu Cong-To. 2011. An upgrading feature-based opinion mining model on vietnamese product reviews. In Active Media Technology, Zhong Ning, Callaghan Vic, Ghorbani Ali A., and Hu Bin (Eds.). Springer, Berlin, 173185.Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Ha Quang-Vinh, Nguyen-Hoang Bao-Dai, and Nghiem Minh-Quoc. 2016. Lifelong learning for cross-domain vietnamese sentiment classification. In Computational Social Networks, Nguyen Hien T. and Snasel Vaclav (Eds.). Springer International Publishing, Cham, 298308.Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Ho Vong Anh, Nguyen Duong Huynh-Cong, Nguyen Danh Hoang, Pham Linh Thi-Van, Nguyen Duc-Vu, Nguyen Kiet Van, and Nguyen Ngan Luu-Thuy. 2020. Emotion recognition for vietnamese social media text. In Computational Linguistics, Nguyen Le-Minh, Phan Xuan-Hieu, Hasida Kôiti, and Tojo Satoshi (Eds.). Springer, Singapore, Singapore, 319333.Google ScholarGoogle Scholar
  26. [26] Hoang Suong N., Nguyen Linh V., Huynh Tai, and Pham Vuong T.. 2019. An efficient model for sentiment analysis of electronic product reviews in vietnamese. In Future Data and Security Engineering, Dang Tran Khanh, Küng Josef, Takizawa Makoto, and Bui Son Ha (Eds.). Springer International Publishing, Cham, 132142.Google ScholarGoogle Scholar
  27. [27] Houlsby Neil, Giurgiu Andrei, Jastrzebski Stanislaw, Morrone Bruna, Laroussilhe Quentin De, Gesmundo Andrea, Attariyan Mona, and Gelly Sylvain. 2019. Parameter-efficient transfer learning for NLP. In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research), Chaudhuri Kamalika and Salakhutdinov Ruslan (Eds.), Vol. 97. PMLR, 27902799.Google ScholarGoogle Scholar
  28. [28] Huang Yong, Liu Siwei, Qu Liangdong, and Li Yongsheng. 2020. Effective vietnamese sentiment analysis model using sentiment word embedding and transfer learning. In Data Science, Qin Pinle, Wang Hongzhi, Sun Guanglu, and Lu Zeguang (Eds.). Springer, Singapore, Singapore, 3646.Google ScholarGoogle Scholar
  29. [29] Huong Thien Ho and Hoang Vinh Truong. 2020. A data augmentation technique based on text for vietnamese sentiment analysis. In Proceedings of the 11th International Conference on Advances in Information Technology (IAIT2020). Association for Computing Machinery, New York, NY, Article 13, 5 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. [30] Huynh Huy Duc, Do Hang Thi-Thuy, Nguyen Kiet Van, and Nguyen Ngan Thuy-Luu. 2020. A simple and efficient ensemble classifier combining multiple neural network models on social media datasets in vietnamese. In Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation. Association for Computational Linguistics, 420429.Google ScholarGoogle Scholar
  31. [31] Katrekar Ashish and AVP Big Data Analytics. 2005. An Introduction to Sentiment Analysis. GlobalLogic Inc.Google ScholarGoogle Scholar
  32. [32] Tran Thien Khai and Phan Tuoi Thi. 2019. Deep learning application to ensemble learning-the simple, but effective, approach to sentiment classifying. Appl. Sci. 9, 13 (2019). Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Kieu Binh Thanh and Pham Son Bao. 2010. Sentiment analysis for vietnamese. In Proceedings of the 2nd International Conference on Knowledge and Systems Engineering. IEEE, 152157. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. [34] Le Lac Si, Thin Dang Van, Nguyen Ngan Luu-Thuy, and Trinh Son Quoc. 2020. A multi-filter BiLSTM-CNN architecture for vietnamese sentiment analysis. In Advances in Computational Collective Intelligence, Hernes Marcin, Wojtkiewicz Krystian, and Szczerbicki Edward (Eds.). Springer International Publishing, Cham, 752763.Google ScholarGoogle Scholar
  35. [35] Li Mingzheng, Chen Lei, Zhao Jing, and Li Qiang. 2021. Sentiment analysis of chinese stock reviews based on BERT model. Appl. Intell. 51, 7 (2021), 19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. [36] Li Menggang, Li Wenrui, Wang Fang, Jia Xiaojun, and Rui Guangwei. 2021. Applying BERT to analyze investor sentiment in stock market. Neural Comput. Appl. 33, 10 (2021), 46634676.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. [37] Liu Bing. 2012. Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5, 1 (2012), 1167.Google ScholarGoogle ScholarCross RefCross Ref
  38. [38] Liu Yinhan, Ott Myle, Goyal Naman, Du Jingfei, Joshi Mandar, Chen Danqi, Levy Omer, Lewis Mike, Zettlemoyer Luke, and Stoyanov Veselin. 2019. RoBERTa: A robustly optimized BERT pretraining approach. arXiv e-prints, arXiv.1907.Google ScholarGoogle Scholar
  39. [39] Loshchilov Ilya and Hutter Frank. 2019. Decoupled weight decay regularization. In Proceedings of the 7th International Conference on Learning Representations (ICLR’19, New Orleans, LA, USA, May 6-9, 2019). OpenReview.net. https://openreview.net/forum?id=Bkg6RiCqY7.Google ScholarGoogle Scholar
  40. [40] Luu Son, Nguyen Kiet, and Nguyen Ngan. 2020. Empirical study of text augmentation on social media text in vietnamese. In Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation. Association for Computational Linguistics, 462470.Google ScholarGoogle Scholar
  41. [41] Munikar Manish, Shakya Sushil, and Shrestha Aakash. 2019. Fine-grained sentiment classification using BERT. In Proceedings of the Artificial Intelligence for Transforming Business and Society (AITB’19), Vol. 1. IEEE, 15. Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Nguyen Hong Nam, Le Thanh Van, Le Hai Son, and Pham Tran Vu. 2014. Domain specific sentiment dictionary for opinion mining of vietnamese text. In Multi-disciplinary Trends in Artificial Intelligence. Springer International Publishing, Cham, 136148.Google ScholarGoogle Scholar
  43. [43] Nguyen Cuong, Le Khiem, Tran Anh, and Nguyen Binh. 2020. Knowledge innovation through Intelligent software methodologies, tools and techniques. In An Efficient Framework for Vietnamese Sentiment Classification, Vol. 327. IOS Press, 343354. Google ScholarGoogle ScholarCross RefCross Ref
  44. [44] Nguyen Dat Quoc and Nguyen Anh Tuan. 2020. PhoBERT: Pre-trained language models for vietnamese. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’20). Association for Computational Linguistics, 10371042. Google ScholarGoogle ScholarCross RefCross Ref
  45. [45] Nguyen Huyen, Nguyen Hung, Ngo Quyen, Vu Luong, Tran Vu, Ngo Bach, and Le Cuong. 2019. VLSP shared task: Sentiment analysis. J. Comput. Sci. Cybernet. 34, 4 (2019), 295310. Google ScholarGoogle ScholarCross RefCross Ref
  46. [46] Nguyen H. and Nguyen Q.. 2018. An ensemble of shallow and deep learning algorithms for vietnamese sentiment analysis. In Proceedings of the 5th NAFOSTED Conference on Information and Computer Science. IEEE, 165170. Google ScholarGoogle ScholarCross RefCross Ref
  47. [47] Nguyen Hien D., Huynh Tai, Hoang Suong N., Pham Vuong T., and Zelinka Ivan. 2020. Language-oriented sentiment analysis based on the grammar structure and improved self-attention network. In Proceedings of the Evaluation of Novel Approaches to Software Engineering (ENASE’20). 339346.Google ScholarGoogle ScholarCross RefCross Ref
  48. [48] Nguyen Khang Phuoc-Quy and Kiet Nguyen Van. 2020. Exploiting vietnamese social media characteristics for textual emotion recognition in vietnamese. In Proceedings of the International Conference on Asian Language Processing. IEEE, 276281. Google ScholarGoogle ScholarCross RefCross Ref
  49. [49] Nguyen Kiet Van, Nguyen Vu Duc, Nguyen Phu Xuan Vinh, and Nguyen Tham Thi Hong Truong; Ngan Luu-Thuy. 2018. UIT-VSFC: Vietnamese students’ feedback corpus for sentiment analysis. In Proceedings of the 10th International Conference on Knowledge and Systems Engineering. IEEE, 1924. Google ScholarGoogle ScholarCross RefCross Ref
  50. [50] Nguyen Phu X. V., Hong Tham T. T., Nguyen Kiet Van, and Nguyen Ngan Luu-Thuy. 2018. Deep learning versus traditional classifiers on vietnamese students’ feedback corpus. In Proceedings of the 5th NAFOSTED Conference on Information and Computer Science (NICS). IEEE, 7580. Google ScholarGoogle ScholarCross RefCross Ref
  51. [51] Nguyen Quan, Vu Ly, and Nguyen Quang Uy. 2020. A two-channel model for representation learning in vietnamese sentiment classification problem. J. Comput. Sci. Cybernet. 36, 4 (2020), 305323. Google ScholarGoogle ScholarCross RefCross Ref
  52. [52] Nguyen Quoc Thai, Nguyen Thoai Linh, Luong Ngoc Hoang, and Ngo Quoc Hung. 2020. Fine-tuning BERT for sentiment analysis of vietnamese reviews. In Proceedings of the 7th NAFOSTED Conference on Information and Computer Science. IEEE, 302307. Google ScholarGoogle ScholarCross RefCross Ref
  53. [53] Nguyen Vu Duc, Nguyen Kiet Van, and Nguyen Ngan Luu-Thuy. 2018. Variants of long short-term memory for sentiment analysis on vietnamese students’ feedback corpus. In Proceedings of the 10th International Conference on Knowledge and Systems Engineering. IEEE, 306311.Google ScholarGoogle ScholarCross RefCross Ref
  54. [54] Nguyen Vu Duc, Nguyen Kiet Van, and Nguyen Ngan Luu-Thuy. 2018. Variants of long short-term memory for sentiment analysis on vietnamese students’ feedback corpus. In Proceedings of the 10th International Conference on Knowledge and Systems Engineering. IEEE, 306311. Google ScholarGoogle ScholarCross RefCross Ref
  55. [55] Nguyen-Nhat Dang-Khoa and Duong Huu-Thanh. 2019. One-document training for vietnamese sentiment analysis. In Computational Data and Social Networks, Tagarelli Andrea and Tong Hanghang (Eds.). Springer International Publishing, Cham, 189200.Google ScholarGoogle Scholar
  56. [56] Nguyen-Thanh Thuy and Tran Giang Tran Cong. 2019. Vietnamese sentiment analysis for hotel review based on overfitting training and ensemble learning. In Proceedings of the 10th International Symposium on Information and Communication Technology. Association for Computing Machinery, 147153. Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. [57] Nguyen-Thi Bich-Tuyen and Duong Huu-Thanh. 2019. A vietnamese sentiment analysis system based on multiple classifiers with enhancing lexicon features. In Industrial Networks and Intelligent Systems, Duong Trung Quang, Vo Nguyen-Son, Nguyen Loi K., Vien Quoc-Tuan, and Nguyen Van-Dinh (Eds.). Springer International Publishing, Cham, 240249.Google ScholarGoogle Scholar
  58. [58] Pereira Denilson Alves. 2021. A survey of sentiment analysis in the portuguese language. Artif. Intell. Rev. 54, 2 (2021), 10871115.Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. [59] Phan Long, Tran Hieu, Nguyen Hieu, and Trinh Trieu H.. 2022. ViT5: Pretrained text-to-text transformer for vietnamese language generation. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop. Association for Computational Linguistics, 136142. Google ScholarGoogle ScholarCross RefCross Ref
  60. [60] Phu Vo Ngoc, Chau Vo Thi Ngoc, Tran Vo Thi Ngoc, and Dat Nguyen Duy. 2018. A vietnamese adjective emotion dictionary based on exploitation of Vietnamese language characteristics. Artif. Intell. Rev. 50, 1 (2018), 93159.Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. [61] Phu Vo Ngoc, Chau Vo Thi Ngoc, Tran Vo Thi Ngoc, Duy Dat Nguyen, and Duy Khanh Ly Doan. 2019. A valence-totaling model for vietnamese sentiment classification. Evolv. Syst. 10, 3 (2019), 453499.Google ScholarGoogle ScholarCross RefCross Ref
  62. [62] Poria Soujanya, Hazarika Devamanyu, Majumder Navonil, and Mihalcea Rada. 2023. Beneath the tip of the iceberg: Current challenges and new directions in sentiment analysis research. IEEE Trans. Affect. Comput. 14, 1 (2023), 108132. Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. [63] Pota Marco, Ventura Mirko, Catelli Rosario, and Esposito Massimo. 2021. An effective BERT-based pipeline for twitter sentiment analysis: A case study in italian. Sensors 21, 1 (2021), 133.Google ScholarGoogle ScholarCross RefCross Ref
  64. [64] Raffel Colin, Shazeer Noam, Roberts Adam, Lee Katherine, Narang Sharan, Matena Michael, Zhou Yanqi, Li Wei, Liu Peter J, et al. 2020. Exploring the limits of transfer learning with a unified text-to-text transformer.J. Mach. Learn. Res. 21, 140 (2020), 167.Google ScholarGoogle Scholar
  65. [65] Ray Biswarup, Garain Avishek, and Sarkar Ram. 2021. An ensemble-based hotel recommender system using sentiment analysis and aspect categorization of hotel reviews. Appl. Soft Comput. 98 (2021), 106935.Google ScholarGoogle ScholarCross RefCross Ref
  66. [66] Rust Phillip, Pfeiffer Jonas, Vulić Ivan, Ruder Sebastian, and Gurevych Iryna. 2021. How good is your tokenizer? On the monolingual performance of multilingual language models. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics, 31183135. Google ScholarGoogle ScholarCross RefCross Ref
  67. [67] Singh Mrityunjay, Jakhar Amit Kumar, and Pandey Shivam. 2021. Sentiment analysis on the impact of coronavirus in social life using the BERT model. Soc. Netw. Anal. Min. 11, 1 (2021), 111. Google ScholarGoogle ScholarCross RefCross Ref
  68. [68] Sun Chi, Qiu Xipeng, Xu Yige, and Huang Xuanjing. 2019. How to fine-tune BERT for text classification? In Chinese Computational Linguistics, Sun Maosong, Huang Xuanjing, Ji Heng, Liu Zhiyuan, and Liu Yang (Eds.). Springer International Publishing, Cham, 194206.Google ScholarGoogle ScholarDigital LibraryDigital Library
  69. [69] Tran Thien Khai and Phan Tuoi Thi. 2015. Constructing sentiment ontology for vietnamese reviews. In Proceedings of the 17th International Conference on Information Integration and Web-Based Applications and Services (iiWAS’15). Association for Computing Machinery, New York, NY, Article 36, 5 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  70. [70] Tran Thien Khai and Phan Tuoi Thi. 2016. Computing sentiment scores of adjective phrases for vietnamese. In Multi-disciplinary Trends in Artificial Intelligence, Sombattheera Chattrakul, Stolzenburg Frieder, Lin Fangzhen, and Nayak Abhaya (Eds.). Springer International Publishing, Cham, 288296.Google ScholarGoogle ScholarCross RefCross Ref
  71. [71] Tran Thien Khai and Phan Tuoi Thi. 2016. Multi-class opinion classification for Vietnamese hotel reviews. Int. J. Intell. Technol. Appl. Stat. 9, 1 (2016), 718.Google ScholarGoogle Scholar
  72. [72] Tran Thien Khai and Phan Tuoi Thi. 2018. A hybrid approach for building a Vietnamese sentiment dictionary. J. Intell. Fuzzy Syst. 35, 1 (2018), 967978.Google ScholarGoogle ScholarDigital LibraryDigital Library
  73. [73] Tran Thien Khai and Phan Tuoi Thi. 2018. Towards a sentiment analysis model based on semantic relation analysis. Int. J. Synth. Emot. 9, 2 (2018), 5475.Google ScholarGoogle ScholarDigital LibraryDigital Library
  74. [74] Tran Thien Khai and Phan Tuoi Thi. 2020. Capturing contextual factors in sentiment classification: An ensemble approach. IEEE Access 8 (2020), 116856116865.Google ScholarGoogle ScholarCross RefCross Ref
  75. [75] Trinh Son, Nguyen Luu, and Vo Minh. 2018. Combining Lexicon-Based and Learning-Based Methods for Sentiment Analysis for Product Reviews in Vietnamese Language. Springer International Publishing, Cham, 5775. Google ScholarGoogle ScholarCross RefCross Ref
  76. [76] Trinh Son, Nguyen Luu, Vo Minh, and Do Phuc. 2016. Lexicon-Based Sentiment Analysis of Facebook Comments in Vietnamese Language. Springer International Publishing, Cham, 263276. Google ScholarGoogle ScholarCross RefCross Ref
  77. [77] Truong Trong-Loc, Le Hanh-Linh, and Dang Thien-Phuc Le. 2020. Sentiment analysis implementing BERT-based pre-trained language model for vietnamese. In Proceedings of the 7th NAFOSTED Conference on Information and Computer Science. IEEE, 362367. Google ScholarGoogle ScholarCross RefCross Ref
  78. [78] Vo Huynh Quoc Viet and Yamamoto Kazuhide. 2018. VietSentiLex: A sentiment dictionary that considers the polarity of ambiguous sentiment words. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation. Association for Computational Linguistics.Google ScholarGoogle Scholar
  79. [79] Vo Hung T., Lam Hai C., Nguyen Duc Dung, and Tuong Nguyen Huynh. 2016. Topic classification and sentiment analysis for vietnamese education survey system. As. J. Comput. Sci. Inf. Technol. 6, 3 (2016), 2734.Google ScholarGoogle Scholar
  80. [80] Vo Quan, Nguyen Huy, Le Bac, and Nguyen Minh. 2017. Multi-channel LSTM-CNN model for vietnamese sentiment analysis. In Proceedings of the 9th International Conference on Knowledge and Systems Engineering. IEEE, 2429. Google ScholarGoogle ScholarCross RefCross Ref
  81. [81] Vo Thanh Hung, Nguyen Thien Tin, Pham Hoang Anh, and Le Thanh Van. 2017. An efficient hybrid model for vietnamese sentiment analysis. In Intelligent Information and Database Systems, Nguyen Ngoc Thanh, Tojo Satoshi, Nguyen Le Minh, and Trawiński Bogdan (Eds.). Springer International Publishing, Cham, 227237.Google ScholarGoogle ScholarCross RefCross Ref
  82. [82] Vu Thanh, Nguyen Dat Quoc, Nguyen Dai Quoc, Dras Mark, and Johnson Mark. 2018. VnCoreNLP: A vietnamese natural language processing toolkit. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations. Association for Computational Linguistics, 5660. Google ScholarGoogle ScholarCross RefCross Ref
  83. [83] Vu Xuan-Son and Park Seong-Bae. 2014. Construction of vietnamese sentiwordnet by using Vietnamese dictionary. In Proceedings of the Korea Information Processing Society Conference. Korea Information Processing Society, 745748.Google ScholarGoogle Scholar
  84. [84] Wei Jason and Zou Kai. 2019. EDA: Easy data augmentation techniques for boosting performance on text classification tasks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Association for Computational Linguistics, Hong Kong, China, 63826388. Google ScholarGoogle ScholarCross RefCross Ref
  85. [85] Wolf Thomas, Debut Lysandre, Sanh Victor, Chaumond Julien, Delangue Clement, Moi Anthony, Cistac Pierric, Rault Tim, Louf Remi, Funtowicz Morgan, Davison Joe, Shleifer Sam, Platen Patrick von, Ma Clara, Jernite Yacine, Plu Julien, Xu Canwen, Scao Teven Le, Gugger Sylvain, Drame Mariama, Lhoest Quentin, and Rush Alexander. 2020. Transformers: State-of-the-art natural language processing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. Association for Computational Linguistics, Online, 3845. Google ScholarGoogle ScholarCross RefCross Ref
  86. [86] Xue Linting, Constant Noah, Roberts Adam, Kale Mihir, Al-Rfou Rami, Siddhant Aditya, Barua Aditya, and Raffel Colin. 2021. mT5: A massively multilingual pre-trained text-to-text transformer. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, Online, 483498. Google ScholarGoogle ScholarCross RefCross Ref
  87. [87] Yadav Ashima and Vishwakarma Dinesh Kumar. 2020. Sentiment analysis using deep learning architectures: A review. Artificial Intelligence Review 53, 6 (2020), 43354385.Google ScholarGoogle ScholarDigital LibraryDigital Library
  88. [88] Yao Yuan, Dong Bowen, Zhang Ao, Zhang Zhengyan, Xie Ruobing, Liu Zhiyuan, Lin Leyu, Sun Maosong, and Wang Jianyong. 2022. Prompt tuning for discriminative pre-trained language models. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL’22). Association for Computational Linguistics, Dublin, Ireland, 34683473. Google ScholarGoogle ScholarCross RefCross Ref
  89. [89] Zhang Lei, Wang Shuai, and Liu Bing. 2018. Deep learning for sentiment analysis: A survey. Data Min. Knowl. Discov. 8, 4 (2018), e1253.Google ScholarGoogle Scholar

Index Terms

  1. Vietnamese Sentiment Analysis: An Overview and Comparative Study of Fine-tuning Pretrained Language Models

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 22, Issue 6
      June 2023
      635 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3604597
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 16 June 2023
      • Online AM: 4 April 2023
      • Accepted: 17 March 2023
      • Revised: 17 May 2022
      • Received: 22 December 2021
      Published in tallip Volume 22, Issue 6

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)226
      • Downloads (Last 6 weeks)44

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!