Abstract
Emoji are often used in social media to enrich users’ emotions, and they play an important role in the task of social media sentiment analysis. In practice, researchers are more likely to consider emoji as special symbols and treat them separately from the text. Some existing methods use emoji as a dictionary for matching or converting emoji into text. However, these methods disregard the relationship between emoji and context, blue and they do not reflect the emotions that users are expected to express. It is challenging to incorporate the original emotions of emoji in social media sentiment analysis. In this article, we propose the EPE model: Emoji Pre-trained feature Enhanced sentiment analysis. Specifically, we collected 8 million tweets and selected 5 million tweets with pre-trained emoji with context using the BERT model. We labeled 20,000 tweets as a three-category dataset and used Bi-LSTM with an attention layer to extract text features. Emoji were retained as key emotion information and combined with text features in the final layer as a connected vector for final prediction. Experimental results with our dataset showed that the proposed EPE model achieved better performance than other baseline models.
- [1] . 2018. Emojis-based sentiment classification of arabic microblogs using deep recurrent neural networks. In Proceedings of the International Conference on Computing Sciences and Engineering. 1–6.Google Scholar
Cross Ref
- [2] . 2018. Interpretable emoji prediction via label-wise attention LSTMs. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 4766–4771.Google Scholar
Cross Ref
- [3] . 2009. The myth of impoverished signal: Dispelling the spoken language fallacy for emoticons in online communication. In Proceedings of the Electronic Emotion: The Mediation of Emotion via Information and Communication Technologies. 107–135.Google Scholar
- [4] . 2017. Recurrent attention network on memory for aspect sentiment analysis. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 452–461.Google Scholar
Cross Ref
- [5] . 2018. Twitter sentiment analysis via bi-sense emoji embedding and attention-based LSTM. In Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference. ACM, 117–125.Google Scholar
Digital Library
- [6] . 2018. Twitter sentiment analysis via bisense emoji embedding and attention-based LSTM. In Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference. 117–125.Google Scholar
Digital Library
- [7] . 2019. Emoji-powered representation learning for cross-lingual sentiment classification. In Proceedings of the World Wide Web Conference, WWW 2019. ACM, 251–262.Google Scholar
Digital Library
- [8] . 2010. Enhanced sentiment learning using twitter hashtags and smileys. In Proceedings of the COLING 2010, 23rd International Conference on Computational Linguistics. Chinese Information Processing Society of China, 241–249.Google Scholar
- [9] . 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019. Association for Computational Linguistics, 4171–4186.Google Scholar
- [10] . 2017. Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017. Association for Computational Linguistics, 1615–1625.Google Scholar
Cross Ref
- [11] . 2013. Unsupervised sentiment analysis with emotional signals. In Proceedings of the 22nd International World Wide Web Conference. International World Wide Web Conferences Steering Committee/ACM, 607–618.Google Scholar
Digital Library
- [12] . 2017. Automatic sarcasm detection: A survey. ACM Computing Surveys 50, 5 (2017), 73:1–73:22.Google Scholar
- [13] . 2014. A convolutional neural network for modelling sentences. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014. The Association for Computer Linguistics, 655–665.Google Scholar
Cross Ref
- [14] . 2014. Sentiment analysis of short informal texts. Journal of Artificial Intelligence Research 50, 2014 (2014), 723–762. Google Scholar
Cross Ref
- [15] . 2016. Sentiment analysis for low resource languages: A study on informal indonesian tweets. In Proceedings of the 12th Workshop on Asian Language Resources., , , and (Eds.), The COLING 2016 Organizing Committee, 123–131.Google Scholar
- [16] . 2018. Emoticon-aware recurrent neural network model for chinese sentiment analysis. In Proceedings of the 9th International Conference on Awareness Science and Technology. IEEE, 161–166.Google Scholar
Cross Ref
- [17] . 2019. Emoji-aware attention-based bi-directional GRU network model for chinese sentiment analysis. In Joint Proceedings of the Workshops on Linguistic and Cognitive Approaches to Dialog Agents (LaCATODA 2019) and on Bridging the Gap Between Human and Automated Reasoning (BtG 2019) co-located with 28th International Joint Conference on Artificial Intelligence (IJCAI 2019).CEUR-WS.org, 11–18.Google Scholar
- [18] . 2019. A novel machine learning-based sentiment analysis method for chinese social media considering chinese slang lexicon and emoticons. In Proceedings of the 2nd Workshop on Affective Content Analysis (AffCon 2019) Co-located with 33rd AAAI Conference on Artificial Intelligence . CEUR-WS.org, 88–103.Google Scholar
- [19] . 2012. Sentiment Analysis and Opinion Mining. Morgan and Claypool Publishers.Google Scholar
Digital Library
- [20] . 2020. Emoji-based sentiment analysis using attention networks. ACM Transactions on Asian and Low-Resource Language Information Processing 19, 5 (2020), 64:1–64:13.Google Scholar
Digital Library
- [21] . 2013. NRC-Canada: Building the state-of-the-art in sentiment analysis of tweets. In Proceedings of the 7th International Workshop on Semantic Evaluation. The Association for Computer Linguistics, 321–327.Google Scholar
- [22] . 2016. The “face with tears of joy” emoji. a socio-semiotic and multimodal insight into a japan-america mash-up. HERMES-Journal of Language and Communication in Business 55, 2016 (2016), 11–25.Google Scholar
- [23] . 2012. Language-independent twitter sentiment analysis. In Proceedings of the Learning, Knowledge, and Adaption Conference. 12–14.Google Scholar
- [24] . 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing. 79–86.Google Scholar
Digital Library
- [25] . 2018. Context-augmented convolutional neural networks for twitter sarcasm detection. Neurocomputing 308, 2018 (2018), 1–7.Google Scholar
Digital Library
- [26] . 2019. Exploiting emojis for sarcasm detection. In Proceedings of the Social, Cultural, and Behavioral Modeling—12th International Conference, SBP-BRiMS 2019. Springer, 70–80.Google Scholar
Cross Ref
- [27] . 2012. Sentiment strength detection for the social web. Journal of the Association for Information Science and Technology 63, 1 (2012), 163–173.Google Scholar
Digital Library
- [28] . 2016. Hierarchical attention networks for document classification. In Proceedings of the NAACL HLT 2016, The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. The Association for Computational Linguistics, 1480–1489.Google Scholar
Cross Ref
- [29] . 2014. Semantic parsing for single-relation question answering. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. The Association for Computer Linguistics, 643–648.Google Scholar
Cross Ref
- [30] . 2016. Attention-based bidirectional long short-term memory networks for relation classification. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. The Association for Computer Linguistics.Google Scholar
Cross Ref
Index Terms
Fusion Pre-trained Emoji Feature Enhancement for Sentiment Analysis
Recommendations
Emoji-Based Sentiment Analysis Using Attention Networks
Emojis are frequently used to express moods, emotions, and feelings in social media. There has been much research on emojis and sentiments. However, existing methods mainly face two limitations. First, they treat emojis as binary indicator features and ...
Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM
MM '18: Proceedings of the 26th ACM international conference on MultimediaSentiment analysis on large-scale social media data is important to bridge the gaps between social media contents and real world activities including political election prediction, individual and public emotional status monitoring and analysis, and so ...
Evaluation of online emoji description resources for sentiment analysis purposes
AbstractEmoji sentiment analysis is a relevant research topic nowadays, for which emoji sentiment lexica are key assets. Manual annotation affects directly their quality (where high quality usually corresponds to high self-...






Comments