Abstract
The Asian social networking market dominates the world landscape with the highest consumer penetration rate. Businesses and investors often look for winning strategies to attract consumers to increase revenues from sales, advertisements, and other services offered on social media platforms. Social media engagement and online relational cohesion have often been defined within the frameworks of social psychology and personality identification is a possible way in which social psychology can inform, engage, and learn from social media. Personality profiling has many real-world applications, including preference-based recommendation systems, relationship building, and career counseling. This research puts forward a novel kernel-based soft-voting ensemble model for personality detection from natural language, KBSVE-P. The KBSVE-P model is built by first evaluating the performance of various Support Vector Machine (SVM) kernels, namely radial basis function (RBF), linear, sigmoidal, and polynomial, to find the best-suited kernel for automatic personality detection in natural language text. Next, an ensemble of SVM kernels is implemented with a variety of voting techniques, such as soft voting, hard voting, and weighted hard voting. The model is evaluated on the publicly available Kaggle_MBTI dataset and a novel South Asian, Indian, low-resource Hindi language
_MBTI (pronounced as vishesh charitr, meaning personality in Hindi) dataset for detecting a user's personality across four personality traits, namely introvert/extrovert (IE), thinking/feeling (TF), sensing/intuitive (SI), and judging/perceiving (JP). The proposed kernel-based ensemble with soft voting, KBSVE-P, outperforms the existing models on English Kaggle-MBTI dataset with an average F-score of 85.677 and achieves an accuracy of 66.89 for the Hindi
_MBTI dataset.
- [1] . 2020. Personality identification based on handwritten signature using convolutional neural networks. In Proceedings of the 5th NA International Conference on Industrial Engineering and Operations Management Detroit.Google Scholar
- [2] . 2018. Personality identification of palmprint using convolutional neural networks. In Proceedings of the 2018 International Symposium on Advanced Intelligent Informatics. IEEE, 90–95.Google Scholar
Cross Ref
- [3] . 2020. Bottom-up and top-down: Predicting personality with psycholinguistic and language model features. In Proceedings of the 2020 IEEE International Conference on Data Mining. IEEE, 1184–1189.Google Scholar
Cross Ref
- [4] 2020. Recent trends in deep learning based personality detection. Artificial Intelligence Review 53, 4 (2020), 2313–2339.Google Scholar
Cross Ref
- [5] 2018. Persona traits identification based on Myers–Briggs Type Indicator (MBTI)-a text classification approach. In Proceedings of the 2018 International Conference on Advances in Computing, Communications, and Informatics. IEEE, 1076–1082.Google Scholar
Cross Ref
- [6] . 2021. A multimodal deep framework for derogatory social media post identification of a recognized person. Transactions on Asian and Low-Resource Language Information Processing 21, 1 (2021), 1–19.Google Scholar
Digital Library
- [7] . 2017. Survey Analysis of Machine Learning Methods for Natural Language Processing for MBTI Personality Type Prediction.Google Scholar
- [8] . 1993. Personality and occupational behavior: Myers–Briggs type indicator correlates of managerial practices in two cultures. Human Relations 46, 7 (1993), 827–848.Google Scholar
Cross Ref
- [9] . 2011. Manifestations of personality in online social networks: Self-reported Facebook-related behaviors and observable profile information. Cyberpsychology, Behavior, and Social Networking 14, 9 (2011), 483–488.Google Scholar
Cross Ref
- [10] . 2019. Using textual data for personality prediction: A machine learning approach. In Proceedings of the 2019 4th International Conference on Information Systems and Computer Networks. IEEE, 529–533.Google Scholar
- [11] . 2009. Personality and motivations associated with Facebook use. Computers in Human Behavior 25, 2 (2009), 578–586.Google Scholar
Digital Library
- [12] . 2021. Personality classification of facebook users according to big five personality using SVM (support vector machine) method. Procedia Computer Science 179 (2021), 177–184.Google Scholar
Cross Ref
- [13] . 2020. Pandora talks: Personality and demographics on reddit. arXiv:2004.04460. Retrieved from https://arxiv.org/abs/2004.04460.Google Scholar
- [14] . 2015. Personality classification based on Twitter text using Naive Bayes, KNN and SVM. In Proceedings of the 2015 International Conference on Data and Software Engineering. IEEE, 170–174.Google Scholar
Cross Ref
- [15] . 2021. Detecting Arabic spam reviews in social networks based on classification algorithms. Transactions on Asian and Low-Resource Language Information Processing 21, 1 (2021), 1–13.Google Scholar
- [16] . 2020. Denigration bullying resolution using wolf search optimized online reputation rumour detection. Procedia Computer Science 173, (2020), 305–314.Google Scholar
Cross Ref
- [17] . 2008. Social psychology and social networks: Individuals and social systems. Asian Journal of Social Psychology 11, 1 (2008), 1–12.Google Scholar
Cross Ref
- [18] . 2018. A survey of deep learning techniques in speech recognition. In Proceedings of the International Conference on Advances in Computing, Communication Control, and Networking. IEEE.Google Scholar
Cross Ref
- [19] . 2016. ASCERTAIN: Emotion and personality recognition using commercial sensors. IEEE Transactions on Affective Computing 9, 2 (2016), 147–160.Google Scholar
Cross Ref
- [20] . 1997. A critique of the Myers–Briggs Type Indicator and its operationalization of Carl Jung's psychological types. Psychological Reports 80, 2 (1997), 611–625.Google Scholar
Cross Ref
- [21] . 2022. The 12 most spoken languages in the world. Retrieved Jan 07, 2022 from https://blog.busuu.com/most-spoken-languages-in-the-world/.Google Scholar
- [22] . 2020. Sarcasm detection in mash-up language using soft-attention based bi-directional LSTM and feature-rich CNN. Applied Soft Computing 91 (2020), 106198.Google Scholar
Cross Ref
- [23] 2022. Hybrid deep learning model for sarcasm detection in Indian indigenous language using word-emoji embeddings. Transactions on Asian and Low-Resource Language Information Processing (2022).Google Scholar
- [24] . 2022. TANA: The amalgam neural architecture for sarcasm detection in indian indigenous language combining LSTM and SVM with word-emoji embeddings. Pattern Recognition Letters (2022).Google Scholar
Digital Library
- [25] . 2021. Sentiment analysis using XLM-R transformer and zero-shot transfer learning on resource-poor indian language. Transactions on Asian and Low-Resource Language Information Processing 20, 5 (2021), 1–13.Google Scholar
Digital Library
- [26] . 2021. Denigrate comment detection in low-resource Hindi language using attention-based residual networks. Transactions on Asian and Low-Resource Language Information Processing 21, 1 (2021), 1–14.Google Scholar
- [27] . 2015. The development and psychometric properties of LIWC2015.Google Scholar
- [28] . 2012. Evolutionary algorithm for a genetic robot's personality based on the Myers–Briggs Type Indicator. Robotics and Autonomous Systems 60, 7 (2012), 941–961.Google Scholar
Digital Library
- [29] . 2017. Predicting Myers–Briggs type indicator with text. In Proceedings of the 31st Conference on Neural Information Processing Systems.Google Scholar
- [30] . 2019. Myers–Briggs personality classification and personality-specific language generation using pre-trained language models. arXiv:1907.06333. Retrieved from https://arxiv.org/abs/1907.06333.Google Scholar
- [31] . 2021. A sentiment-aware deep learning approach for personality detection from text. Information Processing & Management 58, 3 (2021), 102532.Google Scholar
Digital Library
- [32] . 2021. A method for MBTI classification based on impact of class components. IEEE Access 9 (2021), 146550–146567.Google Scholar
Cross Ref
- [33] . 2021. Extending the abstraction of personality types based on MBTI with machine learning and natural language processing. arXiv:2105.11798. Retrieved from https://arxiv.org/abs/2105.11798.Google Scholar
- [34] . 2021. A machine learning approach for personality type identification using MBTI framework. Journal of Independent Studies and Research Computing 19, 2 (2021), 6–10.Google Scholar
- [35] . 2018. Optimization for automatic personality recognition on Twitter in Bahasa Indonesia. Procedia Computer Science 135 (2018), 473–480.Google Scholar
Cross Ref
- [36] . 2017. Personality prediction based on twitter information in bahasa indonesia. In Proceedings of the 2017 Federated Conference on Computer Science and Information Systems. Prague, Czech Republic, 367–372.Google Scholar
- [37] . 2021. Predicting judging-perceiving of Myers–Briggs Type Indicator (MBTI) in online social forum. PeerJ 9 (2021), e11382.Google Scholar
Cross Ref
- [38] . A Weighted Voting Classifier Based on Differential Evolution.Google Scholar
Index Terms
Personality Detection using Kernel-based Ensemble Model for Leveraging Social Psychology in Online Networks
Recommendations
My Tweets Bring All the Traits to the Yard: Predicting Personality and Relational Traits in Online Social Networks
Users in Online Social Networks (OSNs,) leave traces that reflect their personality characteristics. The study of these traces is important for several fields, such as social science, psychology, marketing, and others. Despite a marked increase in ...
Predicting personality with social media
CHI EA '11: CHI '11 Extended Abstracts on Human Factors in Computing SystemsSocial media is a place where users present themselves to the world, revealing personal details and insights into their lives. We are beginning to understand how some of this information can be utilized to improve the users' experiences with interfaces ...
Personality classification based on profiles of social networks' users and the five-factor model of personality
Online social networks have become demanded ways for users to show themselves and connect and share information with each other among these social networks. Facebook is the most popular social network. Personality recognition is one of the new ...






Comments