skip to main content
research-article

Personality Detection using Kernel-based Ensemble Model for Leveraging Social Psychology in Online Networks

Authors Info & Claims
Published:09 May 2023Publication History
Skip Abstract Section

Abstract

The Asian social networking market dominates the world landscape with the highest consumer penetration rate. Businesses and investors often look for winning strategies to attract consumers to increase revenues from sales, advertisements, and other services offered on social media platforms. Social media engagement and online relational cohesion have often been defined within the frameworks of social psychology and personality identification is a possible way in which social psychology can inform, engage, and learn from social media. Personality profiling has many real-world applications, including preference-based recommendation systems, relationship building, and career counseling. This research puts forward a novel kernel-based soft-voting ensemble model for personality detection from natural language, KBSVE-P. The KBSVE-P model is built by first evaluating the performance of various Support Vector Machine (SVM) kernels, namely radial basis function (RBF), linear, sigmoidal, and polynomial, to find the best-suited kernel for automatic personality detection in natural language text. Next, an ensemble of SVM kernels is implemented with a variety of voting techniques, such as soft voting, hard voting, and weighted hard voting. The model is evaluated on the publicly available Kaggle_MBTI dataset and a novel South Asian, Indian, low-resource Hindi language _MBTI (pronounced as vishesh charitr, meaning personality in Hindi) dataset for detecting a user's personality across four personality traits, namely introvert/extrovert (IE), thinking/feeling (TF), sensing/intuitive (SI), and judging/perceiving (JP). The proposed kernel-based ensemble with soft voting, KBSVE-P, outperforms the existing models on English Kaggle-MBTI dataset with an average F-score of 85.677 and achieves an accuracy of 66.89 for the Hindi _MBTI dataset.

REFERENCES

  1. [1] Aulia M. R., Djamal E. C., and Bon A. T.. 2020. Personality identification based on handwritten signature using convolutional neural networks. In Proceedings of the 5th NA International Conference on Industrial Engineering and Operations Management Detroit.Google ScholarGoogle Scholar
  2. [2] Ariyanto A., Djamal E. C., and Ilyas R.. 2018. Personality identification of palmprint using convolutional neural networks. In Proceedings of the 2018 International Symposium on Advanced Intelligent Informatics. IEEE, 9095.Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Mehta Y., Fatehi S., Kazameini A., Stachl C., Cambria E., and Eetemadi S.. 2020. Bottom-up and top-down: Predicting personality with psycholinguistic and language model features. In Proceedings of the 2020 IEEE International Conference on Data Mining. IEEE, 11841189.Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Mehta Y., Majumder N., Gelbukh A., and Cambria E.. 2020. Recent trends in deep learning based personality detection. Artificial Intelligence Review 53, 4 (2020), 23132339.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Bharadwaj S., Sridhar S., Choudhary R., and Srinath R.. 2018. Persona traits identification based on Myers–Briggs Type Indicator (MBTI)-a text classification approach. In Proceedings of the 2018 International Conference on Advances in Computing, Communications, and Informatics. IEEE, 10761082.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Bhowmick R. S., Ganguli I., Paul J., and Sil J.. 2021. A multimodal deep framework for derogatory social media post identification of a recognized person. Transactions on Asian and Low-Resource Language Information Processing 21, 1 (2021), 119.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Cui B. and Qi C.. 2017. Survey Analysis of Machine Learning Methods for Natural Language Processing for MBTI Personality Type Prediction.Google ScholarGoogle Scholar
  8. [8] Furnham A. and Stringfield P.. 1993. Personality and occupational behavior: Myers–Briggs type indicator correlates of managerial practices in two cultures. Human Relations 46, 7 (1993), 827848.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Gosling S. D., Augustine A. A., Vazire S., Holtzman N., and Gaddis S.. 2011. Manifestations of personality in online social networks: Self-reported Facebook-related behaviors and observable profile information. Cyberpsychology, Behavior, and Social Networking 14, 9 (2011), 483488.Google ScholarGoogle ScholarCross RefCross Ref
  10. [10] Kunte A. V. and Panicker S.. 2019. Using textual data for personality prediction: A machine learning approach. In Proceedings of the 2019 4th International Conference on Information Systems and Computer Networks. IEEE, 529533.Google ScholarGoogle Scholar
  11. [11] Ross C., Orr E. S., Sisic M., Arseneault J. M., Simmering M. G., and Orr R. R.. 2009. Personality and motivations associated with Facebook use. Computers in Human Behavior 25, 2 (2009), 578586.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] Utami N. A., Maharani W., and Atastina I.. 2021. Personality classification of facebook users according to big five personality using SVM (support vector machine) method. Procedia Computer Science 179 (2021), 177184.Google ScholarGoogle ScholarCross RefCross Ref
  13. [13] Gjurković M., Karan M., Vukojević I., Bošnjak M., and Šnajder J.. 2020. Pandora talks: Personality and demographics on reddit. arXiv:2004.04460. Retrieved from https://arxiv.org/abs/2004.04460.Google ScholarGoogle Scholar
  14. [14] Pratama B. Y. and Sarno R.. 2015. Personality classification based on Twitter text using Naive Bayes, KNN and SVM. In Proceedings of the 2015 International Conference on Data and Software Engineering. IEEE, 170174.Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Najadat H., Alzubaidi M. A., and Qarqaz I.. 2021. Detecting Arabic spam reviews in social networks based on classification algorithms. Transactions on Asian and Low-Resource Language Information Processing 21, 1 (2021), 113.Google ScholarGoogle Scholar
  16. [16] Sangwan Saurabh Raj and Bhatia M. P. S.. 2020. Denigration bullying resolution using wolf search optimized online reputation rumour detection. Procedia Computer Science 173, (2020), 305314.Google ScholarGoogle ScholarCross RefCross Ref
  17. [17] Robins G. and Kashima Y.. 2008. Social psychology and social networks: Individuals and social systems. Asian Journal of Social Psychology 11, 1 (2008), 112.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Kumar Akshi, Verma Sukriti, and Mangla Himanshu. 2018. A survey of deep learning techniques in speech recognition. In Proceedings of the International Conference on Advances in Computing, Communication Control, and Networking. IEEE.Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Subramanian R., Wache J., Abadi M. K., Vieriu R. L., Winkler S., and Sebe N.. 2016. ASCERTAIN: Emotion and personality recognition using commercial sensors. IEEE Transactions on Affective Computing 9, 2 (2016), 147160.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Barbuto J. E.. 1997. A critique of the Myers–Briggs Type Indicator and its operationalization of Carl Jung's psychological types. Psychological Reports 80, 2 (1997), 611625.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Klappenbach Anna. 2022. The 12 most spoken languages in the world. Retrieved Jan 07, 2022 from https://blog.busuu.com/most-spoken-languages-in-the-world/.Google ScholarGoogle Scholar
  22. [22] Jain D., Kumar A., and Garg G.. 2020. Sarcasm detection in mash-up language using soft-attention based bi-directional LSTM and feature-rich CNN. Applied Soft Computing 91 (2020), 106198.Google ScholarGoogle ScholarCross RefCross Ref
  23. [23] Kumar A., Sangwan S. R., Singh A. K., and Wadhwa G.. 2022. Hybrid deep learning model for sarcasm detection in Indian indigenous language using word-emoji embeddings. Transactions on Asian and Low-Resource Language Information Processing (2022).Google ScholarGoogle Scholar
  24. [24] Jain D. K., Kumar A., and Sangwan S. R.. 2022. TANA: The amalgam neural architecture for sarcasm detection in indian indigenous language combining LSTM and SVM with word-emoji embeddings. Pattern Recognition Letters (2022).Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. [25] Kumar A. and Albuquerque V. H. C.. 2021. Sentiment analysis using XLM-R transformer and zero-shot transfer learning on resource-poor indian language. Transactions on Asian and Low-Resource Language Information Processing 20, 5 (2021), 113.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Sangwan S. R. and Bhatia M. P. S.. 2021. Denigrate comment detection in low-resource Hindi language using attention-based residual networks. Transactions on Asian and Low-Resource Language Information Processing 21, 1 (2021), 114.Google ScholarGoogle Scholar
  27. [27] Pennebaker J. W., Boyd R. L., Jordan K., and Blackburn K.. 2015. The development and psychometric properties of LIWC2015.Google ScholarGoogle Scholar
  28. [28] Lee K. H., Choi Y., and Stonier D. J.. 2012. Evolutionary algorithm for a genetic robot's personality based on the Myers–Briggs Type Indicator. Robotics and Autonomous Systems 60, 7 (2012), 941961.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. [29] Hernandez R. K. and Scott I.. 2017. Predicting Myers–Briggs type indicator with text. In Proceedings of the 31st Conference on Neural Information Processing Systems.Google ScholarGoogle Scholar
  30. [30] Keh S. S. and Cheng I.. 2019. Myers–Briggs personality classification and personality-specific language generation using pre-trained language models. arXiv:1907.06333. Retrieved from https://arxiv.org/abs/1907.06333.Google ScholarGoogle Scholar
  31. [31] Ren Z., Shen Q., Diao X., and Xu H.. 2021. A sentiment-aware deep learning approach for personality detection from text. Information Processing & Management 58, 3 (2021), 102532.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Cerkez N., Vrdoljak B., and Skansi S.. 2021. A method for MBTI classification based on impact of class components. IEEE Access 9 (2021), 146550146567.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Basto C.. 2021. Extending the abstraction of personality types based on MBTI with machine learning and natural language processing. arXiv:2105.11798. Retrieved from https://arxiv.org/abs/2105.11798.Google ScholarGoogle Scholar
  34. [34] Shafi H., Sikander A., Jamal I. M., Ahmad J., and Aboamer M. A.. 2021. A machine learning approach for personality type identification using MBTI framework. Journal of Independent Studies and Research Computing 19, 2 (2021), 610.Google ScholarGoogle Scholar
  35. [35] Adi G. Y. N., Tandio M. H., Ong V., and Suhartono D.. 2018. Optimization for automatic personality recognition on Twitter in Bahasa Indonesia. Procedia Computer Science 135 (2018), 473480.Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Ong V., Rahmanto A. D. S., Williem Suhartono D., Nugroho A. E., Andangsari E. W., and Suprayogi M. N.. 2017. Personality prediction based on twitter information in bahasa indonesia. In Proceedings of the 2017 Federated Conference on Computer Science and Information Systems. Prague, Czech Republic, 367372.Google ScholarGoogle Scholar
  37. [37] Choong E. J. and Varathan K. D.. 2021. Predicting judging-perceiving of Myers–Briggs Type Indicator (MBTI) in online social forum. PeerJ 9 (2021), e11382.Google ScholarGoogle ScholarCross RefCross Ref
  38. [38] Zhang Yong, Zhang Hongrui, Cai Jing, and Yang Binbin. A Weighted Voting Classifier Based on Differential Evolution.Google ScholarGoogle Scholar

Index Terms

  1. Personality Detection using Kernel-based Ensemble Model for Leveraging Social Psychology in Online Networks

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 22, Issue 5
      May 2023
      653 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3596451
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 9 May 2023
      • Online AM: 18 January 2023
      • Accepted: 6 November 2022
      • Revised: 20 October 2022
      • Received: 29 May 2022
      Published in tallip Volume 22, Issue 5

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)329
      • Downloads (Last 6 weeks)51

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!