Deep Learning-Based Document Modeling for Personality Detection from Turkish Texts

  • Tuncay Yılmaz
  • Abdullah Ergil
  • Bahar İlgenEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 1069)


The usage of social media is increasing exponentially since it has been the easiest and fastest way to share information between people or organizations. As a result of this broad usage and activity of people on social networks, considerable amount of data is generated continuously. The availability of user generated data makes it possible to analyze personality of people. Personality is the most distinctive feature for an individual. The results of these analyses can be utilized in several ways. They provide support for human resources recruitment units to consider suitable candidates. Similar products and services can be offered to people who share the similar personality characteristics. Personality traits help in diagnosis of certain mental illnesses. It is also helpful in forensics to use personality traits on suspects to clarify the forensic case. With the rapid dissemination of online documents in many different languages, the classification of these documents has become an important requirement. Machine Learning (ML) and Natural Language Processing (NLP) methods have been used to classify these digitized data. In this study, current ML techniques and methodologies have been used to classify text documents and analyze person characteristics from these datasets. As a result of classification, detailed information about the personality traits of the writer could be obtained. It was seen that the frequency-based analysis and the use of the emotional words at the word level are crucial in the textual personality analysis.


Big five personality traits Deep neural network Natural Language Processing Text mining RNN LSTM 


  1. 1.
    Ong, V., Rahmanto, A.D., Williem, W., Suhartono, D.: Exploring personality prediction from text on social media: a literature review. Internetworking Indonesia 9(1), 65–70 (2017)Google Scholar
  2. 2.
    Majumder, N., Poria, S., Gelbukh, A., Cambria, E.: Deep learning-based document modeling for personality detection from text. IEEE Intell. Syst. 32(2), 74–79 (2017)CrossRefGoogle Scholar
  3. 3.
    John, O.P., Donahue, E.M., Kentle, R.L.: The big five inventory—versions 4a and 54 (1991)Google Scholar
  4. 4.
    Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRefGoogle Scholar
  5. 5.
    Khan, A., Baharudin, B., Lee, L.H., Khan, K.: A review of machine learning algorithms for text-documents classification. J. Adv. Inf. Technol. 1(1), 4–20 (2010)Google Scholar
  6. 6.
    Agarwal, B.: Personality detection from text: a review. Int. J. Comput. Syst. 1(1), 1–4 (2014)Google Scholar
  7. 7.
    Barroso, A.S., da Silva, J.S.M., Souza, T.D., Bryanne, S.D.A., Soares, M.S., do Nascimento, R.P.: Relationship between personality traits and software quality-big five model vs. object-oriented software metrics. In: ICEIS, no. 3, pp. 63–74 (2017)Google Scholar
  8. 8.
    Wei, H., et al.: Beyond the words: predicting user personality from heterogeneous information. In: Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, pp. 305–314. ACM 2017Google Scholar
  9. 9.
    Zhou, L., Twitchell, D.P., Qin, T., Burgoon, J.K., Nunamaker, J.F.: An exploratory study into deception detection in text-based computer-mediated communication. In: Proceedings of the 36th Annual Hawaii International Conference on System Sciences, p. 10. IEEE (2003)Google Scholar
  10. 10.
    Plank, B., Hovy, D.: Personality traits on twitter—or—how to get 1,500 personality tests in a week. In: Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 92–98 (2015)Google Scholar
  11. 11.
    Sewwandi, D., Perera, K., Sandaruwan, S., Lakchani, O., Nugaliyadde, A., Thelijjagoda, S.: Linguistic features based personality recognition using social media data. In: 6th National Conference on Technology and Management (NCTM), pp. 63–68. IEEE (2017)Google Scholar
  12. 12.

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.Istanbul Kültür UniversityIstanbulTurkey

Personalised recommendations