Judgment of Slang Based on Character Feature and Feature Expression Based on Slang’s Context Feature

  • Kazuyuki MatsumotoEmail author
  • Seiji Tsuchiya
  • Minoru Yoshida
  • Kenji Kita
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 652)


Our research aim was to develop the means to automatically identify a particular character string as slang and then connect the detected slang word to words with similar meaning in order to successfully process the sentence in which the word appears. By recognizing a slang word in this way, one can apply different processing to the word and avoid the distinctive problems associated with processing slang words. This paper proposes a method to distinguish standard words from slang words using information from the characters comprising the character string. An experiment testing the effectiveness of our method showed a 30 % or more improvement in classification accuracy compared to the baseline method. We also use a contextual feature related to emotion to expand the unregistered slang word in the training data into other expressions and propose an emotion estimation method based on the expanded expressions. In our experiment, successful emotion estimation was obtained in nearly 54 % of the cases, a notably higher rate than with the baseline method. Our proposed method was shown to have validity.


Slang Character feature Context feature Unknown expression 



This research was partially supported by JSPS KAKENHI Grant Numbers 15K16077, 15K00425, 15K00309.


  1. 1.
    Matsumoto, K., Ren, F.: Construction of Wakamono Kotoba emotion dictionary and its application. In: Gelbukh, A.F. (ed.) CICLing 2011, Part I. LNCS, vol. 6608, pp. 405–416. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  2. 2.
    Matsumoto, K., Kita, K., Ren, F.: Emotional vector distance based sentiment analysis of Wakamono Kotoba. China Commun. 9(3), 87–98 (2012)Google Scholar
  3. 3.
    Matsumoto, K., Akita, K., Keranmu, X., Yoshida, M., Kita, K.: Extraction Japanese slang from weblog data based on script type and stroke count. Procedia Comput. Sci. 35(2014), 464–473 (2014)CrossRefGoogle Scholar
  4. 4.
    Yonekawa, A.: Wakamonogo wo kagakusuru. (Meiji Shoin) (1998). (in Japanese)Google Scholar
  5. 5.
  6. 6.
    Amano, N., Kondo, K.: NTT Database Series Nihongo-no Goitokusei: Lexical Properties of Japanese, CD-ROM version, Sanseido (2008)Google Scholar
  7. 7.
  8. 8.
    Mikolov, T., Chen, K., Corrado, G. and Dean, J.: Efficient estimation of word representations in vector space. In: Proceedings of the Workshop at ICLR (2013)Google Scholar
  9. 9.
    Matsumoto, K., Kita, K. Ren, F.: Emotion estimation from sentence using relation between Japanese slangs and emotion expressions. In: Proceeding of the 26th Pacific Asia Conference on Language, Information, and Computation (PACLIC 2012), pp. 343–350 (2012)Google Scholar
  10. 10.
    Ren, F., Matsumoto, K.: Semi-automatic creation of youth slang corpus and its application to affective computing. IEEE Trans. Affect. Comput. 7(2), 176–189 (2016)CrossRefGoogle Scholar
  11. 11.
    Twitter Website.

Copyright information

© Springer Nature Singapore Pte Ltd. 2016

Authors and Affiliations

  • Kazuyuki Matsumoto
    • 1
    Email author
  • Seiji Tsuchiya
    • 2
  • Minoru Yoshida
    • 1
  • Kenji Kita
    • 1
  1. 1.Faculty of Science and EngineeringTokushima UniversityTokushimaJapan
  2. 2.Faculty of Science and EngineeringDoshisha UniversityKyo-TanabeJapan

Personalised recommendations