Judgment of Slang Based on Character Feature and Feature Expression Based on Slang’s Context Feature
Our research aim was to develop the means to automatically identify a particular character string as slang and then connect the detected slang word to words with similar meaning in order to successfully process the sentence in which the word appears. By recognizing a slang word in this way, one can apply different processing to the word and avoid the distinctive problems associated with processing slang words. This paper proposes a method to distinguish standard words from slang words using information from the characters comprising the character string. An experiment testing the effectiveness of our method showed a 30 % or more improvement in classification accuracy compared to the baseline method. We also use a contextual feature related to emotion to expand the unregistered slang word in the training data into other expressions and propose an emotion estimation method based on the expanded expressions. In our experiment, successful emotion estimation was obtained in nearly 54 % of the cases, a notably higher rate than with the baseline method. Our proposed method was shown to have validity.
KeywordsSlang Character feature Context feature Unknown expression
This research was partially supported by JSPS KAKENHI Grant Numbers 15K16077, 15K00425, 15K00309.
- 2.Matsumoto, K., Kita, K., Ren, F.: Emotional vector distance based sentiment analysis of Wakamono Kotoba. China Commun. 9(3), 87–98 (2012)Google Scholar
- 4.Yonekawa, A.: Wakamonogo wo kagakusuru. (Meiji Shoin) (1998). (in Japanese)Google Scholar
- 5.Moji module. http://gimite.net/gimite/rubymess/moji.html
- 6.Amano, N., Kondo, K.: NTT Database Series Nihongo-no Goitokusei: Lexical Properties of Japanese, CD-ROM version, Sanseido (2008)Google Scholar
- 8.Mikolov, T., Chen, K., Corrado, G. and Dean, J.: Efficient estimation of word representations in vector space. In: Proceedings of the Workshop at ICLR (2013)Google Scholar
- 9.Matsumoto, K., Kita, K. Ren, F.: Emotion estimation from sentence using relation between Japanese slangs and emotion expressions. In: Proceeding of the 26th Pacific Asia Conference on Language, Information, and Computation (PACLIC 2012), pp. 343–350 (2012)Google Scholar
- 11.Twitter Website. https://twitter.com/