Abstract
According to the brief and meaningless features of the social media content, we propose a text classification algorithm based on word vectors, which can quickly and effectively realize the automatic classification of the short text in social media. In view of the lack of word order and position considerations in the Word2vec model, we combine the Word2vec trained word vector with the convolutional neural network (CNN) model to propose SW-CNN and WW-CNN classification algorithms. The methods are evaluated on the three different datasets. Compared with existing text classification methods based on convolutional neural network (CNN) or recurrent neural network (RNN), the experimental results show that our approach has superior performance in short text classification.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Silva, J., Coheur, L., Mendes, A.C.: From symbolic to sub-symbolic information in question classification. Artif. Intell. Rev. 35(2), 137–154 (2011)
Huang, Z., Thint, M., Qin, Z.: Question classification using head words and their hypernyms. In: EMNLP, pp. 927–936 (2008)
Wang, S., Manning, C.D.: Baselines and bigrams: simple, good sentiment and topic classification. In: Association for Computational Linguistics, pp. 90–94 (2012)
Delaye, A., Liu, C.L.: Text/Non-text classification in online handwritten documents with conditional random fields. Commun. Comput. Inf. Sci. 321(53), 514–521 (2012)
Kim, Y.: Convolutional neural networks for sentence classification. In: Conference on Empirical Methods on Natural Language Processing (2014)
Kalchbrenner, N., Grefenstette, E., Blunsom, P.: A Convolutional neural network for modelling sentences. Eprint Arxiv. (2014)
Socher, R., Huval, B., Manning, C.D., et al.: Semantic compositionality through recursive matrix-vector spaces. In: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 1201–1211 (2012)
Johnson, R., Zhang, T.: Effective use of word order for text categorization with convolutional neural networks. Eprint Arxiv (2014)
Zhou, C., Sun, C., Liu, Z., et al.: A C-LSTM neural network for text classification. Comput. Sci. 1(4), 39–44 (2015)
Lee, J.Y., Dernoncourt, F.: Sequential short-text classification with recurrent and convolutional neural networks. 515–520 (2016)
Trask, A., Gilmore, D., Russell, M.: Modeling order in neural word embeddings at scale. Comput. Sci., 2266–2275 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhao, D., Chang, Z., Du, N., Guo, S. (2018). Classification for Social Media Short Text Based on Word Distributed Representation. In: Meng, X., Li, R., Wang, K., Niu, B., Wang, X., Zhao, G. (eds) Web Information Systems and Applications. WISA 2018. Lecture Notes in Computer Science(), vol 11242. Springer, Cham. https://doi.org/10.1007/978-3-030-02934-0_24
Download citation
DOI: https://doi.org/10.1007/978-3-030-02934-0_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02933-3
Online ISBN: 978-3-030-02934-0
eBook Packages: Computer ScienceComputer Science (R0)