WordNet-Based Text Categorization Using Convolutional Neural Networks

Premchander, K.; Sarma, S. S. V. N.; Vaishali, K.; Vijaypal Reddy, P.; Anjaneyulu, M.; Nagaprasad, S.

doi:10.1007/978-981-10-8198-9_25

K. Premchander⁷,
S. S. V. N. Sarma⁸,
K. Vaishali⁹,
P. Vijaypal Reddy¹⁰,
M. Anjaneyulu⁷ &
…
S. Nagaprasad¹¹

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 34))

1037 Accesses
3 Citations

Abstract

Text Categorization is a task of assigning documents to a fixed number of predefined categories. Concept is the grouping of semantically related items under a unique name. Dimensionality space and sparsity of the document representation can be reduced using concept generation. Conceptual representation of a text can be generated using WordNet. In this paper, an empirical evolution using Convolutional Neural Networks (CNN) for text categorization has been performed. The Convolutional Neural Networks exploit the one-dimensional structures of the text such as words, concepts, word embeddings, and concept embeddings to improve the categorical label prediction. The Reuter’s dataset is evaluated with Convolutional Neural Networks on four categories of data. The representation of a text with word embeddings and concept embeddings together results to a better classification performance using CNN compared with word embeddings and concept embeddings individually.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Thorsten: TC with SVM and Learn relevant features, ECML (1998)
Google Scholar
Yang, E.T.: Semi supervised RNN classification of text with word embedding. JMLR Res. 5, 361–397 (2004)
Google Scholar
Dai, A.M., Le, Q.V.: Semi-supervised sequence learning. Adv. Neural Inf. Process. Syst. 3079–3087 (2015)
Google Scholar
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. Adv. Neural Inf. Process. Syst. 649–657 (2015)
Google Scholar
Johnson, R., Zhang, T.: Semi-supervised convolutional neural networks for text categorization via region embedding. Adv. Neural Inf. Process. Syst. 919–927 (2015)
Google Scholar
Aggarwal, C.C., Zhai, C.: A survey of text classification algorithms. Mining text data, 163–222 (2012)
Google Scholar
Dinu, G.: Predict a systematic compare of context counting using context predict semantic vector. ACL, 238–247 (2012)
Google Scholar
Vincent, P.: ANN probabilistic model of a language. JMLR 3, 1137, 1155 (2003)
Google Scholar
Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013)
Article Google Scholar
Bottou, L.: Learning of gradient in networks using CNN. In: Proceedings on Neuro-Nımes, vol. 91 (1999)
Google Scholar
Bloehdorn, S., Hotho, A.: Boosting for text classification with semantic features. In: WebKDD, pp. 149–166 (2004)
Google Scholar
Johnson, M.: Maxent discriminative re-ranking and Coarse-to-fine n-best parsing. In: Association for Computational Linguistics, pp. 173–180 (2005)
Google Scholar
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011)
Google Scholar
Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley (2012)
Google Scholar
Glänzel, Wolfgang, Thijs, Bart: Using ‘core documents’ for detecting and labelling new emerging topics. Scientometrics 91(2), 399–416 (2012)
Article Google Scholar
Hinton, G.E, Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science, 313(5786), 504–507 (2006)
Article MathSciNet Google Scholar
Huang, E.H., Socher, R., Manning, C.D., Ng, A.Y.: Improving word representations via global context and multiple word prototypes. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol. 1, pp. 873–882. Association for Computational Linguistics (2012)
Google Scholar
Kalchbrenner, N., Blunsom, P.: Recurrent convolutional neural networks for discourse compositionality. arXiv preprint arXiv:1306.3584 (2013)
Klementiev, A., Titov, I., Bhattarai, B.: Inducing crosslingual distributed representations of words (2012)
Google Scholar
Mikonos, T.: Distributed representations of sentences and docs. ICML (2014)
Google Scholar
Sutskever, I.: Distributional representations of words and phrases and their composite. NIPS, 3111–3119 (2013)
Google Scholar
Mikolov, T., Yih, W., Zweig, G.: Linguistic regularities in continuous space word representations. In hlt-Naacl 13, 746–751 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Dravidian University, Kuppam, India
K. Premchander & M. Anjaneyulu
Department of CSE, Vaagdevi College of Engineering, Warangal, India
S. S. V. N. Sarma
Department of CSE, Jyothismathi Institute of Technology and Sciences, Karimnagar, India
K. Vaishali
Department of CSE, Matrusri Engineering College, Hyderabad, India
P. Vijaypal Reddy
S.R.R. Government Arts & Science College, Karimnagar, India
S. Nagaprasad

Authors

K. Premchander
View author publications
You can also search for this author in PubMed Google Scholar
S. S. V. N. Sarma
View author publications
You can also search for this author in PubMed Google Scholar
K. Vaishali
View author publications
You can also search for this author in PubMed Google Scholar
P. Vijaypal Reddy
View author publications
You can also search for this author in PubMed Google Scholar
M. Anjaneyulu
View author publications
You can also search for this author in PubMed Google Scholar
S. Nagaprasad
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to K. Premchander .

Editor information

Editors and Affiliations

Computer Science and Engineering, Technocrats Institute of Technology, Bhopal, Madhya Pradesh, India
Basant Tiwari
Computer Science and Engineering, DSPM IIIT, Naya Raipur, Chhattisgarh, India
Vivek Tiwari
Department of Mathematics, Sungkyunkwan University, Suwon, Korea (Republic of)
Kinkar Chandra Das
Microsoft Innovation Academy, Computer Science and Engineering, Sri Aurobindo Institute of Technology, Indore, Madhya Pradesh, India
Durgesh Kumar Mishra
Department of Mathematics, South Asian University, New Delhi, Delhi, India
Jagdish C. Bansal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Premchander, K., Sarma, S.S.V.N., Vaishali, K., Vijaypal Reddy, P., Anjaneyulu, M., Nagaprasad, S. (2018). WordNet-Based Text Categorization Using Convolutional Neural Networks. In: Tiwari, B., Tiwari, V., Das, K., Mishra, D., Bansal, J. (eds) Proceedings of International Conference on Recent Advancement on Computer and Communication . Lecture Notes in Networks and Systems, vol 34. Springer, Singapore. https://doi.org/10.1007/978-981-10-8198-9_25

Download citation

DOI: https://doi.org/10.1007/978-981-10-8198-9_25
Published: 19 April 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8197-2
Online ISBN: 978-981-10-8198-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics