Adding Morphological Information to a Connectionist Part-Of-Speech Tagger

Zamora-Martínez, Francisco; Castro-Bleda, María José; España-Boquera, Salvador; Tortajada-Velert, Salvador

doi:10.1007/978-3-642-14264-2_20

Francisco Zamora-Martínez²²,
María José Castro-Bleda²³,
Salvador España-Boquera²³ &
…
Salvador Tortajada-Velert²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5988))

Included in the following conference series:

Conference of the Spanish Association for Artificial Intelligence

608 Accesses

Abstract

In this paper, we describe our recent advances on a novel approach to Part-Of-Speech tagging based on neural networks. Multilayer perceptrons are used following corpus-based learning from contextual, lexical and morphological information. The Penn Treebank corpus has been used for the training and evaluation of the tagging system. The results show that the connectionist approach is feasible and comparable with other approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Voutilainen, A.: Handcrafted rules. In: Syntactic Wordclass Tagging, pp. 217–246. H. van Halteren (1999)
Google Scholar
Merialdo, B.: Tagging English Text with a Probabilistic Model. Comp. Linguistics 20(2), 155–171 (1994)
Google Scholar
Brants, T.: TnT: a statistical part-of-speech tagger. In: Proc. of the 6th Conf. on Applied Natural Language Processing, pp. 224–231 (2000)
Google Scholar
Pla, F., Molina, A.: Improving Part-of-Speech Tagging using Lexicalized HMMs. Natural Language Engineering 10(2), 167–190 (2004)
Article Google Scholar
Daelemans, W., et al.: MBT: A Memory-Based Part-of-Speech Tagger Generator. In: Proc. of the 4th Workshop on Very Large Corpora, pp. 14–27 (1996)
Google Scholar
Ratnaparkhi, A.: A Maximum Entropy Part-Of-Speech Tagger. In: 1st Conf. on Empirical Methods in Natural Language Processing, pp. 133–142 (1996)
Google Scholar
Brill, E.: Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging. Comp. Linguistics 21(4) (1995)
Google Scholar
Giménez, J., Márquez, L.: Fast and accurate Part-of-Speech tagging: the SVM approach revisited. In: Proc. of the 4th RANLP, pp. 153–163 (2003)
Google Scholar
Giménez, J., Márquez, L.: SVMTool: A general POS tagger generator based on Support Vector Machines. In: Proc. of the Fourth Conf. on Language Resources and Evaluation, pp. 43–46 (2004)
Google Scholar
Schmid, H.: Part-of-Speech tagging with neural networks. In: Proc. of COLING 1994, pp. 172–176 (1994)
Google Scholar
Benello, J., Mackie, A., Anderson, J.: Syntactic category disambiguation with neural networks. Computer Speech and Language 3, 203–217 (1989)
Article Google Scholar
Martín Valdivia, M.: Algoritmo LVQ aplicado a tareas de Procesamiento del Lenguaje Natural. PhD thesis, Universidad de Málaga (2004)
Google Scholar
Marques, N., Pereira, G.: A POS-Tagger generator for Unknown Languages. Procesamiento del Lenguaje Natural 27, 199–207 (2001)
Google Scholar
Pérez-Ortiz, J., Forcada, M.: Part-of-speech tagging with recurrent neural networks. In: Proc. of the Int. Joint Conf. on Neural Networks, pp. 1588–1592 (2001)
Google Scholar
Ahmed Raju, S., Chandrasekhar, P., Prasad, M.: Application of multilayer perceptron network for tagging parts-of-speech. In: Language Engineering Conf. (2002)
Google Scholar
Tortajada, S., Castro, M.J., Pla, F.: Part-of-Speech tagging based on artificial neural networks. In: 2nd Language & Technology Conf. Proc., pp. 414–418 (2005)
Google Scholar
Zamora-Martínez, F., et al.: A connectionist approach to Part-Of-Speech Tagging. In: Int. Conf. on Neural Computation (2009)
Google Scholar
Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: The Penn Treebank. Comp. Linguistics 19(2), 313–330 (1993)
Google Scholar
Jurafsky, D., Martin, J.H.: Speech and Language Processing. Prentice Hall, Englewood Cliffs (2000)
Google Scholar
Zamora, F., Castro, M., España, S.: Fast evaluation of connectionist language models. In: Cabestany, J., Sandoval, F., Prieto, A., Corchado, J.M. (eds.) IWANN 2009. LNCS, vol. 5517, pp. 144–151. Springer, Heidelberg (2009)
Google Scholar
Gascó, G., Sánchez, J.A.: Part-of-Speech Tagging Based on Machine Translation Techniques. In: Martí, J., Benedí, J.M., Mendonça, A.M., Serrat, J. (eds.) IbPRIA 2007. LNCS, vol. 4477, pp. 257–264. Springer, Heidelberg (2007)
Chapter Google Scholar
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)
Article Google Scholar
Mollineda, R.A., Vidal, E.: A relative approach to hierarchical clustering. In: Pattern Recognition and Applications, vol. 56, pp. 19–28. IOS Press, Amsterdam (2000)
Google Scholar
Zamora-Martínez, F., España-Boquera, S., Castro-Bleda, M.: Behaviour-based Clustering of Neural Networks applied to Document Enhancement. In: Sandoval, F., Prieto, A.G., Cabestany, J., Graña, M. (eds.) IWANN 2007. LNCS, vol. 4507, pp. 144–151. Springer, Heidelberg (2007)
Chapter Google Scholar
Rumelhart, D., Hinton, G., Williams, R.: Learning internal representations by error propagation. In: Rumelhart, D.E., McClelland, J.L. (eds.) PDP: Computational models of cognition and perception, I, pp. 319–362. MIT Press, Cambridge (1986)
Google Scholar
España, S., et al.: Efficient BP Algorithms for General Feedforward Neural Networks. In: Mira, J., Álvarez, J.R. (eds.) IWINAC 2007. LNCS, vol. 4527, pp. 327–336. Springer, Heidelberg (2007)
Chapter Google Scholar
Cutting, D., Kupiec, J., Pedersen, J., Sibun, P.: A practical part-of-speech tagger. In: Proc. of the 3th Conf. on Applied Natural Language Processing, pp. 133–140 (1992)
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Ciencias Físicas, Matemáticas y de la Computación, Universidad CEU-Cardenal Herrera, 46115, Alfara del Patriarca (Valencia), Spain
Francisco Zamora-Martínez
Departamento de Sistemas Informáticos y Computación, Spain
María José Castro-Bleda & Salvador España-Boquera
IBIME, Instituto de Aplicaciones de Tecnologías de la Información y de las Comunicaciones Avanzadas (ITACA), Universidad Politécnica de Valencia, Camino de Vera s/n, 46022, Valencia, Spain
Salvador Tortajada-Velert

Authors

Francisco Zamora-Martínez
View author publications
You can also search for this author in PubMed Google Scholar
María José Castro-Bleda
View author publications
You can also search for this author in PubMed Google Scholar
Salvador España-Boquera
View author publications
You can also search for this author in PubMed Google Scholar
Salvador Tortajada-Velert
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

IIIA - CSIC, Campus UAB s/n, 08193, Bellaterra, Spain
Pedro Meseguer
Dpto. Lenguajes y Ciencias de la Computación, Universidad de Málaga, Campus de Teatinos, 29071, Málaga, Spain
Lawrence Mandow
Dpto. Lenguajes y Sistemas Informáticos, ETS Ingeniería Informática, University of Seville, Av. Reina Mercedes S/N, 41012, Sevilla, Spain
Rafael M. Gasca

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zamora-Martínez, F., Castro-Bleda, M.J., España-Boquera, S., Tortajada-Velert, S. (2010). Adding Morphological Information to a Connectionist Part-Of-Speech Tagger. In: Meseguer, P., Mandow, L., Gasca, R.M. (eds) Current Topics in Artificial Intelligence. CAEPIA 2009. Lecture Notes in Computer Science(), vol 5988. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14264-2_20

Download citation

DOI: https://doi.org/10.1007/978-3-642-14264-2_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14263-5
Online ISBN: 978-3-642-14264-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics