Abstract
In this paper, we describe our recent advances on a novel approach to Part-Of-Speech tagging based on neural networks. Multilayer perceptrons are used following corpus-based learning from contextual, lexical and morphological information. The Penn Treebank corpus has been used for the training and evaluation of the tagging system. The results show that the connectionist approach is feasible and comparable with other approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Voutilainen, A.: Handcrafted rules. In: Syntactic Wordclass Tagging, pp. 217–246. H. van Halteren (1999)
Merialdo, B.: Tagging English Text with a Probabilistic Model. Comp. Linguistics 20(2), 155–171 (1994)
Brants, T.: TnT: a statistical part-of-speech tagger. In: Proc. of the 6th Conf. on Applied Natural Language Processing, pp. 224–231 (2000)
Pla, F., Molina, A.: Improving Part-of-Speech Tagging using Lexicalized HMMs. Natural Language Engineering 10(2), 167–190 (2004)
Daelemans, W., et al.: MBT: A Memory-Based Part-of-Speech Tagger Generator. In: Proc. of the 4th Workshop on Very Large Corpora, pp. 14–27 (1996)
Ratnaparkhi, A.: A Maximum Entropy Part-Of-Speech Tagger. In: 1st Conf. on Empirical Methods in Natural Language Processing, pp. 133–142 (1996)
Brill, E.: Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging. Comp. Linguistics 21(4) (1995)
Giménez, J., Márquez, L.: Fast and accurate Part-of-Speech tagging: the SVM approach revisited. In: Proc. of the 4th RANLP, pp. 153–163 (2003)
Giménez, J., Márquez, L.: SVMTool: A general POS tagger generator based on Support Vector Machines. In: Proc. of the Fourth Conf. on Language Resources and Evaluation, pp. 43–46 (2004)
Schmid, H.: Part-of-Speech tagging with neural networks. In: Proc. of COLING 1994, pp. 172–176 (1994)
Benello, J., Mackie, A., Anderson, J.: Syntactic category disambiguation with neural networks. Computer Speech and Language 3, 203–217 (1989)
Martín Valdivia, M.: Algoritmo LVQ aplicado a tareas de Procesamiento del Lenguaje Natural. PhD thesis, Universidad de Málaga (2004)
Marques, N., Pereira, G.: A POS-Tagger generator for Unknown Languages. Procesamiento del Lenguaje Natural 27, 199–207 (2001)
Pérez-Ortiz, J., Forcada, M.: Part-of-speech tagging with recurrent neural networks. In: Proc. of the Int. Joint Conf. on Neural Networks, pp. 1588–1592 (2001)
Ahmed Raju, S., Chandrasekhar, P., Prasad, M.: Application of multilayer perceptron network for tagging parts-of-speech. In: Language Engineering Conf. (2002)
Tortajada, S., Castro, M.J., Pla, F.: Part-of-Speech tagging based on artificial neural networks. In: 2nd Language & Technology Conf. Proc., pp. 414–418 (2005)
Zamora-Martínez, F., et al.: A connectionist approach to Part-Of-Speech Tagging. In: Int. Conf. on Neural Computation (2009)
Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: The Penn Treebank. Comp. Linguistics 19(2), 313–330 (1993)
Jurafsky, D., Martin, J.H.: Speech and Language Processing. Prentice Hall, Englewood Cliffs (2000)
Zamora, F., Castro, M., España, S.: Fast evaluation of connectionist language models. In: Cabestany, J., Sandoval, F., Prieto, A., Corchado, J.M. (eds.) IWANN 2009. LNCS, vol. 5517, pp. 144–151. Springer, Heidelberg (2009)
Gascó, G., Sánchez, J.A.: Part-of-Speech Tagging Based on Machine Translation Techniques. In: Martí, J., Benedí, J.M., Mendonça, A.M., Serrat, J. (eds.) IbPRIA 2007. LNCS, vol. 4477, pp. 257–264. Springer, Heidelberg (2007)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)
Mollineda, R.A., Vidal, E.: A relative approach to hierarchical clustering. In: Pattern Recognition and Applications, vol. 56, pp. 19–28. IOS Press, Amsterdam (2000)
Zamora-Martínez, F., España-Boquera, S., Castro-Bleda, M.: Behaviour-based Clustering of Neural Networks applied to Document Enhancement. In: Sandoval, F., Prieto, A.G., Cabestany, J., Graña, M. (eds.) IWANN 2007. LNCS, vol. 4507, pp. 144–151. Springer, Heidelberg (2007)
Rumelhart, D., Hinton, G., Williams, R.: Learning internal representations by error propagation. In: Rumelhart, D.E., McClelland, J.L. (eds.) PDP: Computational models of cognition and perception, I, pp. 319–362. MIT Press, Cambridge (1986)
España, S., et al.: Efficient BP Algorithms for General Feedforward Neural Networks. In: Mira, J., Álvarez, J.R. (eds.) IWINAC 2007. LNCS, vol. 4527, pp. 327–336. Springer, Heidelberg (2007)
Cutting, D., Kupiec, J., Pedersen, J., Sibun, P.: A practical part-of-speech tagger. In: Proc. of the 3th Conf. on Applied Natural Language Processing, pp. 133–140 (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zamora-Martínez, F., Castro-Bleda, M.J., España-Boquera, S., Tortajada-Velert, S. (2010). Adding Morphological Information to a Connectionist Part-Of-Speech Tagger. In: Meseguer, P., Mandow, L., Gasca, R.M. (eds) Current Topics in Artificial Intelligence. CAEPIA 2009. Lecture Notes in Computer Science(), vol 5988. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14264-2_20
Download citation
DOI: https://doi.org/10.1007/978-3-642-14264-2_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14263-5
Online ISBN: 978-3-642-14264-2
eBook Packages: Computer ScienceComputer Science (R0)