Abstract
This paper presents an overview of work on inducing part-of-speech taggers using Inductive Logic Programming. Constraint Grammar inspired rules have been induced for several languages (English, Hungarian, Slovene, Swedish) using Progol. This overview focuses on a Swedish tagger, but other work is discussed as well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brill, E. (1994). Some advances in transformation-based part of speech tagging. In Proceedings of the Twelfth National Conference on Artificial Intelligence.
Carlberger, J. and Kann, V. (1999). Implementing an efficient part-of-speech tagger. In press. Available at http://www.nada.kth.se/theory/projects/granska/.
Cussens, J. (1997). Part of speech tagging using Progol. In Proceedings of the Seventh International Workshop on Inductive Logic Programming, 93–108, Prague, Czech Republic.
Cussens, J., Džzeroski, S., and Erjavec, T. (1999). Morphosyntactic tagging of Slovene using Progol. In Proceedings of the Ninth International Workshop on Inductive Logic Programming, 68–79.
Cutting, D., Kupiec, J., Pedersen, J., and Sibun, P. (1992). A practical part-of-speech tagger. In Proceedings of the Third Conference on Applied Natural Language Processing, 133–140.
Eineborg, M. and Lindberg, N. (1998). Induction of Constraint Grammar-rules using Progol. In Proceedings of The Eighth International Conference on Inductive Logic Programming.
Ejerhed, E., Källgren, G., Wennstedt, O., and Åström, M. (1992). The Linguistic Annotation System of the Stockholm-Umeå Project. Department of General Linguistics, University of Umeå.
Horváth, T., Alexin, Z., Gyimóthy, T., and Wrobel, S. (1999). Application of different learning methods to Hungarian part-of-speech tagging. In Džeroski, S. and Flach, P., editors, Proceedings of the Ninth International Workshop on Inductive Logic Programming, 128–139.
Karlsson, F., Voutilainen, A., Heikkilä, J., and Anttila, A., editors (1995). Constraint Grammar: A language-independent system for parsing unrestricted text. Mouton de Gruyter, Berlin and New York.
Lindberg, N. and Eineborg, M. (1998). Learning Constraint Grammar-style disambiguation rules using Inductive Logic Programming. In Proceedings of the Seventeenth International Conference on Computational Linguistics and the thirty-sixth Annual Meeting of the Association for Computational Linguistics, volume II, 775–779.
Lindberg, N. and Eineborg, M. (1999). Improving part of speech disambiguation rules by adding linguistic knowledge. In Džeroski, S. and Flach, P., editors, Proceedings of the Ninth International Workshop on Inductive Logic Programming, 186–197.
Mason, O. (1997). QTAG—A portable probabilistic tagger. Corpus Research, The University of Birmingham, U.K.
Megyesi, B. (1999). Improving Brill’s PoS tagger for an agglutinative language. In Joint Sigdat Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 21–22.
Muggleton, S. (1991). Inductive logic programming. New Generation Computing, 8(4):295–318.
Muggleton, S. (1995). Inverse entailment and Progol. New Generation Computing Journal, 13:245–286.
Popelínský, L., Pavelek, T., and Ptáčník, T. (1999). Towards disambiguation in Czech corpora. In Cussens, J., editor, In Proceedings of the First Learning Language in Logic Workshop, 106–116.
Ratnaparkhi, A. (1996). A maximum entropy model for part-of-speech tagging. In Proceedings of Conference on Empirical Methods in Natural Language Processing, University of Pennsylvania.
Ridings, D. (1998). SUC and the Brill tagger. GU-ISS-98-1 (Research Reports from the Department of Swedish, Göteborg University).
Samuelsson, C., Tapanainen, P., and Voutilainen, A. (1996). Inducing Constraint Grammars. In Laurent, M. and de la Higuera, C., editors, Grammatical Inference: Learning Syntax from Sentences, 146–155. Springer-Verlag.
Tapanainen, P. (1996). The Constraint Grammar Parser CG-2. Department of General Linguistics, University of Helsinki.
Zavrel, J. and Daelemans, W. (1999). Recent advances in memory-based part-of-speech tagging. In VI Simposio Internacional de Comunicacion Social, 590–597.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Eineborg, M., Lindberg, N. (2000). ILP in Part-of-Speech Tagging — An Overview. In: Cussens, J., Džeroski, S. (eds) Learning Language in Logic. LLL 1999. Lecture Notes in Computer Science(), vol 1925. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-40030-3_10
Download citation
DOI: https://doi.org/10.1007/3-540-40030-3_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41145-1
Online ISBN: 978-3-540-40030-1
eBook Packages: Springer Book Archive