Abstract
In the presented paper we deal with the issue of semantic tagging of the (Czech) corpus texts. An attempt has been made to take advantage of the grammatical tagging and relabel some of the tags as semantic and pragmatic. Then the notion of the enriched valency frame is introduced - we call it lexical valency frame.
This research has been partially supported by the Czech Ministry of Education under the grant VS97028.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Charles J. Fillmore. The Case for Case. Universals in Linguistic Theory, New York, 1968, pp 1–88.
Jan Hajič, Barbara Hladká. Probabilistic and Rule-Based tagging of an Inflective Language — a Comparison. Technical Report No.1, UFAL MFF UK, November 1996, Prague.
Nancy Ide, Jean Véronis. Word Sense Disambiguation: The State of Art. Computational Linguistics, Vol.24, No.1, March 1998, pp 1–40.
Claudia Leacock, Martin Chodorow, George A. Miller. Using Corpus Statistics and WordNet Relations for Sense Identification. Computational Linguistics, Vol.24, No.1, March 1998, pp 147–167.
Klára Osolsobě. Algorithmic Description of Czech Morphology. Dissertation, Brno 1995.
Karel Pala, Pavel Rychlý, and Pavel Smrž. DESAM — Annotated Corpus for Czech. In Proceedings of SOFSEM’97. Springer-Verlag, 1997.
Karel Pala, Pavel Ševeček. Valencies of Czech Verbs. Studia Minora Facultatis Philosophicae Universitatis Brunensis, A45, 1997.
Prague Dependency Treebank. Technical Report, UFAL MFF UK, Prague 1998.
Pavel Ševeček. LEMMA — a lemmatizer for Czech. Brno, 1996. (manuscript, programme in C).
Piek Vossen (ed.). EuroWordNet General Document — Version 2. Amsterdam, June 1999. (Draft of the resulting CD).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pala, K. (1999). Semantic Annotation of (Czech) Corpus Texts. In: Matousek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds) Text, Speech and Dialogue. TSD 1999. Lecture Notes in Computer Science(), vol 1692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48239-3_10
Download citation
DOI: https://doi.org/10.1007/3-540-48239-3_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66494-9
Online ISBN: 978-3-540-48239-0
eBook Packages: Springer Book Archive