Semantic Annotation of (Czech) Corpus Texts

Pala, Karel

doi:10.1007/3-540-48239-3_10

Karel Pala³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1692))

Included in the following conference series:

International Workshop on Text, Speech and Dialogue

473 Accesses
2 Citations

Abstract

In the presented paper we deal with the issue of semantic tagging of the (Czech) corpus texts. An attempt has been made to take advantage of the grammatical tagging and relabel some of the tags as semantic and pragmatic. Then the notion of the enriched valency frame is introduced - we call it lexical valency frame.

This research has been partially supported by the Czech Ministry of Education under the grant VS97028.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Charles J. Fillmore. The Case for Case. Universals in Linguistic Theory, New York, 1968, pp 1–88.
Google Scholar
Jan Hajič, Barbara Hladká. Probabilistic and Rule-Based tagging of an Inflective Language — a Comparison. Technical Report No.1, UFAL MFF UK, November 1996, Prague.
Google Scholar
Nancy Ide, Jean Véronis. Word Sense Disambiguation: The State of Art. Computational Linguistics, Vol.24, No.1, March 1998, pp 1–40.
Google Scholar
Claudia Leacock, Martin Chodorow, George A. Miller. Using Corpus Statistics and WordNet Relations for Sense Identification. Computational Linguistics, Vol.24, No.1, March 1998, pp 147–167.
Google Scholar
Klára Osolsobě. Algorithmic Description of Czech Morphology. Dissertation, Brno 1995.
Google Scholar
Karel Pala, Pavel Rychlý, and Pavel Smrž. DESAM — Annotated Corpus for Czech. In Proceedings of SOFSEM’97. Springer-Verlag, 1997.
Google Scholar
Karel Pala, Pavel Ševeček. Valencies of Czech Verbs. Studia Minora Facultatis Philosophicae Universitatis Brunensis, A45, 1997.
Google Scholar
Prague Dependency Treebank. Technical Report, UFAL MFF UK, Prague 1998.
Google Scholar
Pavel Ševeček. LEMMA — a lemmatizer for Czech. Brno, 1996. (manuscript, programme in C).
Google Scholar
Piek Vossen (ed.). EuroWordNet General Document — Version 2. Amsterdam, June 1999. (Draft of the resulting CD).
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Informatics, Masaryk University, Brno, Botanická 68a, 602 00, Brno, Czech Republic
Karel Pala

Authors

Karel Pala
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineerig, Faculty of Applied Sciences, University of West Bohemia in Plzeň, Universitní 22, 306 14, Pizeň, Czech Republic
Václav Matousek , Pavel Mautner & Jana Ocelíková , &
Department of Programming Systems and Communication, Faculty of Informatics, Masaryk University Brno, Botanická 68a, 602 00, Brno, Czech Republic
Petr Sojka

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pala, K. (1999). Semantic Annotation of (Czech) Corpus Texts. In: Matousek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds) Text, Speech and Dialogue. TSD 1999. Lecture Notes in Computer Science(), vol 1692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48239-3_10

Download citation

DOI: https://doi.org/10.1007/3-540-48239-3_10
Published: 01 October 1999
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66494-9
Online ISBN: 978-3-540-48239-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics