Abstract
Semantically tagged corpora are becoming an urgent need for training and evaluation within many applications. They are also the natural accompaniment of semantic lexicons, for which they constitute both a useful testbed to evaluate their adequacy and a repository of corpus examples for the attested senses. It is essential that sound criteria are defined for their construction and a specific methodology is set up for the treatment of various semantic phenomena. We present some observations and results concerning the lexical-semantic tagging of an Italian corpus within the framework of two projects: the ELSNET feasibility study, part of a preparatory phase started with Senseval/Romanseval, and an Italian National Project (TAL), where one of the components is the lexical-semantic annotation of larger quantities of texts for an Italian syntactic-semantic Treebank. The results of the ELSNET experiment have been of utmost importance for the definition of the technical guidelines for the lexical-semantic level of annotation of the Treebank.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Atkins, B.T., kegl, J., Levin, B.: Anatomy of a Verb Entry: from Linguistic Theory to Lexicographic Practice. International Journal of Lexicography 1 (1988) 84–126
Alonge, A., Calzolari, N., Vossen, P., Bloksma, L., Castellon, I., Marti, T., Peters, W.: The Linguistic Design of the EuroWordNet Database. Special Issue on EuroWordNet. Computers and the Humanities 32 (1998) 2-3, 91–115
Busa, F., Calzolari, N., Lenci, A., Pustejovski, J.: Building a Lexicon: Structuring and Generating Concepts. In: Proceedings of the Computational Semantics Workshop. Tilburg (1999)
Calzolari, N., Corazzari, O.: Senseval/Romanseval: the framework for Italian. Computers and the Humanities 34 (2000) 1-2, 61–78
Calzolari, N., Corazzari, O., Monachini, M., Roventini, A.: Speech Act and Perception Verbs: Generalizations and Contrastive Aspects. In: EURALEX-96 Proceedings. Goteborg (1996) 73–83
Corazzari, O.: Phraseological Units. ILC, Pisa (1992)
Cruse, D.A.: Lexical Semantics. Cambridge University Press, Cambridge (1986)
Fass, D.: A Method for Discriminating Metonymy and Metaphor by Computer. Computational Linguistics 17 (1991) 1, 49–90.
Fellbaum, C. (ed.): Wordnet, An Electronic Lexical Database. MIT Press, Cambridge, (1998)
Gale, A. W., Church, K.W., Yarowsky, D.: A Method for Disambiguating Word Senses in a Large Corpus. Computers and the Humanities 26 (1992) 415–439.
Kilgarriff, A.: Dictionary word sense distinctions: An enquiry into their nature. Computers and the Humanities 26 (1993) 365–387
Kokkinakis, D., Kokkinakis, S. J.: Sense-Tagging at the Cycle-Level Using GLDB. Göteborg University (1999)
Lenci, A., Busa, F., Ruimy, N., Gola, E., Monachini, M., Calzolari, N., Zampolli, A.: Linguistic Specifications. SIMPLE Deliverable D2.1. ILC and University of Pisa (1999)
Monachini, M., Roventini, A., Alonge, A., Calzolari, N., Corazzari, O.: Linguistic Analysis of Italian Perception and Speech Act Verbs. DELIS Working Paper. ILC, Pisa (1994)
Montemagni, S., Barsotti, F., Battista, M., Calzolari, N., Corazzari, O., Zampolli, A., Fanciulli, F., Massetani, M., Raffaelli, R., Basili, R., Pazienza, M.T., Saracino, D., Zanzotto, F., Mana, N., Pianesi, F., Delmonte, R.: The Italian Syntactic-Semantic Treebank: Architecture, Annotation, Tools and Evaluation. In: Proceedings of the COLING Workshop on.Linguistically Interpreted Corpora (LINC-2000).. Luxembourg (2000) 18–27
PAROLE: Preparatory Action for Linguistic Resources Organization for Language Engineering. LE-4017, Pisa (1996).
Rodriguez, H., Climent, S., Vossen, P., Loksma, L., Peters, W., Alonge, A., Bertagna, F., Roventini, A.: The Top-Down Strategy for building EuroWordNet: Vocabulary Coverage, Base Concepts and Top Ontology. Special Issue on EuroWordNet. Computers and the Humanities 32 (1998) 2-3.
SI-TAL: Specifiche Tecniche di SI-TAL. Manuale Operativo. ILC and CPR, Pisa (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Calzolari, N., Corazzari, O., Zampolli, A. (2001). Lexical-Semantic Tagging of an Italian Corpus. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2001. Lecture Notes in Computer Science, vol 2004. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44686-9_30
Download citation
DOI: https://doi.org/10.1007/3-540-44686-9_30
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41687-6
Online ISBN: 978-3-540-44686-6
eBook Packages: Springer Book Archive