Abstract
Thiswork presents the results of the application of a technique for automatic extraction of semantic relations among words from a corpus. The technique used is the one proposed by Grefenstette in [1]. We brought contributions to the syntactic context notion in [1], aiming to improve the identification of semantically related words. Then, we carried on three different experiments using a Portuguese language corpus: the first one compares the original Grefenstette’s technique with the technique modified with our contributions, the second experiment investigates which syntactic relation is more relevant when identifying semantic relations, and the last experiment investigates the influence of the parser errors on the quality of the extracted semantic relations. Results and their analyses are detailed in this article.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Grefenstette, G.: Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers, USA (1994)
Lin, D.: Automatic retrieval and clustering of similar words. In: Proceedings of the COLINGACL’ 98, Montreal (1998)
Ruge, G.: Automatic detection of thesaurus relations for information retrieval. In: Foundations of Computer Science. Springer, Berlin (1997) 499–506
Park, Y.C., Han, Y.S., Choi, K.S.: Automatic thesaurus construction using bayesian networks. In: Proceedings of the 1995 International Conference on Information and Knowledge Management, Baltimore (1995) 212–217
Pacey, M.: The use of clustering techniques to reveal semantic relations between words. In Renouf, A., ed.: Explorations in Corpus Linguistics. Rodopi, Amsterdam (1998) 269–280
Thanopoulos, A., Fakotakis, N., Kokkinakis, G.: Automatic extraction of semantic relations from specialized corpora. In: Proceedings of COLING’2000, Saarbrücken (2000)
Bick, E.: The Parsing System Palavras: Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework. PhD thesis, Århus University, Århus (2000)
Bolshakov, I., Gelbukh, A.: Heuristics-based replenishment of collocation databases. In Ranchhod, E., Mamede, N., eds.: Lecture Notes in Computer Science N 2389: Advances in Natural Language Processing. Springer-Verlag (2002)
Gasperin, C.V.: Extração automática de relações semânticas a partir de relações sintáticas. Master’s thesis, PUCRS, Porto Alegre (2001)
A. Gelbukh, G. Sidorov, L.C.H.: Compilation of a spanish representative corpus. In: Lecture Notes in Computer Science N 2276: Computational Linguistics and Intelligent Text Processing. Springer-Verlag (2002) 285–288
Bolshakov, I., Gelbukh, A.: A very large database of collocations and semantic links. In et al., M.B., ed.: Lecture Notes in Computer Science N 1959: Natural Language Processing and Information Systems. Springer-Verlag (2001) 103–114
Lee, L.: Measures of distributional similarity. In: 37th Annual Meeting of the ACL. (1999) 25–32
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gasperin, C.V., de Lima, V.L.S. (2003). Experiments on Extracting Semantic Relations from Syntactic Relations. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2003. Lecture Notes in Computer Science, vol 2588. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36456-0_32
Download citation
DOI: https://doi.org/10.1007/3-540-36456-0_32
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00532-2
Online ISBN: 978-3-540-36456-6
eBook Packages: Springer Book Archive