Abstract
This paper focuses especially on two problems that are crucial for retrieval performance in information retrieval (IR) systems: the lack of information caused by document pre-processing and the difficulty caused by homonymous and synonymous words in natural language. Author argues that traditional IR methods, i. e. methods based on dealing with individual terms without considering their relations, can be overcome using natural language processing (NLP). In order to detect the relations among terms in sentences and make use of lemmatisation and morphological and syntactic tagging of Czech texts, author proposes a method for construction of dependency word microcontexts fully automatically extracted from texts, and several ways how to exploit the microcontexts for the sake of increasing retrieval performance.
This study has been supported by MŠMT (the FRVŠ grant no 1909).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
E. Brill, R. J. Mooney: An Overview of Empirical Natural Language Processing. In: AI Magazine, Vol. 18 (1997), No. 4.
M. Holub, A. Böhmová: Use of Dependency Tree Structures for the Microcontext Extraction. Accepted for the ACL’2000 conference. 350
R. Krovetz, W. B. Croft: Lexical ambiguity and information retrieval. In: ACM Transactions on Information Systems, 10(2), 1992, pp 115–141. 350
C. Leacock, G. Towell, E. M. Voorhees: Toward building contextual representations of word senses using statistical models. In: B. Boguraev and J. Pustejovsky (editors), Corpus Processing for Lexical Acquisitions, 1996, pp 97–113, MIT Press. 350
D. Lin: Extracting Collocations from Text Corpora. In: Computerm’ 98. Proceedings of the First Workshop on Computational Terminology. Montreal, 1998. 352
G. A. Miller, W. G. Charles: Contextual correlates of semantic similarity. In: Language and cognitive processes, 6(1), 1991. 350
H. Schütze, J. O. Pedersen: Information Retrieval Based on Word Senses. In: Proceedings of the Fourth Annual Symposium on Document Analysis and Information retrieval, pp 161–175, Las Vegas, NV, 1995. 350
G. Towell, E. M. Voorhees: Disambiguating Highly Ambiguous Words. In: Computational Linguistics, March 1998, Vol. 24, Number 1, pp 125–145. 350
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Holub, M. (2000). Use of Dependency Microcontexts in Information Retrieval. In: Hlaváč, V., Jeffery, K.G., Wiedermann, J. (eds) SOFSEM 2000: Theory and Practice of Informatics. SOFSEM 2000. Lecture Notes in Computer Science, vol 1963. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44411-4_23
Download citation
DOI: https://doi.org/10.1007/3-540-44411-4_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41348-6
Online ISBN: 978-3-540-44411-4
eBook Packages: Springer Book Archive