Use of Dependency Microcontexts in Information Retrieval

Holub, Martin

doi:10.1007/3-540-44411-4_23

Martin Holub⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1963))

Included in the following conference series:

International Conference on Current Trends in Theory and Practice of Computer Science

409 Accesses
1 Citations

Abstract

This paper focuses especially on two problems that are crucial for retrieval performance in information retrieval (IR) systems: the lack of information caused by document pre-processing and the difficulty caused by homonymous and synonymous words in natural language. Author argues that traditional IR methods, i. e. methods based on dealing with individual terms without considering their relations, can be overcome using natural language processing (NLP). In order to detect the relations among terms in sentences and make use of lemmatisation and morphological and syntactic tagging of Czech texts, author proposes a method for construction of dependency word microcontexts fully automatically extracted from texts, and several ways how to exploit the microcontexts for the sake of increasing retrieval performance.

This study has been supported by MŠMT (the FRVŠ grant no 1909).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

E. Brill, R. J. Mooney: An Overview of Empirical Natural Language Processing. In: AI Magazine, Vol. 18 (1997), No. 4.
Google Scholar
M. Holub, A. Böhmová: Use of Dependency Tree Structures for the Microcontext Extraction. Accepted for the ACL’2000 conference. 350
Google Scholar
R. Krovetz, W. B. Croft: Lexical ambiguity and information retrieval. In: ACM Transactions on Information Systems, 10(2), 1992, pp 115–141. 350
Article Google Scholar
C. Leacock, G. Towell, E. M. Voorhees: Toward building contextual representations of word senses using statistical models. In: B. Boguraev and J. Pustejovsky (editors), Corpus Processing for Lexical Acquisitions, 1996, pp 97–113, MIT Press. 350
Google Scholar
D. Lin: Extracting Collocations from Text Corpora. In: Computerm’ 98. Proceedings of the First Workshop on Computational Terminology. Montreal, 1998. 352
Google Scholar
G. A. Miller, W. G. Charles: Contextual correlates of semantic similarity. In: Language and cognitive processes, 6(1), 1991. 350
Google Scholar
H. Schütze, J. O. Pedersen: Information Retrieval Based on Word Senses. In: Proceedings of the Fourth Annual Symposium on Document Analysis and Information retrieval, pp 161–175, Las Vegas, NV, 1995. 350
Google Scholar
G. Towell, E. M. Voorhees: Disambiguating Highly Ambiguous Words. In: Computational Linguistics, March 1998, Vol. 24, Number 1, pp 125–145. 350
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Prague, Czech republic
Martin Holub

Authors

Martin Holub
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Cybernetics, Czech Technical University, Karlovo nám. 13, 121 35, Prague, Czech Republic
Václav Hlaváč
Information Technology Department, CLRC RAL, Chilton, Didcot, Oxfordshire, UK
Keith G. Jeffery
Insitute of Computer Science, Academy of Sciences of the Czech Republic, Pod vodárenskou věží 2, 182 07, Prague, Czech Republic
Jiří Wiedermann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Holub, M. (2000). Use of Dependency Microcontexts in Information Retrieval. In: Hlaváč, V., Jeffery, K.G., Wiedermann, J. (eds) SOFSEM 2000: Theory and Practice of Informatics. SOFSEM 2000. Lecture Notes in Computer Science, vol 1963. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44411-4_23

Download citation

DOI: https://doi.org/10.1007/3-540-44411-4_23
Published: 22 January 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41348-6
Online ISBN: 978-3-540-44411-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics