Abstract
We show that most approaches in probabilistic information retrieval can be regarded as a combination of the three concepts representation, model and abstraction. First, documents and queries have to be represented in a certain form, e.g. as a sets of terms. Probabilistic models use certain assumptions about the distribution of the elements of the representation in relevant and nonrelevant documents in order to estimate the probability of relevance of a document w.r.t. a query. Older approaches based on query-specific relevance feedback are restricted to simple representations and models. Using abstractions from specific documents, terms and queries, more powerful approaches can be realized.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Biebricher, P.; Fuhr, N.; Knorz, G.; Lustig, G.; Schwantner, M. (1988). The Automatic Indexing System AIR/PHYS-from Research to Application. In: Chiaramella, Y. (ed.): 11th International Conference on Research and Development in Information Retrieval, pages 333–342. Presses Universitaires de Grenoble, Grenoble, France.
Cooper, W. (1991). Some Inconsistencies and Misnomers in Probabilistic IR. In: Bookstein, A.; Chiaramella, Y.; Salton, G.; Raghavan, V. (eds.): Proceedings of the Fourteenth Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval, pages 57–61. ACM, New York.
Croft, W.; Harper, D. (1979). Using Probabilistic Models of Document Retrieval without Relevance Information. Journal of Documentation 35, pages 285–295.
Croft, W. B.; Thompson, R. H. (1987). I3R: A New Approach to the Design of Document Retrieval Systems. Journal of the American Society for Information Science 38(6), pages 389–404.
Fuhr, N.; Hüther, H. (1989). Optimum Probability Estimation from Empirical Distributions. Information Processing and Management 25(5), pages 493–507.
Fuhr, N.; Pfeifer, U. (1991). Combining Model-Oriented and Description-Oriented Approaches for Probabilistic Indexing. In: Bookstein, A.; Chiaramella, Y.; Salton, G.; Raghavan, V. (eds.): Proceedings of the Fourteenth Annual International ACM/Sigir Conference on Research and Development in Information Retrieval, pages 46–56. ACM, New York.
Fuhr, N. (1989a). Models for Retrieval with Probabilistic Indexing. Information Processing and Management 25(1), pages 55–72.
Fuhr, N. (1989b). Optimum Polynomial Retrieval Functions Based on the Probability Ranking Principle. ACM Transactions on Information Systems 7(3), pages 183–204.
Fuhr, N. (1992). Probabilistic Models in Information Retrieval. The Computer Journal 35.
Fuhr, N.; Hartmann, S.; Knorz, G.; Lustig, G.; Schwantner, M.; Tzeras, K. (1991). AIR/X-a Rule-Based Multistage Indexing System for Large Subject Fields. In: Proceedings of the RIAO’91, Barcelona, Spain, April 2–5, 1991, pages 606-623.
Lam, K.; Yu, C. (1982). A Clustered Search Algorithm Incorporating Arbitrary Term Dependencies. ACM Transactions on Database Systems 7.
Maron, M.; Kuhns, J. (1960). On Relevance, Probabilistic Indexing, and Information Retrieval. Journal of the ACM 7, pages 216–244.
Pejtersen, A. (1989). A Library System for Information Retrieval Based on a Cognitive Task Analysis and Supported by a Icon-Based Interface. In: Belkin, N.; van Rijsbergen, C. (eds.): Proceedings of the Twelfth Annual International Acmsigir Conference on Research and Development in Information Retrieval, pages 40–47. ACM, New York.
Van Rijsbergen, C. (1977). A Theoretical Basis for the Use of Co-Occurrence Data in Information Retrieval. Journal of Documentation 33, pages 106–119.
Robertson, S.; Sparck Jones, K. (1976). Relevance Weighting of Search Terms. Journal of the American Society for Information Science 27, pages 129–146.
Robertson, S. (1977). The Probability Ranking Principle in IR. Journal of Documentation 33, pages 294–304.
Salton, G.; Buckley, C.; Yu, C. (1983). An Evaluation of Term Dependence Models in Information Retrieval. In: Salton, G.; Schneider, H.-J. (eds.): Research and Development in Information Retrieval, pages 151–173. Springer, Berlin et al.
Wong, S.; Yao, Y. (1990). A Generalized Binary Probabilistic Independence Model. Journal of the American Society for Information Science 41(5), pages 324–329.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1993 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Fuhr, N. (1993). Representations, Models and Abstractions in Probabilistic Information Retrieval. In: Opitz, O., Lausen, B., Klar, R. (eds) Information and Classification. Studies in Classification, Data Analysis and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-50974-2_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-50974-2_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-56736-3
Online ISBN: 978-3-642-50974-2
eBook Packages: Springer Book Archive