Skip to main content

Representations, Models and Abstractions in Probabilistic Information Retrieval

  • Conference paper
Information and Classification

Part of the book series: Studies in Classification, Data Analysis and Knowledge Organization ((STUDIES CLASS))

  • 526 Accesses

Abstract

We show that most approaches in probabilistic information retrieval can be regarded as a combination of the three concepts representation, model and abstraction. First, documents and queries have to be represented in a certain form, e.g. as a sets of terms. Probabilistic models use certain assumptions about the distribution of the elements of the representation in relevant and nonrelevant documents in order to estimate the probability of relevance of a document w.r.t. a query. Older approaches based on query-specific relevance feedback are restricted to simple representations and models. Using abstractions from specific documents, terms and queries, more powerful approaches can be realized.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Biebricher, P.; Fuhr, N.; Knorz, G.; Lustig, G.; Schwantner, M. (1988). The Automatic Indexing System AIR/PHYS-from Research to Application. In: Chiaramella, Y. (ed.): 11th International Conference on Research and Development in Information Retrieval, pages 333–342. Presses Universitaires de Grenoble, Grenoble, France.

    Google Scholar 

  • Cooper, W. (1991). Some Inconsistencies and Misnomers in Probabilistic IR. In: Bookstein, A.; Chiaramella, Y.; Salton, G.; Raghavan, V. (eds.): Proceedings of the Fourteenth Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval, pages 57–61. ACM, New York.

    Google Scholar 

  • Croft, W.; Harper, D. (1979). Using Probabilistic Models of Document Retrieval without Relevance Information. Journal of Documentation 35, pages 285–295.

    Article  Google Scholar 

  • Croft, W. B.; Thompson, R. H. (1987). I3R: A New Approach to the Design of Document Retrieval Systems. Journal of the American Society for Information Science 38(6), pages 389–404.

    Article  Google Scholar 

  • Fuhr, N.; Hüther, H. (1989). Optimum Probability Estimation from Empirical Distributions. Information Processing and Management 25(5), pages 493–507.

    Article  Google Scholar 

  • Fuhr, N.; Pfeifer, U. (1991). Combining Model-Oriented and Description-Oriented Approaches for Probabilistic Indexing. In: Bookstein, A.; Chiaramella, Y.; Salton, G.; Raghavan, V. (eds.): Proceedings of the Fourteenth Annual International ACM/Sigir Conference on Research and Development in Information Retrieval, pages 46–56. ACM, New York.

    Google Scholar 

  • Fuhr, N. (1989a). Models for Retrieval with Probabilistic Indexing. Information Processing and Management 25(1), pages 55–72.

    Article  Google Scholar 

  • Fuhr, N. (1989b). Optimum Polynomial Retrieval Functions Based on the Probability Ranking Principle. ACM Transactions on Information Systems 7(3), pages 183–204.

    Article  Google Scholar 

  • Fuhr, N. (1992). Probabilistic Models in Information Retrieval. The Computer Journal 35.

    Google Scholar 

  • Fuhr, N.; Hartmann, S.; Knorz, G.; Lustig, G.; Schwantner, M.; Tzeras, K. (1991). AIR/X-a Rule-Based Multistage Indexing System for Large Subject Fields. In: Proceedings of the RIAO’91, Barcelona, Spain, April 2–5, 1991, pages 606-623.

    Google Scholar 

  • Lam, K.; Yu, C. (1982). A Clustered Search Algorithm Incorporating Arbitrary Term Dependencies. ACM Transactions on Database Systems 7.

    Google Scholar 

  • Maron, M.; Kuhns, J. (1960). On Relevance, Probabilistic Indexing, and Information Retrieval. Journal of the ACM 7, pages 216–244.

    Article  Google Scholar 

  • Pejtersen, A. (1989). A Library System for Information Retrieval Based on a Cognitive Task Analysis and Supported by a Icon-Based Interface. In: Belkin, N.; van Rijsbergen, C. (eds.): Proceedings of the Twelfth Annual International Acmsigir Conference on Research and Development in Information Retrieval, pages 40–47. ACM, New York.

    Google Scholar 

  • Van Rijsbergen, C. (1977). A Theoretical Basis for the Use of Co-Occurrence Data in Information Retrieval. Journal of Documentation 33, pages 106–119.

    Article  Google Scholar 

  • Robertson, S.; Sparck Jones, K. (1976). Relevance Weighting of Search Terms. Journal of the American Society for Information Science 27, pages 129–146.

    Article  Google Scholar 

  • Robertson, S. (1977). The Probability Ranking Principle in IR. Journal of Documentation 33, pages 294–304.

    Article  Google Scholar 

  • Salton, G.; Buckley, C.; Yu, C. (1983). An Evaluation of Term Dependence Models in Information Retrieval. In: Salton, G.; Schneider, H.-J. (eds.): Research and Development in Information Retrieval, pages 151–173. Springer, Berlin et al.

    Chapter  Google Scholar 

  • Wong, S.; Yao, Y. (1990). A Generalized Binary Probabilistic Independence Model. Journal of the American Society for Information Science 41(5), pages 324–329.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1993 Springer-Verlag Berlin · Heidelberg

About this paper

Cite this paper

Fuhr, N. (1993). Representations, Models and Abstractions in Probabilistic Information Retrieval. In: Opitz, O., Lausen, B., Klar, R. (eds) Information and Classification. Studies in Classification, Data Analysis and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-50974-2_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-50974-2_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-56736-3

  • Online ISBN: 978-3-642-50974-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics