Skip to main content

Document Filtering for Fast Ranking

  • Conference paper
SIGIR ’94

Abstract

Ranking techniques are effective for finding answers in document collections but the cost of evaluation of ranked queries can be unacceptably high. We propose an evaluation technique that reduces both main memory usage and query evaluation time. based on early recognition of which documents are likely to be highly ranked. Our experiments show that, for our test data, the proposed technique evaluates queries in 20% of the time and 2% of the memory taken by the standard inverted file implementation, without degradation in retrieval effectiveness.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. C. Buckley and A.F. Lewit. Optimisation of inverted vector searches. In Proc. ACM-SIGIR International Conference on Research and Development in Information Retrieval, pages 97–110, Montreal, Canada, June 1985.

    Google Scholar 

  2. W.B. Frakes and R. Baeza-Yates, editors. Information Retrieval: Data Structures and Algorithms. Prentice-Hall, New Jersey, 1992.

    Google Scholar 

  3. D. Harman and C. Candela. Retrieving records from a gigabyte of text on a minicomputer using statistical ranking. Journal of the American Society for Information Science, 41 (8): 581–589, 1990.

    Article  Google Scholar 

  4. D. Lucarella. A document retrieval system based upon nearest neighbour searching. Journal of Information Science, 14: 25–33, 1988.

    Article  Google Scholar 

  5. A. Moffat and J. Zobel. Parameterised compression for sparse bitmaps. ln Proc. ACM-SIGIR International Conference on Research and Development in Information. Retrieval, pages 274–285, Copenhagen, Denmark, June 1992. ACM Press.

    Google Scholar 

  6. A. Moffat and J. Zobel. Fast ranking in limited space. Technical Report 93/11, Department of Computer Science, The University of Melbourm, 1993. Submitted to the 1994 Data Engineering conference.

    Google Scholar 

  7. National Institute of Standards and Technology. Proc. Text Retrieval Conference (TRAC),Washington, November 1992. Special Publication 500–207.

    Google Scholar 

  8. S.A. Perry and P. Willett. A reniew of the use of inverted files for best matchs searching in information retrieval systems. Journal of Information Science, 6: 59–66, 1983.

    Article  Google Scholar 

  9. M. Persin, J. Zobel, and R. Sacks-Davis. Fast document ranking for large scale information retrieval. Technical Report 94/1, Collaborative Information Technology Research Institute, Department of Computer Science, Royal Melbourne Institute of Technology, Australia, 1994.

    Google Scholar 

  10. G. Salton. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley, Reading, MA, 1989.

    Google Scholar 

  11. G. Salton and M.J. McGill. Introduction to Modern Information Retrieval. McGraw-Hill, New York, 1983.

    MATH  Google Scholar 

  12. J. Zobel, A. Moffat, and R. Sacks-Davis. An efficient indexing technique for full-text database systems. In Proc. International Conference on Very Large Databases, pages 352–362, Vancouver, Canada, August 1992.

    Google Scholar 

  13. J. Zobel, A. Moffat, and R. Sacks-Davis. Memory-efficient ranking of document collections. Technical Report TR-92–53, Collaborative Information Technology Research Institute, RMIT and The University of Melbourne, Melbourne, Australia, August 1992.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1994 Springer-Verlag London Limited

About this paper

Cite this paper

Persin, M. (1994). Document Filtering for Fast Ranking. In: Croft, B.W., van Rijsbergen, C.J. (eds) SIGIR ’94. Springer, London. https://doi.org/10.1007/978-1-4471-2099-5_35

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-2099-5_35

  • Publisher Name: Springer, London

  • Print ISBN: 978-3-540-19889-5

  • Online ISBN: 978-1-4471-2099-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics