Skip to main content

Decision-Theoretic Resource Selection for Different Data Types in MIND

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2924))

Abstract

In a federated digital library system, it is too expensive to query every accessible library. Resource selection is the task to decide to which libraries a query should be routed. In this paper, we describe a novel technique that is used in the MIND project. Our approach, decision-theoretic framework (DTF), differs from existing algorithms like CORI in two ways: It computes a selection which minimises the overall costs (e.g. retrieval quality, time, money) of the distributed retrieval. And it allows for other data types beside text (e.g., names, years, images), whereas other resource selection techniques are restricted to text.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Callan, J., Connell, M.: Query-based sampling of text databases. ACM Transactions on Information Systems 19(2), 97–130 (2001)

    Article  Google Scholar 

  2. Callan, J., Croft, W., Harding, S.: The INQUERY retrieval system. In: Proceedings of DEXA 1992, 3rd International Conference on Database and Expert Systems Applications, pp. 78–83. Springer, Heidelberg (1992)

    Google Scholar 

  3. Callan, J., Lu, Z., Croft, W.: Searching distributed collections with inference networks. In: Fox, E., Ingwersen, P., Fidel, R. (eds.) Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New York, pp. 21–29. ACM, New York (1995) ISBN 0-89791-714-6

    Chapter  Google Scholar 

  4. Fienberg, S.: The Analysis of Cross-Classified Categorical Data, 2nd edn. MIT Press, Cambridge (1980)

    MATH  Google Scholar 

  5. Freeman, D.: Applied Categorical Data Analysis. Dekker, New York (1987)

    MATH  Google Scholar 

  6. French, J., Powell, A., Callan, J., Viles, C., Emmitt, T., Prey, K., Mou, Y.: Comparing the performance of database selection algorithms. In: Proceedings of the 22nd International Conference on Research and Development in Information Retrieval, New York, pp. 238–245. ACM, New York (1999)

    Google Scholar 

  7. Fuhr, N.: A decision-theoretic approach to database selection in networked IR. ACM Transactions on Information Systems 17(3), 229–249 (1999)

    Article  Google Scholar 

  8. Gravano, L., Garcia-Molina, H.: Generalizing GIOSS to vector-space databases and broker hierarchies. In: Dayal, U., Gray, P., Nishio, S. (eds.) VLDB 1995, Proceedings of 21th International Conference on Very Large Data Bases, Los Altos, California, pp. 78–89. Morgan Kaufman, San Francisco (1995)

    Google Scholar 

  9. Harman, D. (ed.): The Second Text REtrieval Conference (TREC-2), Gaithersburg, Md. 20899. National Institute of Standards and Technology (1994)

    Google Scholar 

  10. Kullback, S.: Information theory and statistics. Dover, New York (1968)

    Google Scholar 

  11. Niblack, W., Barber, R., Equitz, W., Flickner, M., Glasman, E., Petkovic, D., Yanker, P., Faloutsos, C., Taubin, G.: The QBIC project: Querying images by content using color, texture, and shape, vol. 1908, pp. 173–181. SPIE, Bellingham (1993)

    Google Scholar 

  12. Nottelmann, H., Fuhr, N.: Combining DAML+OIL, XSLT and probabilistic logics for uncertain schema mappings in MIND. In: Koch, T., Sølvberg, I.T. (eds.) ECDL 2003. LNCS, vol. 2769, pp. 194–206. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  13. Nottelmann, H., Fuhr, N.: Evaluating different methods of estimating retrieval quality for resource selection. In: Proceedings of the 26st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New York, ACM, New York (2003)

    Google Scholar 

  14. Nottelmann, H., Fuhr, N.: From uncertain inference to probability of relevance for advanced IR applications. In: Sebastiani, F. (ed.) ECIR 2003. LNCS, vol. 2633, pp. 235–250. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  15. Press, W.H., Teukolsky, S.A., Vetterling, W.T., Flannery, B.P. (eds.): Nested Relations and Complex Objects in Databases. Cambridge University Press, Cambridge (1992)

    Google Scholar 

  16. Robertson, S.E., Walker, S., Hancock-Beaulieu, M., Gull, A., Lau, M.: Okapi at TREC. In: Text REtrieval Conference, pp. 21–30 (1992)

    Google Scholar 

  17. Swain, M., Ballard, D.: Color indexing. International Journal of Computer Vision 7(1), 11–32 (1991)

    Article  Google Scholar 

  18. van Rijsbergen, C.J.: A non-classical logic for information retrieval. The Computer Journal 29(6), 481–485 (1986)

    Article  MATH  Google Scholar 

  19. Wong, S., Yao, Y.: On modeling information retrieval with probabilistic inference. ACM Transactions on Information Systems 13(1), 38–68 (1995)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nottelmann, H., Fuhr, N. (2004). Decision-Theoretic Resource Selection for Different Data Types in MIND. In: Callan, J., Crestani, F., Sanderson, M. (eds) Distributed Multimedia Information Retrieval. DIR 2003. Lecture Notes in Computer Science, vol 2924. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24610-7_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24610-7_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20875-4

  • Online ISBN: 978-3-540-24610-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics