Skip to main content

A Grid-Based Infrastructure for Distributed Retrieval

  • Conference paper
Research and Advanced Technology for Digital Libraries (ECDL 2007)

Abstract

In large-scale distributed retrieval, challenges of latency, heterogeneity, and dynamicity emphasise the importance of infrastructural support in reducing the development costs of state-of-the-art solutions. We present a service-based infrastructure for distributed retrieval which blends middleware facilities and a design framework to ‘lift’ the resource sharing approach and the computational services of a European Grid platform into the domain of e-Science applications. In this paper, we give an overview of the Diligent Search Framework and illustrate its exploitation in the field of Earth Science.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Blair, D.C.: The data-document distinction revisited. SIGMIS Database 37, 77–96 (2006)

    Article  Google Scholar 

  2. Sanderson, R.: Srw: Search/retrieve webservice. Public Draft (2003)

    Google Scholar 

  3. Callan, J.: 5 Distributed Information Retrieval. In: Advances in Information Retrieval, pp. 127–150. Kluwer Academic Publishers, Hingham, MA (2000)

    Google Scholar 

  4. Kobayashi, M., Takeda, K.: Information retrieval on the web. ACM Comput. Surv. 32, 144–173 (2000)

    Article  Google Scholar 

  5. Risson, J., Moors, T.: Survey of research towards robust peer-to-peer networks: search methods. Comput. Networks 50, 3485–3521 (2006)

    Article  MATH  Google Scholar 

  6. Atkinson, M., Crowcroft, J., Goble, C., Gurd, J., Rodden, T., Shadbolt, N., Sloman, M., Sommerville, I., Storey, T.: Computer Challenges to emerge from eScience (e-Science vision document)

    Google Scholar 

  7. Foster, I., Kesselman, C., Tuecke, S.: The Anatomy of the Grid: Enabling Scalable Virtual Organization. The International Journal of High Performance Computing Applications 15, 200–222 (2001)

    Article  Google Scholar 

  8. Foster, I., Kesselman, C., Nick, J., Tuecke, S.: The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration. Open Grid Service Infrastructure WG, Global Grid Forum (2002)

    Google Scholar 

  9. Globus Alliance: The Globus Alliance Website, http://www.globus.org/

  10. EGEE: Enabling Grids for E-sciencE. INFSO 508833, http://public.eu-egee.org/

  11. Atkins, D.E., Droegemeier, K.K., Feldman, S.I., Garcia-Molina, H., Klein, M.L., Messerschmitt, D.G., Messina, P., Ostriker, J.P., Wright, M.H.: Revolutionizing science and engineering through cyberinfrastructure. Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure (2003)

    Google Scholar 

  12. Larson, R.R., Sanderson, R.: Grid-based digital libraries: Cheshire3 and distributed retrieval. In: JCDL 2005. Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 112–113. ACM Press, New York (2005)

    Chapter  Google Scholar 

  13. GRACE: GRid seArch & Categorization Engine (2005), http://www.grace-ist.org

  14. Banks, T.: Web Services Resource Framework (WSRF) - Primer. Committee draft 01, OASIS (2005), http://docs.oasis-open.org/wsrf/wsrf-primer-1.2-primer-cd-01.pdf

  15. Niblett, P., Graham, S.: Events and service-oriented architecture: The oasis web services notification specification. IBM Systems Journal 44, 869–886 (2005)

    Article  Google Scholar 

  16. Kossmann, D.: The state of the art in distributed query processing. ACM Computing Surveys 32, 422–469 (2000)

    Article  Google Scholar 

  17. Ioannidis, Y.E.: Query optimization. ACM Computing Surveys 28, 121–123 (1996)

    Article  Google Scholar 

  18. Stonebraker, M., Aoki, P., Litwin, W., Pfeffer, A., Sah, A., Sidell, J., Staelin, C., Yu, A.: Mariposa: A Wide-Area Distributed Database System. The VLDB Journal 5, 48–63 (1996)

    Article  Google Scholar 

  19. Chen, C., Roussopoulos, N.: Adaptive selectivity estimation using query feedback. In: 1994 ACM SIGMOD International Conference on Management of data, pp. 161–172 (1994)

    Google Scholar 

  20. Simeoni, F., Azzopardi, L., Crestani, F.: An application framework for distributed information retrieval. In: Sugimoto, S., Hunter, J., Rauber, A., Morishima, A. (eds.) ICADL 2006. LNCS, vol. 4312, pp. 192–201. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  21. Callan, J.P., Connell, M.E.: Query-based sampling of text databases. Information Systems 19, 97–130 (2001)

    Google Scholar 

  22. Guttman, A.: R-trees: a dynamic index structure for spatial searching. In: SIGMOD 1984. Proceedings of the 1984 ACM SIGMOD international conference on Management of data, pp. 47–57. ACM Press, New York (1984)

    Chapter  Google Scholar 

  23. Manolopoulos, Y., Nanopoulos, A., Papadopoulos, A., Theodoridis, Y.: R-Trees: Theory and Applications. In: Advanced Information and Knowledge Processing, Springer, Heidelberg (2006)

    Google Scholar 

  24. Martínez, C.: Partial Quicksort. In: ANALCO 2004. The First Workshop on Analytic Algorithmics and Combinatorics, New Orleans (2004)

    Google Scholar 

  25. Callan, J.P., Lu, Z., Croft, W.B.: Searching distributed collections with inference networks. In: SIGIR 1995. Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 21–28. ACM Press, New York (1995)

    Chapter  Google Scholar 

  26. Sun, W., Ling, Y., Rishe, N., Deng, Y.: An instant and accurate size estimation method for joins and selections in a retrieval-intensive environment. In: SIGMOD 1993. Proceedings of the 1993 ACM SIGMOD international conference on Management of data, pp. 79–88. ACM Press, New York (1993)

    Chapter  Google Scholar 

  27. Si, L., Callan, J.: A semisupervised learning method to merge search engine results. ACM Trans. Inf. Syst. 21, 457–491 (2003)

    Article  Google Scholar 

  28. Si, L., Callan, J.P.: Unified utility maximization framework for resource selection. In: Grossman, D., Gravano, L., Zhai, C., Herzog, O., Evans, D.A. (eds.) CIKM, pp. 32–41. ACM, New York (2004)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

László Kovács Norbert Fuhr Carlo Meghini

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Simeoni, F. et al. (2007). A Grid-Based Infrastructure for Distributed Retrieval. In: Kovács, L., Fuhr, N., Meghini, C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2007. Lecture Notes in Computer Science, vol 4675. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74851-9_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74851-9_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74850-2

  • Online ISBN: 978-3-540-74851-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics