Abstract
In large-scale distributed retrieval, challenges of latency, heterogeneity, and dynamicity emphasise the importance of infrastructural support in reducing the development costs of state-of-the-art solutions. We present a service-based infrastructure for distributed retrieval which blends middleware facilities and a design framework to ‘lift’ the resource sharing approach and the computational services of a European Grid platform into the domain of e-Science applications. In this paper, we give an overview of the Diligent Search Framework and illustrate its exploitation in the field of Earth Science.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Blair, D.C.: The data-document distinction revisited. SIGMIS Database 37, 77–96 (2006)
Sanderson, R.: Srw: Search/retrieve webservice. Public Draft (2003)
Callan, J.: 5 Distributed Information Retrieval. In: Advances in Information Retrieval, pp. 127–150. Kluwer Academic Publishers, Hingham, MA (2000)
Kobayashi, M., Takeda, K.: Information retrieval on the web. ACM Comput. Surv. 32, 144–173 (2000)
Risson, J., Moors, T.: Survey of research towards robust peer-to-peer networks: search methods. Comput. Networks 50, 3485–3521 (2006)
Atkinson, M., Crowcroft, J., Goble, C., Gurd, J., Rodden, T., Shadbolt, N., Sloman, M., Sommerville, I., Storey, T.: Computer Challenges to emerge from eScience (e-Science vision document)
Foster, I., Kesselman, C., Tuecke, S.: The Anatomy of the Grid: Enabling Scalable Virtual Organization. The International Journal of High Performance Computing Applications 15, 200–222 (2001)
Foster, I., Kesselman, C., Nick, J., Tuecke, S.: The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration. Open Grid Service Infrastructure WG, Global Grid Forum (2002)
Globus Alliance: The Globus Alliance Website, http://www.globus.org/
EGEE: Enabling Grids for E-sciencE. INFSO 508833, http://public.eu-egee.org/
Atkins, D.E., Droegemeier, K.K., Feldman, S.I., Garcia-Molina, H., Klein, M.L., Messerschmitt, D.G., Messina, P., Ostriker, J.P., Wright, M.H.: Revolutionizing science and engineering through cyberinfrastructure. Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure (2003)
Larson, R.R., Sanderson, R.: Grid-based digital libraries: Cheshire3 and distributed retrieval. In: JCDL 2005. Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 112–113. ACM Press, New York (2005)
GRACE: GRid seArch & Categorization Engine (2005), http://www.grace-ist.org
Banks, T.: Web Services Resource Framework (WSRF) - Primer. Committee draft 01, OASIS (2005), http://docs.oasis-open.org/wsrf/wsrf-primer-1.2-primer-cd-01.pdf
Niblett, P., Graham, S.: Events and service-oriented architecture: The oasis web services notification specification. IBM Systems Journal 44, 869–886 (2005)
Kossmann, D.: The state of the art in distributed query processing. ACM Computing Surveys 32, 422–469 (2000)
Ioannidis, Y.E.: Query optimization. ACM Computing Surveys 28, 121–123 (1996)
Stonebraker, M., Aoki, P., Litwin, W., Pfeffer, A., Sah, A., Sidell, J., Staelin, C., Yu, A.: Mariposa: A Wide-Area Distributed Database System. The VLDB Journal 5, 48–63 (1996)
Chen, C., Roussopoulos, N.: Adaptive selectivity estimation using query feedback. In: 1994 ACM SIGMOD International Conference on Management of data, pp. 161–172 (1994)
Simeoni, F., Azzopardi, L., Crestani, F.: An application framework for distributed information retrieval. In: Sugimoto, S., Hunter, J., Rauber, A., Morishima, A. (eds.) ICADL 2006. LNCS, vol. 4312, pp. 192–201. Springer, Heidelberg (2006)
Callan, J.P., Connell, M.E.: Query-based sampling of text databases. Information Systems 19, 97–130 (2001)
Guttman, A.: R-trees: a dynamic index structure for spatial searching. In: SIGMOD 1984. Proceedings of the 1984 ACM SIGMOD international conference on Management of data, pp. 47–57. ACM Press, New York (1984)
Manolopoulos, Y., Nanopoulos, A., Papadopoulos, A., Theodoridis, Y.: R-Trees: Theory and Applications. In: Advanced Information and Knowledge Processing, Springer, Heidelberg (2006)
Martínez, C.: Partial Quicksort. In: ANALCO 2004. The First Workshop on Analytic Algorithmics and Combinatorics, New Orleans (2004)
Callan, J.P., Lu, Z., Croft, W.B.: Searching distributed collections with inference networks. In: SIGIR 1995. Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 21–28. ACM Press, New York (1995)
Sun, W., Ling, Y., Rishe, N., Deng, Y.: An instant and accurate size estimation method for joins and selections in a retrieval-intensive environment. In: SIGMOD 1993. Proceedings of the 1993 ACM SIGMOD international conference on Management of data, pp. 79–88. ACM Press, New York (1993)
Si, L., Callan, J.: A semisupervised learning method to merge search engine results. ACM Trans. Inf. Syst. 21, 457–491 (2003)
Si, L., Callan, J.P.: Unified utility maximization framework for resource selection. In: Grossman, D., Gravano, L., Zhai, C., Herzog, O., Evans, D.A. (eds.) CIKM, pp. 32–41. ACM, New York (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Simeoni, F. et al. (2007). A Grid-Based Infrastructure for Distributed Retrieval. In: Kovács, L., Fuhr, N., Meghini, C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2007. Lecture Notes in Computer Science, vol 4675. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74851-9_14
Download citation
DOI: https://doi.org/10.1007/978-3-540-74851-9_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74850-2
Online ISBN: 978-3-540-74851-9
eBook Packages: Computer ScienceComputer Science (R0)