Advertisement

Advanced Resource Selection for Federated Enterprise Search

  • Matthias Wauer
  • Daniel Schuster
  • Alexander Schill
Part of the Lecture Notes in Business Information Processing book series (LNBIP, volume 97)

Abstract

Distributed information retrieval is a well-known approach for accessing heterogeneous, highly autonomous sources of unstructured information. Selecting and querying only a number of relevant sources can help improve its performance, but most resource selection algorithms are limited to syntactic comparisons.

We present a framework for applying resource selection in the context of a semantic federated product information system, and evaluate the performance of the well-known CORI resource selection algorithm in this context.

Keywords

Resource selection distributed information retrieval federated search enterprise search enterprise information systems 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Wauer, M., Schuster, D., Meinecke, J.: Aletheia: an architecture for semantic federation of product information from structured and unstructured sources. In: Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services, iiWAS 2010, pp. 325–332. ACM, New York (2010)Google Scholar
  2. 2.
    Callan, J.: Distributed information retrieval. In: Advances in Information Retrieval, pp. 127–150. Kluwer Academic Publishers (2000)Google Scholar
  3. 3.
    Callan, J.P., Lu, Z., Croft, W.B.: Searching distributed collections with inference networks. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1995, pp. 21–28. ACM, New York (1995)Google Scholar
  4. 4.
    Si, L., Lu, J., Callan, J.: Distributed information retrieval with skewed database size distributions. In: Proceedings of the 2003 Annual National Conference on Digital Government Research. dg.o 2003, pp. 1–6. Digital Government Society of North America (2003)Google Scholar
  5. 5.
    Si, L., Callan, J.: Relevant document distribution estimation method for resource selection. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2003, pp. 298–305. ACM, New York (2003)Google Scholar
  6. 6.
    Shokouhi, M.: Central-rank-based collection selection in uncooperative distributed information retrieval. In: Amati, G., Carpineto, C., Romano, G. (eds.) ECIR 2007. LNCS, vol. 4425, pp. 160–172. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  7. 7.
    Thomas, P., Shokouhi, M.: SUSHI: scoring scaled samples for server selection. In: SIGIR 2009: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 419–426. ACM, New York (2009)Google Scholar
  8. 8.
    Hong, D., Si, L., Bracke, P., Witt, M., Juchcinski, T.: A joint probabilistic classification model for resource selection. In: Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010, pp. 98–105. ACM, New York (2010)Google Scholar
  9. 9.
    Arguello, J., Callan, J., Diaz, F.: Classification-based resource selection. In: Proceeding of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, pp. 1277–1286. ACM, New York (2009)Google Scholar
  10. 10.
    Ferrucci, D., Lally, A.: UIMA: an architectural approach to unstructured information processing in the corporate research environment. Nat. Lang. Eng. 10(3-4), 327–348 (2004)CrossRefGoogle Scholar
  11. 11.
    Clark, J., DeRose, S.: XML Path Language (XPath) version 1.0. Recommendation, World Wide Web Consortium (November 1999), http://www.w3.org/TR/xpath.html
  12. 12.
    Broder, A.: A taxonomy of web search. SIGIR Forum 36, 3–10 (2002)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Matthias Wauer
    • 1
  • Daniel Schuster
    • 1
  • Alexander Schill
    • 1
  1. 1.Faculty of Computer Science, Chair for Computer NetworksTU DresdenDresdenGermany

Personalised recommendations