Multimedia Tools and Applications

, Volume 24, Issue 3, pp 215–232 | Cite as

Merging Results for Distributed Content Based Image Retrieval

  • Stefano Berretti
  • Alberto Del Bimbo
  • Pietro Pala


Searching information through the Internet often requires users to separately contact several digital libraries, use each library interface to author the query, analyze retrieval results and merge them with results returned by other libraries. Such a solution could be simplified by using a centralized server that acts as a gateway between the user and several distributed repositories: The centralized server receives the user query, forwards the user query to federated repositories—possibly translating the query in the specific format required by each repository—and fuses retrieved documents for presentation to the user. To accomplish these tasks efficiently, the centralized server should perform some major operations such as: resource selection, query transformation and data fusion.

In this paper we report on some aspects of MIND, a system for managing distributed, heterogeneous multimedia libraries (MIND, 2001, In particular, this paper focusses on the issue of fusing results returned by different image repositories. The proposed approach is based on normalization of matching scores assigned to retrieved images by individual libraries. Experimental results on a prototype system show the potential of the proposed approach with respect to traditional solutions.

digital libraries collection fusion distributed retrieval 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    S. Adali, P.A. Bonatti, M.L. Sapino, and V.S. Subrahmanian, "A multi-similarity algebra," in Proc. of ACM SIGMOD Conf. on Management of Data, Seattle, WA, June 1998, pp. 402-413.Google Scholar
  2. 2.
    S. Berretti, A. Del Bimbo, and P. Pala, "Using indexing structures for resource descriptors extraction from distributed image repositories," in Proc. IEEE Int. Conf. on Multimedia and Expo, Lousanne, Switzerland, Aug. 2002, Vol. 2, pp. 197-200.Google Scholar
  3. 3.
    J. Callan and M. Connell, "Query-based sampling of text databases," ACM Transactions on Information Systems, Vol. 19, No. 2, pp. 97-130, 2001.Google Scholar
  4. 4.
    W. Chang, D. Murthy, A. Zhang, and T. Syeda-Mahmood, "Metadatabase and search agent for multimedia database access over internet," Int. Conf. on Multimedia Computing and Systems (ICMCS'97), Ottawa, Canada, June 1997.Google Scholar
  5. 5.
    W. Chang, G. Sheikholaslami, A. Zhang, and T. Syeda-Mahmood, "Efficient resource selection in distributed visual information retrieval," in Proc. of ACM Multimedia'97, Seattle, 1997.Google Scholar
  6. 6.
    S. Chawathe, H. Garcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J. Ullman, and J.Widom, "The TSIMMIS project: Integration of heterogeneous information sources," in Proceedings of IPSJ Conference, Tokyo, Japan, Oct. 1994, pp. 7-18.Google Scholar
  7. 7.
    P. Ciaccia, M. Patella, and P. Zezula, "M-tree: An efficient access method for similarity search in metric spaces," in Proc. of the Int. Conf. on Very Large Databases, Athens, Greece, 1997.Google Scholar
  8. 8.
    W.F. Cody, L.M. Haas, W. Niblack, M. Arya, M.J. Carey, R. Fagin, M. Flickner, D. Lee, D. Petkovic, P.M. Schwarz, J. Thomas, M. Tork Roth, J.H. Williams, and E.L. Wimmers, "Querying multimedia data from multiple repositories by content: The garlic project," IFIP 2.6 Third Working Conference on Visual Database Systems (VDB-3), Lausanne, Switzerland, March 1995, pp. 17-35.Google Scholar
  9. 9.
    R. Fagin, R. Kumar, and D. Sivakumar, "Comparing top klists," in Proc. of the ACM-SIAM Symposium on Discrete Algorithms (SODA'03), Baltimore, MD, Jan. 2003, pp. 28-36.Google Scholar
  10. 10.
    N. Fuhr, "Optimum database selection in networked IR," in Proc. of the SIGIR'96 Workshop on Networked Information Retrieval, Zurich, Switzerland, Aug. 1996.Google Scholar
  11. 11.
    L. Gravano and H. Garcia-Molina, "Generalizing gloss to vector-space databases and broker hierarchies," in Proc. of the 21st Int. Conf. on Very Large Data Bases, 1995, pp. 78-89.Google Scholar
  12. 12.
    I.F. Ilyas, W.G. Aref, and A.K. Elmagarmid, "Joining ranked inputs in practice," in Proc. of the VLDB Conference, Hong Kong, China, 2002, pp. 950-961.Google Scholar
  13. 13.
    S.T. Kirsch, Document Retrieval Over Networks wherein Ranking and Relevance Scores are Computed at the Client for Multiple Database Documents. US patent 5659732, 1999.Google Scholar
  14. 14.
    K.L. Kwok, L. Grunfeld, and D.D. Lewis, "TREC-3 Ad-hoc, routing retrieval and thresholding experiment using PIRCS," in Proc. of TREC-3, 1995, pp. 247-255.Google Scholar
  15. 15.
    MIND: Resource Selection and Data Fusion for Multimedia International Digital Libraries. EU project IST-2000-26061. Scholar
  16. 16.
    N. Roussopoulos, S. Kelley, and F. Vincent, "Nearest neighbor queries," in Proc. of the 1995 ACM SIGMOD Int. Conf. on Management of data, San Jose, CA, May 1995, pp. 71-79.Google Scholar
  17. 17.
    L. Si and J. Callan, "Using sampled data and regression to merge search engine results," in Proc. of Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, Tampere, Finland, 2002, pp. 19-26.Google Scholar
  18. 18.
    L. Si and J. Callan, "A semisupervised learning method to merge search engine results," ACM Transactions on Information Systems, Vol. 21, No. 4, pp. 457-491, 2003.Google Scholar
  19. 19.
    V.S. Subrahmanian, S. Adali, A. Brink, R. Emery, J. Lu, A. Rajput, T.J. Rogers, R. Ross, and C. Ward, HERMES: Heterogeneous Reasoning and Mediator System. Scholar
  20. 20.
    A. Tomasic, L. Raschid, and P. Valduriez, "Scaling heterogeneous databases and the design of disco," in Proc. of the Int. Conf. on Distributed Computer Systems, Hong-Kong, 1996.Google Scholar
  21. 21.
    E.M. Vorhees, N.K. Gupta, and B. Johnson-Laird, "Learning collection fusion strategies," in Proc. ACMSIGIR'95, 1995, pp. 172–179.Google Scholar
  22. 22.
    E.M.Vorhees, N.K. Gupta, and B. Johnson-Laird, "The collection fusion problem," in The Third Text REtrieval Conference (TREC-3).Google Scholar

Copyright information

© Kluwer Academic Publishers 2004

Authors and Affiliations

  • Stefano Berretti
    • 1
  • Alberto Del Bimbo
    • 1
  • Pietro Pala
    • 1
  1. 1.Dipartimento Sistemi e InformaticaUniversità di FirenzeFirenzeItaly.

Personalised recommendations