Towards Large Scale Cross-Media Retrieval via Modeling Heterogeneous Information and Exploring an Efficient Indexing Scheme

  • Bo Lu
  • Guoren Wang
  • Ye Yuan
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7633)


With the rapid development of Internet and multimedia technology, cross-media retrieval is concerned to retrieve all the related media objects with multi-modality by submitting a query media object. In this paper, we propose a novel method which is dedicate to achieve effective and accurate cross-media retrieval. Firstly, a Multi-modality Semantic Relationship Graph (MSRG) is constructed by using the semantic correlation amongst the media objects with multi-modality. Secondly, all the media objects in MSRG are mapped onto an isomorphic semantic space. Further, an efficient indexing MK-tree based on heterogeneous data distribution is proposed to manage the media objects within the semantic space and improve the performance of cross-media retrieval. Extensive experiments on real large scale cross-media datasets indicate that our proposal dramatically improves the accuracy and efficiency of cross-media retrieval, outperforming the existing methods significantly.


Query Processing Canonical Correlation Analysis Average Precision Range Query Semantic Concept 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Lew, M., Sebe, N., Djeraba, C., Jain, R.: Content-based multimeida information retrieval: State-of-the-art and challenges. ACM Trans. Multimeida Comput., Commun., Applic. 2(1), 1–19 (2006)CrossRefGoogle Scholar
  2. 2.
    Smeaton, A.F., Over, P., Kraaij, W.: Evaluation campaigns and TRECVid. In: Proc. of MIR (2006)Google Scholar
  3. 3.
    Paramita, M., Sanderson, M., Clough, P.: Diversity in photo retrieval: overview of the Image CLEF photo task 2009. CLEF Working Notes (2009)Google Scholar
  4. 4.
    Naphade, M., Smith, J.R., Tesic, J., Chang, S.-F., Hsu, W., Kennedy, L., Hauptmann, A., Curtis, J.: Large-Scale Concept Ontology for Multimedia. IEEE Multimedia Magazine 13(3) (2006)Google Scholar
  5. 5.
    Hotelling, H.: Relations between two sets of variates. Biometrike 28, 321–377 (1936)zbMATHGoogle Scholar
  6. 6.
    Zhang, H., Zhuang, Y., Wu, F.: Cross-modal correlation learning for clustering on image-audio dataset. In: ACM Multimeida (2007)Google Scholar
  7. 7.
    Yang, Y., Zhuang, Y., Wu, F.: Harmonizing hierarchical manifolds for multimedia document semantics understanding and cross-media retrieval. IEEE Transactions on Multimedia 10, 437–446 (2008)CrossRefGoogle Scholar
  8. 8.
    Ciaccia, P., Patella, M., Zezula, P.: M-tree: An efficient access method for similarity search in metric spaces. In: Proc of the VLDB Conference, pp. 426–435 (1997)Google Scholar
  9. 9.
    Christian, B.: A cost model for query processing in high dimensional data spaces. ACM Transactions on Database Systems 25, 129–178 (2000)CrossRefGoogle Scholar
  10. 10.
    Zhuang, Y., Li, Q., Chen, L.: A Unified Indexing Structure for Efficient Cross-Media Retrieval. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds.) DASFAA 2009. LNCS, vol. 5463, pp. 677–692. Springer, Heidelberg (2009)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Bo Lu
    • 1
  • Guoren Wang
    • 1
  • Ye Yuan
    • 1
  1. 1.Key Laboratory of Medical Image Computing, Ministry of Education, School of Information Science & EngineeringNortheastern UniversityChina

Personalised recommendations