Towards Large Scale Cross-Media Retrieval via Modeling Heterogeneous Information and Exploring an Efficient Indexing Scheme
- 1.7k Downloads
With the rapid development of Internet and multimedia technology, cross-media retrieval is concerned to retrieve all the related media objects with multi-modality by submitting a query media object. In this paper, we propose a novel method which is dedicate to achieve effective and accurate cross-media retrieval. Firstly, a Multi-modality Semantic Relationship Graph (MSRG) is constructed by using the semantic correlation amongst the media objects with multi-modality. Secondly, all the media objects in MSRG are mapped onto an isomorphic semantic space. Further, an efficient indexing MK-tree based on heterogeneous data distribution is proposed to manage the media objects within the semantic space and improve the performance of cross-media retrieval. Extensive experiments on real large scale cross-media datasets indicate that our proposal dramatically improves the accuracy and efficiency of cross-media retrieval, outperforming the existing methods significantly.
KeywordsQuery Processing Canonical Correlation Analysis Average Precision Range Query Semantic Concept
Unable to display preview. Download preview PDF.
- 2.Smeaton, A.F., Over, P., Kraaij, W.: Evaluation campaigns and TRECVid. In: Proc. of MIR (2006)Google Scholar
- 3.Paramita, M., Sanderson, M., Clough, P.: Diversity in photo retrieval: overview of the Image CLEF photo task 2009. CLEF Working Notes (2009)Google Scholar
- 4.Naphade, M., Smith, J.R., Tesic, J., Chang, S.-F., Hsu, W., Kennedy, L., Hauptmann, A., Curtis, J.: Large-Scale Concept Ontology for Multimedia. IEEE Multimedia Magazine 13(3) (2006)Google Scholar
- 6.Zhang, H., Zhuang, Y., Wu, F.: Cross-modal correlation learning for clustering on image-audio dataset. In: ACM Multimeida (2007)Google Scholar
- 8.Ciaccia, P., Patella, M., Zezula, P.: M-tree: An efficient access method for similarity search in metric spaces. In: Proc of the VLDB Conference, pp. 426–435 (1997)Google Scholar