Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Cross-Modal Multimedia Information Retrieval

  • Qing LiEmail author
  • Yu Yang
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_90


Cross-media information retrieval; Multi-modal information retrieval


Multimedia information retrieval tries to find the distinctive multimedia documents that satisfy people’s needs within a huge dataset. Due to the vagueness on the representation of multimedia data, usually the user may only have some clues (e.g., a vague idea, a rough query object of the same or even different modality as that of the intended result) rather than concrete and indicative query objects. In such cases, traditional multimedia information retrieval techniques as Query-By-Example (QBE) fails to retrieve what users really want since their performance depends on a set of specifically defined features and carefully chosen query objects. The cross-modal multimedia information retrieval (CMIR) framework consists of a novel multifaceted knowledge base (which is embodied by a layered graph model) to discover the query results on multiple modalities. Such cross-modality paradigm leads to better...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Chang SF, Chen W, Meng HJ, Sundaram H, Zhong D. VideoQ: an automated content based video search system using visual cues. In: Proceedings of the 5th ACM International Conference on Multimedia; 1997.Google Scholar
  2. 2.
    Flickner M, Sawhney H, Niblack W, Ashley J. Query by image and video content: the QBIC system. IEEE Comput. 1995;28(9):23–32.CrossRefGoogle Scholar
  3. 3.
    Huang TS, Mehrotra S, Ramchandran K. Multimedia analysis and retrieval system (MARS) project. In: Proceedings of the 33rd Annual Clinic on Library Application of Data Processing-Digital Image Access and Retrieval; 1996.Google Scholar
  4. 4.
    Smith JR, Chang SF. VisualSEEk: a fully automated content-based image query system. In: Proceedings of the 4th ACM International Conference on Multimedia; 1996.Google Scholar
  5. 5.
    Smith JR, Chang SF. Visually searching the web for content. IEEE Multimedia Mag. 1997;4(3):12–20.CrossRefGoogle Scholar
  6. 6.
    Lu Y, Hu CH, Zhu XQ, Zhang HJ, Yang Q. A unified framework for semantics and feature based relevance feedback in image retrieval systems. In: Proceedings of the 8th ACM International Conference on Multimedia; 2000. p. 31–8.Google Scholar
  7. 7.
    Yang J, Zhuang YT, Li Q. Search for multi-modality data in digital libraries. In: Proceedings of the Second IEEE Pacific-Rim Conference on Multimedia; 2001.Google Scholar
  8. 8.
    Benitez AB, Smith JR, Chang SF. MediaNet: a multimedia information network for knowledge representation. In: Proceedings of the SPIE Conference on Internet Multimedia Management Systems; 2000. p. 1–12.Google Scholar
  9. 9.
    Tansley R. The multimedia thesaurus: an aid for multimedia information retrieval and navigation. Master thesis, Computer Science, University of Southampton; 1998.Google Scholar
  10. 10.
    Yang J, Li Q, Zhuang Y. Octopus: aggressive search of multi-modality data using multifaceted knowledge base. In: Proceedings of the 11th International World Wide Web Conference; 2002. p. 54–64.Google Scholar
  11. 11.
    Hofmann T, Buhmann JM. Pairwise data clustering by deterministic annealing. IEEE Trans Pattern Anal Mach Intell. 1997;19(1):1–14.CrossRefGoogle Scholar
  12. 12.
    Li Q, Yang J, Zhuang YT. Web-based multimedia retrieval: balancing out between common knowledge and personalized views. In: Proceedings of the 2nd International Conference on Web Information Systems Eng; 2001.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.City University of Hong KongHong KongChina

Section editors and affiliations

  • Jeffrey Xu Yu
    • 1
  1. 1.The Chinese University of Hong KongHong KongChina