Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Multimedia Tagging

  • Xiaofeng ZhuEmail author
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_80632


High-level feature extraction; Interactive tagging; Multimedia annotation; Multimedia concept detection; Multimedia labeling; Tag location; Tag recommendation; Tag refinement


Multimedia data indicates large amounts of multi-/rich media data, such as text, graphics, images, music, video, and their combination. The basic elements of multimedia data are text, images, audio, animation, and video. Multimedia tagging is referred to as the process by which a computer system automatically assigns metadata in the form of captions or keywords to multimedia data for describing their content on semantic or syntactic levels. With such metadata, the management, summarization, and retrieval of multimedia content can be easily accomplished. The tags (i.e., metadata or captions or keywords) can be directly used to index multimedia data. According to the semantic and syntactic content, multimedia data can be assigned one tag or multiple tags. For example, the video on gold fish in...
This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Chen M, Zheng A, Weinberger K. Fast image tagging. In: Proceedings of the 30th International Conference on Machine Learning; 2013. p. 1274–82.Google Scholar
  2. 2.
    Guillaumin M, Mensink T, Verbeek J, Schmid C. Tagprop: discriminative metric learning in nearest neighbor models for image auto-annotation. In: Proceedings of the 12th IEEE Conference on Computer Vision; 2009. p. 309–16.Google Scholar
  3. 3.
    Makadia A., Pavlovic V., Kumar S. A new baseline for image annotation. In: Proceedings of the 10th European Conference on Computer Vision; 2008. p. 316–29.Google Scholar
  4. 4.
    Tran HT, Fromont E, Jacquenet F, Jeudy B, Martins A, et al. Unsupervised video tag correction system. In: Extraction et gestion des connaissances; 2013. p. 461–6.Google Scholar
  5. 5.
    Wang M, Ni B, Hua XS, Chua TS. Assistive tagging: a survey of multimedia tagging with human-computer joint exploration. ACM Comput Surv. 2012;44(4):25.CrossRefGoogle Scholar
  6. 6.
    Yang J, Jiang YG, Hauptmann AG, Ngo CW. Evaluating bag-of-visual-words representations in scene classification. In: Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval; 2007. p. 197–206.Google Scholar
  7. 7.
    Zhao WL, Wu X, Ngo CW. On the annotation of web videos by efficient near-duplicate search. IEEE Trans Multimed. 2010;12(5):448–61.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Guangxi Normal UniversityGuilinPeople’s Republic of China

Section editors and affiliations

  • Jeffrey Xu Yu
    • 1
  1. 1.The Chinese University of Hong KongHong KongChina