Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Video Content Analysis

  • Alexander HauptmannEmail author
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_1018


Semantic analysis of video; Video analysis; Video content processing


Video content analysis deals with the extraction of metadata from raw video to be used as components for further processing in applications such as search, summarization, classification or event detection. The purpose of video content analysis is to provide extracted features and identification of structure that constitute building blocks for video retrieval, video similarity finding, summarization and navigation. Video content analysis transforms the audio and image stream into a set of semantically meaningful representations. The ultimate goal is to extract structural and semantic content automatically, without any human intervention, at least for limited types of video domains. Algorithms to perform content analysis include those for detecting objects in video, recognizing specific objects, persons, locations, detecting dynamic events in video, associating keywords with image regions or motion...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Chang S, Sundaram H. Structural and semantic analysis of video. In: Proceedings of the IEEE International Conference on Multimedia and Expo; 2000. p. 687–90.Google Scholar
  2. 2.
    Hanjalic A. Content-based analysis of digital video. Boston: Kluwer Academic; 2004.zbMATHGoogle Scholar
  3. 3.
    Jay KC. Video content analysis using multimodal information: for movie content extraction, indexing and representation. Norwell: Kluwer Academic; 2003.Google Scholar
  4. 4.
    Marques O, Furht B. Content-based image and video retrieval. Norwell: Kluwer Academic; 2002.zbMATHCrossRefGoogle Scholar
  5. 5.
    Guan L, Kung SY, Larsen J, editors. Multimedia image and video processing. Boca Raton: CRC; 1999.Google Scholar
  6. 6.
    Naphade MR, Smith JR. On the detection of semantic concepts at TRECVID. In: Proceedings of the 12th ACM International Conference on Multimedia; 2004. p. 660–7.Google Scholar
  7. 7.
    Smeulders AW, Worring M, Santini S, Gupta A, Jain R. Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell. 2000;22(12):1349–80.CrossRefGoogle Scholar
  8. 8.
    Smith MA, Kanade T. Multimodal video characterization and summarization. Kluwer, 2005. Series in video computing, Vol. 9.Google Scholar
  9. 9.
    Snoek C, Worring M, A.G H. Learning rich semantics from news video archives by style analysis. ACM Trans Multimedia Comp Comm Appl. 2006;2(2):91–108.CrossRefGoogle Scholar
  10. 10.
    The informedia digital video project. http://www.informedia.cs.cmu.edu.
  11. 11.
    Worring M, Snoek CG. Semantic indexing and retrieval of video. In: Proceedings of the 14th ACM International Conference on Multimedia; p. 13.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Carnegie Mellon UniversityPittsburghUSA

Section editors and affiliations

  • Vincent Oria
    • 1
  • Shin'ichi Satoh
    • 2
  1. 1.Dept. of Computer ScienceNew Jersey Inst. of TechnologyNewarkUSA
  2. 2.Digital Content and Media Sciences ReseaMultimedia Information Research DivisionNational Institute of InformaticsTokyoJapan