A Knowledge Engineering Approach for Complex Violence Identification in Movies

  • Thanassis Perperis
  • Sofia Tsekeridou
Part of the IFIP The International Federation for Information Processing book series (IFIPAICT, volume 247)


Along with the rapid increase of available multimedia data, comes the proliferation of objectionable content such as violence and pornography. We need efficient tools for automatically identifying, classifying and filtering out harmful or undesirable video content for the protection of sensitive user groups (e.g. children). In this paper we present a multimodal approach towards the identification and semantic analysis of violent content in video data. We propose a layered architecture and focus on ontological and knowledge engineering aspects of video analysis. We demonstrate the development of two ontologies defining violent hints hierarchy that low level analysis, in visual and audio modality, respectively should identify. Violence domain ontology, as a reality representation, defines higher-level semantics. Taking under consideration extracted violent hints, spatio-temporal relations and behavior patterns higher-level semantics automatic inference is possible.


Video Data Domain Ontology Late Fusion Violent Content Audio Modality 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Chandrasekaran B, Josephson J R, Benjamins R V: What Are Ontologies, and Why Do We Need Them? IEEE Intelligent Systems, 14,1 (1999), 20–26.CrossRefGoogle Scholar
  2. 2.
    Manjunath B S, Salembier P, Sikora T: Introduction to MPEG-7: Multimedia Content Description Interface. John Wiley and Sons / England (2002).Google Scholar
  3. 3.
    Datta A, Mubarak S, Lobo N: Person-on-Person Violence Detection in Video Data. Proc. of ICPR2002, Quebec City, Canada, Aug. (2002), 433–438.Google Scholar
  4. 4.
    Smith M K, Welty C, McGuinness D L: OWL Web Ontology Language Guide. W3C Recommendation 10 February 2004, Scholar
  5. 5.
    Hunter J: Enhancing the Semantic Interoperability of Multimedia through a Core Ontology. IEEE Transactions on Circuits and Systems for Video Technology. Special Issue on Conceptual and Dynamical Aspects of Multimedia Content Description, 13,1 (2003), 49–58.Google Scholar
  6. 6.
    Nam J, Tewfik A H: Event-driven video abstraction and visualisation. Multimedia Tools and Applications, 16(1–2), 55–77, 2002.MATHCrossRefGoogle Scholar
  7. 7.
    Vasconcelos N, Lippman A: Towards semantically meaningful feature spaces for the characterization of video content. Proc. of ICIP1997, Washington, DC, USA, Oct 1997, vol.1, 25–28.Google Scholar
  8. 8.
    Giannakopoulos T, Kosmopoulos D, Aristidou A, Theodoridis S: Violence Content Classification Using Audio Features. Proc. of 4th Hellenic Conference on Artificial Intelligence (SETN’06), Heraklion, Crete, Greece, May 18–20, 2006.Google Scholar
  9. 9.
    Pratikakis I, Tsekeridou S: Use Case: Semantic Media Analysis for Intelligent Retrieval. W3C Multimedia Semantics Incubator Group, Scholar
  10. 10.
    Makris A, Kosmopoulos D, Perantonis S, Theodoridis S: Hierarchical feature fusion for visual tracking. Accepted to be published in Proceedings of IEEE International Conference on Image Processing 2007 (ICIP2007).Google Scholar

Copyright information

© International Federation for Information Processing 2007

Authors and Affiliations

  • Thanassis Perperis
    • 1
  • Sofia Tsekeridou
    • 2
  1. 1.University of AthensGreece
  2. 2.Athens Information TechnologyGreece

Personalised recommendations