Violence Video Classification Performance Using Deep Neural Networks

  • Ashikin Ali
  • Norhalina Senan
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 700)


Violence is autonomous, the contents that one would not let children to see in movies or web videos. This is a challenging problem due to strong content variations among the positive instances. To solve this problem, implementation of deep neural network to classify the violence content in videos is proposed. Currently, deep neural network has shown its efficiency in natural language processing, fraud detection, social media, text classification, image classification. Regardless of the conventional methods applied to overcome this issue, but these techniques seem insufficiently accurate and does not adopt well to certain webs or user needs. Therefore, the purpose of this study is to assess the classification performances on violence video using Deep Neural Network (DNN). Hence, in this paper different architectures of hidden layers and hidden nodes in DNN have been implemented using the try-error method and equation based method, to examine the effect of the number of hidden layers and hidden nodes to the classification performance. From the results, it indicates 53% accuracy rate for try and error approach, meanwhile for equation based approach it indicates 51% accuracy rate.


Violence video Artificial neural network Deep neural network Classification 



This research funded by Ministry of Higher Education (MOHE) under the Fundamental Research Grant Scheme (FRGS)—Vot. No. 1608. Besides, partially supported by Office for Research, Innovation, Commercialization and Consultancy Management (ORICC), UTHM.


  1. 1.
  2. 2.
    Giannakopoulos, T., Kosmopoulos, D., Aristidou, A., Theodoridis, S.: Violence content classification using audio features. In: Hellenic Conference on Artificial Intelligence, pp. 502–507. Springer, Berlin, Heidelberg (2006)Google Scholar
  3. 3.
    Mu, G., Cao, H., Jin, Q.: Violent scene detection using convolutional neural networks and deep audio features. In: Chinese Conference on Pattern Recognition, pp. 451–463. Springer, Singapore (2016)Google Scholar
  4. 4.
    Mironică, I., Duţă, I.C., Ionescu, B., Sebe, N.: A modified vector of locally aggregated descriptors approach for fast video classification. Multimed. Tools Appl. 75(15), 9045–9072 (2016)CrossRefGoogle Scholar
  5. 5.
    Ali, A., Senan, N.: A review on violence video classification using convolutional neural networks. In: International Conference on Soft Computing and Data Mining, pp. 130–140. Springer, Cham (2016)Google Scholar
  6. 6.
    Dai, Q., Wu, Z., Jiang, Y.G., Xue, X., Tang, J.: Fudan-NJUST at MediaEval 2014: violent scenes detection using deep neural networks. In: MediaEval (2014)Google Scholar
  7. 7.
    Wu, Z., Jiang, Y.G., Wang, J., Pu, J., Xue, X.: Exploring inter-feature and inter-class relationships with deep neural networks for video classification. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 167–176. ACM (2014) Google Scholar
  8. 8.
    Jiang, Y.G., Wu, Z., Wang, J., Xue, X., Chang, S.F.: Exploiting feature and class relationships in video categorization with regularized deep neural networks (2015). arXiv:1502.07209
  9. 9.
    Eyben, F., Weninger, F., Lehment, N., Schuller, B., Rigoll, G.: Affective video retrieval: violence detection in hollywood movies by large-scale segmental feature extraction. PLoS ONE 8(12), e78506 (2013)CrossRefGoogle Scholar
  10. 10.
    Benevenuto, F., Rodrigues, T., Almeida, V., Almeida, J., Gonçalves, M.: Detecting spammers and content promoters in online video social networks. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 620–627. ACM (2009)Google Scholar
  11. 11.
    Zhang, B., Yi, Y., Wang, H., Yu, J.: MIC-TJU at MediaEval violent scenes detection (VSD) 2014. In: MediaEval (2014)Google Scholar
  12. 12.
    Schedi, M., Sjöberg, M., Mironică, I., Ionescu, B., Quang, V.L., Jiang, Y.G., Demarty, C.H.: Vsd2014: a dataset for violent scenes detection in hollywood movies and web videos. In: 2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI), pp. 1–6. IEEE (2015)Google Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  1. 1.Faculty of Computer Science and Information TechnologyUniversiti Tun Hussein Onn Malaysia (UTHM)Parit Raja, Batu PahatMalaysia

Personalised recommendations