Learning Motion Regularity for Temporal Video Segmentation and Anomaly Detection

Daha, Fatima Zohra; Shah, Shishir K.

doi:10.1007/978-3-030-41404-7_9

Fatima Zohra Daha¹² &
Shishir K. Shah¹²

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12046))

Included in the following conference series:

Asian Conference on Pattern Recognition

1471 Accesses

Abstract

Detecting anomalous events in a long video sequence is a challenging task due to the subjective definition of “anomalous” as well as the duration of such events. Anomalous events are usually short-lived and occur rarely. We propose a semi-supervised solution to detect such events. Our method is able to capture the video segment where the anomaly happens via the analyses of the interaction between the spatially co-located interest points. The evolution of their motion characteristics is modeled and abrupt changes are used to temporally segment the videos. Spatiotemporal and motion features are then extracted to model standard events and identify the anomalous segments using a one-class classifier. Quantitative and qualitative experiments on publicly available anomaly detection dataset capturing real-world scenarios show that the proposed method outperforms state-of-the-art approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Chasanis, V., Kalogeratos, A., Likas, A.: Movie segmentation into scenes and chapters using locally weighted bag of visual words (2009)
Google Scholar
Cui, X., Liu, Q., Gao, M., Metaxas, D.N.: Abnormal detection using interaction energy potentials. In: CVPR 2011, pp. 3161–3167 (2011)
Google Scholar
Daha, F., Hewavitharana, S.: Deep neural architecture with character embedding for semantic frame detection, pp. 302–307 (2019)
Google Scholar
Potapov, D., Douze, M., Harchaoui, Z., Schmid, C.: Category-specific video summarization. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8694, pp. 540–555. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10599-4_35
Chapter Google Scholar
Grundmann, M., Kwatra, V., Han, M., Essa, I.: Efficient hierarchical graph-based video segmentation. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2141–2148 (2010)
Google Scholar
Guo, Z., Wu, F., Chen, H., Yuan, J., Cai, C.: Pedestrian violence detection based on optical flow energy characteristics. In: 2017 4th International Conference on Systems and Informatics (ICSAI), pp. 1261–1265 (2017)
Google Scholar
Hanson, A., PNVR, K., Krishnagopal, S., Davis, L.: Bidirectional convolutional LSTM for the detection of violence in videos. In: Leal-Taixé, L., Roth, S. (eds.) ECCV 2018. LNCS, vol. 11130, pp. 280–295. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-11012-3_24
Chapter Google Scholar
Hasan, M., Choi, J., Neumann, J., Roy-Chowdhury, A.K., Davis, L.S.: Learning temporal regularity in video sequences. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 733–742 (2016)
Google Scholar
Kelathodi Kumaran, S., Dogra, D., Roy, P.: Anomaly detection in road traffic using visual surveillance: a survey (2019)
Google Scholar
Lea, C., Flynn, M.D., Vidal, R., Reiter, A., Hager, G.D.: Temporal convolutional networks for action segmentation and detection. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1003–1012 (2017)
Google Scholar
Lea, C., Reiter, A., Vidal, R., Hager, G.D.: Segmental spatiotemporal CNNs for fine-grained action segmentation. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9907, pp. 36–52. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46487-9_3
Chapter Google Scholar
Lezama, J., Alahari, K., Sivic, J., Laptev, I.: Track to the future: spatio-temporal video segmentation with long-range motion cues. In: CVPR, pp. 3369–3376 (2011)
Google Scholar
Li, W., Mahadevan, V., Vasconcelos, N.: Anomaly detection and localization in crowded scenes. IEEE Trans. Pattern Anal. Mach. Intell. 36(1), 18–32 (2014)
Article Google Scholar
Lu, C., Shi, J., Jia, J.: Abnormal event detection at 150 FPS in MATLAB. In: 2013 IEEE International Conference on Computer Vision, pp. 2720–2727, December 2013
Google Scholar
Lu, J., Xu, R., Corso, J.J.: Human action segmentation with hierarchical supervoxel consistency. In: CVPR, pp. 3762–3771 (2015)
Google Scholar
Mahadevan, V., Li, W., Bhalodia, V., Vasconcelos, N.: Anomaly detection in crowded scenes. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1975–1981 (2010)
Google Scholar
Mehran, R., Oyama, A., Shah, M.: Abnormal crowd behavior detection using social force model. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 935–942 (2009)
Google Scholar
Mohammadi, S., Perina, A., Kiani, H., Murino, V.: Angry crowds: detecting violent events in videos. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9911, pp. 3–18. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46478-7_1
Chapter Google Scholar
Schölkopf, B., Williamson, R., Smola, A., Shawe-Taylor, J., Platt, J.: Support vector method for novelty detection. In: Proceedings of the 12th International Conference on Neural Information Processing Systems, NIPS 1999, pp. 582–588 (1999)
Google Scholar
Simakov, D., Caspi, Y., Shechtman, E., Irani, M.: Summarizing visual data using bidirectional similarity. In: 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2008)
Google Scholar
Soomro, K., Roshan Zamir, A., Shah, M.: UCF101: a dataset of 101 human actions classes from videos in the wild. CoRR (2012)
Google Scholar
Sultani, W., Chen, C., Shah, M.: Real-world anomaly detection in surveillance videos. In: Conference on Computer Vision and Pattern Recognition (2018)
Google Scholar
Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3D convolutional networks. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 4489–4497 (2015)
Google Scholar
Wang, S., Wang, D., Li, C., Li, Y., Ding, G.: Clustering by fast search and find of density peaks with data field. Chin. J. Electron. 25(3), 397–402 (2016)
Article Google Scholar
Wu, Y., Ye, Y., Zhao, C.: Coherent motion detection with collective density clustering. In: Proceedings of the 23rd ACM International Conference on Multimedia, MM 2015, New York, NY, USA, pp. 361–370 (2015)
Google Scholar
Xu, D., Ricci, E., Yan, Y., Song, J., Sebe, N.: Learning deep representations of appearance and motion for anomalous event detection. In: Computer Vision and Image Understanding (2015)
Google Scholar
Zhang, H., Kankanhalli, A., Smoliar, S.W.: Automatic partitioning of full-motion video. Multimedia Syst. 1(1), 10–28 (1993)
Article Google Scholar
Zhang, S., Zhu, Y., Roy-Chowdhury, A.K.: Context-aware surveillance video summarization. IEEE Trans. Image Process. 25(11), 5469–5478 (2016)
Article MathSciNet Google Scholar
Zhou, B., Tang, X., Zhang, H., Wang, X.: Measuring crowd collectiveness. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1586–1599 (2014)
Article Google Scholar
Zhu, Y., Nayak, M., Roy-Chowdhury, K.: Context-aware activity recognition and anomaly detection in video. IEEE J. Sel. Top. Signal Process. 7(1), 91–101 (2013)
Article Google Scholar

Download references

Acknowledgement

This work was performed in part through the financial assistance award, Multitiered Video Analytics for Abnormality Detection and Alerting to Improve Response Time for First Responder Communications and Operations (Grant No. 60NANB17D178), from U.S. Department of Commerce, National Institute of Standards and Technology.

Author information

Authors and Affiliations

University of Houston, 4800 Calhoun Road, Houston, TX, 77204, USA
Fatima Zohra Daha & Shishir K. Shah

Authors

Fatima Zohra Daha
View author publications
You can also search for this author in PubMed Google Scholar
Shishir K. Shah
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shishir K. Shah .

Editor information

Editors and Affiliations

University of Malaya, Kuala Lumpur, Malaysia
Shivakumara Palaiahnakote
Consiglio Nazionale delle Ricerche, ICAR, Naples, Italy
Gabriella Sanniti di Baja
Chinese Academy of Sciences, Beijing, China
Liang Wang
Auckland University of Technology, Auckland, New Zealand
Wei Qi Yan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Daha, F.Z., Shah, S.K. (2020). Learning Motion Regularity for Temporal Video Segmentation and Anomaly Detection. In: Palaiahnakote, S., Sanniti di Baja, G., Wang, L., Yan, W. (eds) Pattern Recognition. ACPR 2019. Lecture Notes in Computer Science(), vol 12046. Springer, Cham. https://doi.org/10.1007/978-3-030-41404-7_9

Download citation

DOI: https://doi.org/10.1007/978-3-030-41404-7_9
Published: 23 February 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-41403-0
Online ISBN: 978-3-030-41404-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics