Skip to main content

Video Event Detection as Matching of Spatiotemporal Projection

  • Conference paper
Advances in Visual Computing (ISVC 2010)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6455))

Included in the following conference series:

  • 1600 Accesses

Abstract

Detection of events and actions in video entails substantial processing of very large, even open-ended, video streams. Video data presents a unique challenge for the information retrieval community because it is hard to find a way to properly represent video events. We propose a novel approach to analyze temporal aspects of video data. We consider the video data as a sequence of images that form a 3-dimensional spatiotemporal structure, and multiview orthographic projection is performed to transform the video data into 2-dimensional representations. The projected views allow a unique way to represent video events, and we apply template matching using color moments to detect video events.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 2008 trecvid event detection evaluation plan (2008), http://www.itl.nist.gov/iad/mig/tests/trecvid/2008/doc/EventDet08-EvalPlan-v06.htm

  2. Aggarwal, J.K., Cai, Q.: Human motion analysis: a review. In: Proceedings of the IEEE workshop on Nonrigid and Articulated Motion (1997)

    Google Scholar 

  3. Blank, M., Gorelick, L., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. In: Proceedings of the IEEE International Conference on Computer Vision, vol. 2 (2005)

    Google Scholar 

  4. Bobick, A.F., Davis, J.W.: The recognition of human movement using temporal templates. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(3) (2001)

    Google Scholar 

  5. Campbell, M., Haubold, A., Ebadollahi, S., Joshi, D., Naphade, M.R., Natsev, A., Seidl, J., Smith, J.R., Scheinberg, K., Tesic, J., Xie, L.: Ibm research trecvid-2006 video retrieval system. In: Proceedings of the TRECVID 2006 (2006)

    Google Scholar 

  6. Carlbom, I., Paciorek, J.: Planar geometric projections and viewing transformations. ACM Computing Surveys (1978)

    Google Scholar 

  7. Efros, A.A., Berg, A.C., Mori, G., Malik, J.: Recognizing action at a distance. In: Proceedings of the IEEE International Conference on Computer Vision (2003)

    Google Scholar 

  8. Evans, J.: The future of video indexing in the BBC. In: NIST TRECVID Workshop (2003)

    Google Scholar 

  9. Hongeng, S., Nevatia, R., Bremond, F.: Video-based event recognition: activity representation and probabilistic recognition methods. Computer Vision and Image Understanding 96(2) (2004)

    Google Scholar 

  10. Ke, Y., Sukthankar, R., Hebert, M.: Efficient visual event detection using volumetric features. In: Proceedings of the IEEE International Conference on Computer Vision, vol. 1 (2005)

    Google Scholar 

  11. Laptev, I., Lindeberg, T.: Space-time interest points. In: Proceedings of the IEEE International Conference on Computer Vision (2003)

    Google Scholar 

  12. Moeslund, T.B., Hilton, A., Kruger, V.: A survey of advances in vision-based human motion capture and analysis. Computer Vision and Image Understanding 104(2-3) (2006)

    Google Scholar 

  13. Niebles, J., Wang, H., Fei-Fei, L.: Unsupervised learning of human action categories using spatial-temporal words. International Journal of Computer Vision (2008)

    Google Scholar 

  14. Niyogi, S.A., Adelson, E.H.: Analyzing gait with spatiotemporal surfaces. In: Preceedings of the IEEE Workshop on Motion of Non-Rigid and Articulated Objects (1994)

    Google Scholar 

  15. Ren, W., Singh, S., Singh, M., Zhu, Y.S.: State-of-the-art on spatio-temporal information-based video retrieval. Pattern Recognition 42(2) (2009)

    Google Scholar 

  16. Ricquebourg, Y., Bouthemy, P.: Rela-time tracking of moving persons by exploiting spatio-temporal image slices. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(8) (2000)

    Google Scholar 

  17. Robertson, N., Reid, I.: A general method for human activity recognition in video. Computer Vision and Image Understanding 104(2-3) (2006)

    Google Scholar 

  18. Shechtman, E., Irani, M.: Space-time behavior based correlation or how to tell if two underlying motion fields are similar without computing them? IEEE Transactions on Pattern Analysis and Machine Intelligence 29(11) (2007)

    Google Scholar 

  19. Stricker, M.A., Orengo, M.: Similarity of color images. Storage and Retrieval for Image and Video Databases (SPIE), 381–392 (1995)

    Google Scholar 

  20. Swain, M.J., Ballard, D.H.: Color indexing. Internation Journal of Computer Vision 7(1), 11–32 (1991)

    Article  Google Scholar 

  21. Turaga, P., Chellappa, R., Subrahmanian, V.S., Udrea, O.: Machine recognition of human activities: a survey. IEEE Transactions on Circuits and Systems for Video Technology 18(11) (2008)

    Google Scholar 

  22. Yamato, J., Ohya, J., Ishii, K.: Recognizing human action in time-sequential images using hidden markov model. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (1992)

    Google Scholar 

  23. Yilmaz, A., Shah, M.: A differential geometric approach to representing the human actions. Computer Vision and Image Understanding 109(3) (2008)

    Google Scholar 

  24. Zelnik-Manor, L., Irani, M.: Event-based analysis of video. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2001)

    Google Scholar 

  25. Zhao, T., Nevatia, R.: Tracking multiple humans in complex situations. IEEE Transactions on Pattern Analysis and Machine Intelligence 26(9) (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Park, DJ., Eichmann, D. (2010). Video Event Detection as Matching of Spatiotemporal Projection. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2010. Lecture Notes in Computer Science, vol 6455. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17277-9_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17277-9_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17276-2

  • Online ISBN: 978-3-642-17277-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics