Abstract
Detection of events and actions in video entails substantial processing of very large, even open-ended, video streams. Video data presents a unique challenge for the information retrieval community because it is hard to find a way to properly represent video events. We propose a novel approach to analyze temporal aspects of video data. We consider the video data as a sequence of images that form a 3-dimensional spatiotemporal structure, and multiview orthographic projection is performed to transform the video data into 2-dimensional representations. The projected views allow a unique way to represent video events, and we apply template matching using color moments to detect video events.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
2008 trecvid event detection evaluation plan (2008), http://www.itl.nist.gov/iad/mig/tests/trecvid/2008/doc/EventDet08-EvalPlan-v06.htm
Aggarwal, J.K., Cai, Q.: Human motion analysis: a review. In: Proceedings of the IEEE workshop on Nonrigid and Articulated Motion (1997)
Blank, M., Gorelick, L., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. In: Proceedings of the IEEE International Conference on Computer Vision, vol. 2 (2005)
Bobick, A.F., Davis, J.W.: The recognition of human movement using temporal templates. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(3) (2001)
Campbell, M., Haubold, A., Ebadollahi, S., Joshi, D., Naphade, M.R., Natsev, A., Seidl, J., Smith, J.R., Scheinberg, K., Tesic, J., Xie, L.: Ibm research trecvid-2006 video retrieval system. In: Proceedings of the TRECVID 2006 (2006)
Carlbom, I., Paciorek, J.: Planar geometric projections and viewing transformations. ACM Computing Surveys (1978)
Efros, A.A., Berg, A.C., Mori, G., Malik, J.: Recognizing action at a distance. In: Proceedings of the IEEE International Conference on Computer Vision (2003)
Evans, J.: The future of video indexing in the BBC. In: NIST TRECVID Workshop (2003)
Hongeng, S., Nevatia, R., Bremond, F.: Video-based event recognition: activity representation and probabilistic recognition methods. Computer Vision and Image Understanding 96(2) (2004)
Ke, Y., Sukthankar, R., Hebert, M.: Efficient visual event detection using volumetric features. In: Proceedings of the IEEE International Conference on Computer Vision, vol. 1 (2005)
Laptev, I., Lindeberg, T.: Space-time interest points. In: Proceedings of the IEEE International Conference on Computer Vision (2003)
Moeslund, T.B., Hilton, A., Kruger, V.: A survey of advances in vision-based human motion capture and analysis. Computer Vision and Image Understanding 104(2-3) (2006)
Niebles, J., Wang, H., Fei-Fei, L.: Unsupervised learning of human action categories using spatial-temporal words. International Journal of Computer Vision (2008)
Niyogi, S.A., Adelson, E.H.: Analyzing gait with spatiotemporal surfaces. In: Preceedings of the IEEE Workshop on Motion of Non-Rigid and Articulated Objects (1994)
Ren, W., Singh, S., Singh, M., Zhu, Y.S.: State-of-the-art on spatio-temporal information-based video retrieval. Pattern Recognition 42(2) (2009)
Ricquebourg, Y., Bouthemy, P.: Rela-time tracking of moving persons by exploiting spatio-temporal image slices. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(8) (2000)
Robertson, N., Reid, I.: A general method for human activity recognition in video. Computer Vision and Image Understanding 104(2-3) (2006)
Shechtman, E., Irani, M.: Space-time behavior based correlation or how to tell if two underlying motion fields are similar without computing them? IEEE Transactions on Pattern Analysis and Machine Intelligence 29(11) (2007)
Stricker, M.A., Orengo, M.: Similarity of color images. Storage and Retrieval for Image and Video Databases (SPIE), 381–392 (1995)
Swain, M.J., Ballard, D.H.: Color indexing. Internation Journal of Computer Vision 7(1), 11–32 (1991)
Turaga, P., Chellappa, R., Subrahmanian, V.S., Udrea, O.: Machine recognition of human activities: a survey. IEEE Transactions on Circuits and Systems for Video Technology 18(11) (2008)
Yamato, J., Ohya, J., Ishii, K.: Recognizing human action in time-sequential images using hidden markov model. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (1992)
Yilmaz, A., Shah, M.: A differential geometric approach to representing the human actions. Computer Vision and Image Understanding 109(3) (2008)
Zelnik-Manor, L., Irani, M.: Event-based analysis of video. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2001)
Zhao, T., Nevatia, R.: Tracking multiple humans in complex situations. IEEE Transactions on Pattern Analysis and Machine Intelligence 26(9) (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Park, DJ., Eichmann, D. (2010). Video Event Detection as Matching of Spatiotemporal Projection. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2010. Lecture Notes in Computer Science, vol 6455. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17277-9_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-17277-9_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17276-2
Online ISBN: 978-3-642-17277-9
eBook Packages: Computer ScienceComputer Science (R0)