Abstract
We propose a principled approach to video summarization using optimal reconstruction as a metric to guide the creation of the summary output. The spatio-temporal video patches included in the summary are viewed as observations about the local motion of the original input video and are chosen to minimize the reconstruction error of the missing observations under a set of learned predictive models. The method is demonstrated using fixed-viewpoint video sequences and shown to generalize to multiple camera systems with disjoint views, which can share activity already summarized in one view to inform the summary of another. The results show that this approach can significantly reduce or even eliminate the inclusion of patches in the summary that contain activities from the video that are already expected based on other summary patches, leading to a more concise output.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Wang, X., Tieu, K., Grimson, W.: Correspondence-free activity analysis and scene modeling in multiple camera views. PAMI (2009)
Wang, X., Ma, K., Ng, G., Grimson, W.: Trajectory analysis and semantic region modeling using a nonparametric bayesian model. In: CVPR (2008)
Piciarelli, C., Micheloni, C., Foresti, G.L.: Trajectory-based anomalous event detection. IEEE Trans. Circuits Systems Vid. Tech. 18, 1544–1554 (2008)
Breitenstein, M., Grabner, H., Gool, L.V.: Hunting nessie – real-time abnormality detection from webcams. In: ICCV WS on VS (2009)
Pritch, Y., Ratovitch, S., Hendel, A., Peleg, S.: Clustered synopsis of surveillance video. In: AVSS (2009)
Adam, A., Rivlin, E., Shimshoni, I., Reinitz, D.: Robust real-time unusual event detection using multiple fixed-location monitors. PAMI (2008)
Zhong, H., Shi, J., Visontai, M.: Detecting unusual activity in video. In: CVPR (2004)
Zhu, X., Wu, X., Fan, J., Elmagarmid, A., Aref, W.: Exploring video content structure for hierarchical summarization. Multimedia Systems 10, 98–115 (2004)
Chen, B., Sen, P.: Video carving. Eurographics (2008)
Simakov, D., Caspi, Y., Irani, M., Shechtman, E.: Summarizing visual data using bidirectional similarity. In: CVPR (2008)
Loy, C., Xiang, T., Gong, S.: Multi-camera activity correlation analysis. In: CVPR, pp. 1988–1995 (2009)
Zelnik-Manor, L., Perona, P.: Self-tuning spectral clustering. In: NIPS (2004)
Akaike, H.: A new look at the statistical model identification. IEEE Trans. Automatic Control 19, 716–723 (1974)
Loy, C., Xiang, T., Gong, S.: Modelling activity global temporal dependencies using time delayed probabilistic graphical model. In: ICCV (2009)
Eshelman, L.: The chc adaptive search algorithm. Foundations of Genetic Algorithms, 256–283 (1991)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
De Leo, C., Manjunath, B.S. (2011). Multicamera Video Summarization from Optimal Reconstruction. In: Koch, R., Huang, F. (eds) Computer Vision – ACCV 2010 Workshops. ACCV 2010. Lecture Notes in Computer Science, vol 6468. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22822-3_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-22822-3_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22821-6
Online ISBN: 978-3-642-22822-3
eBook Packages: Computer ScienceComputer Science (R0)