Abstract
This chapter describes a method for automatic classification of video shots from a video database by using distance metrics derived from motion information only. The classification serves as the first step of the indexing process of a video scene and its retrieval from a large database in order to partition the database into more manageable sub-units according to the types of scenes, e.g. sport, drama, scenery, news reading. The method is intended for web-based and telecommunication applications and therefore the processing is carried out in the MPEG (compressed) domain making use of the spatio-temporal data already available in MPEG video files. The confidence of the MPEG motion vectors estimated by the block matching algorithm is evaluated using a block activity factor, for retaining or discarding the vectors from the classification distance measure by a filtering process of the MPEG motion vector fields. The chapter presents a robust regression technique, based on Least Median-of-Squares, to deal with the situation. A novel metrics called activity power flow is introduced to effectively capture the spatiotemporal evolution of scenes through the video sequence.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
W.A.C. Fernando et al., Scene change detection algorithms for content-based video indexing and retrieval, IEE Electronics and Communication Engineering Journal, 117–125 (June 2001).
D.T. Nguyen and W. Gillespie, A Video Retrieval System Based on Compressed Data from MPEG Files, Proceedings of IEEE TENCON 2003, Bangalore, India (October 2003)
MPEG-1, ISO/IEC 11172-2, ‘Information Technology — Coding of moving pictures and associated audio for digital storage media up to about 1.5 Mbit/s’, Part 2: Video, (1993).
W. Gillespie and D.T. Nguyen, Filtering of MPEG Motion Vector Fields for use in Motion-Based Video Indexing and Retrieval, Proceedings 7th International Symposium on Digital Signal Processing for Communication Systems, Gold Coast, Australia, 8–11 (December 2003).
J.M. Odobez and P. Bouthemy, Robust Multiresolution Estimation of Parametric Motion Models, Journal of Visual Communication and Image Representation, 6(4), 348–365 (December 1995).
K. Jinzenji, S. Ishibashi, and H. Kotera, Algorithm for automatically producing layered sprites by detecting camera movement, Proceedings IEEE International Conference on Image Processing ICIP 1997, 767–770 (November 1997)
P. J. Rousseeuw and A.M. Leroy, Robust Regression and Outlier Detection, (John Wiley, 1987).
P. Meer, D. Mintz, and A. Rosenfeld, “Robust Regression Methods for Computer Vision: A Review”, Int. Journ. of Computer Vision, 6(1), 59–70 (1991).
W. J. Gillespie and D.T. Nguyen, Classification of Video Shots Using Activity Power Flow, IEEE Consumer Communications and Networking Conference CCNC 2004, Las Vegas, USA, (Jannuary 2004).
T. A. Hoang, Wavelet-Based Techniques for Classification of Power Quality Disturbances, PhD Thesis, School of Engineering, University of Tasmania, Oct 2002
Sascha Spengenberg, The RBF Network Receiver, http://www.ee.ed.ac.uk/~ssp/project/html/, site last viewed Oct 2003.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science + Business Media, Inc.
About this chapter
Cite this chapter
Gillespie, W., Nguyen, T. (2005). Classification of Video Sequences in MPEG Domain. In: Wysocki, T.A., Honary, B., Wysocki, B.J. (eds) Signal Processing for Telecommunications and Multimedia. Multimedia Systems and Applications Series, vol 27. Springer, Boston, MA. https://doi.org/10.1007/0-387-22928-0_6
Download citation
DOI: https://doi.org/10.1007/0-387-22928-0_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-22847-1
Online ISBN: 978-0-387-22928-7
eBook Packages: EngineeringEngineering (R0)