Abstract
HD video content represents a tremendous quantity of information that all types of devices can not easily handle. Hence the scalability issues in its processing have become a focus of interest in HD video coding technologies. In this chapter, we focus on the natural scalability of hierarchical transforms to tackle video indexing and retrieval. In the first part of the chapter, we give an overview of the transforms used and then present the methods which aim at exploring the transform coefficients to extract meaningful features from video and embed metadata in the scalable code-stream. Statistical global object-based descriptor incorporating low frequency and high-frequency features is proposed. In the second part of the chapter, we introduce a video retrieval technique based on a multiscale description of the video content. Both spatial and temporal scalable descriptors are proposed on the basis of multi-scale patches. A statistical dissimilarity between videos is derived using Kullback-Leibler divergences to compare patch descriptors.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
JPEG 2000 image coding system: Motion JPEG 2000, ISO/IEC. 15444-3:2007. Information technology edn.
Morand, C., Benois-Pineau, J., Domenger, J.P., Mansencal, B.: Object-based indexing of compressed video content: From sd to hd video. In: International Conference on Image Analysis and Processing Workshops, Modena, Italy (2007)
Adami, N., Boschetti, A., Leonardi, R., Migliorati, P.: Embedded indexing in scalable video coding. In: 7th International Workshop on Content-Based Multimedia Indexing, Chania, Crete (2009)
Morand, C., Benois-Pineau, J., Domenger, J.P.: HD motion estimation in a wavelet pyramid in JPEG 2000 context. In: 5th International Conference on Image Processing, Chicago, IL, USA (2008)
JPEG 2000 image coding system: Core coding system, ISO/IEC 15444-1:2004. Information technology edn.
Digital Cinema Initiative, http://www.dcimovies.com/ (accessed November 9, 2009)
Pearson, G., Gill, M.: An evaluation of Motion JPEG 2000 for video archiving. In: Archiving, Washington, D.C., USA (2005)
Coding of audio-visual objects – Part 2: Visual (MPEG4), ISO/IEC 14496-2:2004. Information technology edn.
Wang, Y., Hannuksela, M., Gabbouj, M.: Error-robust inter/intra mode selection using isolated regions. In: Packet Video (PV), Nantes, France (2003)
Totozafiny, T., Patrouix, O., Luthon, F., Coutellier, J.M.: Dynamic background segmentation for remote reference image updating within motion detection JPEG 2000. In: IEEE International Symposium on Industrial Electronics (ISIE), Montreal, Canada (2006)
Manerba, F., Benois-Pineau, J., Leonardi, R.: Extraction of foreground objects from MPEG2 video stream in rough indexing framework. In: SPIE Storage and Retrieval Methods and Applications for Multimedia, San Jose, CA, USA (2004)
Mallat, S.: A theory for multiresolution signal decomposition: The wavelet representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 11(7), 674–693 (1989)
Ćalić, J., Mrak, M., Kondoz, A.: Dynamic layout of visual summaries for scalable video. In: 6th International Workshop on Content-Based Multimedia Indexing, London, UK (2008)
Chevalier, F., Domenger, J.P., Benois-Pineau, J., Delest, M.: Retrieval of objects in video by similarity based on graph matching. Pattern Recognition Letters 28(8), 939–949 (2007)
TREC Video Retrieval Evaluation, http://www-nlpir.nist.gov/projects/trecvid/ (accessed November 9, 2009)
Dufaux, F., Ansorge, M., Ebrahimi, T.: Overview of JPSearch. A standard for image search and retrieval. In: 5th International Workshop on Content-Based Multimedia Indexing, Bordeaux, France (2007)
Saraceno, C., Leonardi, R.: Indexing audio-visual databases through a joint audio and video processing. International Journal of Imaging Systems and Technology 9(5), 320–331 (1998)
Benois-Pineau, J., Dupuy, W., Barba, D.: Recovering visual scenarios in movies by motion analysis and grouping of spatio-temporal signatures of shots. In: 2nd International Conference in Fuzzy Logic and Technology (EUSFLAT), Leicester, UK (2001)
Nilsback, M.E., Zisserman, A.: A visual vocabulary for flower classification. In: IEEE Comp. Soc. Conf. on Computer Vision and Pattern Recognition (CVPR), New York, NY, USA (2006)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Tuytelaars, T., Lampert, C.H., Blaschko, M.B., Buntine, W.: Unsupervised object discovery: a comparison. International Journal on Computer Vision (2009) (Online First version)
Multimedia content description interface – Part 3: Visual (MPEG7), ISO/IEC 15938-3:2002. Information technology edn.
Benois-Pineau, J., Morier, F., Barba, D., Sanson, H.: Hierarchical segmentation of video sequences for content manipulation and adaptive coding. Signal Processing 66(2), 181–201 (1998)
Salembier, P., Marqués, F., Pardàs, M., Morros, J., Corset, I., Jeannin, S., Bouchard, L., Meyer, F., Marcotegui, B.: Segmentation-based video coding system allowing the manipulation of objects. IEEE transactions on circuits and systems for video technology 7(1), 60–74 (1997)
Jehan-Besson, S., Barlaud, M., Aubert, G.: A 3-step algorithm using region-based active contours for video objects detection. EURASIP Journal on Applied Signal Processing 2002(6), 572–581 (2002)
Morand, C., Benois-Pineau, J., Domenger, J.P., Zepeda, J., Kijak, E., Guillemot, C.: Scalable object-based video retrieval in HD video databases. Submitted to Signal Processing: Image Communication (2009)
Liu, Y., Ngi Ngan, K.: Fast multiresolution motion estimation algorithms for wavelet-based scalable video coding. Signal Processing: Image Communication 22, 448–465 (2007)
Liu, Y., Ngan, K.N.: Fast multiresolution motion estimation algorithms for wavelet-based scalable video coding. Signal Processing: Image Communication 22(5), 448–465 (2007)
Sturges, H.A.: The choice of a class interval. Journal of the American Statistical Association 21(153), 65–66 (1926)
Piro, P., Anthoine, S., Debreuve, E., Barlaud, M.: Image retrieval via kullback-leibler divergence of patches of multiscale coefficients in the knn framework. In: 6th International Workshop on Content-Based Multimedia Indexing, London, UK (2008)
Piro, P., Anthoine, S., Debreuve, E., Barlaud, M.: Sparse Multiscale Patches for Image Processing. In: Nielsen, F. (ed.) Emerging Trends in Visual Computing. LNCS, vol. 5416, pp. 284–304. Springer, Heidelberg (2009)
Fukunaga, K., Hostetler, L.: The estimation of the gradient of a density function, with applications in pattern recognition. IEEE Transactions on Information Theory 21(1), 32–40 (1975)
Fukunaga, K.: Introduction to statistical pattern recognition, 2nd edn. Academic Press Professional, Inc., San Diego (1990)
Terrell, G.R., Scott, D.W.: Variable kernel density estimation. The Annals of Statistics 20(3), 1236–1265 (1992)
Kozachenko, L.F., Leonenko, N.: On statistical estimation of entropy of random vector. Problems of Information Transmission 23(2), 95–101 (1987)
Goria, M., Leonenko, N., Mergel, V., Novi Inverardi, P.: A new class of random vector entropy estimators and its applications in testing statistical hypotheses. Journal of Nonparametric Statistics 17(3), 277–298 (2005)
Leonenko, N., Pronzato, L., Savani, V.: A class of Rényi information estimators for multidimensional densities. Annals of Statistics 36(5), 2153–2182 (2008)
Hero, A.O., Ma, B., Michel, O., Gorman, J.: Alpha-divergence for classification, indexing and retrieval. Tech. Rep. CSPL-328, University of Michigan (2001)
Do, M., Vetterli, M.: Wavelet based texture retrieval using generalized Gaussian density and Kullback-Leibler distance. IEEE Transactions on Image Processing 11(2), 146–158 (2002)
Ahmad, I., Lin, P.E.: A nonparametric estimation of the entropy for absolutely continuous distributions. IEEE Transactions on Information Theory 22(3), 372–375 (1976)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Benois-Pineau, J. et al. (2010). Scalable Indexing of HD Video. In: Mrak, M., Grgic, M., Kunt, M. (eds) High-Quality Visual Experience. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12802-8_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-12802-8_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12801-1
Online ISBN: 978-3-642-12802-8
eBook Packages: EngineeringEngineering (R0)