Skip to main content

Scalable Indexing of HD Video

  • Chapter
  • 1227 Accesses

Part of the book series: Signals and Communication Technology ((SCT))

Abstract

HD video content represents a tremendous quantity of information that all types of devices can not easily handle. Hence the scalability issues in its processing have become a focus of interest in HD video coding technologies. In this chapter, we focus on the natural scalability of hierarchical transforms to tackle video indexing and retrieval. In the first part of the chapter, we give an overview of the transforms used and then present the methods which aim at exploring the transform coefficients to extract meaningful features from video and embed metadata in the scalable code-stream. Statistical global object-based descriptor incorporating low frequency and high-frequency features is proposed. In the second part of the chapter, we introduce a video retrieval technique based on a multiscale description of the video content. Both spatial and temporal scalable descriptors are proposed on the basis of multi-scale patches. A statistical dissimilarity between videos is derived using Kullback-Leibler divergences to compare patch descriptors.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   179.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   229.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   279.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. JPEG 2000 image coding system: Motion JPEG 2000, ISO/IEC. 15444-3:2007. Information technology edn.

    Google Scholar 

  2. Morand, C., Benois-Pineau, J., Domenger, J.P., Mansencal, B.: Object-based indexing of compressed video content: From sd to hd video. In: International Conference on Image Analysis and Processing Workshops, Modena, Italy (2007)

    Google Scholar 

  3. Adami, N., Boschetti, A., Leonardi, R., Migliorati, P.: Embedded indexing in scalable video coding. In: 7th International Workshop on Content-Based Multimedia Indexing, Chania, Crete (2009)

    Google Scholar 

  4. Morand, C., Benois-Pineau, J., Domenger, J.P.: HD motion estimation in a wavelet pyramid in JPEG 2000 context. In: 5th International Conference on Image Processing, Chicago, IL, USA (2008)

    Google Scholar 

  5. JPEG 2000 image coding system: Core coding system, ISO/IEC 15444-1:2004. Information technology edn.

    Google Scholar 

  6. Digital Cinema Initiative, http://www.dcimovies.com/ (accessed November 9, 2009)

  7. Pearson, G., Gill, M.: An evaluation of Motion JPEG 2000 for video archiving. In: Archiving, Washington, D.C., USA (2005)

    Google Scholar 

  8. Coding of audio-visual objects – Part 2: Visual (MPEG4), ISO/IEC 14496-2:2004. Information technology edn.

    Google Scholar 

  9. Wang, Y., Hannuksela, M., Gabbouj, M.: Error-robust inter/intra mode selection using isolated regions. In: Packet Video (PV), Nantes, France (2003)

    Google Scholar 

  10. Totozafiny, T., Patrouix, O., Luthon, F., Coutellier, J.M.: Dynamic background segmentation for remote reference image updating within motion detection JPEG 2000. In: IEEE International Symposium on Industrial Electronics (ISIE), Montreal, Canada (2006)

    Google Scholar 

  11. Manerba, F., Benois-Pineau, J., Leonardi, R.: Extraction of foreground objects from MPEG2 video stream in rough indexing framework. In: SPIE Storage and Retrieval Methods and Applications for Multimedia, San Jose, CA, USA (2004)

    Google Scholar 

  12. Mallat, S.: A theory for multiresolution signal decomposition: The wavelet representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 11(7), 674–693 (1989)

    Article  MATH  Google Scholar 

  13. Ćalić, J., Mrak, M., Kondoz, A.: Dynamic layout of visual summaries for scalable video. In: 6th International Workshop on Content-Based Multimedia Indexing, London, UK (2008)

    Google Scholar 

  14. Chevalier, F., Domenger, J.P., Benois-Pineau, J., Delest, M.: Retrieval of objects in video by similarity based on graph matching. Pattern Recognition Letters 28(8), 939–949 (2007)

    Article  Google Scholar 

  15. TREC Video Retrieval Evaluation, http://www-nlpir.nist.gov/projects/trecvid/ (accessed November 9, 2009)

  16. Dufaux, F., Ansorge, M., Ebrahimi, T.: Overview of JPSearch. A standard for image search and retrieval. In: 5th International Workshop on Content-Based Multimedia Indexing, Bordeaux, France (2007)

    Google Scholar 

  17. Saraceno, C., Leonardi, R.: Indexing audio-visual databases through a joint audio and video processing. International Journal of Imaging Systems and Technology 9(5), 320–331 (1998)

    Article  Google Scholar 

  18. Benois-Pineau, J., Dupuy, W., Barba, D.: Recovering visual scenarios in movies by motion analysis and grouping of spatio-temporal signatures of shots. In: 2nd International Conference in Fuzzy Logic and Technology (EUSFLAT), Leicester, UK (2001)

    Google Scholar 

  19. Nilsback, M.E., Zisserman, A.: A visual vocabulary for flower classification. In: IEEE Comp. Soc. Conf. on Computer Vision and Pattern Recognition (CVPR), New York, NY, USA (2006)

    Google Scholar 

  20. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  21. Tuytelaars, T., Lampert, C.H., Blaschko, M.B., Buntine, W.: Unsupervised object discovery: a comparison. International Journal on Computer Vision (2009) (Online First version)

    Google Scholar 

  22. Multimedia content description interface – Part 3: Visual (MPEG7), ISO/IEC 15938-3:2002. Information technology edn.

    Google Scholar 

  23. Benois-Pineau, J., Morier, F., Barba, D., Sanson, H.: Hierarchical segmentation of video sequences for content manipulation and adaptive coding. Signal Processing 66(2), 181–201 (1998)

    Article  MATH  Google Scholar 

  24. Salembier, P., Marqués, F., Pardàs, M., Morros, J., Corset, I., Jeannin, S., Bouchard, L., Meyer, F., Marcotegui, B.: Segmentation-based video coding system allowing the manipulation of objects. IEEE transactions on circuits and systems for video technology 7(1), 60–74 (1997)

    Article  Google Scholar 

  25. Jehan-Besson, S., Barlaud, M., Aubert, G.: A 3-step algorithm using region-based active contours for video objects detection. EURASIP Journal on Applied Signal Processing 2002(6), 572–581 (2002)

    Article  Google Scholar 

  26. Morand, C., Benois-Pineau, J., Domenger, J.P., Zepeda, J., Kijak, E., Guillemot, C.: Scalable object-based video retrieval in HD video databases. Submitted to Signal Processing: Image Communication (2009)

    Google Scholar 

  27. Liu, Y., Ngi Ngan, K.: Fast multiresolution motion estimation algorithms for wavelet-based scalable video coding. Signal Processing: Image Communication 22, 448–465 (2007)

    Article  Google Scholar 

  28. Liu, Y., Ngan, K.N.: Fast multiresolution motion estimation algorithms for wavelet-based scalable video coding. Signal Processing: Image Communication 22(5), 448–465 (2007)

    Article  Google Scholar 

  29. Sturges, H.A.: The choice of a class interval. Journal of the American Statistical Association 21(153), 65–66 (1926)

    Google Scholar 

  30. Piro, P., Anthoine, S., Debreuve, E., Barlaud, M.: Image retrieval via kullback-leibler divergence of patches of multiscale coefficients in the knn framework. In: 6th International Workshop on Content-Based Multimedia Indexing, London, UK (2008)

    Google Scholar 

  31. Piro, P., Anthoine, S., Debreuve, E., Barlaud, M.: Sparse Multiscale Patches for Image Processing. In: Nielsen, F. (ed.) Emerging Trends in Visual Computing. LNCS, vol. 5416, pp. 284–304. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  32. Fukunaga, K., Hostetler, L.: The estimation of the gradient of a density function, with applications in pattern recognition. IEEE Transactions on Information Theory 21(1), 32–40 (1975)

    Article  MATH  MathSciNet  Google Scholar 

  33. Fukunaga, K.: Introduction to statistical pattern recognition, 2nd edn. Academic Press Professional, Inc., San Diego (1990)

    MATH  Google Scholar 

  34. Terrell, G.R., Scott, D.W.: Variable kernel density estimation. The Annals of Statistics 20(3), 1236–1265 (1992)

    Article  MATH  MathSciNet  Google Scholar 

  35. Kozachenko, L.F., Leonenko, N.: On statistical estimation of entropy of random vector. Problems of Information Transmission 23(2), 95–101 (1987)

    MATH  MathSciNet  Google Scholar 

  36. Goria, M., Leonenko, N., Mergel, V., Novi Inverardi, P.: A new class of random vector entropy estimators and its applications in testing statistical hypotheses. Journal of Nonparametric Statistics 17(3), 277–298 (2005)

    Article  MATH  MathSciNet  Google Scholar 

  37. Leonenko, N., Pronzato, L., Savani, V.: A class of Rényi information estimators for multidimensional densities. Annals of Statistics 36(5), 2153–2182 (2008)

    Article  MATH  MathSciNet  Google Scholar 

  38. Hero, A.O., Ma, B., Michel, O., Gorman, J.: Alpha-divergence for classification, indexing and retrieval. Tech. Rep. CSPL-328, University of Michigan (2001)

    Google Scholar 

  39. Do, M., Vetterli, M.: Wavelet based texture retrieval using generalized Gaussian density and Kullback-Leibler distance. IEEE Transactions on Image Processing 11(2), 146–158 (2002)

    Article  MathSciNet  Google Scholar 

  40. Ahmad, I., Lin, P.E.: A nonparametric estimation of the entropy for absolutely continuous distributions. IEEE Transactions on Information Theory 22(3), 372–375 (1976)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Benois-Pineau, J. et al. (2010). Scalable Indexing of HD Video. In: Mrak, M., Grgic, M., Kunt, M. (eds) High-Quality Visual Experience. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12802-8_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12802-8_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12801-1

  • Online ISBN: 978-3-642-12802-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics