Skip to main content
Log in

Combining spatial and temporal patches for scalable video indexing

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

This paper tackles the problem of scalable video indexing. We propose a new framework combining spatial and motion patch descriptors. The spatial descriptors are based on a multiscale description of the image and are called Sparse Multiscale Patches. We propose motion patch descriptors based on block motion that describe the motion in a Group of Pictures. The distributions of these sets of patches are compared combining weighted Kullback-Leibler divergences between spatial and motion patches. These divergences are estimated in a non-parametric framework using a k-th Nearest Neighbor estimator. We evaluate this weighted dissimilarity measure on selected videos from the ICOS-HD ANR project. Experiments show that the spatial part of the measure is relevant to detect different sequences, while its motion part allows to detect clips within a sequence. Experiments combining the spatial and temporal parts of the dissimilarity measure show its robustness to resampling and compression; thus exhibiting the spatial scalability of the method on heterogeneous networks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. ICOS-HD (Scalable Joint Indexing and Compression for High-Definition Video Content) is a research project funded by ANR (French Research Agency).

  2. Original HD sequences ©Warner Bros issued from the Dolby 4-4-4 Film Content Kit One.

References

  1. Boltz S, Debreuve E, Barlaud M (2007) High-dimensional kullback-leibler distance for region-of-interest tracking: application to combining a soft geometric constraint with radiometry. In: CVPR. Minneapolis, USA

  2. Hero AO, Ma B, Michel O, Gorman J (2001) Alpha-divergence for classification, indexing and retrieval. Technical report CSPL-328, University of Michigan

  3. Laptev I, Pérez P (2007) Retrieving actions in movies. In: Proc. int. conf. comp. vis. (ICCV’07). Rio de Janeiro, Brazil, pp 1–8

  4. Loftsgaarden D, Quesenberry C (1965) A nonparametric estimate of a multivariate density function. AMS 36:1049–1051

    MATH  MathSciNet  Google Scholar 

  5. Mezaris V, Kompatsiaris I, Strintzis MG (2006) Object-based mpeg-2 video indexing and retrieval in a collaborative environment. Multimed Tools Appl 30:255–272

    Article  Google Scholar 

  6. Morand C, Benois-Pineau J, Domenger J-Ph, Mansencal B (2007) Object-based indexing of compressed video content: from sd to hd video. In: IEEE VMDL/ICIAP. Modena, Italy

  7. Piro P, Anthoine S, Debreuve E, Barlaud M (2008) Image retrieval via kullback-leibler divergence of patches of multiscale coefficients in the knn framework. In: CBMI. London, UK

  8. Piro P, Anthoine S, Debreuve E, Barlaud M (2009) Sparse multiscale patches for image processing. In: ETVC. LNCS, vol 5416/2009. Springer, New York

    Google Scholar 

  9. Rothganger F, Lazebnik S, Schmid C, Ponce J (2007) Segmenting, modeling, and matching video clips containing multiple moving objects. IEEE Trans Pattern Anal Mach Intell 29(3):477–491

    Article  Google Scholar 

  10. Schwarz H, Marpe D, Wiegand T (2007) Overview of the scalable video coding extension of the h.264/avc standard. IEEE Trans Circuits Syst Video Technol 17(9):1103–1120

    Article  Google Scholar 

  11. Terrell DW, Scott GR (1992) Variable kernel density estimation. Ann Stat 20(3):1236–1265

    Article  MATH  MathSciNet  Google Scholar 

  12. Zhai Y, Liu J, Cao X, Basharat A, Hakeem A, Ali S, Shah M (2005) Video understanding and content-based retrieval. In: TRECVID05

  13. Zong D, Chang SF (1999) An integrated approach for content-based video object segmentation and retrieval. IEEE Trans Circuits Syst Video Technol 9:1259–1268

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to acknowledge the contribution of W. Belhajali in the experiments conducted here.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sandrine Anthoine.

Additional information

This work is supported by the French ANR grant “ICOS-HD”.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Piro, P., Anthoine, S., Debreuve, E. et al. Combining spatial and temporal patches for scalable video indexing. Multimed Tools Appl 48, 89–104 (2010). https://doi.org/10.1007/s11042-009-0350-4

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-009-0350-4

Keywords

Navigation