Abstract
This paper presents a novel and efficient similarity measure between video segments. We consider local spatio-temporal descriptors. They are considered to be realizations of an unknown, but class-specific distribution. The similarity of two video segments is calculated by evaluating an appropriate statistical criterion issued from a rank test. It does not require any matching of the local features between the two considered video segments, and can deal with a different number of computed local features in the two segments. Furthermore, our measure is self-normalized which allows for simple cue integration, and even on-line adapted class-dependent combination of the different descriptors. Satisfactory results have been obtained on real video sequences for two motion event recognition problems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brunelli, R., Mich, O., Modena, C.M.: A survey on the automatic indexing of video data. Journal of Visual Communication and Image Representation 10(2), 78–112 (1999)
DeMenthon, D., Doerman, D.: Video retrieval using spatio-temporal descriptors. In: ACM Multimedia’03, Berkeley, Nov. 2003, ACM Press, New York (2003)
Dimitrova, N., et al.: Applications of video-content analysis and retrieval. IEEE Multimedia 9(3), 42–55 (2002)
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55(1), 119–139 (1997)
Haidar, S., Joly, P., Chebaro, B.: Style similarity measure for video documents comparison. In: Leow, W.-K., et al. (eds.) CIVR 2005. LNCS, vol. 3568, Springer, Heidelberg (2005)
Hájek, J., Šidák, Z.: Theory of rank tests. Academic Press, New York (1967)
Kokaram, A., et al.: Browsing sports video. IEEE Signal Processing Magazine 23(2), 47–58 (2006)
Laptev, I., Lindeberg, T.: Local descriptors for spatio-temporal recognition. In: MacLean, W.J. (ed.) SCVMA 2004. LNCS, vol. 3667, Springer, Heidelberg (2006)
Ma, Y.-F., Zhang, H.-J.: Motion pattern-based video classification and retrieval. EURASIP Journal on Applied Signal Processing 2, 199–208 (2003)
Moënne-Loccoz, N., Bruno, E., Marchand-Maillet, S.: Video content representation as salient regions of activity. In: Enser, P.G.B., et al. (eds.) CIVR 2004. LNCS, vol. 3115, Springer, Heidelberg (2004)
Piriou, G., Bouthemy, P., Yao, J.-F.: Extraction of semantic dynamic content from videos with probabilistic motion models. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3023, Springer, Heidelberg (2004)
Schapire, R.E.: Using output codes to boost multiclass learning problems. In: ICML ’97, Proc. of the Int. Conf. on Machine Learning (1997)
Viola, P., Jones, M.J., Snow, D.: Detecting pedestrians using patterns of motion and appearance. Int. Journal of Computer Vision 63(2), 153–161 (2005)
Zelnik-Manor, L., Irani, M.: Event-based video analysis. In: Proc. Int. Conf. on Computer Vision and Pattern Recognition, vol. 2, Kauai, Hawaii, December 2001, pp. 123–130 (2001)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Lehmann, A., Bouthemy, P., Yao, JF. (2007). Rank-Test Similarity Measure Between Video Segments for Local Descriptors. In: Marchand-Maillet, S., Bruno, E., Nürnberger, A., Detyniecki, M. (eds) Adaptive Multimedia Retrieval: User, Context, and Feedback. AMR 2006. Lecture Notes in Computer Science, vol 4398. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71545-0_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-71545-0_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71544-3
Online ISBN: 978-3-540-71545-0
eBook Packages: Computer ScienceComputer Science (R0)