Multi-level Fusion for Semantic Video Content Indexing and Retrieval

Benmokhtar, Rachid; Huet, Benoit

doi:10.1007/978-3-540-79860-6_13

Rachid Benmokhtar¹⁹ &
Benoit Huet¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4918))

Included in the following conference series:

International Workshop on Adaptive Multimedia Retrieval

428 Accesses
2 Citations
1 Altmetric

Abstract

In this paper, we present the results of our work on the analysis of an automatic semantic video content indexing and retrieval system based on fusing various low level visual descriptors. Global MPEG-7 features extracted from video shots, are described via IVSM signature (Image Vector Space Model) in order to have a compact description of the content. Both static and dynamic feature fusion are introduced to obtain effective signatures. Support Vector Machines (SVMs) are employed to perform classification (One classifier per feature). The task of the classifiers is to detect the video semantic content. Then, classifier outputs are fused using a neural network based on evidence theory (NNET) in order to provide a decision on the content of each shot. The experimental results are conducted in the framework of the TRECVid feature extraction task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Mottaleb, M.A., Krishnamachari, S.: Multimedia descriptions based on MPEG-7: Extraction and applications. Proceeding of IEEE Multimedia 6, 459–468 (2004)
Article Google Scholar
Spyrou, E., Leborgne, H., Mailis, T., Cooke, E., Avrithis, Y., O’Connor, N.: Fusing MPEG-7 visual descriptors for image classification. In: Duch, W., Kacprzyk, J., Oja, E., Zadrożny, S. (eds.) ICANN 2005. LNCS, vol. 3697, pp. 847–852. Springer, Heidelberg (2005)
Google Scholar
Rautiainen, M., Seppanen, T.: Comparison of visual features and fusion techniques in automatic detection of concepts from news video based on gabor filters. In: Proceeding of ICME, pp. 932–935 (2005)
Google Scholar
Souvannavong, F., Merialdo, B., Huet, B.: Latent semantic analysis for an effective region based video shot retrieval system. In: Proceedings of ACM MIR, pp. 243–250 (2004)
Google Scholar
Jolliffe, I.: Principle component analysis. Springer, Heidelberg (1986)
Book Google Scholar
Zhang, W., Shan, S., Gao, W., Chang, Y., Cao, B., Yang, P.: Information fusion in face identification. In: Proceedings of IEEE ICPR, vol. 3, pp. 950–953 (2004)
Google Scholar
Vapnik, V.: The nature of statistical learning theory. Springer, Heidelberg (1995)
Book MATH Google Scholar
Shafer, G.: A mathematical theory of evidence. Princeton University Press, Princeton (1976)
MATH Google Scholar
Benmokhtar, R., Huet, B.: Neural network combining classifier based on Dempster-Shafer theory. In: Cham, T.-J., Cai, J., Dorai, C., Rajan, D., Chua, T.-S., Chia, L.-T. (eds.) MMM 2007. LNCS, vol. 4351, pp. 196–205. Springer, Heidelberg (2006)
Chapter Google Scholar
TrecVid, Digital video retrieval at NIST, http://www-nlpir.nist.gov/projects/trecvid/

Download references

Author information

Authors and Affiliations

Département Communications Multimédias, Institut Eurécom, 2229, route des crêtes, 06904, Sophia-Antipolis, France
Rachid Benmokhtar & Benoit Huet

Authors

Rachid Benmokhtar
View author publications
You can also search for this author in PubMed Google Scholar
Benoit Huet
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

INRIA Rocquencourt, 78153, Le Chesnay, France
Nozha Boujemaa
University Pierrre and Marie Curie, CNRS - LIP6 , 75016, Paris, France
Marcin Detyniecki
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg , 39106, Magdeburg, Germany
Andreas Nürnberger

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Benmokhtar, R., Huet, B. (2008). Multi-level Fusion for Semantic Video Content Indexing and Retrieval. In: Boujemaa, N., Detyniecki, M., Nürnberger, A. (eds) Adaptive Multimedia Retrieval: Retrieval, User, and Semantics. AMR 2007. Lecture Notes in Computer Science, vol 4918. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-79860-6_13

Download citation

DOI: https://doi.org/10.1007/978-3-540-79860-6_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-79859-0
Online ISBN: 978-3-540-79860-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics