Abstract
Automated detection of semantic concepts in multimedia documents has been attracting intensive research efforts over the last years. These efforts can be generally classified in two categories of methodologies: the ones that attempt to solve the problem using discriminative methods (classifiers) and those that build knowledge-based models, as driven by the W3C consortium. This paper proposes a methodology that tries to combine both approaches for multimedia retrieval. Our main contribution is the adoption of a formal model for defining concepts using logic and the incorporation of the output of concept classifiers to the computation of annotation scores. Our method does not require the computationally intensive training of new classifiers for the concepts defined. Instead, it employs a knowledge-based mechanism to combine the output score of existing classifiers and can be used for either detecting new concepts or enhancing the accuracy of existing detectors. Optimization procedures are employed to adapt the concept definitions to the multimedia corpus in hand, further improving the attained accuracy. Experiments using the TRECVID2005 video collection demonstrate promising results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Smeaton, A.F., Over, P., Kraaij, W.: Evaluation campaigns and trecvid. In: MIR 2006: Proceedings of the 8th ACM international workshop on Multimedia information retrieval, pp. 321–330. ACM, New York (2006)
Snoek, C.G.M., Worring, M., van Gemert, J.C., Geusebroek, J.-M., Smeulders, A.W.M.: The challenge problem for automated detection of 101 semantic concepts in multimedia. In: MULTIMEDIA 2006: Proceedings of the 14th annual ACM international conference on Multimedia, pp. 421–430. ACM, New York (2006)
Yanagawa, A., Chang, S.-F., Kennedy, L., Hsu, W.: Columbia university’s baseline detectors for 374 lscom semantic visual concepts. Technical report, Columbia University ADVENT Technical Report #222-2006-8 (March 2007)
Hauptmann, A., Yan, R., Lin, W.-H., Christel, M., Wactlar, H.: Filling the semantic gap in video retrieval: An exploration. Semantic Multimedia and Ontologies, 253–278 (2008)
Christel, M., Hauptmann, A.: The use and utility of high-level semantic features in video retrieval. Image and Video Retrieval, 134–144 (2005)
Volkmer, T., Natsev, A.: Exploring automatic query refinement for text-based video retrieval, July 2006, pp. 765–768 (2006)
Neo, S.-Y., Zhao, J., Kan, M.-Y., Chua, T.-S.: Video retrieval using high level features: Exploiting query matching and confidence-based weighting, pp. 143–152 (2006)
Baader, F., Calvanese, D., McGuinness, D.L., Nardi, D., Patel-Schneider, P.F. (eds.): The Description Logic Handbook: Theory, Implementation, and Applications. Cambridge University Press, Cambridge (2003)
Straccia, U.: Reasoning within fuzzy description logics. Journal of Artificial Intelligence Research (April 14, 2001)
Stoilos, G., Stamou, G., Tzouvaras, V., Pan, J.Z., Horrocks, I.: A fuzzy description logic for multimedia knowledge representation. In: Proc. of the International Workshop on Multimedia and the Semantic Web (2005)
Athanasiadis, T., Simou, N., Papadopoulos, G., Benmokhtar, R., Chandramouli, K., Tzouvaras, V., Mezaris, V., Phiniketos, M., Avrithis, Y., Kompatsiaris, Y., Huet, B., Izquierdo, E.: Integrating image segmentation and classification for fuzzy knowledge-based multimedia indexing. In: Huet, B., Smeaton, A., Mayer-Patel, K., Avrithis, Y. (eds.) MMM 2009. LNCS, vol. 5371, pp. 263–274. Springer, Heidelberg (2009)
Mosteller, F.: A k-sample slippage test for an extreme population. The Annals of Mathematical Statistics 19(1), 58–65 (1948)
Freund, Y., Iyer, R., Schapire, R.E., Singer, Y.: An efficient boosting algorithm for combining preferences. J. Mach. Learn. Res. 4, 933–969 (2003)
Naphade, M., Smith, J.R., Tesic, J., Chang, S.-F., Hsu, W., Kennedy, L., Hauptmann, A., Curtis, J.: Large-scale concept ontology for multimedia. IEEE MultiMedia 13(3), 86–91 (2006)
Klir, G.J., Yuan, B.: Fuzzy Sets and Fuzzy Logic; Theory and Applications. Prentice-Hall, Englewood Cliffs (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Falelakis, M., Karydas, L., Delopoulos, A. (2009). Knowledge-Based Concept Score Fusion for Multimedia Retrieval. In: Liu, J., Wu, J., Yao, Y., Nishida, T. (eds) Active Media Technology. AMT 2009. Lecture Notes in Computer Science, vol 5820. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04875-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-04875-3_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04874-6
Online ISBN: 978-3-642-04875-3
eBook Packages: Computer ScienceComputer Science (R0)