Abstract
In this paper, we address two main problems encountered in content-based image retrieval, namely the lack of image semantics that can be captured by extracting and indexing visual image features and the difficulty originating from the subjectivity and context dependency of user queries. This work proposes a new method for semantic browsing and retrieval of images by finding semantic coherence between words and image segments on three layers. The method is based on the matching of visual segment clusters with words on various levels of abstraction and is very promising for effective browsing and retrieval in large image databases. It supports various textual and/or visual query modes as well as both target- and category-type browsing and retrieval. Experiment conducted on a large set of natural images proved that step-by-step semantic inference on consecutive layers of image – word association helps to improve accuracy of retrieval and browsing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhou, X.S., Huang, T.S.: Unifying Keywords and Visual Contents in Image Retrieval. IEEE Multimedia 9(2), 23–33 (2002)
Zhao, R., et al.: Negotiating the semantic gap: from feature maps to semantic landscapes. Pattern Recognition 35, 593–600 (2002)
Barnard, K., et al.: Matching Words and Pictures. Journal of Machine Learning Research 3, 1107–1135 (2003)
Wang, J.Z., Li, J.: Learning-based linguistic indexing of pictures with 2-D MHMMs. In: Proc. ACM Multimedia, pp. 436–445 (2002)
Lim, J.-H., Tian, Q., Mulhem, P.: Home Photo Content for Personalized Event-Based Retrieval. IEEE MultiMedia 9(2), 28–37 (2003)
Hofmann, T.: Learning and Representing Topic, A Hierarchical Mixture Model for Word Occurrences in Document Databases. In: Proc. of the Conference for Automated Learning and Discovery (CONALD), Pittsburgh (1998)
Kutics, A., et al.: An object-based image retrieval system using an inhomogeneous diffusion model. In: Proc. of the ICIP 1999, vol. II, pp. 590–594 (1999)
Kutics, A., et al.: Image retrieval via the inhomogeneous diffusion of luminance and texture features. Journal of Electronic Imaging 9(2), 159–169 (2000)
Mojsilovic, A.: A method for color naming and description of color composition in images. In: Proc. of the ICIP 2002 (2002)
Bhusnan, N., et al.: The texture lexicon: Understanding the categorization of visual texture terms and their relationship to texture images. Cognitive Science 21(2), 219–246 (1997)
Hendee, W.R., Wells, P.N.T.: The Perception of Visual Information. Springer, Heidelberg (1997)
Fellbaum, C., et al.: WordNet: An Electronic Lexical Database, May 15. MIT Press, Cambridge (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kutics, A., Nakagawa, A. (2004). Semantic Browsing and Retrieval in Image Libraries. In: Campilho, A., Kamel, M. (eds) Image Analysis and Recognition. ICIAR 2004. Lecture Notes in Computer Science, vol 3211. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30125-7_91
Download citation
DOI: https://doi.org/10.1007/978-3-540-30125-7_91
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23223-0
Online ISBN: 978-3-540-30125-7
eBook Packages: Springer Book Archive