Abstract
The ability to learn from user interaction is an important asset for content-based image retrieval (CBIR) systems. Over short times scales, it enables the integration of information from successive queries assuring faster convergence to the desired target images. Over long time scales (retrieval sessions) it allows the retrieval system to tailor itself to the preferences of particular users. We address the issue of learning by formulating retrieval as a problem of Bayesian inference. The new formulation is shown to have various advantages over previous approaches: it leads to the minimization of the probability of retrieval error, enables region-based queries without prior image segmentation, and suggests elegant procedures for combining multiple user specifications. As a consequence of all this, it enables the design of short and long-term learning mechanisms that are simple, intuitive, and extremely efficient in terms of computational and storage requirements. We introduce two such algorithms and present experimental evidence illustrating the clear advantages of learning for CBIR.
Chapter PDF
References
B. Bhanu, J. Peng, and S. Qing. Learning Feature Relevance and Similarity Metrics in Image Databases. In Workshop in Content-based Access to Image and Video Libraries, pages 14–18, 1998, Santa Barbara, California.
I. Cox, M. Miller, S. Omohundro, and P. Yianilos. PicHunter: Bayesian Relevance Feedback for Image Retrieval. In Int. Conf. on Pattern Recognition, Vienna, Austria, 1996.
L. Devroye, L. Gyorfi, and G. Lugosi. A Probabilistic Theory of Pattern Recognition. Springer-Verlag, 1996.
T. Minka and R. Picard. Interactive learning using a “society of models”. Pattern Recognition, 30:565–582, 1997.
J. Pearl. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, 1988.
Yong Rui, T. Huang, M. Ortega, and S. Mehrotra. Relevance Feedback: A Power Tool for Interactive Content-Based Image Retrieval. IEEE Trans. on Circuits and Systems for Video Technology, 8(5):644–655, September 1998.
L. Taycher, M. Cascia, and S. Sclaroff. Image Digestion and Relevance Feedback in the Image Rover WWW Search Engine. In Visual 1997, San Diego, California.
N. Vasconcelos and A. Lippman. A Probabilistic Architecture for Content-based Image Retrieval. In Proc. IEEE Computer Vision and Pattern Recognition Conf., Hilton Head, North Carolina, 2000.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vasconcelos, N., Lippman, A. (2000). Learning over Multiple Temporal Scales in Image Databases. In: Computer Vision - ECCV 2000. ECCV 2000. Lecture Notes in Computer Science, vol 1842. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45054-8_3
Download citation
DOI: https://doi.org/10.1007/3-540-45054-8_3
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67685-0
Online ISBN: 978-3-540-45054-2
eBook Packages: Springer Book Archive