Abstract
Existing content-based image retrieval paradigms almost never address the problem of starting the search, when the user has no starting example image but rather a mental image. We propose a new image retrieval system to allow the user to perform mental image search by formulating boolean composition of region categories. The query interface is a region photometric thesaurus which can be viewed as a visual summary of salient regions available in the database. It is generated from the unsupervised clustering of regions with similar visual content into categories. In this thesaurus, the user simply selects the types of regions which should and should not be present in the mental image (boolean composition). The natural use of inverted tables on the region category labels enables powerful boolean search and very fast retrieval in large image databases. The process of query and search of images relates to that of documents with Google. The indexing scheme is fully unsupervised and the query mode requires minimal user interaction (no example image to provide, no sketch to draw). We demonstrate the feasibility of such a framework to reach the user mental target image with two applications: a photo-agency scenario on Corel Photostock and a TV news scenario. Perspectives will be proposed for this simple and innovative framework, which should motivate further development in various research areas.
Similar content being viewed by others
Notes
Google: http://www.google.com
The idea of the “mental document” may be more or less precise in the user's mind: it may be an already seen document or more generally a document related to a particular topic.
References
Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. Addison-Wesley
Bezdek JC (1981) Pattern recognition with fuzzy objective functions. Plenum, New York NY
Boujemaa N, Fauqueur J, Ferecatu M, Fleuret F, Gouet V, Le Saux B, Sahbi H (2001) Ikona: Interactive generic and specific image retrieval. International workshop on Multimedia Content-Based Indexing and Retrieval (MMCBIR), Rocquencourt, France, pp 25–28
Carson C, et al (1999) Blobworld: A system for region-based image indexing and retrieval. Proc. of International Conference on Visual Information System, LNCS 1614:509–517
Cox IJ, Miller ML, Minka TP (2000) The bayesian image retrieval system, pichunter: Theory, implementation and psychological experiments. IEEE Trans Image Process 9(1):20–37
DelBimbo A, Pala P (1997, February) Visual image retrieval by elastic matching of user sketches. IEEE Trans Pattern Anal Mach Intell 19(2):121–132
DelBimbo A, Vicario E (1998, June) Using weighted spatial relationships in retrieval by visual contents. IEEE workshop on Image and Video Libraries
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. JR Stat Soc 39:1–38
Deng Y, Manjunath BS (1999, March) An efficient low-dimensional color indexing scheme for region based image retrieval. Proc. IEEE Int Conf Acoust Speech Signal Proc (ICASSP), Phoenix, Arizona
Egenhofer MJ (1997) Query processing in spatial query by sketch. J Vis Lang Comput (JVLC) 8(4):403–424
Fauqueur J (2003) Contributions to image retrieval by their visual components. PhD Thesis, UVSQ - INRIA, (in French).
Fauqueur J, Boujemaa N, (2003) Logical query composition from local visual feature thesaurus. International Workshop on Content-Based Multimedia Indexing (CBMI), Rennes, France
Fauqueur J, Boujemaa N (2004) Region-based image retrieval: Fast coarse segmentation and fine color description. J Vis Lang Comput (JVLC), special issue on Visual Information Systems 15(1):69–95
Flickner M, et al (1995) Query by image and video content: The qbic system. IEEE Computer 28(9):23–32
Frigui H, Krishnapuram R (1997) Clustering by competitive agglomeration. Pattern Recognition 30(7): 1109–1119
Fung CY, Loe KJ (1999) Learning primitive and scene semantics for image for classification and retrieval. ACM Multimedia
Gouet V, Boujemaa N (2001) Object-based queries using color points of interest. IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL)
Gupta A, et al (1996) The virage image search engine: an open framework for image management. SPIE Storage and Retrieval for Image and Video Databases, 2670
Harper DJ, Jose JM, Furner J (1998) Spatial querying for image retrieval: A user-oriented evaluation. International ACM SIGIR conference, pp 232–240
Hiroike A, Musha Y, Sugimoto A, Mori Y (1999) Visualization of information spaces to retrieve and browse image data. International Conference on Visual Information System (VIS)
Huang T, Mehrotra S, Ramchandran K (1996) Multimedia analysis and retrieval system (mars) project. Proc. of the 33rd Annual Clinic on Library Application of Data Processing—Digital Image Access and Retrieval
Huang T, Rui Y, Mehrotra S (1997) Content-based image retrieval with relevance feedback in mars. IEEE Int Conf Image Proc (ICIP)
Kohonen T (1997) Self-organizing maps. Springer Berlin Heidelberg New York
La Cascia M, Sethi S, Sclaroff S (1998, June ) Combining textual and visual cues for content-based image retrieval on the world wide web. IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL)
Laaksonen J, Oja E, Koskela M, Brandt S (2000) Analyzing low-level visual features using content-based image retrieval. International Conference on Neural Information Processing (ICONIP). Taejon, Korea
LeSaux B, Boujemaa N (2002) Unsupervised robust clustering for image database categorization. IAPR Int Conf Pattern Recognit (ICPR)
Lim JH (1999) Learnable visual keywords for image classification. ACM conference on Digital libraries, pp 139–145
Linde Y, Buzo A, Gray RM (1980) An algorithm for vector quantizer design. IEEE Trans Commun COM-28:84–95
Ma WY, Manjunath BS (1998) A texture thesaurus for browsing large aerial photographs. Journal of the American Society of Information Science 49(7):633–648
Ma WY, Manjunath BS (1999) Netra: A toolbox for navigating large image databases. Multimedia Syst 7(3):184–198
MacDonald S, Tait (2003) Search strategies in content-based image retrieval. International ACM SIGIR conference
MacQueen J (1967) Some methods for classification and analysis of multivariate observations. Proc. of the Fifth Berkeley Symp on Math Stat and Prob 1:281–296
Malki J, Boujemaa N, Nastar C, Winter A (1999) Region queries without segmentation for image retrieval by content. In Proc. of International Conference on Visual Information System (VIS), pp 115–122
Manjunath BS, Salembier P, Sikora T (2002) Introduction to MPEG-7: Multimedia Content Description Interface. Wiley, ISBN: 0-471-48678-7
Meiers T, Sikora T, Keller I (2002) Hierarchical image database browsing environment with embedded relevance feedback. IEEE Int Conf Image Proc (ICIP)
Meilhac C, Nastar C (1999) Relevance feedback and category search in image databases. IEEE International Conference on Multimedia Computing and Systems
Moghaddam B, Biermann H, Margaritis D (1999) Defining image content with multiple regions of interest. IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL)
Nastar C, Mitschke M, Meilhac C, Boujemaa N (1998) Surfimage: A flexible content-based image retrieval system. ACM Multimedia Conference Proceedings, Bristol, UK
Niblack W, Barber R, Equitz W, Flickner M, et al (1993) The QBic project: Querying images by content using color, texture, and shape. Proc. SPIE (Storage and Retrieval for Image and Video Databases) 1908:173–187
Pentland A, Picard R, Sclaroff S (1994, February) Photobook: content-based manipulation of image databases. SPIE Storage and Retrieval for Image and Video Databases, II(2185)
Picard RW (1995) Toward a visual thesaurus. MIT Technical Report TR358
Rissanen J (1978) Modeling by shortest data description. Automatica
Rodden K, Basalaj W, Sinclair D, Wood K (2001) Does organisation by similarity assist image browsing? International ACM SIGCHI conference, pp 190–197
Rubner Y (1999) Perceptual metrics for image database navigation. PhD Thesis, Stanford University
Sclaroff S, Taycher L, La Cascia M (1997, June) Imagerover: A content-based image browser for the world wide web. IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL)
Sivic J, Zisserman A (2003) Video google: A text retrieval approach to object matching in videos. Proceedings International Conference on Computer Vision (ICCV), pp 1470–1477
Smeulders A, Worring M, Santini S, Gupta A, Jain R (2000) Content based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell (PAMI) 22(12):1349–1380
Smith JR, Chang SF (1996) Tools and techniques for color image retrieval. IST/SPIE Proceedings, pp 426–437
Smith JR, Chang SF (1996) Visualseek: A fully automated content-based image query system. ACM Multimedia Conference Boston, MA, USA, pp 87–98
Squire D, Muller W, Muller H, Raki J (1999) Content-based query of image databases, inspirations from text retrieval: inverted files, frequency-based weights and relevance feedback. 11th Scandinavian Conference on Image Analysis (SCIA) Kangerlussuaq, Greenland
Swain M, Ballard D (1991) Color indexing. Int J Comput Vis (IJCV) 7(1):11–32
Town C, Sinclair D (2001) Content based image retrieval using semantic visual categories. ATT Technical Report
Wang JZ, Du Y (2001) Rf*ipf: A weighting scheme for multimedia information retrieval. IEEE International Conference on Image Analysis and Processing (ICIAP)
Witten IH, Moffat A, Bell TC (1994) Managing gigabytes: compressing and indexing documents and images. Van Nostrand Reinhold, 115 Fifth Avenue, New York, NY 10003, USA
Zhang HJ, Jing F, Li M, Zhang B (2002) An effective region-based image retrieval framework. Proceeding of ACM Multimedia, pp 456–465
Acknowledgments
We would like to thank Aicha El Golli for generating the Kohonen maps of the visual thesaurus. We would also like to thank TF1 Channel for providing TV news videos.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Fauqueur, J., Boujemaa, N. Mental image search by boolean composition of region categories. Multimed Tools Appl 31, 95–117 (2006). https://doi.org/10.1007/s11042-006-0033-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-006-0033-3