Abstract
We introduce in this chapter some fundamental theories for content-based image retrieval. Section 1.1 looks at the development of content-based image retrieval techniques. Then, as the emphasis of this chapter, we introduce in detail in Section 1.2 some widely used methods for visual content descriptions. After that, we briefly address similarity/distance measures between visual features, the indexing schemes, query formation, relevance feedback, and system performance evaluation in Sections 1.3, 1.4 and 1.5. Details of these techniques are discussed in subsequent chapters. Finally, we draw a conclusion in Section 1.6.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
K. Arbter, W. E. Snyder, H. Burkhardt, and G. Hirzinger, “Application of affine-invariant Fourier descriptors to recognition of 3D objects,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 12, pp. 640–647, 1990.
E. M. Arkin, L.P. Chew, D..P. Huttenlocher, K. Kedem, and J.S.B. Mitchell, “An efficiently computable metric for comparing polygonal shapes,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, no. 3, pp. 209–226, 1991.
J. Assfalg, A. D. Bimbo, and P. Pala, “Using multiple examples for content-based retrieval,” Proc. Intl Conf. Multimedia and Expo, 2000.
J. R. Bach, C. Fuller, A. Gupta, A. Hampapur, B. Horowitz, R. Humphrey, R. Jain, and C. F. Shu, “The virage image search engine: An open framework for image management,” In Proc. SPIC’ Storage and Retrieval for Image and Video Database,Feb. 1996.
N. Beckmann, et al, “The R*-tree: An efficient robust access method for points and rectangles,” ACM SIGMOD Int. Conf on Management of Data, Atlantic City, May 1990.
A. Blaser, Database Techniques for Pictorial Applications, Lecture Notes in Computer Science, Vol.81, Springer Verlag GmbH, 1979.
P. Brodatz, “Textures: A photographic album for artists designers,” Dover, NY, 1966.
H. Burkhardt, and S. Siggelkow, “Invariant features for discriminating between equivalence classes,” Nonlinear Model-based Image Video Processing and Analysis, John Wiley and Sons, 2000.
C. Carson, M. Thomas, S. Belongie, J. M. Hellerstein, and J. Malik, “Blobworld: A system for region-based image indexing and retrieval,” In D. P. Huijsmans and A. W. M. Smeulders, ed. Visual Information and Information System, Proceedings of the Third International Conference VISUAL ‘89, Amsterdam, The Netherlands, June 1999, Lecture Notes in Computer Science 1614. Springer, 1999.
J.A. Catalan, and J.S. Jin, “Dimension reduction of texture features for image retrieval using hybrid associative neural networks,” IEEE International Conference on Multimedia and Expo, Vol. 2, pp. 1211–1214, 2000.
A. E. Cawkill, “The British Library’s Picture Research Projects: Image, Word, and Retrieval,” Advanced Imaging, Vol.8, No. 10, pp. 38–40, October 1993.
N. S. Chang, and K. S. Fu, “A relational database system for images,” Technical Report TR-EE 79–82, Purdue University, May 1979.
N. S. Chang, and K. S. Fu, “Query by pictorial example,” IEEE Trans. on Software Engineering, Vol. 6, No. 6, pp. 519–524, Nov.1980.
S. K. Chang, and A. Hsu, “Image information systems: where do we go from here’ Taos osr Knowledge and Data Engineering, Vol. 5, No. 5, pp. 431–442, Oct.1992.
S. K. Chang, E. Jungert, and Y. Li, “Representation and retrieval of symbolic pictures using generalized 2D string”, Technical Report, University of Pittsburgh, 1988.
S. K. Chang, and T. L. Kunii, “Pictorial database systems,” IEEE Computer Magazine, Vol. 14, No. 11, pp. 13–21, Nov.1981.
S. K. Chang, Q. Y. Shi, and C. Y. Yan, “Iconic indexing by 2-D strings,” IEEE Trans. on Pattern Anal. Machine Intell., Vol. 9, No. 3, pp. 413–428, May 1987.
S. K. Chang, C. W. Yan, D. C. Dimitroff, and T. Arndt, “An intelligent image database system,” IEEE Trans. on Software Engineering, Vol. 14, No. 5, pp. 681–688, May 1988.
T. Chang, and C.C.J. Kuo, “Texture analysis and classification with tree-structured wavelet transform,” IEEE Trans. on Image Processing, vol. 2, no. 4, pp. 429–441, October 1993.
I. J. Cox, M. L. Miller, T. P. Minka, T. Papathomas, and P. N. Yianilos, “The Bayesian image retrieval system, PicHunter: Theory, implementation, and psychophysical experiments,” IEEE Trans. on Image Processing, Vol. 9, No. 1, pp. 20–37, Jan. 2000.
I. Daubechies, “The wavelet transform, time-frequency localization and signal analysis,” IEEE Trans. on Information Theory, Vol. 36, pp. 961–1005, Sept. 1990.
J. G. Daugman, “Complete discrete 2D Gabor transforms by neural networks for image analysis and compression,” IEEE Trans. ASSP, vol. 36, pp. 1169–1179, July 1998.
J. Dowe, “Content-based retrieval in multimedia imaging,” In Proc. SPIE Storage and Retrieval for Image and Video Database,1993.
C. Faloutsos et al, “Efficient and effective querying by image content,” Journal of intelligent information systems, Vol. 3, pp. 231–262, 1994.
G D. Finlayson, “Color in perspective,” IEEE Trans on Pattern Analysis and Machine Intelligence, Vol. 8, No. 10, pp. 1034–1038, Oct. 1996.
M. Flickner, H. Sawhney, W. Niblack, J. Ashley, Q. Huang, B. Dom, M. Gorkani, J. Hafner, D. Lee, D. Petkovic, D. Steele, and P. Yanker, “Query by image and video content: The QBIC system.” IEEE Computer, Vol. 28, No. 9, pp. 23–32, Sept. 1995.
J. D. Foley, A. van Dam, S. K. Feiner, and J. F. Hughes, Computer graphics: principles and practice, 2“d ed., Reading, Mass, Addison-Wesley, 1990.
J. M. Francos. “Orthogonal decompositions of 2D random fields and their applications in 2D spectral estimation,” N. K. Bose and C. R. Rao, editors, Signal Processing and its Application, pp.20–227. North Holland, 1993.
J. M. Francos, A. A. Meiri, and B. Porat, “A unified texture model based on a 2d Wold like decomposition,” IEEE Trans on Signal Processing, pp. 2665–2678, Aug. 1993.
J. M. Francos, A. Narasimhan, and J. W. Woods, “Maximum likelihood parameter estimation of textures using a Wold-decomposition based model,” IEEE Trans. on Image Processing, pp. 1655–1666, Dec.1995.
B. Furht, S. W. Smoliar, and H.J. Zhang, Video and Image Processing in Multimedia Systems, Kluwer Academic Publishers, 1995.
J. E. Gary, and R. Mehrotra, “Shape similarity-based retrieval in image database systems,” Proc. of SPIE, Image Storage and Retrieval Systems, Vol. 1662, pp. 2–8, 1992.
T. Gevers, and A.W.M.Smeulders, “Pictoseek: Combining color and shape invariant features for image retrieval,” IEEE Trans. on image processing, Vol. 9, No. 1, pp 102–119, 2000.
T. Gevers, and A. W. M. Smeulders, “Content-based image retrieval by viewpoint-invariant image indexing,” Image and Vision Computing, Vol.17, No. 7, pp. 475–488, 1999.
Y. Gong, H. J. Zhang, and T. C. Chua, “An image database system with content capturing and fast image indexing abilities”, Proc. IEEE International Conference on Multimedia Computing and Systems, Boston, pp. 121–130, 14–19 May 1994.
W. I. Grosky, and R. Mehrotra, “Index based object recognition in pictorial data management,” CVGIP, Vol. 52, No. 3, pp. 416–436, 1990.
V. N. Gudivada, and V. V. Raghavan, “Design and evaluation of algorithms for image retrieval by spatial similarity,” ACM Trans. on Information Systems, Vol. 13, No. 2, pp. 115–144, April 1995.
F. Guo, J. Jin, and D. Feng, “Measuring image similarity using the geometrical distribution of image contents”, Proc. of ICSP, pp. 1108–1112, 1998.
A. Gupta, and R. Jain, “Visual information retrieval,” Communication of the ACM, Vol.40, No.5, pp.71–79, May, 1997.
J. Hafner, et al.,“Efficient color histogram indexing for quadratic form distance functions,” IEEE Trans. on Pattern Analysis and Machine Intelligence,Vol. 17, No. 7, pp. 729–736, July 1995.
M. K. Hu, “Visual pattern recognition by moment invariants,” in J. K. Aggarwal, R. O. Duda, and A. Rosenfeld, Computer Methods in Image Analysis, IEEE computer Society, Los Angeles, CA, 1977.
J. Huang, S. R. Kumar, and M. Metra, “Combining supervised learning with color correlograms for content-based image retrieval,” Proc. of ACMMultimedia’95, pp. 325–334, Nov. 1997.
J. Huang, S.R. Kumar, M. Metra, W. J., Zhu, and R. Zabith, “Spatial color indexing and applications,” Intl J. Computer Vision, Vol. 35, No. 3, pp. 245–268, 1999.
J. Huang, et al.,“Image indexing using color correlogram,” IEEE Int. Conf on Computer Vision and Pattern Recognition,pp. 762–768, Puerto Rico, June 1997.
M. Ioka, “A method of defining the similarity of images on the basis of color information,” Technical Report RT-0030, IBM Tokyo Research Laboratory, Tokyo, Japan, Nov. 1989.
H. V. Jagadish, “A retrieval technique for similar shapes,” Proc. of Int. Conf. on Management of Data, SIGMOID’91, Denver, CO, pp. 208–217, May 1991.
A. K. Jain, Fundamental of Digital Image Processing, Englewood Cliffs, Prentice Hall, 1989.
A. K. Jain, and F. Farroknia, “Unsupervised texture segmentation using Gabor filters,” Pattern Recognition, Vo. 24, No. 12, pp. 1167–1186, 1991.
R. Jain, Proc. US NSF Workshop Visual Information Management Systems, 1992.
R. Jain, A. Pentland, and D. Petkovic, Workshop Report: NSF-ARPA Workshop on Visual Information Management Systems, Cambridge, Mass, USA, June 1995.
A. Kankanhalli, H. J. Zhang, and C. Y. Low, “Using texture for image retrieval,” Third Int. Conf. on Automation, Robotics and Computer Vision, pp. 935–939, Singapore, Nov. 1994.
H. Kauppinen, T. Seppnäen, and M. Pietikäinen, “An experimental comparison of autoregressive and Fourier-based descriptors in 2D shape classification,” IEEE Trans. Pattern Anal. and Machine Intell., Vol. 17, No. 2, pp. 201–207, 1995.
W. J. Krzanowski, Recent Advances in Descriptive Multivariate Analysis, Chapter 2, Oxford science publications, 1995.
A. Laine, and J. Fan, “Texture classification by wavelet packet signatures,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 15, No. 11, pp. 1186–1191, Nov. 1993.
S. Y. Lee, and F. H. Hsu, “2D C-string: a new spatial knowledge representation for image database systems,” Pattern Recognition, Vol. 23, pp 1077–1087, 1990.
S. Y. Lee, M.C. Yang, and J. W. Chen, “2D B-string: a spatial knowledge representation for image database system,” Proc. ICSC’92 Second Int. computer Sci. Conf, pp. 609–615, 1992.
F. Liu, and R. W. Picard, “Periodicity, directionality, and randomness: Wold features for image modeling and retrieval,” IEEE Trans. on Pattern Analysis and Machine Learning, Vol. 18, No. 7, July 1996.
W. Y. Ma, and B. S. Manjunath, “A comparison of wavelet features for texture annotation,” Proc. of IEEE Int. Conf on Image Processing, Vol. II, pp. 256–259, Washington D.C., Oct. 1995.
W. Y. Ma, and B. S. Manjunath, “Image indexing using a texture dictionary,” Proc. of SPIE Conf. on Image Storage and Archiving System, Vol. 2606, pp. 288–298, Philadelphia, Pennsylvania, Oct. 1995.
W. Y. Ma, and B. S. Manjunath, “Netra: A toolbox for navigating large image databases,” Multimedia Systems, Vol.7, No.3, pp.: 184–198, 1999.
W. Y. Ma, and B. S. Manjunath, “Edge flow: a framework of boundary detection and image segmentation,” IEEE Int. Conf. on Computer Vision and Pattern Recognition, pp. 744–749, Puerto Rico, June 1997.
S. G. Mallat, “A theory for multiresolution signal decomposition: the wavelet representation,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 11, pp. 674–693, July 1989.
B. S. Manjunath, and W. Y. Ma, “Texture features for browsing and retrieval of image data,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 18, No. 8, pp. 837–842, Aug. 1996.
J. Mao, and A. K. Jain, “Texture classification and segmentation using multiresolution simultaneous autoregressive models,” Pattern Recognition, Vol. 25, No. 2, pp. 173–188, 1992.
E. Mathias, “Comparing the influence of color spaces and metrics in content-based image retrieval,” Proceedings of International Symposium on Computer Graphics, Image Processing, and Vision, pp. 371–378, 1998.
T. P. Minka, and R. W. Picard, “Interactive learning using a ‘society of models’, ” IEEE Int. Conf. on Computer Vision and Pattern Recognition, pp. 447–452, 1996.
W. Niblack et al., “Querying images by content, using color, texture, and shape,” SPIE Conference on Storage and Retrieval for Image and Video Database, Vol. 1908, pp. 173–187, April 1993.
J. Nievergelt, H. Hinterberger, and K. C. Sevcik, “The grid file: an adaptable symmetric multikey file structure,” ACM Trans. on Database Systems, pp. 38–71, March 1984.
V. E. Ogle, and M. Stonebraker, “Chabot: Retrieval from a relational database of images,” IEEE Computer, Vol. 28, No. 9, pp. 40–48, Sept. 1995.
T. Ojala, M. Pietikainen, and D. Harwood, “A comparative study of texture measures with classification based feature distributions,” Pattern Recognition, Vol. 29, No. 1, pp. 51–59, 1996.
G.Pass, and R. Zabith, “Comparing images using joint histograms,” Multimedia Systems, Vol. 7, pp. 234–240, 1999.
G. Pass, and R. Zabith, “Histogram refinement for content-based image retrieval,” IEEE Workshop on Applications of Computer Vision, pp. 96–102, 1996.
A. Pentland, R.W. Picard and S. Sclaroff, “Photobook: Content-Based Manipulation of Image Databases,” Proc. Storage and Retrieval for Image and Video Databases II, Vol. 2185, San Jose, CA, USA February, 1994.
E. Persoon, and K. Fu, “Shape discrimination using Fourier descriptors,” IEEE Trans. Syst., Man, and Cybern., Vol. 7, pp. 170–179, 1977.
R. W. Picard, T. Kabir, and F. Liu, “Real-time recognition with the entire Brodatz texture database,” Proc. IEEE Int. Conf. on Computer Vision and Pattern Recognition, pp. 638–639, New York, June 1993.
J. T. Robinson, “The k-d-B-tree: a search structure for large multidimensional dynamic indexes,” Proc. of SIGMOD Conference, Ann Arbor, April 1981.
Y. Rui, T. S. Huang, and S. F. Chang, “Image retrieval: current techniques, promising directions and open issues, ” Journal of Visual Communication and Image Representation, Vol. 10, pp. 39–62, 1999.
Y. Rui, T.S.Huang, and S. Mehrotra, “Content-based image retrieval with relevance feedback in MARS,” Proceedings of International Conference on Image Processing, Vol. 2, pp. 815–818, 1997.
Y. Rui, T. S. Huang, M. Ortega, and S. Mehrotra, “Relevance feedback: a power tool for interactive content-based image retrieval,” IEEE Trans. on Circuits and Systems for Video Technology, 1998.
Y. Rui, et al, “A relevance feedback architecture in content-based multimedia information retrieval systems,” Proc of IEEE Workshop on Content-based Access of Image and Video Libraries, 1997.
G. Salton, and M. McGill, Introduction to Modern Information Retrieval. McGraw-Hill, New York, NY, 1983.
H. Samet, “The quadtree and related hierarchical data structures,” ACM Computing Surveys, Vol. 16, No. 2, pp. 187–260, 1984.
H. Samet, The Design and Analysis of Spatial Data Structures, Addison-Wesley, 1989.
S. Sclaroff, and A. Pentland, “Modal matching for correspondence and recognition,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 17, No. 6, pp. 545–561, June 1995.
S. Sclaroff, L. Taycher, and M. L. Cascia, “ImageRover: a content-based image browser for the World Wide Web,” Boston University CS Dept. Technical Report 97–005, 1997.
A. W. M. Smeulders, S. D. Olabariagga, R. van den Boomgaard, and M. Wowing, “Interactive segmentation,” Proc. Visual’97: Information Systems, pp. 5–12, 1997.
A. M. W. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, “Content-based image retrieval at the end of the early years, ” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 22, No. 12, pp. 1349–1380, Dec. 2000.
J. R. Smith, and S. F. Chang, “VisualSEEk: a fully automated content-based image query system,” ACM Multimedia 96, Boston, MA, Nov. 1996.
M. Stricker, and M. Orengo, “Similarity of color images,” SPIE Storage and Retrieval for Image and Video Databases Ill, vol. 2185, pp. 381–392, Feb. 1995.
M. Stricker, and M. Orengo, “Color indexing with weak spatial constraint,” Proc. SPIE Conf On Visual Communications, 1996.
M. J. Swain, and D. H. Ballard, “Color indexing,” International Journal of Computer Vision, Vol. 7, No. 1, pp. 11–32, 1991.
H. Tamura, S. Mori, and T. Yamawaki, “Texture features corresponding to visual perception,” IEEE Trans. On Systems, Man, and Cybernetics, vol. Smc-8, No. 6, June 1978.
H. Tamura, and N.Yokoya, “Image database systems: A survey, ” Pattern Recognition, Vol. 17, No. 1, pp. 29–43, 1984.
D. Tegolo, “Shape analysis for image retrieval,” Proc. of SPIE, Storage and Retrieval for Image and Video Databases -II, no. 2185, San Jose, CA, pp. 59–69, February 1994.
A. Vailaya, M. A. G. Figueiredo, A. K. Jain, and H. J. Zhang, “Image classification for content-based indexing,” IEEE Trans. on Image Processing, Vol. 10, No. 1, Jan. 2001.
N. Vasoncelos, and A. Lippman, “A probabilistic architecture for content-based image retrieval,” Proc. Computer vision and pattern recognition, pp. 216–221, 2000.
R. C. Veltkamp, and M. Hagedoorn, “State-of-the-art in shape matching,” Technical Report UU-CS-1999–27, Utrecht University, Department of Computer Science, Sept. 1999.
J. Vendrig, M. Worring, and A. W. M. Smeulders, “Filter image browsing: exploiting interaction in retrieval,” Proc. Viusl’99: Information and Information System, 1999.
H. Voorhees, and T. Poggio. “Computing texture boundaries from images,” Nature, 333: 364–367, 1988.
H. Wang, F. Guo, D. Feng, and J. Jin, “A signature for content-based image retrieval using a geometrical transform,” Proc. OfACMMM’98, Bristol, UK, 1998.
W. H. Wong, W. C. Siu, and K. M. Lam, “Generation of moment invariants and their uses for character recognition,” Pattern Recognition Letters, Vol. 16, pp. 115–123, Feb. 1995.
L. Yang, and F. Algregtsen, “Fast computation of invariant geometric moments: A new method giving correct results,” Proc. IEEE Int. Conf. on Image Processing, 1994.
H. J. Zhang, and D. Zhong, “A Scheme for visual feature-based image indexing,” Proc. of SPIE conf. on Storage and Retrieval for Image and Video Databases III, pp. 36–46, San Jose, Feb. 1995.
H. J. Zhang, et al, “Image retrieval based on color features: An evaluation study,” SPIE Conf. on Digital Storage and Archival, Pennsylvania, Oct. 25–27, 1995.
MPEG Video Group, Description of core experiments for MPEG-7 color/texture descriptors, ISO/MPEGJTCI/SC29/WG11 MPEG98/M2819, July 1999.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Long, F., Zhang, H., Feng, D.D. (2003). Fundamentals of Content-Based Image Retrieval. In: Feng, D.D., Siu, WC., Zhang, HJ. (eds) Multimedia Information Retrieval and Management. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-05300-3_1
Download citation
DOI: https://doi.org/10.1007/978-3-662-05300-3_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-05533-1
Online ISBN: 978-3-662-05300-3
eBook Packages: Springer Book Archive