Abstract
In this paper, we examine the complexities involved in retrieving images from a database comprised of objects of very similar appearance. Such an operation requires a process that can discriminate among images at a very fine level, such as distinguishing among various species of fish. Furthermore, incidental environmental factors such as change in viewpoints and slight, nonessential shape deformation must be excluded from the similarity criteria. To this end, we propose a new method for content-based image retrieval and indexing, one that is well suited for discriminating among objects within the same class in a way that is insensitive to incidental environmental changes. The scheme comprises a global alignment and a local matching process. Affine transform is used to model the different viewpoints associated with positioning the camera, while multi-dimensional indexing techniques are used to make the global alignment scheme efficient. A local matching process based on dynamic programming allows the optimal matching of local structures using cost metrics that may ignore nonessential local shape deformation. Results show the method's ability to cancel out visual distortions caused by a changing viewpoint, and its tolerance to noise, occlusion, and slight deformations of the object.
Similar content being viewed by others
References
S.L. Adler and R. Krishnan, "Similarity and affine normalization of partially occluded planar curves using first and 2nd derivatives," Pattern Recognition, Vol. 31, No. 10, pp. 1551–1556,1998
R. Alferez and Y.F. Wang, "Geometric and illumination invariants for object recognition," IEEE Trans. Pattern Analysis and Machine Intelligence, June1999
J.R. Bach, C. Fuller, A. Gupta, A. Hampapur, B. Horowitz, R. Jain, and C.F. Shu, "The Virage image search engine: An open framework for image management," in Proc. IS & T/SPIE Symposium on Electronic Imaging, 1996, Vol. 2670, pp.76–87.
I.A. Bachelder and S. Ullman, "Contour matching using local affine transformations," in Computer Vision and Pattern Recognition, pp. 798–801,1992
L. Bielski, "The image database of the future begins," Advanced Imaging, Vol. 10, No. 10, pp. 26–91,1995
E. Binaghi, I. Gagliardi, and R. Schettini, "Indexing and fuzzy logic-based retrieval of colour images," in Proc. IFIP Working Conf. on Visual Database Systems II, Elsevier Science Publishers, 1992, pp. 79–92.
S.K. Bose, K.K. Biswas, and S.K. Gupta, "Model based object recognition-the role of affine invariants," in Artificial Intelligence in Engineering, pp. 227-234,1996
R. Brunelli and O. Mich, in Proceedings of ICME 2000, IEEE International Conference on Multimedia and Expo, New York City, July 30-Aug. 2,2000
A. Califano and R. Mohan, "Multidimensional indexing for recognizing visual shapes," IEEE Trans. on Pattern Analysis and Mach. Intelligence, Vol. 16, No. 4, pp. 373–392,1994
M. La Cascia and E. Ardizzone, "JACOB: Just a content-based query system for video databases," in Pro-ceedings, ICASSP-96, Atlanta, Georgia,1996
S.K. Chang, C.W. Yan, D.C. Dimitroff, and T. Arndt, "An intelligent image database system," IEEE Trans., Vol. SE 14, pp. 681–688,1988
Corel Corporation, Ottawa, Canada.
G. Crane (Ed.), The Perseus Digital Library, Tufts University, Feb.2001
M. Flickner et al., "Query by image and video content: The QBIC system," IEEE Computer, Vol. 28, No. 9, pp. 23–32,1995
R. Froese and D. Pauly (Eds.), FishBase 2000, International Center for Living Aquatic Resources Management (ICLARM), Feb.2001
S. Frydrychowicz, "A new approach to affine transform invariant shape matching," in Visual Form, 1991, pp. 267–274.
Getty Images, Inc., Seattle, Washington.
N.J. Hastings and J.B. Peacock, Statistical Distribution, John Wiley & Sons: Toronto,1975
P.M. Kelly and T.M. Cannon, "Efficiency issues related to probability density function comparison," in Proc. IS & T/SPIE Symposium on Electronic Imaging, 1996, Vol. 2670, pp. 42–49.
Y. Lamdan, J.T. Schwartz, and H.J. Wolfson, "Affine invariant model-based object recognition," IEEE Trans. on Robotics and Automation, Vol. 6, No. 5,1990
B. Lamiroy and P. Gros, "Rapid object indexing and recognition using enhanced geometric hashing", in Proc. Of the 4th European Conf. on Computer Vision, England, April 1996, pp. 59–70.
J. Larish, "Kodak's still picture exchange for print and film use," Advanced Imaging, Vol. 10, No. 4, pp. 38–39, 1995.
W.Y. Ma and B.S. Manjunath, "NETRA: A toolbox for navigating large image databases," in Proc. IEEE Intl. Conf. Image Processing (ICIP), Santa Barbara, Oct.1997
B.S. Manjunath and W.Y. Ma, "Texture features for browsing and retrieval of large image data," IEEE Trans-actionson Pattern Analysis and Machine Intelligence (Special Issue on Digital Libraries), Vol. 18, No. 8, pp. 837–842,1996
M. Martucci, "Digital still marketing at PressLink," Advanced Imaging, Vol. 10, No. 4, pp. 34–36,1995
F. Mokhtarian and S. Abbasi, "Shape similarity retrieval under affine transforms," Pattern Recognition (special issue on Shape Representation and Similarity for Image Databases), Vol. 35, No. 1, pp. 31–41,2002
F. Mokhtarian, S. Abbasi, and J. Kittler, "Robust and efficient shape indexing through curvature scale space," in Proc. 6th Brit. Mach. Visual Conf., pp. 53–62, Sept.1996
J. Mundy and A. Zisserman (Eds.), Geometric Invariance in Computer Vision, MIT Press, Cambridge, MA, 1992.
A. Nagasaka and Y. Tanaka, "Automatic video indexing and full-video search for object appearances," Visual Database System, II, IFIP, Elsevier Science Publishers, 1992, pp. 113–127.
W. Niblack, R. Barber, W. Equitz, M. Flickner, E. Glasman, D. Petkovic, P. Yanker, C. Faloutsos, and G. Taubin, "The QBIC project: Querying images by content using color, texture and shape," Proc. SPIE, Vol. 1908, pp. 173–187,1993
V.E. Ogle and M. Stonebraker, "Chabot: Retrieval from a relational database of images," IEEE Computer, Vol. 28, pp. 40–48,1995
E.J. Pauwels, T. Moons, L.J. Van Gool, P. Kempenaers, and A. Oosterlinck, "Recognition of Planar Shapes Under Affine Distortion," International Journal of Computer Vision, Vol. 14, No. 1, pp. 49–65,1995
A. Pentland, R.W. Picard, and S. Sclaroff, "Photobook: Tools for content-based manipulation of image databases," Proc. SPIE, Vol. 2185, pp. 34–47,1994
R.W. Picard, T. Kabir, and F. Liu, "Real-time recognition with the entire Brodatz texture database," in Proc. IEEE Conf. on Computer Vision and Pattern Recognition, NY, 1993, pp. 638–639.
S. Proctor and J. Illingworth. "ForeSight: Fast object recognition using geometric hashing with edge-triple features," Inter. Conf. on Image Processing,1997
F. Rabitti and P. Stanchev, "GRIM-DBMS: A graphical image database management system," in Proc. IFIP Working Conf. on Visual Database Systems, 1989, pp. 415–430.
T. Reiss, Recognizing Planar Objects Using Invariant Image Features, Springer-Verlag: Berlin,1993
J.R. Smith and S.F. Chang, "Visualseek: Afully automated content-based image query system," in Proceedings of ACM Multimedia 96, Boston MA, USA, 1996, pp. 87–98.
S. Startchik, R. Milanese, C. Rauber, and T. Pun, "Planar shape databases with affine invariant search," First IAPR Int. Workshop on Image Databases and Multi-Media Search (IDB-MMS'96) Aug. 1996, Amsterdam, NL, pp. 202–209.
M.J. Swain, "Interactive indexing into image database," Proc. SPIE, 1993, Vol. 1908, pp. 193–197.
K. Tanabe and J. Ohya, "A similarity retrieval method for line drawing image database," Progress in Image Analysis and Processing, Chichester, Wiley, pp. 138–146,1989
J. Wang, W. Chang, and R. Acharya, "Efficient and effective similar shape retrieval," International Conference on Multimedia Computing and System, Vol. 1, pp. 875–879,1999
I. Weiss, "Geometric invariants and object recognition," International Journal of Computer Vision., Vol. 10, No. 3, pp. 207–231,1993
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Alferez, R., Wang, YF. & Jiao, L. An Affine-Invariant Tool for Retrieving Images from Homogeneous Databases. Multimedia Tools and Applications 25, 133–159 (2005). https://doi.org/10.1023/B:MTAP.0000046385.62974.be
Issue Date:
DOI: https://doi.org/10.1023/B:MTAP.0000046385.62974.be