Abstract
A neurally-inspired 3-D visual object recognition system is described called SEEMORE, whose goal is to reliably identify common objects from a large known set — independent of viewing angle, distance, and partial occlusion. Three distinctive features of this view-based approach are: (1) use of a shallow but very high- dimensional feedforward network of simple filtering operations, (2) simultaneous use of cues from several different visual sub-modalities, i.e. color, shape, texture, etc., and (3) inclusion of objects that are rigid, non-rigid, articulated, and/or statistical in nature. Preliminary results have been obtained using combinations of color and shape-based features. In response to a testset of 150 novel views of 50 known objects presented individually in color video images, SEEMORE currently identifies the object correctly 83% of the time (chance is 2%). Six non-rigid objects, including a telphone cord, a bicycle chain, two scarves, a grape cluster, and a maple-leaf cluster were included in the training set; all 18 novel test views of these objects were correctly recognized. Recognition time on a Sparc-2 is a non-optimized 60 seconds including all image processing. Extension to the case of multiple object scenes is discussed.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Oram, M., & D. Perrett (1992). Time course of neural responses discriminating different views of the face and head. Journal of Neurophysiology 68(1), 70–84.
Poggio, T., & S. Edelman (1990). A network that learns to recognize three-dimensional objects. Nature 343, 263–266.
Swain, M., & D. Ballard (1991). Color indexing. Internatiaonal Journal of. Computer Vision 7, 11–32.
Tanaka, K., H. Saito, Y. Fukada, & M. Moriya (1991). Coding visual images of objects in the inferotemporal cortex of the macaque monkey. Journal of Neurophysiology 66, 170–189.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Mel, B.W. (2000). A Neurally-Inspired Approach to 3-D Visual Object Recognition. In: Cruse, H., Dean, J., Ritter, H. (eds) Prerational Intelligence: Adaptive Behavior and Intelligent Systems Without Symbols and Logic, Volume 1, Volume 2 Prerational Intelligence: Interdisciplinary Perspectives on the Behavior of Natural and Artificial Systems, Volume 3. Studies in Cognitive Systems, vol 26. Springer, Dordrecht. https://doi.org/10.1007/978-94-010-0870-9_15
Download citation
DOI: https://doi.org/10.1007/978-94-010-0870-9_15
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-3792-1
Online ISBN: 978-94-010-0870-9
eBook Packages: Springer Book Archive