On the Role of Object-Specific Features for Real World Object Recognition in Biological Vision

Serre, Thomas; Riesenhuber, Maximilian; Louie, Jennifer; Poggio, Tomaso

doi:10.1007/3-540-36181-2_39

Thomas Serre⁷,
Maximilian Riesenhuber⁷,
Jennifer Louie⁷ &
…
Tomaso Poggio⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2525))

Included in the following conference series:

International Workshop on Biologically Motivated Computer Vision

2885 Accesses
22 Citations

Abstract

Models of object recognition in cortex have so far been mostly applied to tasks involving the recognition of isolated objects presented on blank backgrounds. However, ultimately models of the visual system have to prove themselves in real world object recognition tasks. Here we took a first step in this direction: We investigated the performance of the HMAX model of object recognition in cortex recently presented by Riesenhuber & Poggio [1],[2] on the task of face detection using natural images. We found that the standard version of hmax performs rather poorly on this task, due to the low specificity of the hardwired feature set of C2 units in the model (corresponding to neurons in intermediate visual area V4) that do not show any particular tuning for faces vs. background. We show how visual features of intermediate complexity can be learned in HMAX using a simple learning rule. Using this rule, hmax outperforms a classical machine vision face detection system presented in the literature. This suggests an important role for the set of features in intermediate visual areas in object recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

M. Riesenhuber and T. Poggio. Hierarchical models of object recognition in cortex. Nat. Neurosci., 2(11):1019–25, 1999.
Article Google Scholar
M. Riesenhuber and T. Poggio. Models of object recognition. Nature Neuroscience, 3 supp.:1199–1204, 2000.
Google Scholar
B. Heisele, T. Serre, M. Pontil, and T. Poggio. Component-based face detection. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, volume 1, pages 657–62, Hawaii, 2001.
Google Scholar
K.-K. Sung. Learning and Example Selection for Object and Pattern Recognition. PhD thesis, MIT, Artificial Intelligence Laboratory and Center for Biological and Computational Learning, Cambridge, MA, 1996.
Google Scholar
D. Hubel and T. Wiesel. Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J. Phys., 160:106–54, 1962.
Google Scholar
T. J. Gawne and J. M. Martin. Response of primate visual cortical V4 neurons to simultaneously presented stimuli. To appear in J. Neurophysiol., 2002.
Google Scholar
D. J. Freedman, M. Riesenhuber, T. Poggio, and E. K. Miller. Categorical representation of visual stimuli in the primate prefrontal cortex. Science, 291:312–16, 2001.
Article Google Scholar
V. Vapnik. The nature of statistical learning. Springer Verlag, 1995.
Google Scholar
T. Vetter. Synthesis of novel views from a single face. International Journal of Computer Vision, 28(2):103–116, 1998.
Article MathSciNet Google Scholar
S. Ullman, M. Vidal-Naquet, and E. Sali. Visual features of intermediate complexity and their use in classification. Nat. Neurosci., 5(7):682–87, 2002.
Google Scholar

Download references

Author information

Authors and Affiliations

Artificial Intelligence Lab, and Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Center for Biological and Computational Learning, Mc Govern Institute for Brain Research, Cambridge, MA, USA
Thomas Serre, Maximilian Riesenhuber, Jennifer Louie & Tomaso Poggio

Authors

Thomas Serre
View author publications
You can also search for this author in PubMed Google Scholar
Maximilian Riesenhuber
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Louie
View author publications
You can also search for this author in PubMed Google Scholar
Tomaso Poggio
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Max Planck Institute for Biological Cybernetics, Spemannstraße 38, 72076, Tübingen, Germany
Heinrich H. Bülthoff & Christian Wallraven &
Department of Computer Science and Engineering, Korea University, Anam-dong, Seongbuk-ku, 136-701, Seoul, Korea
Seong-Whan Lee
Department of Brain and Cognitive Sciences, Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 45 Carleton Street, 02142, Cambridge, MA, USA
Tomaso A. Poggio

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Serre, T., Riesenhuber, M., Louie, J., Poggio, T. (2002). On the Role of Object-Specific Features for Real World Object Recognition in Biological Vision. In: Bülthoff, H.H., Wallraven, C., Lee, SW., Poggio, T.A. (eds) Biologically Motivated Computer Vision. BMCV 2002. Lecture Notes in Computer Science, vol 2525. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36181-2_39

Download citation

DOI: https://doi.org/10.1007/3-540-36181-2_39
Published: 21 November 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00174-4
Online ISBN: 978-3-540-36181-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics