Advertisement

Recent Psychophysical and Neural Research in Shape Recognition

  • Irving Biederman

Abstract

Shape is the major route by which we gain knowledge about our visual world. All contemporary theories of shape-based object representation, e.g., Hummel and Biederman (1992); Riesenhuber and Poggio (2002), assume a hierarchy of features by which the initial Gabor-like filtering that is characteristic of V1 cell-tuning is ultimately transformed through a series of stages to a point where cell tuning is better described by “moderately complex” features with receptive fields (r.f.s) that often cannot be analyzed into their linear components (Tanaka 1993; Kobatake and Tanaka 1994). These later stages are the inferior temporal cortex in the macaque (IT) and, in humans as determined by fMRI, likely the Lateral Occipital Complex (LOC). Along with the increase in r.f. nonlinearity in IT and LOC, the cells exhibit a high degree of invariance to changes in the conditions of presentation so the response is only moderately changed to variations in the viewing conditions. In this chapter we will review recent evidence, both behavioral and neural, that shed light on the nature of these object representations.

Keywords

Metric Property Depth Discontinuity Inferior Temporal Visual Object Recognition Lateral Occipital Complex 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Baker C, Behrmann M, Olson C (2002) Influence of visual discrimination training of the representation of parts and whole in monkey inferotemporal cortex. Nat Neurosci 5:1210–1216PubMedCrossRefGoogle Scholar
  2. Behrmann M, Peterson M, Moscovitch M, Suzuki S (2006) Independent representation of parts and the relations between them: evidence from integrative agnosia. J Exp Psychol Hum Percept Perform (32:1169–1184)PubMedCrossRefGoogle Scholar
  3. Biederman I (1987) Recognition-by-components: a theory of human image understanding. Psychol Rev 94:115–147PubMedCrossRefGoogle Scholar
  4. Biederman I (1995) Visual object recognition. In: Kosslyn SM, Osherson DN (Eds) An invitation to cognitive science, 2nd edition, Vol. 2, visual cognition. MIT Press, Cambridge MA, pp 121–165Google Scholar
  5. Biederman I (2000) Recognizing depth-rotated objects: a review of recent research and theory. Spat Vis 13:241–253PubMedCrossRefGoogle Scholar
  6. Biederman I, Bar M (1998) Same-different matching of depth-rotated objects. Paper presented at the Meetings of the Association for Research in Vision and Ophthalmology, Ft. Lauderdale, FL., May. Invest Ophthalmol Vis Sci 39:1113Google Scholar
  7. Biederman I, Bar M (1999) One-shot viewpoint invariance in matching novel objects. Vision Res 39:2885–2899PubMedCrossRefGoogle Scholar
  8. Biederman I, Bar M (2000) Views on views: response to Hayward & Tarr (2000). Vision Res 40:3901–3905.PubMedCrossRefGoogle Scholar
  9. Biederman I, Cooper EE (1991) Evidence for complete translational and reflectional invariance in visual object priming. Perception 20:585–593PubMedCrossRefGoogle Scholar
  10. Biederman I, Cooper EE (1992) Size invariance in visual object priming. J Exp Psychol Hum Percept Perform 18:121–133CrossRefGoogle Scholar
  11. Biederman I, Gerhardstein PC (1993) Recognizing depth-rotated objects: evidence and conditions for 3D viewpoint invariance. J Exp Psychol Hum Percept Perform 19:1162–1182PubMedCrossRefGoogle Scholar
  12. Biederman I, Gerhardstein PC (1995) Viewpoint-dependent mechanisms in visual object recognition: reply to Tarr and Bülthoff (1995). J Exp Psychol Hum Percept Perform 21:1506–1514CrossRefGoogle Scholar
  13. Biederman I, Kalocsai P (1997) Neurocomputational bases of object and face recognition. Philos Trans R Soc London Biol Sci 352:1203–1219CrossRefGoogle Scholar
  14. Biederman I, Subramaniam S (1997) Predicting the shape similarity of objects without distinguishing viewpoint invariant properties (VIPs) or parts. Invest Ophthalmol Vis Sci 38:998Google Scholar
  15. Biederman I, Subramaniam S, Bar M, Kalocsai P, Fiser J (1999) Subordinate-level object classification reexamined. Psychol Res (62:131–153)PubMedCrossRefGoogle Scholar
  16. Cooper EE, Biederman I, Hummel JE (1992) Metric invariance in object recognition: a review and further evidence. Can J Psychol 46:191–214PubMedGoogle Scholar
  17. Cooper EE, Subramaniam S, Biederman I (1995) Recognizing objects with an irregular part. Invest Opthalmol Vis Sci 36:473Google Scholar
  18. Edelman S, Bülthoff HH (1992) Orientation dependence in the recognition of familiar and novel views of 3D objects. Vision Res 32:2385–4000PubMedCrossRefGoogle Scholar
  19. Esteky H, Tanaka K (1998) Effects of changes in aspect ratio of stimulus shape on responses of cells in the monkey inferotemporal cortex. Soc Neurosci Abstr 24:899Google Scholar
  20. Hayward WG, Tarr MJ (1997) Testing conditions for viewpoint invariance in object recognition. J Exp Psychol Hum Percept Perform 23:1511–1521PubMedCrossRefGoogle Scholar
  21. Hayworth KJ, Biederman I (2005) Differential EMRI activity produced by variation in parts and relations during object perception. J Vision 5(8):740.CrossRefGoogle Scholar
  22. Hummel JE, Biederman I (1992) Dynamic binding in a neural network for shape recognition. Psycholog Rev 99:480–517CrossRefGoogle Scholar
  23. Humphreys GW, Riddoch MJ (1987) Visual object processing: a cognitive neuropsychological approach. Lawrence Erlbaum Associates Ltd, LondonGoogle Scholar
  24. Kayaert G, Biederman I, Vogels R (2003) Shape tuning in macaque inferior temporal cortex. J Neurosci 23:3016–3027PubMedGoogle Scholar
  25. Kayaert G, Biederman I, Op De Beeck H, Vogels R (2005b) Tuning for shape dimensions in macaque inferior temporal cortex. Eur J Neurosci 22:212–224PubMedCrossRefGoogle Scholar
  26. Kayaert G, Biederman I, Vogels R (2005a) Representation of regular and irregular shapes in macaque inferotemporal cortex. Cereb Cortex 15:1308–1321PubMedCrossRefGoogle Scholar
  27. Kobatake E, Tanaka K (1994) Neuronal selectivities to complex object features in the ventral visual pathway of the macaque cerebral cortex. J Neurosci 71:856–867Google Scholar
  28. Kourtzi Z, Kanwisher N (2000) Cortical regions involved in perceiving object shape. J Neurosci 20:3310–3318PubMedGoogle Scholar
  29. Lades M, Vortbrüggen JC, Buhmann J, Lange J, von der Malsburg C, Würtz RP, Konen W (1993) Distortion invariant object recognition in the dynamic link architecture. IEEE Trans Comput 42:300–31CrossRefGoogle Scholar
  30. Logothetis NK (1999) Paper presented at a workshop on Visual Object Recognition by Humans and Machines, Bad Homburg, Germany, May.Google Scholar
  31. Logothetis NK, Pauls J, Bülthoff HH, Poggio T (1994) View-dependent object recognition by monkeys. Curr Biol 4:401–414PubMedCrossRefGoogle Scholar
  32. Marr, D. (1982). Vision. San Francisco: W.H. Freeman.Google Scholar
  33. Mel BW (1997) SEEMORE: combining color, shape, and texture histogramming in a neurally-inspired approach to visual object recognition. Neural Comput 9:777–804PubMedCrossRefGoogle Scholar
  34. Nederhouser M, Biederman I, Davidoff J, Yue X, Kayaert G, Vogels R (2005) The representation of shape in individuals from a culture with limited contact with regular, simple artifacts. Paper presented at the 5th Annual Meeting of the Vision Sciences Society. May.Google Scholar
  35. Nowak LG, Bullier J (1997) The timing of information transfer in the visual system. In: Kaas J, Rockland K, Peters A (Eds) Extrastriate cortex, cerebral cortex Vol. 12. Plenum, New York, pp 205–241Google Scholar
  36. Pasupathy A, Connor CE (1999) Response to contour features in macaque V4. J Neurophysiol 82:2490–2502PubMedGoogle Scholar
  37. Poggio T, Edelman S (1990) A network that learns to recognize three-dimensional objects. Nature 343:263–266PubMedCrossRefGoogle Scholar
  38. Riesenhuber M, Poggio T (2002) Neural mechanisms of object recognition. Curr Opin Neurobiol 12:162–168PubMedCrossRefGoogle Scholar
  39. Shepard RN, Cermak GW (1973) Perceptual-cognitive explorations of a toroidal set of free-form stimuli. Cognit Psychol 4:351–377CrossRefGoogle Scholar
  40. Tanaka K (1993) Neuronal mechanisms of object recognition. Science 262:685–688PubMedCrossRefGoogle Scholar
  41. Tanaka K (1996) Inferotemporal cortex and object vision. Annu Rev Neurosci 19:109–139PubMedCrossRefGoogle Scholar
  42. Tarr MJ, Bülthoff HH (1995) Is human object recognition better described be geon structural descriptions or by multiple views? Comment on Biederman and Gerhardstein (1995). J Exp Psychol Hum Percept Perform 21:1494–1505PubMedCrossRefGoogle Scholar
  43. Tarr MJ, Bülthoff HH, Zabinski M, Blanz V (1997) To what extent do unique parts influence recognition across viewpoint? Psychol Sci 8:282–289CrossRefGoogle Scholar
  44. Tarr MJ, Williams P, Hayward WG, Gauthier I (1998) Three-dimensional object recognition is viewpoint dependent. Nat Neurosci 1:275–277PubMedCrossRefGoogle Scholar
  45. Tsunoda K, Yamane Y, Nishizaki M, Tanifuji M (2001) Complex objects are represented in macaque inferotemporal cortex by the combination of feature columns. Nat Neurosci 4:832–838PubMedCrossRefGoogle Scholar
  46. Tversky A (1977) Features of similarity. Psychol Rev 84:327–352CrossRefGoogle Scholar
  47. Tversky B, Hemenway K (1984) Objects, parts, and categories. J Exp Psychol Gen 113:169–193PubMedCrossRefGoogle Scholar
  48. Vogels R, Biederman I (2002) Effects of illumination intensity and direction on object coding in macaque inferior temporal cortex. Cereb Cortex 12:756–766PubMedCrossRefGoogle Scholar
  49. Vogels R, Biederman I, Bar M, Lorincz A (2001) Sensitivity of macaque temporal neurons to differences in view-invariant vs metric properties of depth-rotated objects. J Cogn Neurosci 13:444–453PubMedCrossRefGoogle Scholar
  50. Winston PH (1975) Learning structural descriptions from examples. In: Winston PH (Ed) The psychology of computer vision. McGraw-Hill, New York, pp 157–209Google Scholar
  51. Wiscott L, Fellous J-M, Krüger N, von der Malsburg C (1997) Face recognition by elastic bunch graph matching. IEEE PAMI 19:775–779Google Scholar
  52. Zerroug M, Nevatia R (1996) Volumetric descriptions from a single intensity image. Int J Comput Vis 20:11–42CrossRefGoogle Scholar

Copyright information

© Springer 2007

Authors and Affiliations

  • Irving Biederman
    • 1
  1. 1.Departments of Psychology and Neuroscience ProgramUniversity of Southern CaliforniaLos AngelesUSA

Personalised recommendations