Abstract
When we look at the object in front of us, a specific pattern of activity is created in the ganglion cells of the retina. This pattern is relayed and transformed on its way via the thalamus and primary visual areas to higher cortical stages, where it may interact with and activate certain memories stored there. If this happens, we feel that we have recognized the object. While the recognition process as a whole is far from being understood, there is a wealth of details known about the individual anatomical subsystems involved in this process. Light entering the eye from the environment is focussed and projected by the lens as an inverted image onto the back of the eye. This concave surface is covered by the retina, the first outpost of the central nervous system (CNS) to be encountered by the light (see Figure 2.1a).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anderson, C.H., Van Essen, D.C.: Shifter circuits: A computational strategy for dynamic aspects of visual processing. Proceedings of the National Academy of Sciences of the United States of America 84, 6297–6301 (1987)
Arathorn, D.W.: Map-Seeking Circuits in Visual Cognition—A Computational Mechanism for Biological and Machine Vision. Stanford University Press, Stanford (2002)
Bak, P.: How nature works: the science of self-organized criticality. Springer, Heidelberg (1996)
Bar, M., Biederman, I.: Localizing the cortical region mediating visual awareness of object identity. PNAS 96, 1790–179 (1999)
Berg, A.C., Berg, T.L., Malik, J.: Shape matching and object recognition using low distortion correspondence. In: Proc. CVPR, pp. 26–33 (2005)
Biederman, I., Kalocsai, P.: Neurocomputational bases of object and face recognition. Phil. Trans. Roy. Soc. B 352, 1203–1219 (1997)
Bosch, A., Zisserman, A., Muoz, X.: Scene classification using a hybrid generative/discriminative approach. IEEE Trans Pattern Anal Mach Intell (PAMI) 30, 712–727 (2008)
Bundesen, C., Larsen, A.: Visual transformation of size. J. Exp. Psychol. Hum. Percept Perform 1(3), 214–220 (1975)
Chater, N., Tenenbaum, J.B., Yuille, A.: Probabilistic models of cognition: Conceptual foundations. Trends in Cognitive Sciences 10(7), 287–291 (2006)
Cox, D., Meier, P., Oertelt, N., DiCarlo, J.J.: Breaking position-invariant object recognition. Nature Neuroscience 8(9), 1145–1147 (2005)
Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Proc. ECCV workshop on Statistical Learning in Computer Vision, pp. 59–74 (2004)
Debruille, J.B., Guillem, F., Renault, B.: Erps and chronometry of face recognition: following-up seeck et al. and george et al. Neuroreport 9(15), 3349–3353 (1998)
Deco, G., Rolls, E.T.: A neurodynamical cortical model of visual attention and invariant object recognition. Vision Research 44(6), 621–642 (2004)
Duhamel, J.R., Colby, C.L., Goldberg, M.E.: The updating of the representation of visual space in parietal cortex by intended eye movements. Science 255(5040), 90–92 (1992)
Fei-Fei, L., Fergus, R., Perona, P.: A bayesian approach to unsupervised one-shot learning of object categories. In: Proc. of the Ninth IEEE Intern. Conf. Computer Vision, pp. 1134–1141 (2003)
Feldman, J.A.: Dynamic connections in neural networks. Biol. Cybern. 46(1), 27–39 (1982)
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition (2003)
Fiser, J., Biederman, I.: Invariance of long-term visual priming to scale, reflection, translation, and hemisphere. Vision Research 41, 221–234 (2001)
Fukushima, K., Miyake, S., Ito, T.: Neocognitron: A neural network model for a mechanism of visual pattern recognition. IEEE Transactions on Systems, Man and Cybernetics 13(5), 826–834 (1983)
Graf, M.: Coordinate transformations in object recognition. Psychol Bull 132, 920–945 (2006)
Gray, C.M., Singer, W.: Stimulus-specific neuronal oscillations in orientation columns of cat visual cortex. Proceedings of the National Academy of Sciences of the USA 86(5), 1698–1702 (1989)
Greenberg, D.S., Houweling, A.R., Kerr, J.N.D.: Population imaging of ongoing neuronal activity in the visual cortex of awake rats. Nature Neuroscience (2008), http://dx.doi.org/10.1038/nn.2140
Hubel, D.H., Wiesel, T.N.: Receptive fields and functional architecture of monkey striate cortex. J. Physiol (Lond.) 195, 215–243 (1968)
Joachims, T.: Text categorization with support vector machines: Learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998)
Johnson, J.S., Olshausen, B.A.: Timecourse of neural signatures of object recognition. J. Vis. 3(7), 499–512 (2003), http://dx.doi.org/10:1167/3.7.4
Jolicoeur, P.: The time to name disoriented natural objects. Mem. Cognit. 13(4), 289–303 (1985)
Kandel, E.R., Jessell, T.M., Schwartz, J.: Principles of Neural Science, 4th edn. McGraw Hill, New York (2000)
Kanizsa, G.: Margini quasi-percettivi in campi con stimolazione omogenea. Rivista di Psicologia 49, 7–30 (1955)
Konen, C.S., Kastner, S.: Two hierarchically organized neural systems for object information in human visual cortex. Nature Neuroscience 11(2), 224–231 (2008), http://dx.doi.org/10.1038/nn2036
Kschischang, F.R., Frey, B.J., Loelinger, H.-A.: Factor graphs and the sum-product algorithm. IEEE Transactions on Information Theory 47, 498–519 (2001)
Kusunoki, M., Goldberg, M.E.: The time course of perisaccadic receptive field shifts in the lateral intraparietal area of the monkey. J. Neurophysiol. 89(3), 1519–1527 (2003)
Lamme, V.: Why visual attention and awareness are different. Trends in Cognitive Sciences 7(1), 12–18 (2003)
Lawson, R., Jolicoeur, P.: The effect of prior experience on recognition thresholds for plane-disoriented pictures of familiar objects. Mem. Cognit. 27(4), 751–758 (1999)
Lazebnik, S., Schmid, C., Ponce, J.: Affine-invariant local descriptors and neighborhood statistics for texture recognition. In: Proc. ICCV, pp. 649–655 (2003)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: Proc. CVPR 2006, vol. 2, pp. 2169–2178 (2006)
LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W., Jackel, L.D.: Backpropagation applied to handwritten zip code recognition. Neural Computation 1(4), 541–551 (1989)
LeCun, Y., Huang, F.J., Bottou, L.: Learning methods for generic object recognition with invariance to pose and lighting. In: CVPR, pp. 97–104. IEEE Computer Society, Los Alamitos (2004)
Leung, T., Malik, J.: Representing and recognizing the visual appearance of materials using three-dimensional textons. International Journal of Computer Vision 43, 29–44 (2001)
Luck, S.J., Chelazzi, L., Hillyard, S.A., Desimone, R.: Neural mechanisms of spatial selective attention in areas V1, V2, and V4 of macaque visual cortex. J Neurophysiol. 77(1), 24–42 (1997)
Marr, D., Poggio, T.: Cooperative computation of stereo disparity. Science 194(4262), 283–287 (1976)
Mel, B.W.: Seemore: Combining color, shape, and texture histogramming in a neurally inspired approach to visual object recognition. Neural Computation 9, 777–804 (1997)
Mel, B.W., Fiser, J.: Minimizing binding errors using learned conjunctive features. Neural Computation 12(4), 731–762 (2000)
Murray, J.F., Kreutz-Delgado, K.: Visual recognition and inference using dynamic overcomplete sparse learning. Neural Computation 19(9), 2301–2352 (2007), http://dx.doi.org/10.1162/neco.2007.19.9.2301
Murray, S.O., Boyaci, H., Kersten, D.: The representation of perceived angular size in human primary visual cortex. Nature Neuroscience 9, 429–434 (2006)
Oliva, A., Torralba, A.: Building the gist of a scene: the role of global image features in recognition. Prog Brain Res 155, 23–36 (2006) http://dx.doi.org/10.1016/S0079-6123(06)55002-2
Olshausen, B.A., Anderson, C.H., van Essen, D.C.: A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information. Journal of Neuroscience 13(11), 4700–4719 (1993)
Oram, M.W., Perret, D.I.: Modeling visual recognition from neurobiological constraints. Neural Networks 7, 945–972 (1994)
Pasupathy, A., Connor, C.E.: Responses to contour features in macaque area V4. J Neurophysiol. 82(5), 2490–2502 (1999)
Phillips, P., Grother, P., Micheals, R., Blackburn, D., Tabassi, E., Bone, J.: Frvt, evaluation report, Technical Report 6965, NISTIR. (2003), http://www.frvt.org/
Pinto, N., Cox, D.D., Dicarlo, J.J.: Why is real-world visual object recognition hard? PLoS Computational Biology 4(1), e27+ (2008), http://dx.doi.org/10.1371/journal.pcbi.0040027
Pitts, W., McCulloch, W.S.: How we know universals: the perception of auditory and visual forms. Bulletin of Mathematical Biophysics 9, 127–147 (1947)
Pollen, D., Lee, J., Taylor, J.: How does the striate cortex begin the reconstruction of the visual world? Science 173, 74–77 (1971)
Postma, E., van den Herik, H., Hudson, P.: SCAN: A Scalable Model of Attentional Selection. Neural Netw. 10(6), 993–1015 (1997)
Rao, R.P., Ballard, D.H.: Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nature Neuroscience 2(1), 79–87 (1999)
Riesenhuber, M., Poggio, T.: Hierarchical models of object recognition in cortex. Nature Neuroscience 2(11), 1019–1025 (1999)
Rosenblatt, F.: Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms. Spartan Books, Washington (1961)
Schiele, B., Crowley, J.: Recognition without correspondence using multidimensional receptive field histograms. International Journal of Computer Vision 36(1), 31–50 (2000)
Schwartz, E.L.: Spatial mapping in primate sensory projection: analytic structure and relevance to perception. Biological Cybernetics 25, 181–194 (1977)
Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M., Poggio, T.: Robust object recognition with cortex-like mechanisms. IEEE Trans. Pattern Anal. Mach Intell. 29(3), 411–426 (2007) (Evaluation Studies)
Singer, W.: Synchronization, binding and expectancy. In: Arbib, M. (ed.) The Handbook of Brain Theory and Neural Networks, pp. 1136–1143. MIT Press, Cambridge (2003)
Sivic, J., Russell, B., Efros, A., Zisserman, A., Freeman, W.: Discovering objects and their location in images. In: Proc. ICCV 2005, pp. 370–377 (2005)
Song, Y., Goncalves, L., Perona, P.: Unsupervised learning of human motion. PAMI 25(25), 1–14 (2003)
Swain, M., Ballard, D.: Color indexing. International Journal of Computer Vision 7, 11–32 (1991)
Tanaka, K.: Inferotemporal cortex and object vision. Annu. Rev. Neurosci. 19, 109–139 (1996)
Thorpe, S.: Identification of rapidly presented images by the human visual system. Perception 17, A77 (1988)
Thorpe, S., Fize, D., Marlot, C.: Speed of processing in the human visual system. Nature 381(6582), 520–522 (1996)
Tootell, R.B., Silverman, M.S., Hamilton, S.L., Switkes, E., De Valois, R.L.: Functional anatomy of macaque striate cortex. V. Spatial frequency. J. Neurosci. 8, 1610–1624 (1988)
Torralba, A., Murphy, K.P., Freeman, W.T., Rubin, M.A.: Context-based vision system for place and object recognition. In: Proc. ICCV 2003, pp. 273–280 (2003)
van Vreeswijk, C., Sompolinsky, H.: Chaotic balanced state in a model of cortical circuits. Neural Computation 10, 1321–1372 (1998)
von der Heydt, R., Peterhans, E., Baumgartner, G.: Illusory contours and cortical neuron responses. Science 224, 1260–1262 (1984)
von der Malsburg, C.: The correlation theory of brain function, Internal report, 81-2, Max-Planck-Institut für Biophysikalische Chemie, Postfach 2841, 3400 Göttingen, FRG (1981); Reprinted in Domany, E., van Hemmen, J.L., Schulten, K. (eds.): Models of Neural Networks II, ch. 2, pp. 95–119. Springer, Berlin (1994)
Wang, D.: The time dimension for scene analysis. IEEE Transactions on Neural Networks 16(6), 1401–1426 (2005)
Wersing, H., Körner, E.: Learning optimized features for hierarchical models of invariant object recognition. Neural Computation 15(7), 1559–1588 (2003), http://dx.doi.org/10.1162/089976603321891800
Wiskott, L., Fellous, J.-M., Krüger, N., von der Malsburg, C.: Face recognition by elastic bunch graph matching. IEEE Trans. on Pattern Analysis and Machine Intelligence 19(7), 775–779 (1997), http://www.cnl.salk.edu/~wiskott/Abstracts/WisFelKrue97a.html
Wiskott, L., von der Malsburg, C.: Face recognition by dynamic link matching. In: Sirosh, J., Miikkulainen, R., Choe, Y. (eds.) Lateral Interactions in the Cortex: Structure and Function, Austin, TX. Electronic book, vol. 11, The UTCS Neural Networks Research Group (1996), http://www.cs.utexas.edu/users/nn/web-pubs/htmlbook96/ , http://www.cnl.salk.edu/~wiskott/Abstracts/WisMal96c.html
Womelsdorf, T., Anton-Erxleben, K., Pieper, F., Treue, S.: Dynamic shifts of visual receptive fields in cortical area MT by spatial attention. Nature Neuroscience 9(9), 1156–1160 (2006)
Yuille, A., Kersten, D.: Vision as Bayesian inference: analysis by synthesis? Trends in Cognitive Sciences 10(7), 301–308 (2006)
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Wolfrum, P. (2010). Background and Concepts. In: Information Routing, Correspondence Finding, and Object Recognition in the Brain. Studies in Computational Intelligence, vol 316. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15254-2_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-15254-2_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15253-5
Online ISBN: 978-3-642-15254-2
eBook Packages: EngineeringEngineering (R0)