Abstract
In the central visual pathway originating from the eye, a bridging is required between two hierarchical tasks, that of pixel based information recording by visual pathway at low level on one hand and that of object recognition at high level on the other. Such a bridge which may be designated as a mid-level block-grained integration has here been modeled by a multi-layer flexible cellular neural network (F-CNN). The proposed CNN architecture is validated by different intermediate level tasks involving rigid and deformable pattern recognition. Execution of such tasks by the proposed architecture, it has been shown, is capable of generating valid and significant inputs for the WHERE (dorsal) and WHAT (ventral) pathways in the brain. The model includes the proposal of a feedback (also by CNN architecture) to the lower mid-level from the higher mid-level dorsal and ventral pathways for flexible cell (physiological receptive field) size adjustment in the primary visual cortex towards successful ‘where’ and ‘what’ identifications for high-level vision.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Marr, D.: Vision: A computational investigation into the human representation & processing of visual information. MIT Press (2010)
Ungerleider, L.G., Mishkin, M.: Two Cortical Visual Systems. In: Ingle, D.J., Goodale, M.A., Mansfield, R.J.W. (eds.) Analysis of Visual Behavior, pp. 549–586. The MIT Press, Cambridge (1982)
Rodieck, R.W., Stone, J.: Analysis of receptive fields of cat retinal ganglion cells. Journal of Neurophysiology 28, 833–849 (1965)
Chua, L.O., Roska, T.: Cellular Neural Networks and Visual Computing. Cambridge University Press (2002)
Itti, L., Koch, C., Niebur, E.: A model of saliency based visual attention for rapid scene analysis. IEEE Trans. on PAMI 20, 1254–1259 (1998)
Koenderink, J.J.: The structure of images. Biological Cybernetics 50, 363–396 (1984)
Lindeberg, T.: Scale-space theory: A basic tool for analyzing structures at different scales. Journal of Applied Statistics 21(2), 224–270 (1994)
Livingstone, M.S., Hubel, D.H.: Anatomy and physiology of a colour system in the primate visual cortex. J. Neurosci. 4, 309–356 (1984)
Livingstone, M.S., Hubel, D.H.: Segregation of form, colour, movement, and depth: anatomy, physiology, and perception. Science 240, 740–749 (1988)
Kandel, E.R., Schwartz, J.H., Jessel, T.M.: Principles of Neural Science, 3rd edn. Elsevier, New York (1991)
Lowe, D.G.: Object recognition from local scale-invariant features. Proceedings of the International Conference on Computer Vision 2, 1150–1157 (1999)
Lowe, D.G.: Distinctive Image Features from Scale-Invariant Key points. International Journal of Computer Vision 60(2), 91–110 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Das, A., Roy, A., Ghosh, K. (2011). Proposing a CNN Based Architecture of Mid-level Vision for Feeding the WHERE and WHAT Pathways in the Brain. In: Panigrahi, B.K., Suganthan, P.N., Das, S., Satapathy, S.C. (eds) Swarm, Evolutionary, and Memetic Computing. SEMCCO 2011. Lecture Notes in Computer Science, vol 7076. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27172-4_66
Download citation
DOI: https://doi.org/10.1007/978-3-642-27172-4_66
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27171-7
Online ISBN: 978-3-642-27172-4
eBook Packages: Computer ScienceComputer Science (R0)