Proposing a CNN Based Architecture of Mid-level Vision for Feeding the WHERE and WHAT Pathways in the Brain

Das, Apurba; Roy, Anirban; Ghosh, Kuntal

doi:10.1007/978-3-642-27172-4_66

Apurba Das²⁰,
Anirban Roy²¹ &
Kuntal Ghosh²²

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7076))

Included in the following conference series:

International Conference on Swarm, Evolutionary, and Memetic Computing

2218 Accesses
5 Citations

Abstract

In the central visual pathway originating from the eye, a bridging is required between two hierarchical tasks, that of pixel based information recording by visual pathway at low level on one hand and that of object recognition at high level on the other. Such a bridge which may be designated as a mid-level block-grained integration has here been modeled by a multi-layer flexible cellular neural network (F-CNN). The proposed CNN architecture is validated by different intermediate level tasks involving rigid and deformable pattern recognition. Execution of such tasks by the proposed architecture, it has been shown, is capable of generating valid and significant inputs for the WHERE (dorsal) and WHAT (ventral) pathways in the brain. The model includes the proposal of a feedback (also by CNN architecture) to the lower mid-level from the higher mid-level dorsal and ventral pathways for flexible cell (physiological receptive field) size adjustment in the primary visual cortex towards successful ‘where’ and ‘what’ identifications for high-level vision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Marr, D.: Vision: A computational investigation into the human representation & processing of visual information. MIT Press (2010)
Google Scholar
Ungerleider, L.G., Mishkin, M.: Two Cortical Visual Systems. In: Ingle, D.J., Goodale, M.A., Mansfield, R.J.W. (eds.) Analysis of Visual Behavior, pp. 549–586. The MIT Press, Cambridge (1982)
Google Scholar
Rodieck, R.W., Stone, J.: Analysis of receptive fields of cat retinal ganglion cells. Journal of Neurophysiology 28, 833–849 (1965)
Google Scholar
Chua, L.O., Roska, T.: Cellular Neural Networks and Visual Computing. Cambridge University Press (2002)
Google Scholar
Itti, L., Koch, C., Niebur, E.: A model of saliency based visual attention for rapid scene analysis. IEEE Trans. on PAMI 20, 1254–1259 (1998)
Article Google Scholar
Koenderink, J.J.: The structure of images. Biological Cybernetics 50, 363–396 (1984)
Article MathSciNet MATH Google Scholar
Lindeberg, T.: Scale-space theory: A basic tool for analyzing structures at different scales. Journal of Applied Statistics 21(2), 224–270 (1994)
Google Scholar
Livingstone, M.S., Hubel, D.H.: Anatomy and physiology of a colour system in the primate visual cortex. J. Neurosci. 4, 309–356 (1984)
Google Scholar
Livingstone, M.S., Hubel, D.H.: Segregation of form, colour, movement, and depth: anatomy, physiology, and perception. Science 240, 740–749 (1988)
Article Google Scholar
Kandel, E.R., Schwartz, J.H., Jessel, T.M.: Principles of Neural Science, 3rd edn. Elsevier, New York (1991)
Google Scholar
Lowe, D.G.: Object recognition from local scale-invariant features. Proceedings of the International Conference on Computer Vision 2, 1150–1157 (1999)
Google Scholar
Lowe, D.G.: Distinctive Image Features from Scale-Invariant Key points. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
http://www.klab.caltech.edu/codedata/codedata.shtml

Download references

Author information

Authors and Affiliations

Centre for Development of Advanced Computing (CDAC), Salt Lake, Kolkata, India
Apurba Das
Techno India, Salt Lake, Kolkata, India
Anirban Roy
Machine Intelligence Unit and Center for Soft Computing Research, Indian Statistical Institute, India
Kuntal Ghosh

Authors

Apurba Das
View author publications
You can also search for this author in PubMed Google Scholar
Anirban Roy
View author publications
You can also search for this author in PubMed Google Scholar
Kuntal Ghosh
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical Engineering, IIT Delhi, India
Bijaya Ketan Panigrahi
School of Electrical and Electronic Engineering, Nanyang Technological University, 639798, Singapore
Ponnuthurai Nagaratnam Suganthan
Department of Electronics and Telecommunications, Jadavpur University, 700032, Kolkata, India
Swagatam Das
ANITS, Visakhapatnam, India
Suresh Chandra Satapathy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Das, A., Roy, A., Ghosh, K. (2011). Proposing a CNN Based Architecture of Mid-level Vision for Feeding the WHERE and WHAT Pathways in the Brain. In: Panigrahi, B.K., Suganthan, P.N., Das, S., Satapathy, S.C. (eds) Swarm, Evolutionary, and Memetic Computing. SEMCCO 2011. Lecture Notes in Computer Science, vol 7076. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27172-4_66

Download citation

DOI: https://doi.org/10.1007/978-3-642-27172-4_66
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27171-7
Online ISBN: 978-3-642-27172-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics