Abstract
This paper presents a visual architecture able to identify salient regions in a visual scene and to use them to focus on interesting locations. It is inspired by the ability of natural vision systems to perform a differential processing of spatial frequencies in both time and space and to focus their attention on a local part of the visual scene. The present paper analyzes how this differential processing of spatial frequencies is able to provide an artificial system with the information required to perform an exploration of its visual world based on a center-surround distinction of the external scene. It shows how the salient locations can be gathered on the basis of their similarities to form a high level representation of the visual scene.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Allport, A.: Visual attention. In: Foundations of cognitive science. M.I. Posner (ed). The MIT Press. (1989)
Aloimonos, Y. (ed) Active Perception. Lawrence Erlbaum, Hillsdale,NJ (1993)
Atick, J.J. and A.N. Redlich: Towards a Theory of Early Visual Processing. Neural Computation 2 (1990) 308ā320
Barlow, H.B.: Possible principles underlying the transformation of sensory messages. In: Sensory Communication. W. Rosenblith (ed). The MIT Press: Cambridge, MA. (1961) 217ā234
Field, D.J.: Relations between the statistics of natural images and the response properties of cortical cells. Journal of the Optical Society of America A 4 (1987) 2379ā2394
Bell, A.J. and T.J. Sejnowski: The āindependent componentsā of natural scenes are edge filters. Vision Research 37 (1997) 3327ā3338
Olshausen, B.A. and D.J. Field: Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381 (1996) 607ā609
Daugman, J. and C. Downing: Gabor wavelets for statistical pattern recognition. In: The Handbook of Brain Theory and Neural Networks. M.A. Arbib (ed). The MIT Press: Cambridge, MA. (1995) 414ā420
GuĆ©rin-DuguĆ©, A. and P.M. Palagi: Implantations de filtres de Gabor par pyramide d'images passe-bas. Traitement du signal 13 (1996) 1ā11
Field, D.J.: What is the goal of sensory coding? Neural Computation 6 (1994) 559ā601
HĆ©rault, J., A. Oliva, and A. GuĆ©rin-DuguĆ©: Scene categorisation by curvilinear component analysis of low frequency spectra. In: ESANN'97. Bruges (1997) 91ā96
Oliva, A. and P.G. Schyns: Coarse blobs or fine edges? Evidence that information diagnosticity changes the perception of complex visual stimuli. Cognitive Psychology 34 (1997) 72ā107
Andrey, P. and P. Tarroux: Unsupervised segmentation of Markov Random Field modeled textured images using selectionist relaxation. IEEE Transactions on Pattern Analysis and Machine Intelligence 20 (1998) 252ā263
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Machrouh, Y., LiƩnard, JS., Tarroux, P. (2001). Multiscale Feature Extraction from the Visual Environment in an Active Vision System. In: Arcelli, C., Cordella, L.P., di Baja, G.S. (eds) Visual Form 2001. IWVF 2001. Lecture Notes in Computer Science, vol 2059. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45129-3_35
Download citation
DOI: https://doi.org/10.1007/3-540-45129-3_35
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42120-7
Online ISBN: 978-3-540-45129-7
eBook Packages: Springer Book Archive