An Attentional System Combining Top-Down and Bottom-Up Influences

Rasolzadeh, Babak; Tavakoli Targhi, Alireza; Eklundh, Jan-Olof

doi:10.1007/978-3-540-77343-6_8

Babak Rasolzadeh³,
Alireza Tavakoli Targhi³ &
Jan-Olof Eklundh³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4840))

Included in the following conference series:

International Workshop on Attention in Cognitive Systems

1712 Accesses
11 Citations

Abstract

Attention plays an important role in human processing of sensory information as a mean of focusing resources toward the most important inputs at the moment. It has in particular been shown to be a key component of vision. In vision it has been argued that the attentional processes are crucial for dealing with the complexity of real world scenes. The problem has often been posed in terms of visual search tasks. It has been shown that both the use of prior task and context information - top-down influences - and favoring information that stands out clearly in the visual field - bottom-up influences - can make such search more efficient. In a generic scene analysis situation one presumably has a combination of these influences and a computational model for visual attention should therefore contain a mechanism for their integration. Such models are abundant for human vision, but relatively few attempts have been made to define any that apply to computer vision.

In this article we describe a model that performs such a combination in a principled way. The system learns an optimal representation of the influences of task and context and thereby constructs a biased saliency map representing the top-down information. This map is combined with bottom-up saliency maps in a process evolving over time as a function over the input. The system is applied to search tasks in single images as well as in real scenes, in the latter case using an active vision system capable of shifting its gaze. The proposed model is shown to have desired qualities and to go beyond earlier proposed systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Itti, L.: Models of Bottom-Up and Top-Down Visual Attention, Ph.D. thesis, California Institute of Technology (2000)
Google Scholar
Li, Z.: A saliency map in primary visual cortex. Trends in Cognitive Sciences 6(1), 9–16 (2002)
Article Google Scholar
Olshausen, B., Anderson, C., van Essen, D.: A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information. J. Neuroscience 13, 4700–4719 (1993)
Google Scholar
Treisman, A.M., Gelade, G.: A feature integration theory of attention. Cognitive Psychology 12, 97–136 (1980)
Article Google Scholar
Koch, C., Ullman, S.: Shifts in selective visual attention: Towards the underlying neural circuitry. Human Neurobiology 4, 219–227 (1985)
Google Scholar
Koike, T., Saiki, J.: Stochastic Guided Search Model for Search Asymmetries in Visual Search Tasks. Biologically Motivated Computer Vision, 408–417 (2002)
Google Scholar
Ramström, O., Christensen, H.I.: Object detection using background context. In: Proc. International Conference of Pattern Recognition, pp. 45–48 (2004)
Google Scholar
Choi, S.B., Ban, S.W., Lee, M.: Biologically motivated visual attention system using bottom-up saliency map and top-down inhibition. Neural Information Processing-Letters and Review 2 (2004)
Google Scholar
Itti, L., Koch, C., Niebur, E.: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 1254–1259 (1998)
Article Google Scholar
Navalpakkam, V., Itti, L.: Sharing Resources: Buy Attention, Get Recognition. In: Proc. International Workshop Attention and Performance in Computer Vision, Graz, Austria (July 2003)
Google Scholar
Lee, K., Buxton, H., Feng, J.: Selective attention for cueguided search using a spiking neural network. In: Proc. of the Int’l Workshop on Attention and Performance in Computer Vision, Graz, Austria, pp. 55–62 (2003)
Google Scholar
Frintrop, S.: VOCUS: A Visual Attention System for Object Detection and Goal-Directed Search. LNCS (LNAI), vol. 3899. Springer, Heidelberg (2006)
Book Google Scholar
Oliva, A., Torralba, A., Castelhano, M.S., Henderson, J.M.: Top-down control of visual attention in object detection. In: Proc. ICIP 2003, pp. 253–256 (2003)
Google Scholar
Theeuwes, J.: Stimulus-driven capture and attentional set: Selective search for colour and visual abrupt onsets. Journal of Experimental Psychology: Human Perception & Performance 1, 799–806 (1994)
Google Scholar
Itti, L., Koch, C.: Computational Modeling of Visual Attention. Nature Reviews Neuroscience 2, 194–203 (2001)
Article Google Scholar
Draper, B., Lionelle, A.: Evaluation of selective attention under similarity transforms. In: Proc. International Workshop on Attention and Performance in Computer Vision, pp. 31–38 (2003)
Google Scholar
Rasolzadeh, B.: Interaction of Bottom-up and Top-down influences for Attention in an Active Vision System, MSc-thesis, TRITA-CSC-E 2006:117, ISSN-1653-5715, KTH, Stockholm (2006)
Google Scholar
Rasolzadeh, B., Björkman, M., Eklundh, J.O.: An attentional system combining top-down and bottom-up influences. In: ICVW 2006. International Cognitive Vision Workshop, at ECCV (2006)
Google Scholar
Haykin, S.: Neural Networks: A Comprehensive Foundation. Prentice Hall, Upper Saddle River, NJ (1994)
MATH Google Scholar
Culhane, S.M., Tsotsos, J.K.: A Prototype for Data-Driven Visual Attention. In: Proc. International Conference on Pattern Recognition, vol. A, pp. 36–39 (1992)
Google Scholar
Hu, Y., Xie, X., Ma, W-Y., Chia, L-T., Rajan, D.: Salient region detection using weighted feature maps based on the Human Visual Attention Model. In: IEEE Pacific-Rum Conference on Multimedia (submitted)
Google Scholar
Wong, A.K.C., Sahoo, P.K: A gray-level threshold selection method based on maximum entropy principle. IEEE Trans. Systems Man and Cybernetics 19, 866–871 (1989)
Article Google Scholar
Malik, J., Belongie, S., Leung, T., Shi, J.: Contour and Texture Analysis for Image Segmentation. International Journal of Computer Vision 43, 7–27 (2001)
Article MATH Google Scholar
Varma, M., Zisserman, A.: Classifying Images of Materials: Achieving Viewpoint and Illumination Independence. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2350, pp. 255–271. Springer, Heidelberg (2002)
Chapter Google Scholar
Ojala, T., Pietikainen, M.: Unsupervised texture segmentation using feature distributions. Journal of Pattern Recognition 32, 477–486 (1999)
Article Google Scholar
Varma, M., Zisserman, A.: Texture classification: are filter banks necessary? In: Proc. CVPR, pp. 691–698 (2003)
Google Scholar
Tavakoli Targhi, A., Shademan, A.: Clustering of singular value decomposition of image data with applications to texture classification. In: Proc. VCIP, pp. 972–979 (2003)
Google Scholar
Tavakoli Targhi, A., Hayman, E., Eklundh, J.O., Shahshahani, M.: The Eigen-Transform and Applications. In: Proc. ACCV, pp. 70–79 (2006)
Google Scholar
Tavakoli Targhi, A., Shahshahani, M.: A simple set of numerical invariants for the analysis of images. International Journal of Imaging Systems and Technology 16, 240–248 (2007)
Google Scholar
Tavakoli Targhi, A., Rasolzadeh, B., Eklundh, J.O.: Texture for Multiple Cue Visual Analysis with Applications to Attention (Submitted, 2007)
Google Scholar
Tavakoli Targhi, A., Björkman, M., Hayman, E., Eklundh, J.O: Real-Time Texture Detection Using the LU-Transform (submitted, 2007)
Google Scholar
Björkman, M., Eklundh, J-O.: Foveated Figure-Ground Segmentation and Its Role in Recognition. In: Proc. British Machine Vision Conf., pp. 819–828 (September 2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Vision and Active Perception Laboratory, CSC, KTH, SE-100 44 Stockholm, Sweden
Babak Rasolzadeh, Alireza Tavakoli Targhi & Jan-Olof Eklundh

Authors

Babak Rasolzadeh
View author publications
You can also search for this author in PubMed Google Scholar
Alireza Tavakoli Targhi
View author publications
You can also search for this author in PubMed Google Scholar
Jan-Olof Eklundh
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Joanneum Research, Forschungsgesellschaft mbH, Computational Perception Group,, Institute of Digital Image Processing, Wastiangasse 6, 8010, Graz, Austria
Lucas Paletta
Autonomous Intelligent Systems (AIS), Autonomous Robots Department, Fraunhofer Institute, Schloss Birlinghoven, 53754, Sankt Augustin, Germany
Erich Rome

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rasolzadeh, B., Tavakoli Targhi, A., Eklundh, JO. (2007). An Attentional System Combining Top-Down and Bottom-Up Influences. In: Paletta, L., Rome, E. (eds) Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint. WAPCV 2007. Lecture Notes in Computer Science(), vol 4840. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77343-6_8

Download citation

DOI: https://doi.org/10.1007/978-3-540-77343-6_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77342-9
Online ISBN: 978-3-540-77343-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics