Perceptual Organization of Shape

Elder, James H.

doi:10.1007/978-1-4471-5195-1_5

James H. Elder³

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

2824 Accesses
1 Citations

Abstract

Humans are very good at rapidly detecting salient objects such as animals in complex natural scenes, and recent psychophysical results suggest that the fastest mechanisms underlying animal detection use contour shape as a principal discriminative cue. How does our visual system extract these contours so rapidly and reliably? While the prevailing computational model represents contours as Markov chains that use only first-order local cues to grouping, computer vision algorithms based on this model fall well below human levels of performance. Here we explore the possibility that the human visual system exploits higher-order shape regularities in order to segment object contours from cluttered scenes. In particular, we consider a recurrent architecture in which higher areas of the object pathway generate shape hypotheses that condition grouping processes in early visual areas. Such a generative model could help to guide local bottom-up grouping mechanisms toward globally consistent solutions. In constructing an appropriate theoretical framework for recurrent shape processing, a central issue is to ensure that shape topology remains invariant under all actions of the feedforward and feedback processes. This can be achieved by a promising new theory of shape representation based upon a family of local image deformations called formlets, shown to outperform alternative contour-based generative shape models on the important problem of visual shape completion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Arbelaez P, Maire M, Fowlkes C, Malik J (2011) Contour detection and hierarchical image segmentation. IEEE Trans Pattern Anal Mach Intell 33(5):898–916. doi:10.1109/TPAMI.2010.161
Article Google Scholar
Cavanagh P (1991) What’s up in top-down processing. In: Gorea A (ed) Representations of vision. Cambridge University Press, Cambridge, pp 295–304
Google Scholar
Cohen LD, Deschamps T (2001) Multiple contour finding and perceptual grouping as a set of energy minimizing paths. In: Energy minimization methods workshop, CVPR
Google Scholar
Crevier D (1999) A probabilistic method for extracting chains of collinear segments. Comput Vis Image Underst 76(1):36–53
Article Google Scholar
Deschamps T, Cohen LD (2000) Minimal paths in 3d images and application to virtual endoscopy. In: Proceedings of the 6th European conference on computer vision
Google Scholar
Dubinskiy A, Zhu SC (2003) A multiscale generative model for animate shapes and parts. In: Proc. 9th IEEE ICCV, vol 1, pp 249–256
Google Scholar
Elder JH, Goldberg RM (2002) Ecological statistics of Gestalt laws for the perceptual organization of contours. J Vis 2(4):324–353
Article Google Scholar
Elder JH, Krupnik A, Johnston LA (2003) Contour grouping with prior models. IEEE Trans Pattern Anal Mach Intell 25(25):661–674
Article Google Scholar
Elder JH, Oleskiw TD, Yakubovich A, Peyré G (2013) On growth and formlets: sparse multi-scale coding of planar shape. Image Vis Comput 31:1–13
Article Google Scholar
Elder JH, Velisavljević L (2009) Cue dynamics underlying rapid detection of animals in natural scenes. J Vis 9(7):1–20
Article Google Scholar
Elder JH, Zucker SW (1996) Computing contour closure. In: Proceedings of the 4th European conference on computer vision. Springer, New York, pp 399–412
Google Scholar
Estrada FJ, Elder JH (2006) Multi-scale contour extraction based on natural image statistics. In: IEEE conference on computer vision and pattern recognition workshop
Google Scholar
Field DJ, Hayes A, Hess RF (1993) Contour integration by the human visual system: evidence for a local “association field”. Vis Res 33(2):173–193
Article Google Scholar
Foxe JJ, Simpson GV (2002) Flow of activation from V1 to frontal cortex in humans. Exp Brain Res 142:139–150
Article Google Scholar
Geisler WS, Perry JS, Super BJ, Gallogly DP (2001) Edge co-occurence in natural images predicts contour grouping performance. Vis Res 41(6):711–724
Article Google Scholar
Gilbert CD, Wiesel TN (1989) Columnar specificity of intrinsic horizontal and corticocortical connections in cat visual cortex. J Neurosci 9(7):2432–2443
Google Scholar
Grenander U, Srivastava A, Saini S (2007) A pattern-theoretic characterization of biological growth. IEEE Trans Med Imaging 26(5):648–659
Article Google Scholar
Cox IJ, Rehg JM, Hingorani S (1993) A Bayesian multiple hypothesis approach to contour segmentation. Int J Comput Vis 11(1):5–24
Article Google Scholar
Jain AK, Zhong Y, Lakshmanan S (1996) Object matching using deformable templates. IEEE Trans Pattern Anal Mach Intell 18(3):267–278
Article Google Scholar
Kirchner H, Thorpe SJ (2006) Ultra-rapid object detection with saccadic eye movements: visual processing speed revisited. Vis Res 46:1762–1766
Article Google Scholar
Kruger N (1998) Collinearity and parallelism are statistically significant second order relations of complex cell responses. Neural Process Lett 8:117–129
Article Google Scholar
Lee TS, Mumford D (2003) Hierarchical Bayesian inference in the visual cortex. J Opt Soc Am A 20(7):1434–1448
Article Google Scholar
Leyton M (1988) A process-grammar for shape. Artif Intell 34(2):213–247
Article Google Scholar
Mahamud S, Thornber KK, Williams LR (1999) Segmentation of salient closed contours from real images. In: IEEE international conference on computer vision. IEEE Computer Society, Los Alamitos, pp 891–897
Chapter Google Scholar
Mallat SG, Zhang Z (1993) Matching pursuits with time frequency dictionaries. IEEE Trans Signal Process 41(12):3397–3415
Article MATH Google Scholar
Martin D, Fowlkes C, Malik J (2004) Learning to detect natural image boundaries using local brightness, color and texture cues. IEEE Trans Pattern Anal Mach Intell 26(5):530–549
Article Google Scholar
Oleskiw TD, Elder JH, Peyré G (2010) On growth and formlets. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)
Google Scholar
Parent P, Zucker SW (1989) Trace inference, curvature consistency, and curve detection. IEEE Trans Pattern Anal Mach Intell 11:823–839
Article Google Scholar
Portman N, Grenander U, Vrscay ER (2009) Direct estimation of biological growth properties from image data using the “GRID” model. In: Kamel M, Campilho A (eds) Image analysis and recognition, proceedings. Lecture notes in computer science, vol 5627, pp 832–843. 6th international conference on image analysis and recognition, Halifax, Canada, Jul 06–08, 2009
Chapter Google Scholar
Portman N, Grenander U, Vrscay ER (2011) GRID macroscopic growth law and it application to image inference. Q Appl Math 69(2):227–260
MathSciNet MATH Google Scholar
Portman N, Vrscay ER (2011) Existence and uniqueness of solutions to the grid macroscopic growth equation. Appl Math Comput 217(21):8318–8327. doi:10.1016/j.amc.2011.03.021. http://www.sciencedirect.com/science/article/pii/S0096300311003754
Article MathSciNet MATH Google Scholar
Thorpe S, Fize D, Marlot C (1996) Speed of processing in the human visual system. Nature 381:520–522
Article Google Scholar
Sha’ashua A, Ullman S (1988) Structural saliency: the detection of globally salient structures using a locally connected network. In: Proceedings of the 2nd international conference on computer vision, pp 321–327
Google Scholar
Sharon E, Mumford D (2004) 2d-shape analysis using conformal mapping. In: Computer vision and pattern recognition, IEEE Comp. Soc. Conf., vol 2, pp 350–357
Google Scholar
Sigman M, Cecchi GA, Gilbert CD, Magnasco MO (2001) On a common circle: natural scenes and Gestalt rules. Proc Natl Acad Sci 98(4):1935–1940
Article Google Scholar
Thompson DW (1961) On growth and form. Cambridge University Press, Cambridge
Google Scholar
Thorpe S (2002) Ultra-rapid scene categorization with a wave of spikes. In: Bülthoff HH et al. (eds) Proceedings of the biologicaly-motivated computer vision conference. LNCS, vol 2525, pp 1–15
Chapter Google Scholar
Tu Z, Chen X, Yuille AL, Zhu SC (2005) Image parsing: unifying segmentation, detection, and recognition. Int J Comput Vis 63(2):113–140
Article Google Scholar
Ungerleider L (1995) Functional brain imaging studies of cortical mechanisms for memory. Science 270(5237):769–775
Article Google Scholar
Van Essen DC, Olshausen B, Anderson CH, Gallant JL (1991) Pattern recognition, attention, and information processing bottlenecks in the primate visual search. SPIE 1473:17–28
Article Google Scholar
Williams LR, Jacobs DW (1997) Stochastic completion fields: a neural model of illusory contour shape and salience. Neural Comput 9(4):837–858
Article Google Scholar
Yuille A, Kersten D (2006) Vision as Bayesian inference: analysis by synthesis? Trends Cogn Sci 10(7):301–308
Article Google Scholar
Zucker SW, Hummel R, Rosenfeld A (1977) An application of relaxation labeling to line and curve enhancement. IEEE Trans Comput 26:394–403
Article Google Scholar

Download references

Acknowledgements

The author would like to thank Francisco Estrada, Tim Oleskiw, Gabriel Peyré, Ljiljana Velisavljević and Alexander Yakubovich for their contributions to the work reviewed in this chapter. This work was supported by NSERC, OCE and GEOIDE.

Author information

Authors and Affiliations

Centre for Vision Research, York University, Toronto, Canada
James H. Elder

Authors

James H. Elder
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to James H. Elder .

Editor information

Editors and Affiliations

Department of Computer Science, University of Toronto, King's College Road 6, Toronto, M5S 3G4, Ontario, Canada
Sven J. Dickinson
Department of Psychological Sciences, Purdue University, Third Street 703, West Lafayette, 47907, Indiana, USA
Zygmunt Pizlo

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Elder, J.H. (2013). Perceptual Organization of Shape. In: Dickinson, S., Pizlo, Z. (eds) Shape Perception in Human and Computer Vision. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-5195-1_5

Download citation

DOI: https://doi.org/10.1007/978-1-4471-5195-1_5
Publisher Name: Springer, London
Print ISBN: 978-1-4471-5194-4
Online ISBN: 978-1-4471-5195-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics