Abstract
Humans are very good at rapidly detecting salient objects such as animals in complex natural scenes, and recent psychophysical results suggest that the fastest mechanisms underlying animal detection use contour shape as a principal discriminative cue. How does our visual system extract these contours so rapidly and reliably? While the prevailing computational model represents contours as Markov chains that use only first-order local cues to grouping, computer vision algorithms based on this model fall well below human levels of performance. Here we explore the possibility that the human visual system exploits higher-order shape regularities in order to segment object contours from cluttered scenes. In particular, we consider a recurrent architecture in which higher areas of the object pathway generate shape hypotheses that condition grouping processes in early visual areas. Such a generative model could help to guide local bottom-up grouping mechanisms toward globally consistent solutions. In constructing an appropriate theoretical framework for recurrent shape processing, a central issue is to ensure that shape topology remains invariant under all actions of the feedforward and feedback processes. This can be achieved by a promising new theory of shape representation based upon a family of local image deformations called formlets, shown to outperform alternative contour-based generative shape models on the important problem of visual shape completion.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Arbelaez P, Maire M, Fowlkes C, Malik J (2011) Contour detection and hierarchical image segmentation. IEEE Trans Pattern Anal Mach Intell 33(5):898–916. doi:10.1109/TPAMI.2010.161
Cavanagh P (1991) What’s up in top-down processing. In: Gorea A (ed) Representations of vision. Cambridge University Press, Cambridge, pp 295–304
Cohen LD, Deschamps T (2001) Multiple contour finding and perceptual grouping as a set of energy minimizing paths. In: Energy minimization methods workshop, CVPR
Crevier D (1999) A probabilistic method for extracting chains of collinear segments. Comput Vis Image Underst 76(1):36–53
Deschamps T, Cohen LD (2000) Minimal paths in 3d images and application to virtual endoscopy. In: Proceedings of the 6th European conference on computer vision
Dubinskiy A, Zhu SC (2003) A multiscale generative model for animate shapes and parts. In: Proc. 9th IEEE ICCV, vol 1, pp 249–256
Elder JH, Goldberg RM (2002) Ecological statistics of Gestalt laws for the perceptual organization of contours. J Vis 2(4):324–353
Elder JH, Krupnik A, Johnston LA (2003) Contour grouping with prior models. IEEE Trans Pattern Anal Mach Intell 25(25):661–674
Elder JH, Oleskiw TD, Yakubovich A, Peyré G (2013) On growth and formlets: sparse multi-scale coding of planar shape. Image Vis Comput 31:1–13
Elder JH, Velisavljević L (2009) Cue dynamics underlying rapid detection of animals in natural scenes. J Vis 9(7):1–20
Elder JH, Zucker SW (1996) Computing contour closure. In: Proceedings of the 4th European conference on computer vision. Springer, New York, pp 399–412
Estrada FJ, Elder JH (2006) Multi-scale contour extraction based on natural image statistics. In: IEEE conference on computer vision and pattern recognition workshop
Field DJ, Hayes A, Hess RF (1993) Contour integration by the human visual system: evidence for a local “association field”. Vis Res 33(2):173–193
Foxe JJ, Simpson GV (2002) Flow of activation from V1 to frontal cortex in humans. Exp Brain Res 142:139–150
Geisler WS, Perry JS, Super BJ, Gallogly DP (2001) Edge co-occurence in natural images predicts contour grouping performance. Vis Res 41(6):711–724
Gilbert CD, Wiesel TN (1989) Columnar specificity of intrinsic horizontal and corticocortical connections in cat visual cortex. J Neurosci 9(7):2432–2443
Grenander U, Srivastava A, Saini S (2007) A pattern-theoretic characterization of biological growth. IEEE Trans Med Imaging 26(5):648–659
Cox IJ, Rehg JM, Hingorani S (1993) A Bayesian multiple hypothesis approach to contour segmentation. Int J Comput Vis 11(1):5–24
Jain AK, Zhong Y, Lakshmanan S (1996) Object matching using deformable templates. IEEE Trans Pattern Anal Mach Intell 18(3):267–278
Kirchner H, Thorpe SJ (2006) Ultra-rapid object detection with saccadic eye movements: visual processing speed revisited. Vis Res 46:1762–1766
Kruger N (1998) Collinearity and parallelism are statistically significant second order relations of complex cell responses. Neural Process Lett 8:117–129
Lee TS, Mumford D (2003) Hierarchical Bayesian inference in the visual cortex. J Opt Soc Am A 20(7):1434–1448
Leyton M (1988) A process-grammar for shape. Artif Intell 34(2):213–247
Mahamud S, Thornber KK, Williams LR (1999) Segmentation of salient closed contours from real images. In: IEEE international conference on computer vision. IEEE Computer Society, Los Alamitos, pp 891–897
Mallat SG, Zhang Z (1993) Matching pursuits with time frequency dictionaries. IEEE Trans Signal Process 41(12):3397–3415
Martin D, Fowlkes C, Malik J (2004) Learning to detect natural image boundaries using local brightness, color and texture cues. IEEE Trans Pattern Anal Mach Intell 26(5):530–549
Oleskiw TD, Elder JH, Peyré G (2010) On growth and formlets. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)
Parent P, Zucker SW (1989) Trace inference, curvature consistency, and curve detection. IEEE Trans Pattern Anal Mach Intell 11:823–839
Portman N, Grenander U, Vrscay ER (2009) Direct estimation of biological growth properties from image data using the “GRID” model. In: Kamel M, Campilho A (eds) Image analysis and recognition, proceedings. Lecture notes in computer science, vol 5627, pp 832–843. 6th international conference on image analysis and recognition, Halifax, Canada, Jul 06–08, 2009
Portman N, Grenander U, Vrscay ER (2011) GRID macroscopic growth law and it application to image inference. Q Appl Math 69(2):227–260
Portman N, Vrscay ER (2011) Existence and uniqueness of solutions to the grid macroscopic growth equation. Appl Math Comput 217(21):8318–8327. doi:10.1016/j.amc.2011.03.021. http://www.sciencedirect.com/science/article/pii/S0096300311003754
Thorpe S, Fize D, Marlot C (1996) Speed of processing in the human visual system. Nature 381:520–522
Sha’ashua A, Ullman S (1988) Structural saliency: the detection of globally salient structures using a locally connected network. In: Proceedings of the 2nd international conference on computer vision, pp 321–327
Sharon E, Mumford D (2004) 2d-shape analysis using conformal mapping. In: Computer vision and pattern recognition, IEEE Comp. Soc. Conf., vol 2, pp 350–357
Sigman M, Cecchi GA, Gilbert CD, Magnasco MO (2001) On a common circle: natural scenes and Gestalt rules. Proc Natl Acad Sci 98(4):1935–1940
Thompson DW (1961) On growth and form. Cambridge University Press, Cambridge
Thorpe S (2002) Ultra-rapid scene categorization with a wave of spikes. In: Bülthoff HH et al. (eds) Proceedings of the biologicaly-motivated computer vision conference. LNCS, vol 2525, pp 1–15
Tu Z, Chen X, Yuille AL, Zhu SC (2005) Image parsing: unifying segmentation, detection, and recognition. Int J Comput Vis 63(2):113–140
Ungerleider L (1995) Functional brain imaging studies of cortical mechanisms for memory. Science 270(5237):769–775
Van Essen DC, Olshausen B, Anderson CH, Gallant JL (1991) Pattern recognition, attention, and information processing bottlenecks in the primate visual search. SPIE 1473:17–28
Williams LR, Jacobs DW (1997) Stochastic completion fields: a neural model of illusory contour shape and salience. Neural Comput 9(4):837–858
Yuille A, Kersten D (2006) Vision as Bayesian inference: analysis by synthesis? Trends Cogn Sci 10(7):301–308
Zucker SW, Hummel R, Rosenfeld A (1977) An application of relaxation labeling to line and curve enhancement. IEEE Trans Comput 26:394–403
Acknowledgements
The author would like to thank Francisco Estrada, Tim Oleskiw, Gabriel Peyré, Ljiljana Velisavljević and Alexander Yakubovich for their contributions to the work reviewed in this chapter. This work was supported by NSERC, OCE and GEOIDE.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag London
About this chapter
Cite this chapter
Elder, J.H. (2013). Perceptual Organization of Shape. In: Dickinson, S., Pizlo, Z. (eds) Shape Perception in Human and Computer Vision. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-5195-1_5
Download citation
DOI: https://doi.org/10.1007/978-1-4471-5195-1_5
Publisher Name: Springer, London
Print ISBN: 978-1-4471-5194-4
Online ISBN: 978-1-4471-5195-1
eBook Packages: Computer ScienceComputer Science (R0)