Skip to main content

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

Abstract

Humans are very good at rapidly detecting salient objects such as animals in complex natural scenes, and recent psychophysical results suggest that the fastest mechanisms underlying animal detection use contour shape as a principal discriminative cue. How does our visual system extract these contours so rapidly and reliably? While the prevailing computational model represents contours as Markov chains that use only first-order local cues to grouping, computer vision algorithms based on this model fall well below human levels of performance. Here we explore the possibility that the human visual system exploits higher-order shape regularities in order to segment object contours from cluttered scenes. In particular, we consider a recurrent architecture in which higher areas of the object pathway generate shape hypotheses that condition grouping processes in early visual areas. Such a generative model could help to guide local bottom-up grouping mechanisms toward globally consistent solutions. In constructing an appropriate theoretical framework for recurrent shape processing, a central issue is to ensure that shape topology remains invariant under all actions of the feedforward and feedback processes. This can be achieved by a promising new theory of shape representation based upon a family of local image deformations called formlets, shown to outperform alternative contour-based generative shape models on the important problem of visual shape completion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Arbelaez P, Maire M, Fowlkes C, Malik J (2011) Contour detection and hierarchical image segmentation. IEEE Trans Pattern Anal Mach Intell 33(5):898–916. doi:10.1109/TPAMI.2010.161

    Article  Google Scholar 

  2. Cavanagh P (1991) What’s up in top-down processing. In: Gorea A (ed) Representations of vision. Cambridge University Press, Cambridge, pp 295–304

    Google Scholar 

  3. Cohen LD, Deschamps T (2001) Multiple contour finding and perceptual grouping as a set of energy minimizing paths. In: Energy minimization methods workshop, CVPR

    Google Scholar 

  4. Crevier D (1999) A probabilistic method for extracting chains of collinear segments. Comput Vis Image Underst 76(1):36–53

    Article  Google Scholar 

  5. Deschamps T, Cohen LD (2000) Minimal paths in 3d images and application to virtual endoscopy. In: Proceedings of the 6th European conference on computer vision

    Google Scholar 

  6. Dubinskiy A, Zhu SC (2003) A multiscale generative model for animate shapes and parts. In: Proc. 9th IEEE ICCV, vol 1, pp 249–256

    Google Scholar 

  7. Elder JH, Goldberg RM (2002) Ecological statistics of Gestalt laws for the perceptual organization of contours. J Vis 2(4):324–353

    Article  Google Scholar 

  8. Elder JH, Krupnik A, Johnston LA (2003) Contour grouping with prior models. IEEE Trans Pattern Anal Mach Intell 25(25):661–674

    Article  Google Scholar 

  9. Elder JH, Oleskiw TD, Yakubovich A, Peyré G (2013) On growth and formlets: sparse multi-scale coding of planar shape. Image Vis Comput 31:1–13

    Article  Google Scholar 

  10. Elder JH, Velisavljević L (2009) Cue dynamics underlying rapid detection of animals in natural scenes. J Vis 9(7):1–20

    Article  Google Scholar 

  11. Elder JH, Zucker SW (1996) Computing contour closure. In: Proceedings of the 4th European conference on computer vision. Springer, New York, pp 399–412

    Google Scholar 

  12. Estrada FJ, Elder JH (2006) Multi-scale contour extraction based on natural image statistics. In: IEEE conference on computer vision and pattern recognition workshop

    Google Scholar 

  13. Field DJ, Hayes A, Hess RF (1993) Contour integration by the human visual system: evidence for a local “association field”. Vis Res 33(2):173–193

    Article  Google Scholar 

  14. Foxe JJ, Simpson GV (2002) Flow of activation from V1 to frontal cortex in humans. Exp Brain Res 142:139–150

    Article  Google Scholar 

  15. Geisler WS, Perry JS, Super BJ, Gallogly DP (2001) Edge co-occurence in natural images predicts contour grouping performance. Vis Res 41(6):711–724

    Article  Google Scholar 

  16. Gilbert CD, Wiesel TN (1989) Columnar specificity of intrinsic horizontal and corticocortical connections in cat visual cortex. J Neurosci 9(7):2432–2443

    Google Scholar 

  17. Grenander U, Srivastava A, Saini S (2007) A pattern-theoretic characterization of biological growth. IEEE Trans Med Imaging 26(5):648–659

    Article  Google Scholar 

  18. Cox IJ, Rehg JM, Hingorani S (1993) A Bayesian multiple hypothesis approach to contour segmentation. Int J Comput Vis 11(1):5–24

    Article  Google Scholar 

  19. Jain AK, Zhong Y, Lakshmanan S (1996) Object matching using deformable templates. IEEE Trans Pattern Anal Mach Intell 18(3):267–278

    Article  Google Scholar 

  20. Kirchner H, Thorpe SJ (2006) Ultra-rapid object detection with saccadic eye movements: visual processing speed revisited. Vis Res 46:1762–1766

    Article  Google Scholar 

  21. Kruger N (1998) Collinearity and parallelism are statistically significant second order relations of complex cell responses. Neural Process Lett 8:117–129

    Article  Google Scholar 

  22. Lee TS, Mumford D (2003) Hierarchical Bayesian inference in the visual cortex. J Opt Soc Am A 20(7):1434–1448

    Article  Google Scholar 

  23. Leyton M (1988) A process-grammar for shape. Artif Intell 34(2):213–247

    Article  Google Scholar 

  24. Mahamud S, Thornber KK, Williams LR (1999) Segmentation of salient closed contours from real images. In: IEEE international conference on computer vision. IEEE Computer Society, Los Alamitos, pp 891–897

    Chapter  Google Scholar 

  25. Mallat SG, Zhang Z (1993) Matching pursuits with time frequency dictionaries. IEEE Trans Signal Process 41(12):3397–3415

    Article  MATH  Google Scholar 

  26. Martin D, Fowlkes C, Malik J (2004) Learning to detect natural image boundaries using local brightness, color and texture cues. IEEE Trans Pattern Anal Mach Intell 26(5):530–549

    Article  Google Scholar 

  27. Oleskiw TD, Elder JH, Peyré G (2010) On growth and formlets. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)

    Google Scholar 

  28. Parent P, Zucker SW (1989) Trace inference, curvature consistency, and curve detection. IEEE Trans Pattern Anal Mach Intell 11:823–839

    Article  Google Scholar 

  29. Portman N, Grenander U, Vrscay ER (2009) Direct estimation of biological growth properties from image data using the “GRID” model. In: Kamel M, Campilho A (eds) Image analysis and recognition, proceedings. Lecture notes in computer science, vol 5627, pp 832–843. 6th international conference on image analysis and recognition, Halifax, Canada, Jul 06–08, 2009

    Chapter  Google Scholar 

  30. Portman N, Grenander U, Vrscay ER (2011) GRID macroscopic growth law and it application to image inference. Q Appl Math 69(2):227–260

    MathSciNet  MATH  Google Scholar 

  31. Portman N, Vrscay ER (2011) Existence and uniqueness of solutions to the grid macroscopic growth equation. Appl Math Comput 217(21):8318–8327. doi:10.1016/j.amc.2011.03.021. http://www.sciencedirect.com/science/article/pii/S0096300311003754

    Article  MathSciNet  MATH  Google Scholar 

  32. Thorpe S, Fize D, Marlot C (1996) Speed of processing in the human visual system. Nature 381:520–522

    Article  Google Scholar 

  33. Sha’ashua A, Ullman S (1988) Structural saliency: the detection of globally salient structures using a locally connected network. In: Proceedings of the 2nd international conference on computer vision, pp 321–327

    Google Scholar 

  34. Sharon E, Mumford D (2004) 2d-shape analysis using conformal mapping. In: Computer vision and pattern recognition, IEEE Comp. Soc. Conf., vol 2, pp 350–357

    Google Scholar 

  35. Sigman M, Cecchi GA, Gilbert CD, Magnasco MO (2001) On a common circle: natural scenes and Gestalt rules. Proc Natl Acad Sci 98(4):1935–1940

    Article  Google Scholar 

  36. Thompson DW (1961) On growth and form. Cambridge University Press, Cambridge

    Google Scholar 

  37. Thorpe S (2002) Ultra-rapid scene categorization with a wave of spikes. In: Bülthoff HH et al. (eds) Proceedings of the biologicaly-motivated computer vision conference. LNCS, vol 2525, pp 1–15

    Chapter  Google Scholar 

  38. Tu Z, Chen X, Yuille AL, Zhu SC (2005) Image parsing: unifying segmentation, detection, and recognition. Int J Comput Vis 63(2):113–140

    Article  Google Scholar 

  39. Ungerleider L (1995) Functional brain imaging studies of cortical mechanisms for memory. Science 270(5237):769–775

    Article  Google Scholar 

  40. Van Essen DC, Olshausen B, Anderson CH, Gallant JL (1991) Pattern recognition, attention, and information processing bottlenecks in the primate visual search. SPIE 1473:17–28

    Article  Google Scholar 

  41. Williams LR, Jacobs DW (1997) Stochastic completion fields: a neural model of illusory contour shape and salience. Neural Comput 9(4):837–858

    Article  Google Scholar 

  42. Yuille A, Kersten D (2006) Vision as Bayesian inference: analysis by synthesis? Trends Cogn Sci 10(7):301–308

    Article  Google Scholar 

  43. Zucker SW, Hummel R, Rosenfeld A (1977) An application of relaxation labeling to line and curve enhancement. IEEE Trans Comput 26:394–403

    Article  Google Scholar 

Download references

Acknowledgements

The author would like to thank Francisco Estrada, Tim Oleskiw, Gabriel Peyré, Ljiljana Velisavljević and Alexander Yakubovich for their contributions to the work reviewed in this chapter. This work was supported by NSERC, OCE and GEOIDE.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to James H. Elder .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag London

About this chapter

Cite this chapter

Elder, J.H. (2013). Perceptual Organization of Shape. In: Dickinson, S., Pizlo, Z. (eds) Shape Perception in Human and Computer Vision. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-5195-1_5

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-5195-1_5

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-5194-4

  • Online ISBN: 978-1-4471-5195-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics