Skip to main content

Shape-Based Instance Detection Under Arbitrary Viewpoint

  • Chapter
Shape Perception in Human and Computer Vision

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

Abstract

Shape-based instance detection under arbitrary viewpoint is a very challenging problem. Current approaches for handling viewpoint variation can be divided into two main categories: invariant and non-invariant. Invariant approaches explicitly represent the structural relationships of high-level, view-invariant shape primitives. Non-invariant approaches, on the other hand, create a template for each viewpoint of the object, and can operate directly on low-level features. We summarize the main advantages and disadvantages of invariant and non-invariant approaches, and conclude that non-invariant approaches are well-suited for capturing fine-grained details needed for specific object recognition while also being computationally efficient. Finally, we discuss approaches that are needed to address ambiguities introduced by recognizing shape under arbitrary viewpoint.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Arbeláez P, Maire M, Fowlkes C, Malik J (2009) From contours to regions: an empirical evaluation. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  2. Bao SYZ, Sun M, Savarese S (2010) Toward coherent object detection and scene layout understanding. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  3. Biederman I (1987) Recognition-by-components: a theory of human image understanding. Psychol Rev 94(2):115

    Article  Google Scholar 

  4. Biederman I (2000) Recognizing depth-rotated objects: a review of recent research and theory. Spat Vis 13(2–3):241–253

    Article  Google Scholar 

  5. Bilodeau GA, Bergevin R (2000) Generic modeling of 3d objects from single 2d images. In: Proceedings of international conference on pattern recognition

    Google Scholar 

  6. Cass TA (1998) Robust affine structure matching for 3d object recognition. IEEE Trans Pattern Anal Mach Intell 20(11):1265–1274

    Article  Google Scholar 

  7. Dalal N (2006) Finding people in images and videos. PhD thesis, Institut National Polytechnique de Grenoble / INRIA Grenoble

    Google Scholar 

  8. Dollar P, Tu Z, Belongie S (2006) Supervised learning of edges and object boundaries. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  9. Dollar P, Wojek C, Schiele B, Perona P (2009) Pedestrian detection: a benchmark. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  10. Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  11. Ferrari V, Fevrier L, Jurie F, Schmid C (2008) Groups of adjacent contour segments for object detection. In: IEEE transactions on pattern analysis and machine intelligence

    Google Scholar 

  12. Ferrari V, Tuytelaars T, Van Gool L (2006) Object detection by contour segment networks. In: Proceedings of European conference on computer vision

    Google Scholar 

  13. Flynn PJ, Jain AK (1991) Cad-based computer vision: from cad models to relational graphs. IEEE Trans Pattern Anal Mach Intell 13(2):114–132

    Article  Google Scholar 

  14. Fransens R, Strecha C, Van Gool L (2006) A mean field em-algorithm for coherent occlusion handling in map-estimation prob. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  15. Girshick RB, Felzenszwalb PF, McAllester D (2011) Object detection with grammar models. In: Proceedings of neural information processing systems

    Google Scholar 

  16. Grimson WEL, Lozano-Perez T (1987) Localizing overlapping parts by searching the interpretation tree. IEEE Trans Pattern Anal Mach Intell 9:469–482

    Article  Google Scholar 

  17. Grimson WEL, Lozano-Pérez T, Huttenlocher DP (1990) Object recognition by computer. MIT Press, Cambridge

    Google Scholar 

  18. Hinterstoisser S, Cagniart C, Ilic S, Sturm P, Navab N, Fua P, Lepetit V (2011) Gradient response maps for real-time detection of texture-less objects. IEEE Trans Pattern Anal Mach Intell

    Google Scholar 

  19. Hinterstoisser S, Lepetit V, Ilic S, Fua P, Navab N (2010) Dominant orientation templates for real-time detection of texture-less objects. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  20. Hoiem D, Efros AA, Hebert M (2008) Putting objects in perspective. Int J Comput Vis

    Google Scholar 

  21. Hsiao E, Collet A, Hebert M (2010) Making specific features less discriminative to improve point-based 3d object recognition. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  22. Hsiao E, Hebert M (2012) Occlusion reasoning for object detection under arbitrary viewpoint. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  23. Huttenlocher DP, Ullman S (1990) Recognizing solid objects by alignment with an image. Int J Comput Vis 5(2):195–212

    Article  Google Scholar 

  24. Ikeuchi K (1987) Generating an interpretation tree from a cad model for 3d-object recognition in bin-picking tasks. Int J Comput Vis 1(2):145–165

    Article  MathSciNet  Google Scholar 

  25. Koenderink JJ (1990) Solid shape. MIT Press, Cambridge

    Google Scholar 

  26. Leibe B, Leonardis A, Schiele B (2004) Combined object categorization and segmentation with an implicit shape model. In: Workshop on statistical learning in computer vision, proceedings of European conference on computer vision

    Google Scholar 

  27. Leordeanu M, Hebert M, Sukthankar R (2007) Beyond local appearance: category recognition from pairwise interactions of simple features. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  28. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis

    Google Scholar 

  29. Lowe DG (1984) Perceptual organization and visual recognition. PhD thesis, Stanford University

    Google Scholar 

  30. Malisiewicz T, Efros AA (2011) Ensemble of exemplar-svms for object detection and beyond. In: Proceedings of IEEE international conference on computer vision

    Google Scholar 

  31. Meger D, Wojek C, Schiele B, Little JJ (2011) Explicit occlusion reasoning for 3d object detection. In: Proceedings of British machine vision conference

    Google Scholar 

  32. Myronenko A, Song X (2010) Point set registration: coherent point drift. IEEE Trans Pattern Anal Mach Intell 32(12):2262–2275

    Article  Google Scholar 

  33. Payet N, Todorovic S (2011) From contours to 3d object detection and pose estimation. In: Proceedings of IEEE international conference on computer vision

    Google Scholar 

  34. Plantinga H, Dyer CR (1990) Visibility, occlusion, and the aspect graph. Int J Comput Vis

    Google Scholar 

  35. Poggio T, Edelman S (1990) A network that learns to recognize 3d objects. Nature 343(6255):263–266

    Article  Google Scholar 

  36. Prisacariu V, Reid I (2009) Fasthog—a real-time gpu implementation of hog. Department of Engineering Science, Oxford University, Technical Report

    Google Scholar 

  37. Rusinkiewicz S, Levoy M (2001) Efficient variants of the ICP algorithm. In: 3DIM

    Google Scholar 

  38. Scheirer WJ, Kumar N, Belhumeur PN, Boult TE (2012) Multi-attribute spaces: calibration for attribute fusion and similarity search. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  39. Srinivasan P, Zhu Q, Shi J (2010) Many-to-one contour matching for describing and discriminating object shape. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  40. Stevens MR, Beveridge JR (2000) Integrating graphics and vision for object recognition. Kluwer Academic, Amsterdam

    Google Scholar 

  41. Toshev A, Taskar B, Daniilidis K (2010) Object detection via boundary structure segmentation. In: Proceedings of IEEE conference on computer vision and pattern recognition

    Google Scholar 

  42. Wang N, Ai H (2011) Who blocks who: simultaneous clothing segmentation for grouping images. In: Proceedings of IEEE international conference on computer vision

    Google Scholar 

  43. Wang X, Han TX, Yan S (2009) An hog-lbp human detector with partial occlusion handling. In: Proceedings of IEEE international conference on computer vision

    Google Scholar 

  44. Witkin AP, Tenenbaum JM (1983) On the role of structure in vision. Human Mach Vis 1:481–543

    Google Scholar 

  45. Wu B, Nevatia R (2005) Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors. In: Proceedings of IEEE international conference on computer vision

    Google Scholar 

  46. Zerroug M, Nevatia R (1999) Part-based 3d descriptions of complex objects from a single image. IEEE Trans Pattern Anal Mach Intell 21(9):835–848

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported in part by the National Science Foundation under ERC Grant No. EEEC-0540865.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Edward Hsiao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag London

About this chapter

Cite this chapter

Hsiao, E., Hebert, M. (2013). Shape-Based Instance Detection Under Arbitrary Viewpoint. In: Dickinson, S., Pizlo, Z. (eds) Shape Perception in Human and Computer Vision. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-5195-1_33

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-5195-1_33

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-5194-4

  • Online ISBN: 978-1-4471-5195-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics