Components for Object Detection and Identification

Heisele, Bernd; Riskov, Ivaylo; Morgenstern, Christian

doi:10.1007/11957959_12

Bernd Heisele^20,21,
Ivaylo Riskov²⁰ &
Christian Morgenstern²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4170))

2817 Accesses
6 Citations

Abstract

We present a component-based system for object detection and identification. From a set of training images of a given object we extract a large number of components which are clustered based on the similarity of their image features and their locations within the object image. The cluster centers build an initial set of component templates from which we select a subset for the final recognizer. The localization of the components is performed by normalized cross-correlation. Two types of components are used, gray value components and components consisting of the magnitudes of the gray value gradient.

In experiments we investigate how the component size, the number of the components, and the feature type affects the recognition performance. The system is compared to several state-of-the-art classifiers on three different data sets for object identification and detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bileschi, S., Wolf, L.: A unified system for object detection, texture recognition, and context analysis based on the standard model feature set. In: British Machine Vision Conference (BMVC) (2005)
Google Scholar
Bileschi, S.M., Heisele, B.: Advances in Component-Based Face Detection. In: Lee, S.-W., Verri, A. (eds.) SVM 2002. LNCS, vol. 2388, pp. 135–143. Springer, Heidelberg (2002)
Chapter Google Scholar
Crandall, D., Felzenszwalb, P., Huttenlocher, D.: Spatial priors for part-based recognition using statistical model. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 10–17 (2005)
Google Scholar
Dorko, G., Schmid, C.: Selection of scale invariant neighborhoods for object class recognition. In: International Conference on Computer Vision (ICCV), pp. 634–640 (2003)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 264–271 (2003)
Google Scholar
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. Tecnical report, Dept. of Statistics, Stanford University (1998)
Google Scholar
Heisele, B., Serre, T., Mukherjee, S., Poggio, T.: Hierarchical classification and feature reduction for fast face detection with support vector machines. Pattern Recognition 36(9), 2007–2017 (2003)
Article MATH Google Scholar
Heisele, B., Serre, T., Pontil, M., Vetter, T., Poggio, T.: Categorization by learning and combining object parts. In: Neural Information Processing Systems (NIPS), Vancouver (2001)
Google Scholar
Leibe, B., Leonardis, A., Schiele, B.: Combined object categorization and segmentation with an implicit model. In: ECCV 2004 Workshop on Statistical Learning in Computer Vision (2004)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Mohan, A., Papageorgiou, C., Poggio, T.: Example-based object detection in images by components. IEEE Transactions on Pattern Analysis and Machine Intelligence 23, 349–361 (2001)
Article Google Scholar
Morgenstern, C., Heisele, B.: Component-based recognition of objects in an office environment. A.I. Memo 232, Center for Biological and Computational Learning. MIT, Cambridge (2003)
Google Scholar
Oren, M., Papageorgiou, C., Sinha, P., Osuna, E., Poggio, T.: Pedestrian detection using wavelet templates. In: IEEE Conference on Computer Vision and Pattern Recognition, San Juan, pp. 193–199 (1997)
Google Scholar
Osuna, E.: Support Vector Machines: Training and Applications. Ph.D thesis. MIT, Department of Electrical Engineering and Computer Science, Cambridge, MA (1998)
Google Scholar
Poggio, T., Edelman, S.: A network that learns to recognize 3-D objects. Nature 343, 163–266 (1990)
Article Google Scholar
Riesenhuber, M., Poggio, T.: Hierarchical models of object recognition in cortex. Nature Neuroscience 2(11), 1019–1025 (1999)
Article Google Scholar
Riesenhuber, M., Poggio, T.: The individual is nothing, the class everything: Psychophysics and modeling of recognition in object classes. A.I. Memo 1682, Center for Biological and Computational Learning. MIT, Cambridge (2000)
Google Scholar
Rowley, H.A., Baluja, S., Kanade, T.: Neural network-based face detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(1), 23–38 (1998)
Article Google Scholar
Schapire, R., Freund, Y., Bartlett, P., Lee, W.S.: Boosting the margin: A new explanation of effectiveness of voting methods. The Annals of Statistics 26(5), 1651–1686 (1998)
Article MATH MathSciNet Google Scholar
Serre, T., Riesenhuber, M., Louie, J., Poggio, T.A.: On the Role of Object-Specific Features for Real World Object Recognition in Biological Vision. In: Bülthoff, H.H., Lee, S.-W., Poggio, T.A., Wallraven, C. (eds.) BMCV 2002. LNCS, vol. 2525, pp. 387–397. Springer, Heidelberg (2002)
Chapter Google Scholar
Serre, T., Wolf, L., Poggio, T.: A new biologically motivated framework for robust object recognition. A.I. Memo 2004-26, Center for Biological and Computational Learning. MIT, Cambridge (2004)
Google Scholar
Sim, T., Baker, S., Bsat, M.: The CMU pose, illumination, and expression database. IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI) 25(12), 1615–1618 (2003)
Article Google Scholar
Sung, K.-K.: Learning and Example Selection for Object and Pattern Recognition. Ph.D thesis. MIT, Artificial Intelligence Laboratory and Center for Biological and Computational Learning, Cambridge, MA (1996)
Google Scholar
Ullman, S., Vidal-Naquet, M., Sali, E.: Visual features of intermdediate complexity and their use in classification. Nature Neuroscience 5(7), 682–687 (2002)
Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 511–518 (2001)
Google Scholar
Weber, M., Welling, W., Perona, P.: Towards automatic dscovery of object categories. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (June 2000)
Google Scholar
Weisberg, S.: Applied Linear Regression. Wiley, New York (1980)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Center for Biological and Computational Learning, M.I.T., Cambridge, MA, 02142, USA
Bernd Heisele, Ivaylo Riskov & Christian Morgenstern
Honda Research Institute US, Boston, MA, 02111, USA
Bernd Heisele

Authors

Bernd Heisele
View author publications
You can also search for this author in PubMed Google Scholar
Ivaylo Riskov
View author publications
You can also search for this author in PubMed Google Scholar
Christian Morgenstern
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Département d’Informatique, Ecole Normale Supérieure, P.O. Box, Paris, France
Jean Ponce
Carnegie Mellon University, Pittsburgh, USA
Martial Hebert
GRAVIR-INRIA, 655 avenue de l’Europe, P.O. Box, 38330, Montbonnot, France
Cordelia Schmid
Department of Engineering Science, University of Oxford, Parks Road, OX1 3PJ, Oxford, UK
Andrew Zisserman

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Heisele, B., Riskov, I., Morgenstern, C. (2006). Components for Object Detection and Identification. In: Ponce, J., Hebert, M., Schmid, C., Zisserman, A. (eds) Toward Category-Level Object Recognition. Lecture Notes in Computer Science, vol 4170. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11957959_12

Download citation

DOI: https://doi.org/10.1007/11957959_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68794-8
Online ISBN: 978-3-540-68795-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics