Abstract
We present a component-based system for object detection and identification. From a set of training images of a given object we extract a large number of components which are clustered based on the similarity of their image features and their locations within the object image. The cluster centers build an initial set of component templates from which we select a subset for the final recognizer. The localization of the components is performed by normalized cross-correlation. Two types of components are used, gray value components and components consisting of the magnitudes of the gray value gradient.
In experiments we investigate how the component size, the number of the components, and the feature type affects the recognition performance. The system is compared to several state-of-the-art classifiers on three different data sets for object identification and detection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bileschi, S., Wolf, L.: A unified system for object detection, texture recognition, and context analysis based on the standard model feature set. In: British Machine Vision Conference (BMVC) (2005)
Bileschi, S.M., Heisele, B.: Advances in Component-Based Face Detection. In: Lee, S.-W., Verri, A. (eds.) SVM 2002. LNCS, vol. 2388, pp. 135–143. Springer, Heidelberg (2002)
Crandall, D., Felzenszwalb, P., Huttenlocher, D.: Spatial priors for part-based recognition using statistical model. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 10–17 (2005)
Dorko, G., Schmid, C.: Selection of scale invariant neighborhoods for object class recognition. In: International Conference on Computer Vision (ICCV), pp. 634–640 (2003)
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 264–271 (2003)
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. Tecnical report, Dept. of Statistics, Stanford University (1998)
Heisele, B., Serre, T., Mukherjee, S., Poggio, T.: Hierarchical classification and feature reduction for fast face detection with support vector machines. Pattern Recognition 36(9), 2007–2017 (2003)
Heisele, B., Serre, T., Pontil, M., Vetter, T., Poggio, T.: Categorization by learning and combining object parts. In: Neural Information Processing Systems (NIPS), Vancouver (2001)
Leibe, B., Leonardis, A., Schiele, B.: Combined object categorization and segmentation with an implicit model. In: ECCV 2004 Workshop on Statistical Learning in Computer Vision (2004)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Mohan, A., Papageorgiou, C., Poggio, T.: Example-based object detection in images by components. IEEE Transactions on Pattern Analysis and Machine Intelligence 23, 349–361 (2001)
Morgenstern, C., Heisele, B.: Component-based recognition of objects in an office environment. A.I. Memo 232, Center for Biological and Computational Learning. MIT, Cambridge (2003)
Oren, M., Papageorgiou, C., Sinha, P., Osuna, E., Poggio, T.: Pedestrian detection using wavelet templates. In: IEEE Conference on Computer Vision and Pattern Recognition, San Juan, pp. 193–199 (1997)
Osuna, E.: Support Vector Machines: Training and Applications. Ph.D thesis. MIT, Department of Electrical Engineering and Computer Science, Cambridge, MA (1998)
Poggio, T., Edelman, S.: A network that learns to recognize 3-D objects. Nature 343, 163–266 (1990)
Riesenhuber, M., Poggio, T.: Hierarchical models of object recognition in cortex. Nature Neuroscience 2(11), 1019–1025 (1999)
Riesenhuber, M., Poggio, T.: The individual is nothing, the class everything: Psychophysics and modeling of recognition in object classes. A.I. Memo 1682, Center for Biological and Computational Learning. MIT, Cambridge (2000)
Rowley, H.A., Baluja, S., Kanade, T.: Neural network-based face detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(1), 23–38 (1998)
Schapire, R., Freund, Y., Bartlett, P., Lee, W.S.: Boosting the margin: A new explanation of effectiveness of voting methods. The Annals of Statistics 26(5), 1651–1686 (1998)
Serre, T., Riesenhuber, M., Louie, J., Poggio, T.A.: On the Role of Object-Specific Features for Real World Object Recognition in Biological Vision. In: Bülthoff, H.H., Lee, S.-W., Poggio, T.A., Wallraven, C. (eds.) BMCV 2002. LNCS, vol. 2525, pp. 387–397. Springer, Heidelberg (2002)
Serre, T., Wolf, L., Poggio, T.: A new biologically motivated framework for robust object recognition. A.I. Memo 2004-26, Center for Biological and Computational Learning. MIT, Cambridge (2004)
Sim, T., Baker, S., Bsat, M.: The CMU pose, illumination, and expression database. IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI) 25(12), 1615–1618 (2003)
Sung, K.-K.: Learning and Example Selection for Object and Pattern Recognition. Ph.D thesis. MIT, Artificial Intelligence Laboratory and Center for Biological and Computational Learning, Cambridge, MA (1996)
Ullman, S., Vidal-Naquet, M., Sali, E.: Visual features of intermdediate complexity and their use in classification. Nature Neuroscience 5(7), 682–687 (2002)
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 511–518 (2001)
Weber, M., Welling, W., Perona, P.: Towards automatic dscovery of object categories. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (June 2000)
Weisberg, S.: Applied Linear Regression. Wiley, New York (1980)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Heisele, B., Riskov, I., Morgenstern, C. (2006). Components for Object Detection and Identification. In: Ponce, J., Hebert, M., Schmid, C., Zisserman, A. (eds) Toward Category-Level Object Recognition. Lecture Notes in Computer Science, vol 4170. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11957959_12
Download citation
DOI: https://doi.org/10.1007/11957959_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68794-8
Online ISBN: 978-3-540-68795-5
eBook Packages: Computer ScienceComputer Science (R0)