Abstract
Viola and Jones [VJ] demonstrate that cascade classification methods can successfully detect objects belonging to a single class, such as faces. Detecting and identifying objects that belong to any of a set of “classes”, many class detection, is a much more challenging problem. We show that objects from each class can form a “cluster” in a “classifier space” and illustrate examples of such clusters using images of real world objects. Our detection algorithm uses a “decision tree classifier” (whose internal nodes each correspond to a VJ classifier) to propose a class label for every sub-image W of a test image (or reject it as a negative instance). If this W reaches a leaf of this tree, we then pass W through a subsequent VJ cascade of classifiers, specific to the identified class, to determine whether W is truly an instance of the proposed class. We perform several empirical studies to compare our system for detecting objects of any of M classes, to the obvious approach of running a set of M learned VJ cascade classifiers, one for each class of objects, on the same image. We found that the detection rates are comparable, and our many-class detection system is about as fast as running a single VJ cascade, and scales up well as the number of classes increases.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR, pp. 264–271 (2003)
Fan, X.: Efficient multiclass object detection by a hierarchy of classifiers. In: CVPR (2003)
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. In: Computational Learning Theory: Eurocolt (1995)
Grossmann, E.: Adatree: boosting a weak classifier into a decision tree. In: IEEE Workshop on Learning in Computer Vision and Patter Recognition (2004)
Isukapalli, R., Greiner, R.: Use of Off-line Dynamic Programming for Efficient Image Interpretation. In: IJCAI, Acapulco, Mexico (August 2003)
Li, F.F., Fergus, R., Perona, P.: Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories. In: Proceedings of the Workshop on Generative Model Based Vision, Washington, DC (June 2004)
Lin, Y., Liu, T.: Robust face detection with multi-class boosting. In: CVPR, pp. 680–687 (2005)
Liu, C., Shum, H.: Kullback-Leibler Boosting. In: CVPR, pp. 587–594 (2003)
Ong, E.-J., Bowden, R.: A boosted classifier tree for hand shape detection. In: International Conference on Automatic Face and Gesture Recognition, pp. 889–894 (2004)
Rowley, H., Baluja, S., Kanade, T.: Neural network-based face detection. IEEE Transactions on Patten Analysis and Machine Intelligence, PAMI (1998)
Roth, D., Yang, M., Ahuja, N.: A snowbased face detector. In: NIPS (2000)
Schneiderman, H., Kanade, T.: A statistical method for 3d object detection applied to faces and cars. In: ICCV (2000)
Sun, J., Rehg, J.M., Bobick, A.: Automatic cascade training with perturbation bias. In: CVPR (2004)
Torralba, A., Murphy, K., Freeman, W.T.: Sharing features: efficient boosting procedures for multiclass object detection. In: CVPR (2004)
Viola, P., Jones, M.: Fast and robust classification using asymmetric adaboost and a detector cascade. In: CVPR (2001)
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: CVPR (2001)
Viola, P., Jones, M.: Robust real-time face detection. In: IJCV (2004)
Wu, J., Rehg, J.M., Mullin, M.D.: Learning a rare event detection cascade by direct feature selection. In: NIPS (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Isukapalli, R., Elgammal, A., Greiner, R. (2006). Learning to Detect Objects of Many Classes Using Binary Classifiers. In: Leonardis, A., Bischof, H., Pinz, A. (eds) Computer Vision – ECCV 2006. ECCV 2006. Lecture Notes in Computer Science, vol 3951. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11744023_28
Download citation
DOI: https://doi.org/10.1007/11744023_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33832-1
Online ISBN: 978-3-540-33833-8
eBook Packages: Computer ScienceComputer Science (R0)