Multi-class Object Detection with Hough Forests Using Local Histograms of Visual Words

Mühling, Markus; Ewerth, Ralph; Shi, Bing; Freisleben, Bernd

doi:10.1007/978-3-642-23672-3_47

Markus Mühling²⁰,
Ralph Ewerth²⁰,
Bing Shi²⁰ &
…
Bernd Freisleben²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6854))

Included in the following conference series:

International Conference on Computer Analysis of Images and Patterns

2007 Accesses
1 Citations

Abstract

Multi-class object detection is a promising approach for reducing the processing time of object recognition tasks. Recently, random Hough forests have been successfully used for single-class object detection. In this paper, we present an extension of random Hough forests for the purpose of multi-class object detection and propose local histograms of visual words as appropriate features. Experimental results for the Caltech-101 test set demonstrate that the performance of the proposed approach is almost as good as the performance of a single-class object detector, even when detecting a large number of 24 object classes at a time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bosch, A., Zisserman, A., Muoz, X.: Image Classification using Random Forests and Ferns. In: Proc. of the IEEE Int. Conf. on Computer Vision, pp. 1–8 (2007)
Google Scholar
Bradski, G.: The OpenCV Library. Dr. Dobb’s Journal of Software Tools (2000)
Google Scholar
Breiman, L.: Random Forests. Machine Learning 45, 5–32 (2001)
Article MATH Google Scholar
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The Pascal Visual Object Classes (VOC) Challenge. International Journal of Computer Vision 88(2), 303–338 (2010)
Article Google Scholar
Fanelli, G., Gall, J., Van Gool, L.: Hough Transform-based Mouth Localization for Audio-Visual Speech Recognition. In: Proc. of the British Mach. Vis. Conf. (2009)
Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories. Computer Vision and Image Understanding 106(1), 59–70 (2007)
Article Google Scholar
Gall, J., Lempitsky, V.: Class-Specific Hough Forests for Object Detection. In: Proc. of the IEEE Conf. on Comp. Vis. and Pat. Recog., pp. 1022–1029 (2009)
Google Scholar
Jiang, Y.G., Ngo, C.W., Yang, J.: Towards Optimal Bag-of-Features for Object Categorization and Semantic Video Retrieval. In: Proc. of the ACM Int. Conference on Image and Video Retrieval, pp. 494–501 (2007)
Google Scholar
Kumar, V., Patras, I.: A Discriminative Voting Scheme for Object Detection using Hough Forests. In: Proc. of the British Machine Vision Conference Postgraduate Workshop, pp. 1–10 (2010)
Google Scholar
Leibe, B., Leonardis, A., Schiele, B.: Robust Object Detection with Interleaved Categorization and Segmentation. Int. J. of Comp. Vis. 77(1-3), 259–289 (2008)
Article Google Scholar
Lowe, D.G.: Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Maji, S., Malik, J.: Object Detection Using a Max-Margin Hough Transform. In: Proc. of the IEEE Conf. on Comp. Vis. and Pattern Recog., pp. 1038–1045 (2009)
Google Scholar
Mühling, M., Ewerth, R., Freisleben, B.: Improving Semantic Video Retrieval via Object-Based Features. In: Proc. of the IEEE Int. Conference on Semantic Computing, pp. 109–115 (2009)
Google Scholar
Torralba, A., Murphy, K.P., Freeman, W.T.: Sharing Visual Features for Multiclass and Multiview Object Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(5), 854–869 (2007)
Article Google Scholar
Vedaldi, A., Fulkerson, B.: VLFeat: An Open and Portable Library of Computer Vision Algorithms (2008), http://www.vlfeat.org/
Yao, A., Gall, J., Van Gool, L.: A Hough Transform-Based Voting Framework for Action Recognition. In: Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, pp. 2061–2068 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics & Computer Science, University of Marburg, Hans-Meerwein-Str. 3, D-35032, Marburg, Germany
Markus Mühling, Ralph Ewerth, Bing Shi & Bernd Freisleben

Authors

Markus Mühling
View author publications
You can also search for this author in PubMed Google Scholar
Ralph Ewerth
View author publications
You can also search for this author in PubMed Google Scholar
Bing Shi
View author publications
You can also search for this author in PubMed Google Scholar
Bernd Freisleben
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dpto. Matematica Aplicada I, Escuaela Técnica Superior de Ingeniería Informática, Universite de Sevilla, Avda. Reina Mercedes, s/n, 41012, Sevilla, Spain
Pedro Real
Departamento de Matemática Aplicada I, Escuela Técnica Superior de Ingeniería Informática, University of Seville, Avenida Reina Mercedes s/n, 41012, Sevilla, Spain
Daniel Diaz-Pernil & Helena Molina-Abril &
Departamento de Didáctica de la Mathemática y de las CC.Experimentales, Universidad del País Vasco-Esukal Herriko Unibertsitatea, Escuela Universitaria de Magisterio, Ramón y Cajal, 72, 48014, Bilbao (Bizcaia), Spain
Ainhoa Berciano
Institute of Computer Graphics and Algorithms, Pattern Recognition and Image Processing Group, Vienna University of Technology, Favoritenstraße 9/186-3, 1040, Vienna, Austria
Walter Kropatsch

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mühling, M., Ewerth, R., Shi, B., Freisleben, B. (2011). Multi-class Object Detection with Hough Forests Using Local Histograms of Visual Words. In: Real, P., Diaz-Pernil, D., Molina-Abril, H., Berciano, A., Kropatsch, W. (eds) Computer Analysis of Images and Patterns. CAIP 2011. Lecture Notes in Computer Science, vol 6854. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23672-3_47

Download citation

DOI: https://doi.org/10.1007/978-3-642-23672-3_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23671-6
Online ISBN: 978-3-642-23672-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics