Skip to main content

Multi-class Object Detection with Hough Forests Using Local Histograms of Visual Words

  • Conference paper
Computer Analysis of Images and Patterns (CAIP 2011)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6854))

Included in the following conference series:

Abstract

Multi-class object detection is a promising approach for reducing the processing time of object recognition tasks. Recently, random Hough forests have been successfully used for single-class object detection. In this paper, we present an extension of random Hough forests for the purpose of multi-class object detection and propose local histograms of visual words as appropriate features. Experimental results for the Caltech-101 test set demonstrate that the performance of the proposed approach is almost as good as the performance of a single-class object detector, even when detecting a large number of 24 object classes at a time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bosch, A., Zisserman, A., Muoz, X.: Image Classification using Random Forests and Ferns. In: Proc. of the IEEE Int. Conf. on Computer Vision, pp. 1–8 (2007)

    Google Scholar 

  2. Bradski, G.: The OpenCV Library. Dr. Dobb’s Journal of Software Tools (2000)

    Google Scholar 

  3. Breiman, L.: Random Forests. Machine Learning 45, 5–32 (2001)

    Article  MATH  Google Scholar 

  4. Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The Pascal Visual Object Classes (VOC) Challenge. International Journal of Computer Vision 88(2), 303–338 (2010)

    Article  Google Scholar 

  5. Fanelli, G., Gall, J., Van Gool, L.: Hough Transform-based Mouth Localization for Audio-Visual Speech Recognition. In: Proc. of the British Mach. Vis. Conf. (2009)

    Google Scholar 

  6. Fei-Fei, L., Fergus, R., Perona, P.: Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories. Computer Vision and Image Understanding 106(1), 59–70 (2007)

    Article  Google Scholar 

  7. Gall, J., Lempitsky, V.: Class-Specific Hough Forests for Object Detection. In: Proc. of the IEEE Conf. on Comp. Vis. and Pat. Recog., pp. 1022–1029 (2009)

    Google Scholar 

  8. Jiang, Y.G., Ngo, C.W., Yang, J.: Towards Optimal Bag-of-Features for Object Categorization and Semantic Video Retrieval. In: Proc. of the ACM Int. Conference on Image and Video Retrieval, pp. 494–501 (2007)

    Google Scholar 

  9. Kumar, V., Patras, I.: A Discriminative Voting Scheme for Object Detection using Hough Forests. In: Proc. of the British Machine Vision Conference Postgraduate Workshop, pp. 1–10 (2010)

    Google Scholar 

  10. Leibe, B., Leonardis, A., Schiele, B.: Robust Object Detection with Interleaved Categorization and Segmentation. Int. J. of Comp. Vis. 77(1-3), 259–289 (2008)

    Article  Google Scholar 

  11. Lowe, D.G.: Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  12. Maji, S., Malik, J.: Object Detection Using a Max-Margin Hough Transform. In: Proc. of the IEEE Conf. on Comp. Vis. and Pattern Recog., pp. 1038–1045 (2009)

    Google Scholar 

  13. Mühling, M., Ewerth, R., Freisleben, B.: Improving Semantic Video Retrieval via Object-Based Features. In: Proc. of the IEEE Int. Conference on Semantic Computing, pp. 109–115 (2009)

    Google Scholar 

  14. Torralba, A., Murphy, K.P., Freeman, W.T.: Sharing Visual Features for Multiclass and Multiview Object Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(5), 854–869 (2007)

    Article  Google Scholar 

  15. Vedaldi, A., Fulkerson, B.: VLFeat: An Open and Portable Library of Computer Vision Algorithms (2008), http://www.vlfeat.org/

  16. Yao, A., Gall, J., Van Gool, L.: A Hough Transform-Based Voting Framework for Action Recognition. In: Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, pp. 2061–2068 (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mühling, M., Ewerth, R., Shi, B., Freisleben, B. (2011). Multi-class Object Detection with Hough Forests Using Local Histograms of Visual Words. In: Real, P., Diaz-Pernil, D., Molina-Abril, H., Berciano, A., Kropatsch, W. (eds) Computer Analysis of Images and Patterns. CAIP 2011. Lecture Notes in Computer Science, vol 6854. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23672-3_47

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23672-3_47

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23671-6

  • Online ISBN: 978-3-642-23672-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics