Object Detection by Estimating and Combining High-Level Features

  • Geoffrey Levine
  • Gerald DeJong
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5716)

Abstract

Many successful object detection systems characterize object classes with a statistical profile over a large number of local features. We present an enhancement to this method that learns to assemble local features into features that capture more global properties such as body shape and color distribution. The system then learns to combine these estimated global features to improve object detection accuracy. In our approach, each candidate object detection from an off-the-shelf gradient-based detection system is transformed into a conditional random field. This CRF is used to extract a most likely object silhouette, which is then processed into features based on color and shape. Finally, we show that on the difficult Pascal VOC 2007 data set, detection rates can be improved by combining these global features with the local features from a state-of-the-art gradient based approach.

References

  1. 1.
    Felzenszwalb, P., McAllester, D., Ramanan, D.: Discriminatively trained, multiscale, deformable part models. In: IEEE Conference on Computer Vision and Pattern Recognition (2008)Google Scholar
  2. 2.
    Zhang, H., Berg, A., Maire, M., Malik, J.: Svm-knn: Discriminative nearest neighbor classification for visual category recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 2 (2006)Google Scholar
  3. 3.
    Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge 2007 (VOC 2007) Results (2007), http://www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html (accessed November 1, 2008)
  4. 4.
    Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge 2008 (VOC 2008) Results (2008), http://www.pascal-network.org/challenges/VOC/voc2008/workshop/index.html (accessed November 1, 2008)
  5. 5.
    Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: International Conference on Machine Learning (2001)Google Scholar
  6. 6.
    Quattoni, A., Collins, M., Darrell, T.: Conditional random fields for object recognition. In: Neural Information Processing Systems (2004)Google Scholar
  7. 7.
    He, S., Zemel, R., M., C.P.: Multiscale conditional random fields for image labeling. In: IEEE Conference of Computer Vision and Pattern Recognition (2004)Google Scholar
  8. 8.
    Felzenszwalb, P., Huttenlocher, D.: Efficient graph-based image segmentation. International Journal on Computer Vision 59(2) (2004)Google Scholar
  9. 9.
    Forsyth, D., Ponce, J.: Computer Vision: A Modern Approach. Prentice Hall, Englewood Cliffs (2003)Google Scholar
  10. 10.
    Hinton, G.: Training products of experts by minimizing contrastive divergence. Neural Comp. 14, 1771–1800 (2002)CrossRefMATHGoogle Scholar
  11. 11.
    Felzenszwalb, P., McAllester, D., Ramanan, D.: Discriminatively trained mixtures of deformable part models. In: PASCAL Visual Object Challenge (2008)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Geoffrey Levine
    • 1
  • Gerald DeJong
    • 1
  1. 1.Department of Computer ScienceUniversity of Illinois at Champaign-UrbanaUrbanaUSA

Personalised recommendations