Mixture Models for Object Detection

Kuang, Xiaoqin; Sang, Nong; Chen, Feifei; Gao, Changxin; Wang, Runmin

doi:10.1007/978-3-662-48558-3_32

Xiaoqin Kuang¹⁴,
Nong Sang¹⁴,
Feifei Chen¹⁴,
Changxin Gao¹⁴ &
…
Runmin Wang¹⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 546))

Included in the following conference series:

CCF Chinese Conference on Computer Vision

2964 Accesses
1 Citations

Abstract

In this paper, we propose an approach based on mixture of multiple components and mid-level part models for object detection in natural scenes. It is difficult to represent an object category with a monolithic model as the intra-variance in the category. To solve this, we use multi-component models and part models to describe the global variation and local deformation respectively. We obtain multi-components by clustering to form visual similar object group and training discriminant model for each one. The mid-level part models are learned automatically. We apply max-pooling to generate the feature vector using all part models and then train the SVM classifier based on these feature vectors. When detecting in image, we first achieve object candidates using multi-component models, and then the performance is refined by using part models and SVM classifier. Experiments on standard benchmarks demonstrate this coarse-to-fine detection system performs competitively.

Download to read the full chapter text

Chapter PDF

Training Deformable Object Models for Human Detection Based on Alignment and Clustering

Hough Voting with Distinctive Mid-Level Parts for Object Detection

Feature Reduction for Efficient Object Detection via L1-norm Latent SVM

Keywords

References

Leibe, B., Leonardis, A., Schiele, B.: Robust object detection with interleaved categorization and segmentation. International Journal of Computer Vision 77(1–3), 259–289 (2008)
Article Google Scholar
Gall, J., Yao, A., Razavi, N., Van Gool, L.: Hough forests for object detection, tracking, and action recognitions. IEEE Transactions on Pattern Analysis and Machine Intelligence. 33(11), 2188–2202 (2011)
Article Google Scholar
Maji, S., Malik, J.: Object detection using a max-margin hough transform. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1038–1045. Miami, FL (2009)
Google Scholar
Kontschieder, P., Riemenschneider, H., Donoser, M., Bischof, H.: Discriminative learning of contour fragments for object detection. In: BMVC, pp. 1–12 (2011)
Google Scholar
Razavi, N., Gall, J., Van Gool, L.: Scalable multi-class object detection. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1505–1512. IEEE (2011)
Google Scholar
Razavi, N., Gall, J., Kohli, P., van Gool, L.: Latent hough transform for object detection. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part III. LNCS, vol. 7574, pp. 312–325. Springer, Heidelberg (2012)
Chapter Google Scholar
Maji, S., Shakhnarovich, G.: Part discovery from partial correspondence. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 931–938. Portland, OR (2013)
Google Scholar
Singh, S., Gupta, A., Efros, A.A.: Unsupervised discovery of mid-level discriminative patches. In: European Conference on Computer Vision, pp. 73–86 (2012)
Google Scholar
Juneja, M., Vedaldi, A., Jawahar, C.V., Zisserman, A.: Blocks that shout: distinctive parts for scene classification. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 923–930. Portland, OR (2013)
Google Scholar
Endres, I., Shih, K.J., Jiaa, J., Hoiem, D.: Learning collections of part models for object recognition. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 939–946. Portland, OR (2013)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection, In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 1, pp. 886–893. San Diego, CA, USA (2005)
Google Scholar
Malisiewicz, T., Gupta, A., Efros, A.A.: Ensemble of exemplar-SVMs for object detection and beyond, In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 89-96. Barcelona (2011)
Google Scholar
Gu, C., Arbeláez, P., Lin, Y., Yu, K., Malik, J.: Multi-component models for object detection. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part IV. LNCS, vol. 7575, pp. 445–458. Springer, Heidelberg (2012)
Chapter Google Scholar
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object Detection with Discriminatively Trained Part-Based Models. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(9), 1627–1645 (2010)
Article Google Scholar
Hariharan, B., Malik, J., Ramanan, D.: Discriminative decorrelation for clustering and classification. In: European Conference on Computer Vision, pp. 459-472 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Science and Technology on Multi-spectral Information Processing Laboratory, School of Automation, Huazhong University of Science and Technology, Wuhan, China
Xiaoqin Kuang, Nong Sang, Feifei Chen, Changxin Gao & Runmin Wang

Authors

Xiaoqin Kuang
View author publications
You can also search for this author in PubMed Google Scholar
Nong Sang
View author publications
You can also search for this author in PubMed Google Scholar
Feifei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Changxin Gao
View author publications
You can also search for this author in PubMed Google Scholar
Runmin Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nong Sang .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Honbin Zha
Chinese Academy of Sciences, Institute of Computing Technology, Beijing, China
Xilin Chen
Chinese Academy of Sciences, Beijing, China
Liang Wang
Xidian University, Shaanxi, China
Qiguang Miao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kuang, X., Sang, N., Chen, F., Gao, C., Wang, R. (2015). Mixture Models for Object Detection. In: Zha, H., Chen, X., Wang, L., Miao, Q. (eds) Computer Vision. CCCV 2015. Communications in Computer and Information Science, vol 546. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-48558-3_32

Download citation

DOI: https://doi.org/10.1007/978-3-662-48558-3_32
Published: 06 November 2015
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-48557-6
Online ISBN: 978-3-662-48558-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)

Mixture Models for Object Detection

Abstract

Chapter PDF

Similar content being viewed by others

Training Deformable Object Models for Human Detection Based on Alignment and Clustering

Hough Voting with Distinctive Mid-Level Parts for Object Detection

Feature Reduction for Efficient Object Detection via L1-norm Latent SVM

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Mixture Models for Object Detection

Abstract

Chapter PDF

Similar content being viewed by others

Training Deformable Object Models for Human Detection Based on Alignment and Clustering

Hough Voting with Distinctive Mid-Level Parts for Object Detection

Feature Reduction for Efficient Object Detection via L1-norm Latent SVM

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation