Improved Object Detection and Pose Using Part-Based Models

Jiang, Fangyuan; Enqvist, Olof; Kahl, Fredrik; Åström, Kalle

doi:10.1007/978-3-642-38886-6_38

Improved Object Detection and Pose Using Part-Based Models

Fangyuan Jiang¹⁸,
Olof Enqvist¹⁸,
Fredrik Kahl¹⁸ &
…
Kalle Åström¹⁸

Conference paper

3399 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7944))

Abstract

Automated object detection is perhaps the most central task of computer vision and arguably the most difficult one. This paper extends previous work on part-based models by using accurate geometric models both in the learning phase and at detection. In the learning phase manual annotations are used to reduce perspective distortion before learning the part-based models. That training is performed on rectified images, leads to models which are more specific, reducing the risk of false positives. At the same time a set of representative object poses are learnt. These are used at detection to remove perspective distortion. The method is evaluated on the bus category of the Pascal dataset with promising results.

Download to read the full chapter text

Chapter PDF

References

Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Conf. Computer Vision and Pattern Recognition (2001)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Conf. Computer Vision and Pattern Recognition (2005)
Google Scholar
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Analysis and Machine Intelligence (2010)
Google Scholar
Lampert, C., Blaschko, M., Hofmann, T.: Efficient subwindow search: A branch and bound framework for object localization. IEEE Trans. Pattern Analysis and Machine Intelligence (2009)
Google Scholar
Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: Int. Conf. Computer Vision (2009)
Google Scholar
Fischler, M., Elschlager, R.: The representation and matching of pictorial structures. Transactons on Computers (1973)
Google Scholar
Zhu, L., Chen, Y., Yuille, A.L., Freeman, W.T.: Latent hierarchical structural learning for object detection. In: Conf. Computer Vision and Pattern Recognition (2010)
Google Scholar
Vedaldi, A., Zisserman, A.: Structured output regression for detection with partial occulsion. In: Advances in Neural Information Processing Systems (2009)
Google Scholar
Torralba, A., Murphy, K., Freeman, W.: Sharing visual features for multiclass and multiview object detection. IEEE Trans. Pattern Analysis and Machine Intelligence (2007)
Google Scholar
Ott, P., Everingham, M.: Shared parts for deformable part-based models. In: Conf. Computer Vision and Pattern Recognition (2011)
Google Scholar
Bourdev, L., Malik, J.: Poselets: Body part detectors trained using 3d human pose annotations. In: Int. Conf. Computer Vision (2010)
Google Scholar
Xiang, Y., Savarese, S.: Estimating the aspect layout of object categories. In: Conf. Computer Vision and Pattern Recognition (2012)
Google Scholar
Thomas, A., Ferrari, V., Leibe, B., Tuytelaars, T., Schiele, B., Gool, L.V.: Towards multi-view object class detection. In: Conf. Computer Vision and Pattern Recognition (2006)
Google Scholar
Savarese, S., Fei-Fei, L.: 3d generic object categorization, localization and pose estimation. In: Int. Conf. Computer Vision (2007)
Google Scholar
Liebelt, J., Schmid, C.: Multi-view object class detection with a 3d geometric model. In: Conf. Computer Vision and Pattern Recognition (2010)
Google Scholar
Yang, Y., Ramanan, D.: Articulated pose estimation using flexible mixtures of parts. In: Conf. Computer Vision and Pattern Recognition (2011)
Google Scholar
Branson, S., Perona, P., Belongie, S.: Strong supervision from weak annotation: Interactive training of deformable part models. In: Int. Conf. Computer Vision (2011)
Google Scholar
Chiu, H., Kaelbling, L., Lozano-Pérez, T.: Virtual training for multi-view object class recognition. In: Conf. Computer Vision and Pattern Recognition (2007)
Google Scholar
Triggs, B.: Camera pose and calibration from 4 or 5 known 3d points. In: ICCV 8, pp. 278–284 (1999)
Google Scholar
Josephson, K., Byröd, M.: Pose estimation with radial distortion and unknown focal length. In: Conf. Computer Vision and Pattern Recognition (2009)
Google Scholar
Felzenszwalb, P., Girshick, R., McAllester, D.: Cascade object detection with deformable part models. In: Conf. Computer Vision and Pattern Recognition (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Mathematical Sciences, Lund University, Sweden
Fangyuan Jiang, Olof Enqvist, Fredrik Kahl & Kalle Åström

Authors

Fangyuan Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Olof Enqvist
View author publications
You can also search for this author in PubMed Google Scholar
Fredrik Kahl
View author publications
You can also search for this author in PubMed Google Scholar
Kalle Åström
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Signal Processing, Tampere University of Technology, P.O. Box 553, Tampere, Finland
Joni-Kristian Kämäräinen
Department of Information and Computer Science,, Aalto University, P.O. Box 15400, 00076, Espoo, Finland
Markus Koskela

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, F., Enqvist, O., Kahl, F., Åström, K. (2013). Improved Object Detection and Pose Using Part-Based Models. In: Kämäräinen, JK., Koskela, M. (eds) Image Analysis. SCIA 2013. Lecture Notes in Computer Science, vol 7944. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38886-6_38

Download citation

DOI: https://doi.org/10.1007/978-3-642-38886-6_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38885-9
Online ISBN: 978-3-642-38886-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)