Abstract
Automated object detection is perhaps the most central task of computer vision and arguably the most difficult one. This paper extends previous work on part-based models by using accurate geometric models both in the learning phase and at detection. In the learning phase manual annotations are used to reduce perspective distortion before learning the part-based models. That training is performed on rectified images, leads to models which are more specific, reducing the risk of false positives. At the same time a set of representative object poses are learnt. These are used at detection to remove perspective distortion. The method is evaluated on the bus category of the Pascal dataset with promising results.
Chapter PDF
References
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Conf. Computer Vision and Pattern Recognition (2001)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Conf. Computer Vision and Pattern Recognition (2005)
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Analysis and Machine Intelligence (2010)
Lampert, C., Blaschko, M., Hofmann, T.: Efficient subwindow search: A branch and bound framework for object localization. IEEE Trans. Pattern Analysis and Machine Intelligence (2009)
Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: Int. Conf. Computer Vision (2009)
Fischler, M., Elschlager, R.: The representation and matching of pictorial structures. Transactons on Computers (1973)
Zhu, L., Chen, Y., Yuille, A.L., Freeman, W.T.: Latent hierarchical structural learning for object detection. In: Conf. Computer Vision and Pattern Recognition (2010)
Vedaldi, A., Zisserman, A.: Structured output regression for detection with partial occulsion. In: Advances in Neural Information Processing Systems (2009)
Torralba, A., Murphy, K., Freeman, W.: Sharing visual features for multiclass and multiview object detection. IEEE Trans. Pattern Analysis and Machine Intelligence (2007)
Ott, P., Everingham, M.: Shared parts for deformable part-based models. In: Conf. Computer Vision and Pattern Recognition (2011)
Bourdev, L., Malik, J.: Poselets: Body part detectors trained using 3d human pose annotations. In: Int. Conf. Computer Vision (2010)
Xiang, Y., Savarese, S.: Estimating the aspect layout of object categories. In: Conf. Computer Vision and Pattern Recognition (2012)
Thomas, A., Ferrari, V., Leibe, B., Tuytelaars, T., Schiele, B., Gool, L.V.: Towards multi-view object class detection. In: Conf. Computer Vision and Pattern Recognition (2006)
Savarese, S., Fei-Fei, L.: 3d generic object categorization, localization and pose estimation. In: Int. Conf. Computer Vision (2007)
Liebelt, J., Schmid, C.: Multi-view object class detection with a 3d geometric model. In: Conf. Computer Vision and Pattern Recognition (2010)
Yang, Y., Ramanan, D.: Articulated pose estimation using flexible mixtures of parts. In: Conf. Computer Vision and Pattern Recognition (2011)
Branson, S., Perona, P., Belongie, S.: Strong supervision from weak annotation: Interactive training of deformable part models. In: Int. Conf. Computer Vision (2011)
Chiu, H., Kaelbling, L., Lozano-Pérez, T.: Virtual training for multi-view object class recognition. In: Conf. Computer Vision and Pattern Recognition (2007)
Triggs, B.: Camera pose and calibration from 4 or 5 known 3d points. In: ICCV 8, pp. 278–284 (1999)
Josephson, K., Byröd, M.: Pose estimation with radial distortion and unknown focal length. In: Conf. Computer Vision and Pattern Recognition (2009)
Felzenszwalb, P., Girshick, R., McAllester, D.: Cascade object detection with deformable part models. In: Conf. Computer Vision and Pattern Recognition (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jiang, F., Enqvist, O., Kahl, F., Åström, K. (2013). Improved Object Detection and Pose Using Part-Based Models. In: Kämäräinen, JK., Koskela, M. (eds) Image Analysis. SCIA 2013. Lecture Notes in Computer Science, vol 7944. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38886-6_38
Download citation
DOI: https://doi.org/10.1007/978-3-642-38886-6_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38885-9
Online ISBN: 978-3-642-38886-6
eBook Packages: Computer ScienceComputer Science (R0)