Coupling-and-Decoupling: A Hierarchical Model for Occlusion-Free Car Detection

Li, Bo; Wu, Tianfu; Hu, Wenze; Pei, Mingtao

doi:10.1007/978-3-642-37331-2_13

Bo Li^20,21,22,
Tianfu Wu^21,22,
Wenze Hu^22,23 &
…
Mingtao Pei²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7724))

Included in the following conference series:

Asian Conference on Computer Vision

8430 Accesses

Abstract

Handling occlusions in object detection is a long-standing problem. This paper addresses the problem of X-to-X-occlusion-free object detection (e.g. car-to-car occlusions in our experiment) by utilizing an intuitive coupling-and-decoupling strategy. In the “coupling” stage, we model the pair of occluding X’s (e.g. car pairs) directly to account for the statistically strong co-occurrence (i.e. coupling). Then, we learn a hierarchical And-Or directed acyclic graph (AOG) model under the latent structural SVM (LSSVM) framework. The learned AOG consists of, from the top to bottom, (i) a root Or-node representing different compositions of occluding X pairs, (ii) a set of And-nodes each of which represents a specific composition of occluding X pairs, (iii) another set of And-nodes representing single X’s decomposed from occluding X pairs, and (iv) a set of terminal-nodes which represent the appearance templates for the X pairs, single X’s and latent parts of the single X’s, respectively. The part appearance templates can also be shared among different single X’s. In detection, a dynamic programming (DP) algorithm is used and as a natural consequence we decouple the two single X’s from the X-to-X occluding pairs. In experiments, we test our method on roadside cars which are collected from real traffic video surveillance environment by ourselves. We compare our model with the state-of-the-art deformable part-based model (DPM) and obtain better detection performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Choi, J.Y., Sung, K.S., Yang, Y.K.: Multiple Vehicles Detection and Tracking based on Scale-Invariant Feature Transform. In: ITSC, pp. 528–533 (2007)
Google Scholar
Dalal, N., Triggs, B.: Histograms of Oriented Gradients for Human Detection. In: CVPR, pp. 886–893 (2005)
Google Scholar
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge 2007 (VOC 2007) Results (2007), http://www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.: Discriminatively Trained Deformable Part Models, Release 4 (2010), http://people.cs.uchicago.edu/~pff/latent-release4/
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.A., Ramanan, D.: Object Detection with Discriminatively Trained Part-Based Models. TPAMI 32, 1627–1645 (2010)
Article Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Distance Transforms of Sampled Functions. Technical report 2004-1963, Cornell University CIS (2004)
Google Scholar
Gupte, S., Masoud, O., Martin, R.F.K., Papanikolopoulos, N.P.: Detection and Classification of Vehicles. TITS 3, 37–47 (2002)
Google Scholar
Lai, A.H.S., Fung, G.S.K., Yung, N.H.C.: Vehicle Type Classification from Visual-based Dimension Estimation. In: ITSC, pp. 201–206 (2001)
Google Scholar
Leotta, M.J., Mundy, J.L.: Vehicle Surveillance with a Generic, Adaptive, 3D Vehicle Model. TPAMI 33, 1457–1469 (2011)
Article Google Scholar
Liu, X., Dai, B., He, H.: Real-Time On-Road Vehicle Detection Combining Specific Shadow Segmentation and SVM Classification. In: ICDMA, pp. 885–888 (2011)
Google Scholar
Ott, P., Everingham, M.: Shared Parts for Deformable Part-based Models. In: CVPR, pp. 1513–1520 (2011)
Google Scholar
Petrovic, V.S., Cootes, T.F.: Analysis of Features for Rigid Structure Vehicle Type Recognition. In: BMVC, pp. 587–596 (2004)
Google Scholar
Tsochantaridis, I., Joachims, T., Hofmann, T., Altun, Y.: Large Margin Methods for Structured and Interdependent Output Variables. JMLR 6, 1453–1484 (2005)
MathSciNet MATH Google Scholar
Yu, C.N.J., Joachims, T.: Learning Structural SVMs with Latent Variables. In: ICML, pp. 1169–1176 (2009)
Google Scholar
Yuille, A.L., Rangarajan, A.: The Concave-Convex Procedure (CCCP). In: NIPS, pp. 1033–1040 (2001)
Google Scholar
Zhu, L., Chen, Y., Yuille, A.L., Freeman, W.T.: Latent Hierarchical Structural Learning for Object Detection. In: CVPR, pp. 1062–1069 (2010)
Google Scholar
Zhu, S.C., Mumford, D.: A Stochastic Grammar of Images. FTCGV 2, 259–362 (2006)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Beijing Lab of Intelligent Information, School of Computer Science and Technology, Beijing Institute of Technology, Beijing, 100081, P.R.China
Bo Li & Mingtao Pei
BUPT-Seesoft Joint Lab of Visual Computing and Image Communication, Beijing University of Posts and Telecommunications (BUPT), Beijing, 100876, P.R.China
Bo Li & Tianfu Wu
Lotus Hill Research Institute, Ezhou, P.R.China
Bo Li, Tianfu Wu & Wenze Hu
Department of Statistics, University of California, Los Angeles, USA
Wenze Hu

Authors

Bo Li
View author publications
You can also search for this author in PubMed Google Scholar
Tianfu Wu
View author publications
You can also search for this author in PubMed Google Scholar
Wenze Hu
View author publications
You can also search for this author in PubMed Google Scholar
Mingtao Pei
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical and Computer Engineering, Seoul National University, 1 Gwanak-ro, 151-744, Gwanak-gu, Seoul, Korea
Kyoung Mu Lee
Microsoft Research Asia, No. 5, Danling st., Haidian district, 100080, Beijing, P.R. China
Yasuyuki Matsushita
School of Interactive Computing, Georgia Institute of Technology, 801 Atlantic Drive, CCB 315, 30332, Atlanta, GA, USA
James M. Rehg
Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, Zhong Quan Cun East Road 95, Haidian District, 100 190, Beijing, P.R. China
Zhanyi Hu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, B., Wu, T., Hu, W., Pei, M. (2013). Coupling-and-Decoupling: A Hierarchical Model for Occlusion-Free Car Detection. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds) Computer Vision – ACCV 2012. ACCV 2012. Lecture Notes in Computer Science, vol 7724. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37331-2_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-37331-2_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37330-5
Online ISBN: 978-3-642-37331-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics