Iterative Maximum Clique Clustering Based Detection Filter

Zhang, Xinyu; Sheng, Hao; Zhang, Yang; Chen, Jiahui; Wu, Yubin; Xue, Guangtao; Wei, Quanrui

doi:10.1007/978-3-030-04212-7_13

Xinyu Zhang¹⁶,
Hao Sheng¹⁶,
Yang Zhang¹⁶,
Jiahui Chen¹⁶,
Yubin Wu¹⁶,
Guangtao Xue¹⁷ &
…
Quanrui Wei¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11304))

Included in the following conference series:

International Conference on Neural Information Processing

2186 Accesses

Abstract

Object detection is an important research field of computer vision, but getting accurate object detection from a large number of detection candidates has always been a challenge. The most current algorithms use an insufficient Greedy Non-Maximum Suppression (NMS) strategy which heavily relies on the confidence of the detection candidates. This paper proposes the Iterative Detection Filter (IDF) approach, which considers more information of the detection candidates, including overlapping, the confidence generated by the detector, and the ground position perception information of the scene. Through this approach, the detection candidates are mapped to more accurate detections. Our method achieves a significant improvement on the MOT16 and MOT17 datasets, which are widely used in video tracking and detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://motchallenge.net/data/MOT16/.

References

Bernardin, K., Stiefelhagen, R.: Evaluating multiple object tracking performance: the clear mot metrics. EURASIP J. Image Video Process. 2008(1), 1–10 (2008)
Article Google Scholar
Cheng, M., Zhang, Z., Lin, W., Torr, P.H.S.: BING: binarized normed gradients for objectness estimation at 300 fps. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3286–3293 (2014)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 886–893 (2005)
Google Scholar
Ellis, A., Ferryman, J.: Pets2010 and pets2009 evaluation of results using individual ground truthed single views. In: IEEE International Conference on Advanced Video and Signal Based Surveillance, pp. 135–142 (2010)
Google Scholar
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.A.: Cascade object detection with deformable part models. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2241–2248 (2010)
Google Scholar
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.A., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)
Article Google Scholar
Felzenszwalb, P.F., McAllester, D.A., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2008)
Google Scholar
Girshick, R.: Fast R-CNN. In: IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Google Scholar
Girshick, R.B., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015)
Article Google Scholar
Henderson, P., Ferrari, V.: End-to-end training of object class detectors for mean average precision. In: Lai, S.-H., Lepetit, V., Nishino, K., Sato, Y. (eds.) ACCV 2016, Part V. LNCS, vol. 10115, pp. 198–213. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-54193-8_13
Chapter Google Scholar
Hosang, J.H., Benenson, R., Schiele, B.: Learning non-maximum suppression. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 6469–6477 (2017)
Google Scholar
Kim, C., Li, F., Ciptadi, A., Rehg, J.M.: Multiple hypothesis tracking revisited. In: IEEE International Conference on Computer Vision, pp. 4696–4704 (2015)
Google Scholar
Lin, T., Dollár, P., Girshick, R.B., He, K., Hariharan, B., Belongie, S.J.: Feature pyramid networks for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 936–944 (2017)
Google Scholar
Milan, A., Leal-Taixé, L., Reid, I.D., Roth, S., Schindler, K.: MOT16: A benchmark for multi-object tracking. CoRR abs/1603.00831 (2016)
Google Scholar
Ren, S., He, K., Girshick, R.B., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Annual Conference on Neural Information Processing Systems, pp. 91–99 (2015)
Google Scholar
van de Sande, K.E.A., Uijlings, J.R.R., Gevers, T., Smeulders, A.W.M.: Segmentation as selective search for object recognition. In: IEEE International Conference on Computer Vision, pp. 1879–1886 (2011)
Google Scholar
Stewart, R., Andriluka, M., Ng, A.Y.: End-to-end people detection in crowded scenes. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2325–2333 (2016)
Google Scholar
Viola, P.A., Jones, M.J.: Rapid object detection using a boosted cascade of simple features. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 511–518 (2001)
Google Scholar
Wan, L., Eigen, D., Fergus, R.: End-to-end integration of a convolutional network, deformable parts model and non-maximum suppression. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 851–859 (2015)
Google Scholar
Zitnick, C.L., Dollár, P.: Edge boxes: locating object proposals from edges. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part V. LNCS, vol. 8693, pp. 391–405. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_26
Chapter Google Scholar

Download references

Acknowledgement

This study is partially supported by the National Key R & D Program of China (No. 2016QY01W0200), the National Natural Science Foundation of China (No. 61472019), the Macao Science and Technology Development Fund (No. 138/2 016/A3), the Open Fund of the State Key Laboratory of Software Development Environment under grant SKLSDE-2017ZX-09, the Project of Experimental Verification of the Basic Commonness and Key Technical Standards of the Industrial Internet network architecture, and the Technology Innovation Fund of China Electronic Technology Group Corporation. Thank you for the support from HAWKEYE Group.

Author information

Authors and Affiliations

State Key Laboratory of Software Development Environment, School of Computer Science and Engineering, Beihang University, Beijing, People’s Republic of China
Xinyu Zhang, Hao Sheng, Yang Zhang, Jiahui Chen & Yubin Wu
Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, People’s Republic of China
Guangtao Xue
The 15th Research Institute of China Electronics Technology group Corporation, Beijing, People’s Republic of China
Quanrui Wei

Authors

Xinyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hao Sheng
View author publications
You can also search for this author in PubMed Google Scholar
Yang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jiahui Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yubin Wu
View author publications
You can also search for this author in PubMed Google Scholar
Guangtao Xue
View author publications
You can also search for this author in PubMed Google Scholar
Quanrui Wei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hao Sheng .

Editor information

Editors and Affiliations

The Chinese Academy of Sciences, Beijing, China
Long Cheng
City University of Hong Kong, Kowloon, Hong Kong
Andrew Chi Sing Leung
Kobe University, Kobe, Japan
Seiichi Ozawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, X. et al. (2018). Iterative Maximum Clique Clustering Based Detection Filter. In: Cheng, L., Leung, A., Ozawa, S. (eds) Neural Information Processing. ICONIP 2018. Lecture Notes in Computer Science(), vol 11304. Springer, Cham. https://doi.org/10.1007/978-3-030-04212-7_13

Download citation

DOI: https://doi.org/10.1007/978-3-030-04212-7_13
Published: 17 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04211-0
Online ISBN: 978-3-030-04212-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics