A Method of Detecting Human Head by Eliminating Redundancy in Dataset

Le, Chao; Ma, Huimin

doi:10.1007/978-981-13-1702-6_57

Chao Le¹¹ &
Huimin Ma¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 875))

Included in the following conference series:

Chinese Conference on Image and Graphics Technologies

1818 Accesses
1 Citations

Abstract

The method of constructing an image dataset by sampling images from videos with a short interval keeps the information in the video but also brings redundancy and increases the training costs significantly. In this paper, we propose a method to detect human heads with less training cost and higher performance, including: (1) A filtering standard to screen out the useless image in video-based image dataset with almost the same average precision. (2) An effective head detection model with the fusion of shoulder context. We evaluate our method on a human head dataset – HollywoodHeads and achieve reasonably good performance. This result shows that our method is very useful in human head detection task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aziz, K.: Head detection based on skeleton graph method for counting people in crowded environments. J. Electron. Imaging 25(1), 013012 (2016)
Article Google Scholar
Felzenszwalb, P., Mcallester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008, pp. 1–8 (2008)
Google Scholar
Geiger, A.: Are we ready for autonomous driving? the kitti vision benchmark suite. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3354–3361 (2012)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2015)
Article Google Scholar
Jafari, O.H., Mitzel, D., Leibe, B.: Real-time RGB-D based people detection and tracking for mobile robots and head-worn cameras. In: IEEE International Conference on Robotics and Automation, pp. 5636–5643 (2014)
Google Scholar
Lin, T.Y., Dollr, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection (2017)
Google Scholar
Marin-Jimenez, M.J., Zisserman, A., Eichner, M., Ferrari, V.: Detecting people looking at each other in videos. Int. J. Comput. Vision 106(3), 282–296 (2014)
Article Google Scholar
Patronperez, A., Marszalek, M., Reid, I., Zisserman, A.: Structured learning of human interactions in tv shows. IEEE Trans. Pattern Anal. Mach. Intell. 34(12), 2441–2453 (2012)
Article Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017)
Article Google Scholar
Schmid, C., Zisserman, A.: Human focused action localization in video. In: European Conference on Computer Vision, pp. 219–233 (2010)
Google Scholar
Shrivastava, A., Gupta, A., Girshick, R.: Training region-based object detectors with online hard example mining. In: Computer Vision and Pattern Recognition, pp. 761–769 (2016)
Google Scholar
Stewart, R., Andriluka, M., Ng, A.Y.: End-to-end people detection in crowded scenes. In: Computer Vision and Pattern Recognition, pp. 2325–2333 (2016)
Google Scholar
Vu, T.H., Osokin, A., Laptev, I.: Context-aware CNNs for person head detection. In: IEEE International Conference on Computer Vision, pp. 2893–2901 (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China
Chao Le & Huimin Ma

Authors

Chao Le
View author publications
You can also search for this author in PubMed Google Scholar
Huimin Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Huimin Ma .

Editor information

Editors and Affiliations

Beijing Institute of Technology, Beijing, China
Yongtian Wang
Beihang University, Beijing, China
Zhiguo Jiang
Peking University, Beijing, China
Yuxin Peng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Le, C., Ma, H. (2018). A Method of Detecting Human Head by Eliminating Redundancy in Dataset. In: Wang, Y., Jiang, Z., Peng, Y. (eds) Image and Graphics Technologies and Applications. IGTA 2018. Communications in Computer and Information Science, vol 875. Springer, Singapore. https://doi.org/10.1007/978-981-13-1702-6_57

Download citation

DOI: https://doi.org/10.1007/978-981-13-1702-6_57
Published: 12 August 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1701-9
Online ISBN: 978-981-13-1702-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics