Abstract
The method of constructing an image dataset by sampling images from videos with a short interval keeps the information in the video but also brings redundancy and increases the training costs significantly. In this paper, we propose a method to detect human heads with less training cost and higher performance, including: (1) A filtering standard to screen out the useless image in video-based image dataset with almost the same average precision. (2) An effective head detection model with the fusion of shoulder context. We evaluate our method on a human head dataset – HollywoodHeads and achieve reasonably good performance. This result shows that our method is very useful in human head detection task.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aziz, K.: Head detection based on skeleton graph method for counting people in crowded environments. J. Electron. Imaging 25(1), 013012 (2016)
Felzenszwalb, P., Mcallester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008, pp. 1–8 (2008)
Geiger, A.: Are we ready for autonomous driving? the kitti vision benchmark suite. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3354–3361 (2012)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2015)
Jafari, O.H., Mitzel, D., Leibe, B.: Real-time RGB-D based people detection and tracking for mobile robots and head-worn cameras. In: IEEE International Conference on Robotics and Automation, pp. 5636–5643 (2014)
Lin, T.Y., Dollr, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection (2017)
Marin-Jimenez, M.J., Zisserman, A., Eichner, M., Ferrari, V.: Detecting people looking at each other in videos. Int. J. Comput. Vision 106(3), 282–296 (2014)
Patronperez, A., Marszalek, M., Reid, I., Zisserman, A.: Structured learning of human interactions in tv shows. IEEE Trans. Pattern Anal. Mach. Intell. 34(12), 2441–2453 (2012)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017)
Schmid, C., Zisserman, A.: Human focused action localization in video. In: European Conference on Computer Vision, pp. 219–233 (2010)
Shrivastava, A., Gupta, A., Girshick, R.: Training region-based object detectors with online hard example mining. In: Computer Vision and Pattern Recognition, pp. 761–769 (2016)
Stewart, R., Andriluka, M., Ng, A.Y.: End-to-end people detection in crowded scenes. In: Computer Vision and Pattern Recognition, pp. 2325–2333 (2016)
Vu, T.H., Osokin, A., Laptev, I.: Context-aware CNNs for person head detection. In: IEEE International Conference on Computer Vision, pp. 2893–2901 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Le, C., Ma, H. (2018). A Method of Detecting Human Head by Eliminating Redundancy in Dataset. In: Wang, Y., Jiang, Z., Peng, Y. (eds) Image and Graphics Technologies and Applications. IGTA 2018. Communications in Computer and Information Science, vol 875. Springer, Singapore. https://doi.org/10.1007/978-981-13-1702-6_57
Download citation
DOI: https://doi.org/10.1007/978-981-13-1702-6_57
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1701-9
Online ISBN: 978-981-13-1702-6
eBook Packages: Computer ScienceComputer Science (R0)