Skip to main content

A Method of Detecting Human Head by Eliminating Redundancy in Dataset

  • Conference paper
  • First Online:
Image and Graphics Technologies and Applications (IGTA 2018)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 875))

Included in the following conference series:

Abstract

The method of constructing an image dataset by sampling images from videos with a short interval keeps the information in the video but also brings redundancy and increases the training costs significantly. In this paper, we propose a method to detect human heads with less training cost and higher performance, including: (1) A filtering standard to screen out the useless image in video-based image dataset with almost the same average precision. (2) An effective head detection model with the fusion of shoulder context. We evaluate our method on a human head dataset – HollywoodHeads and achieve reasonably good performance. This result shows that our method is very useful in human head detection task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Aziz, K.: Head detection based on skeleton graph method for counting people in crowded environments. J. Electron. Imaging 25(1), 013012 (2016)

    Article  Google Scholar 

  2. Felzenszwalb, P., Mcallester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008, pp. 1–8 (2008)

    Google Scholar 

  3. Geiger, A.: Are we ready for autonomous driving? the kitti vision benchmark suite. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3354–3361 (2012)

    Google Scholar 

  4. Girshick, R., Donahue, J., Darrell, T., Malik, J.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2015)

    Article  Google Scholar 

  5. Jafari, O.H., Mitzel, D., Leibe, B.: Real-time RGB-D based people detection and tracking for mobile robots and head-worn cameras. In: IEEE International Conference on Robotics and Automation, pp. 5636–5643 (2014)

    Google Scholar 

  6. Lin, T.Y., Dollr, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection (2017)

    Google Scholar 

  7. Marin-Jimenez, M.J., Zisserman, A., Eichner, M., Ferrari, V.: Detecting people looking at each other in videos. Int. J. Comput. Vision 106(3), 282–296 (2014)

    Article  Google Scholar 

  8. Patronperez, A., Marszalek, M., Reid, I., Zisserman, A.: Structured learning of human interactions in tv shows. IEEE Trans. Pattern Anal. Mach. Intell. 34(12), 2441–2453 (2012)

    Article  Google Scholar 

  9. Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017)

    Article  Google Scholar 

  10. Schmid, C., Zisserman, A.: Human focused action localization in video. In: European Conference on Computer Vision, pp. 219–233 (2010)

    Google Scholar 

  11. Shrivastava, A., Gupta, A., Girshick, R.: Training region-based object detectors with online hard example mining. In: Computer Vision and Pattern Recognition, pp. 761–769 (2016)

    Google Scholar 

  12. Stewart, R., Andriluka, M., Ng, A.Y.: End-to-end people detection in crowded scenes. In: Computer Vision and Pattern Recognition, pp. 2325–2333 (2016)

    Google Scholar 

  13. Vu, T.H., Osokin, A., Laptev, I.: Context-aware CNNs for person head detection. In: IEEE International Conference on Computer Vision, pp. 2893–2901 (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Huimin Ma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Le, C., Ma, H. (2018). A Method of Detecting Human Head by Eliminating Redundancy in Dataset. In: Wang, Y., Jiang, Z., Peng, Y. (eds) Image and Graphics Technologies and Applications. IGTA 2018. Communications in Computer and Information Science, vol 875. Springer, Singapore. https://doi.org/10.1007/978-981-13-1702-6_57

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-1702-6_57

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-1701-9

  • Online ISBN: 978-981-13-1702-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics