Human Area Refinement for Human Detection

  • Rong XuEmail author
  • Satoshi Ueno
  • Tatsuya Kobayashi
  • Naoya Makibuchi
  • Sei Naito
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9280)


Human detection technologies are very useful tools to understand human activity for various purposes, such as surveillance. Recently, tracking-by-detection methods have also become popular for analyzing human activity, but their performance is greatly affected by the accuracy of detected human areas because they use online learning based on the detected results. In order to improve the performance of such tracking methods, the inclination of human bodies in the image is considered as a way to refine the detected human bounding boxes. Based on background subtraction and a novel scheme of estimating human foot position, a refinement scheme is proposed to estimate a bounding box more accurately, which can better fit the contours of inclined human bodies than the conventional method. Experimental results illustrated that the bounding boxes refined by the proposed algorithm achieved a higher cover rate of 92.7 % and a smaller mean angle error of 0.7° compared with the cover rate of 83.7 % and mean angle error of 3.8° obtained using the conventional method, as determined by comparison with the ground truth, and a real-time detection speed of 32.3 fps on a 640 × 480 video has been realized. Thus, tracking performance is significantly enhanced by refining the human areas, with a mean improvement of 42.4 % in the F-measure when compared with the conventional method.


Human detection Background subtraction Foot position estimation Refinement scheme Human tracking 


  1. 1.
    Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 886–893 (2005)Google Scholar
  2. 2.
    Dollár, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: BMVC, vol. 3, p. 5 (2009)Google Scholar
  3. 3.
    Mu, Y., Yan, S., Liu, Y., Huang, T., Zhou, B.: Discriminative local binary patterns for human detection in personal album. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2008)Google Scholar
  4. 4.
    Wu, J., Geyer, C., Rehg, J.M.: Real-time human detection using contour cues. In: 2011 IEEE International Conference on Robotics and Automation (ICRA), pp. 860–867 (2011)Google Scholar
  5. 5.
    Hare, S., Saffari, A., Torr, P.H.: Struck: structured output tracking with kernels. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 263–270 (2011)Google Scholar
  6. 6.
    Satoh, Y., Tanahashi, H., Wang, C., Kaneko, S.I., Niwa, Y., Yamamoto, K.: Robust event detection by radial reach filter (RRF). In: 16th International Conference on Pattern Recognition, pp. 623–626 (2002)Google Scholar
  7. 7.
    Papageorgiou, C., Poggio, T.: A trainable system for object detection. International Journal of Computer Vision 38(1), 15–33 (2000)CrossRefzbMATHGoogle Scholar
  8. 8.
    Wu, B., Nevatia, R.: Detection and tracking of multiple, partially occluded humans by bayesian combination of edgelet based part detectors. International Journal of Computer Vision 75(2), 247–266 (2007)CrossRefGoogle Scholar
  9. 9.
    Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2008)Google Scholar
  10. 10.
    Zhu, Q., Yeh, M.-C., Cheng, K.-T., Avidan, S.: Fast human detection using a cascade of histograms of oriented gradients. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1491–1498 (2006)Google Scholar
  11. 11.
    Wang, X., Han, T.X., Yan, S.: An HOG-LBP human detector with partial occlusion handling. In: IEEE 12th International Conference on Computer Vision, pp. 32–39 (2009)Google Scholar
  12. 12.
    Swain, M.J., Ballard, D.H.: Color indexing. International Journal of Computer Vision 7(1), 11–32 (1991)CrossRefGoogle Scholar
  13. 13.
    Shapiro, R.: Direct linear transformation method for three-dimensional cinematography. Research Quarterly American Alliance for Health, Physical Education and Recreation 49(2), 197–205 (1978)Google Scholar
  14. 14.
    Smith, K., Gatica-Perez, D., Odobez, J.-M., Ba, S.: Evaluating multi-object tracking. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition-Workshops, pp. 36–43 (2005)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Rong Xu
    • 1
    Email author
  • Satoshi Ueno
    • 1
  • Tatsuya Kobayashi
    • 1
  • Naoya Makibuchi
    • 1
  • Sei Naito
    • 1
  1. 1.KDDI R&D Laboratories Inc.Fujimino-shi, SaitamaJapan

Personalised recommendations