Abstract
Two-dimensional image based human detection methods have been widely used in surveillance system. However, detecting human in the presence of occlusion is still a challenge for such image based systems. In this paper, a human detection method aiming to handle occlusions by using the depth data obtained from 3D imaging methods, such as those easily acquired from the Microsoft Kinect depth sensor, is proposed. In the context of surveillance setting, background subtraction on the depth data can be used to extract foreground regions which may correspond to humans. The proposed method analyzes the 3D data of the foreground regions using a “split-merge” approach. Over-segmentation and clustering are preformed on foreground regions followed by the height validation. Experimental results demonstrate that the proposed method outperforms two state-of-art human detection methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ren, X., Malik, J.: Learning a classification model for segmentation. In: IEEE International Conference on Computer Vision, pp. 10–17 (2003)
Viola, P.A., Jones, M.J., Snow, D.: Detecting pedestrians using patterns of motion and appearance. In: IEEE International Conference on Computer Vision, pp. 734–741 (2003)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 886–893 (2005)
Wu, B., Nevatia, R.: Detection of multiple, partially occluded humans in a single image by bayesian combination of edgelet part detectors. In: IEEE International Conference on Computer Vision, pp. 90–97 (2005)
Lin, Z., Davis, L.S., Doermann, D.S., DeMenthon, D.: Hierarchical part-template matching for human detection and segmentation. In: IEEE International Conference on Computer Vision, pp. 1–8 (2007)
Wang, X., Han, T.X., Yan, S.: An hog-lbp human detector with partial occlusion handling. In: IEEE International Conference on Computer Vision, pp. 32–39 (2009)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.A., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Transactions on Pattern Analysis and Machine Intelligence 32, 1627–1645 (2010)
Xu, F., Fujimura, K.: Human detection using depth and gray images. In: IEEE Conference on Advanced Video and Signal Based Surveillance, pp. 115–121 (2003)
Gavrila, D.M., Munder, S.: Multi-cue pedestrian detection and tracking from a moving vehicle. International Journal of Computer Vision 73, 41–59 (2007)
Ikemura, S., Fujiyoshi, H.: Real-Time Human Detection Using Relational Depth Similarity Features. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part IV. LNCS, vol. 6495, pp. 25–38. Springer, Heidelberg (2011)
Spinello, L., Arras, K.O.: People detection in rgb-d data. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 3838–3843 (2011)
Choi, W., Pantofaru, C., Savarese, S.: Detecting and tracking people using an rgb-d camera via multiple detector fusion. In: IEEE International Conference on Computer Vision Workshops, pp. 1076–1083 (2011)
Gill, T., Keller, J.M., Anderson, D.T., Luke III, R.H.: A system for change detection and human recognition in voxel space using the microsoft kinect sensor. In: IEEE Applied Imagery Pattern Recognition Workshop, pp. 1–8 (2011)
Xia, L., Chen, C.C., Aggarwal, J.K.: Human detection using depth information by kinect. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 15–22 (2011)
Gonzalez, R.C., Woods, R.E.: Digital image processing. Prentice-Hall, Inc., Upper Saddle River (2006)
Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 888–905 (2000)
Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: Analysis and an algorithm. In: Advances in Neural Information Processing Systems, pp. 849–856 (2001)
Chung, F.R.K.: Spectral graph theory. In: CBMS Regional Conference Series in Mathematics, vol. 92, pp. 1–212. American Mathematical Society (1997)
Everingham, M., Van Gool, L.J., Williams, C.K.I., Winn, J.M., Zisserman, A.: The pascal visual object classes (voc) challenge. International Journal of Computer Vision 88, 303–338 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, L., Chan, K.L., Wang, G. (2013). Human Detection with Occlusion Handling by Over-Segmentation and Clustering on Foreground Regions. In: Park, JI., Kim, J. (eds) Computer Vision - ACCV 2012 Workshops. ACCV 2012. Lecture Notes in Computer Science, vol 7729. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37484-5_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-37484-5_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37483-8
Online ISBN: 978-3-642-37484-5
eBook Packages: Computer ScienceComputer Science (R0)