Abstract
People counting is an important task in video surveillance applications. It can provide statistic information for shopping centers and other public buildings or knowledge of the current number of people in a building in a case of an emergency. This paper describes a real-time people counting system based on a vertical Kinect depth sensor. Processing pipeline of the system includes depth map improvement, a novel approach to head segmentation, and continuous tracking of head segments. The head segmentation is based on an adaptation of the region-growing segmentation approach with thresholding. The tracking of segments combines minimum-weighted bipartite graph matchings and prediction of object movement to eliminate inaccuracy of segmentation. Results of evaluatation realized on datasets from a shopping center (more than 23 hours of recordings) show that the system can handle almost all real-world situations with high accuracy.
The original version of this chapter was revised: The copyright line was incorrect. This has been corrected. The Erratum to this chapter is available at DOI: 10.1007/978-3-319-02895-8_64
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bevilacqua, A., Di Stefano, L., Azzari, P.: People tracking using a time-of-flight depth sensor. In: IEEE International Conference on Video and Signal Based Surveillance, AVSS 2006, pp. 89–89 (2006)
Camplani, M., Salgado, L.: Efficient spatio-temporal hole filling strategy for kinect depth maps. In: Proc. SPIE 8290, Three-Dimensional Image Processing (3DIP) and Applications II (2012)
Chowdhury, A.S., Chatterjee, R., Ghosh, M., Ray, N.: Cell tracking in video microscopy using bipartite graph matching. In: 2010 20th International Conference on Pattern Recognition (ICPR), pp. 2456–2459 (2010)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 1, pp. 886–893 (2005)
Desai, C., Ramanan, D., Fowlkes, C.: Discriminative models for multi-class object layout. International Journal of Computer Vision 95, 1–12 (2011)
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. Pattern Analysis and Machine Intelligence 32(9), 1627–1645 (2010)
Han, J., Shao, L., Xu, D., Shotton, J.: Enhanced computer vision with microsoft kinect sensor: A review. IEEE Transactions on Cybernetics (2013)
Kong, D., Gray, D., Tao, H.: Counting pedestrians in crowds using viewpoint invariant training. In: British Machine Vision Conf. Citeseer (2005)
Kuhn, H.W.: The hungarian method for the assignment problem. Naval Research Logistics Quarterly 2(1-2), 83–97 (1955)
Marana, A., Costa, L.F., Lotufo, R., Velastin, S.: On the efficacy of texture analysis for crowd monitoring. In: International Symposium on Proceedings of the Computer Graphics, Image Processing, and Vision, SIBGRAPI 1998, pp. 354–361 (1998)
Qi, F., Han, J., Wang, P., Shi, G., Li, F.: Structure guided fusion for depth map inpainting. Pattern Recognition Letters 34(1), 70–76 (2013)
Tanner, R., Studer, M., Zanoli, A., Hartmann, A.: People detection and tracking with tof sensor. In: IEEE Fifth International Conference on Advanced Video and Signal Based Surveillance, AVSS 2008, pp. 356–361 (2008)
Viola, P., Jones, M., Snow, D.: Detecting pedestrians using patterns of motion and appearance. In: Proceedings of the Ninth IEEE International Conference on Computer Vision, vol. 2, pp. 734–741 (2003)
Yu, Y., Song, Y., Zhang, Y., Wen, S.: A shadow repair approach for kinect depth maps. In: Lee, K., Matsushita, Y., Rehg, J., Hu, Z. (eds.) ACCV 2012, Part IV. LNCS, vol. 7727, pp. 615–626. Springer, Heidelberg (2013)
Zhang, X., Yan, J., Feng, S., Lei, Z., Yi, D., Li, S.: Water filling: Unsupervised people counting via vertical kinect sensor. In: 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance (AVSS), pp. 215–220 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Galčík, F., Gargalík, R. (2013). Real-Time Depth Map Based People Counting. In: Blanc-Talon, J., Kasinski, A., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2013. Lecture Notes in Computer Science, vol 8192. Springer, Cham. https://doi.org/10.1007/978-3-319-02895-8_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-02895-8_30
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-02894-1
Online ISBN: 978-3-319-02895-8
eBook Packages: Computer ScienceComputer Science (R0)