Abstract
Video surveillance based crowd counting is important for crowd management and public security. It is a challenge task due to the cluttered background, ambiguous foreground and diverse crowd distributions. In this paper, we propose an end-to-end crowd counting method with convolutional neural networks, which integrates original frames and motion cues for learning a deep crowd counting regressor. The original frames and motion cues are complementary to each other for counting the stationary and moving pedestrians. Experimental results on two widely-used crowd counting datasets demonstrate the effectiveness of our method, and achieve the state-of-the-art performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wu, B., Nevatia, R.: Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors. In: ICCV (2005)
Li, M., Zhang, Z., Huang, K., Tan, T.: Estimating the number of people in crowded scenes by MID based foreground segmentation and head-shoulder detection. In: ICPR (2008)
Wang, M., Wang, X.: Automatic adaptation of a generic pedestrian detector to a specific traffic scene. In: CVPR (2011)
Lin, Z., Davis, L.S.: Shape-based human detection and segmentation via hierarchical part-template matching. IEEE Trans. Pattern Anal. Mach. Intell. 32, 604–618 (2010)
Rabaud, V., Belongie, S.: Counting crowded moving objects. In: CVPR (2006)
Brostow, G.J., Cipolla, R.: Unsupervised Bayesian detection of independent motion in crowds. In: CVPR (2006)
Chan, A.B., Liang, Z.S.J., Vasconcelos, N.: Privacy preserving crowd monitoring: counting people without people models or tracking. In: CVPR (2008)
Chan, A.B., Vasconcelos, N.: Bayesian poisson regression for crowd counting. In: ICCV (2009)
Chen, K., Loy, C.C., Gong, S., Xiang, T.: Feature mining for localised crowd counting. In: BMVC (2012)
Chen, K., Gong, S., Xiang, T., Mary, Q., Loy, C.C.: Cumulative attribute space for age and crowd density estimation. In: CVPR (2013)
Liu, B., Vasconcelos, N.: Bayesian model adaptation for crowd counts. In: ICCV (2015)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: NIPS (2012)
Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., LeCun, Y.: Overfeat: integrated recognition, localization and detection using convolutional networks. arXiv:1312.6229 (2013)
Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: DeCAF: a deep convolutional activation feature for generic visual recognition. In: ICML (2014)
Zhang, C., Li, H., Wang, X., Yang, X.: Cross-scene crowd counting via deep convolutional neural networks. In: CVPR (2015)
Zhang, Y., Zhou, D., Chen, S., Gao, S., Ma, Y.: Single-image crowd counting via multi-column convolutional neural network. In: CVPR (2016)
Kang, K., Wang, X.: Fully convolutional neural networks for crowd segmentation. arXiv:1411.4464 (2014)
Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: Convolutional architecture for fast feature embedding. arXiv:1408.5093 (2014)
Saunders, C., Gammerman, A., Vvovk, V.: Ridge regression learning algorithm in dual variables. In: ICML (2015)
An, S., Liu, W., Venkatesh, S.: Face recognition using kernel ridge regression. In: CVPR (2007)
Wu, X., Liang, G., Lee, K.K., Xu, Y.: Crowd density estimation using texture analysis and learning. In: IEEE International Conference on Robotics and Biomimetics (2006)
Acknowledgment
This work is funded by the National Natural Science Foundation of China (Grant No. 61602433 and Grant No. 61472386), and the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant XDA06040103). The two Titan X GPUs used for this research were donated by the NVIDIA Corporation.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhang, D., Li, Z., Liu, P. (2016). Surveillance Based Crowd Counting via Convolutional Neural Networks. In: Zhang, Z., Huang, K. (eds) Intelligent Visual Surveillance. IVS 2016. Communications in Computer and Information Science, vol 664. Springer, Singapore. https://doi.org/10.1007/978-981-10-3476-3_17
Download citation
DOI: https://doi.org/10.1007/978-981-10-3476-3_17
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3475-6
Online ISBN: 978-981-10-3476-3
eBook Packages: Computer ScienceComputer Science (R0)