Deep Learning Based Pedestrian Detection at Distance in Smart Cities

Dinakaran, Ranjith K; Easom, Philip; Bouridane, Ahmed; Zhang, Li; Jiang, Richard; Mehboob, Fozia; Rauf, Abdul

doi:10.1007/978-3-030-29513-4_43

Ranjith K Dinakaran¹⁷,
Philip Easom¹⁷,
Ahmed Bouridane¹⁷,
Li Zhang¹⁷,
Richard Jiang¹⁹,
Fozia Mehboob¹⁸ &
…
Abdul Rauf¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1038))

Included in the following conference series:

Proceedings of SAI Intelligent Systems Conference

2383 Accesses
9 Citations

Abstract

Generative adversarial networks (GANs) have been promising for many computer vision problems due to their powerful capabilities to enhance the data for training and test. In this paper, we leveraged GANs and proposed a new architecture with a cascaded Single Shot Detector (SSD) for pedestrian detection at distance, which is yet a challenge due to the varied sizes of pedestrians in videos at distance. To overcome the low-resolution issues in pedestrian detection at distance, DCGAN is employed to improve the resolution first to reconstruct more discriminative features for a SSD to detect objects in images or videos. A crucial advantage of our method is that it learns a multi-scale metric to distinguish multiple objects at different distances under one image, while DCGAN serves as an encoder-decoder platform to generate parts of an image that contain better discriminative information. To measure the effectiveness of our proposed method, experiments were carried out on the Canadian Institute for Advanced Research (CIFAR) dataset, and it was demonstrated that the proposed new architecture achieved a much better detection rate, particularly on vehicles and pedestrians at distance, making it highly suitable for smart cities applications that need to discover key objects or pedestrians at distance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dinakaran, R., et al.: Image resolution impact analysis on pedestrian detection in smart cities surveillance. In: Proceedings of the 1st International Conference on Internet of Things & Ma-chine Learning (IML 2017) (2017)
Google Scholar
Ren, S., et al.: Faster R-CNN: towards real-time object detection with region proposal net-works. In: NIPS (2015)
Google Scholar
Dosovitskiy, A., et al.; Discriminative unsupervised feature learning with exemplar convolutional neural networks. IEEE Trans. Pattern Anal. Mach. Intell. 99 (2015)
Google Scholar
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A.C.: SSD: single shot multibox detector. In: ECCV (2016)
Google Scholar
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., WardeFarley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: NIPS (2014)
Google Scholar
Nguyen, A., et al.: Plug & play generative networks: conditional iterative generation of images in latent space. In: CVPR (2017)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015)
Google Scholar
Denton, E.L., et al.: Deep generative image models using a laplacian pyramid of adversarial networks. In: NIPS, vol. 2, pp. 1486–1494 (2015)
Google Scholar
Ledig, C., et al.: Photo-realistic single image super resolution using a generative adversarial network. In: CVPR open access version by computer vision foundation, CVPR (2017)
Google Scholar
Chen, X., et al.: InfoGAN: interpretable representation learning by information maximizing generative adversarial nets. In: Advances in Neural Information Processing Systems (NIPS 2016), vol. 29 (2019)
Google Scholar
Finn, C., et al.: Unsupervised learning for physical interaction through video prediction. In: Advances in Neural information Processing Systems, NIPS, vol. 29 (2016)
Google Scholar
Mathieu, M., et al.: Deep multi-scale video prediction beyond mean square error. In: ICLR (2016)
Google Scholar
Storey, G., Jiang, R., Bouridane, A.: Role for 2D image generated 3D face models in the rehabilitation of facial palsy. IET Healthcare Technology Letters (2017)
Google Scholar
Storey, G., Bouridane, A., Jiang, R.: Integrated deep model for face detection and landmark localisation from in the wild image, IEEE Access (in press)
Google Scholar
Jiang, R., Ho, A.T., Cheheb, I., Al-Maadeed, N., Al-Maadeed, S., Bouridane, A.: Emotion recognition from scrambled facial images via many graph embedding. Pattern Recogn. 67, 245–251 (2017)
Article Google Scholar
Jiang, R., Al-Maadeed, S., Bouridane, A., Crookes, D., Celebi, M.E.: Face recognition in the scrambled domain via salience-aware ensembles of many kernels. IEEE Trans. Inf. Forensics Secur. 11(8), 1807–1817 (2016)
Article Google Scholar
Jiang, R., Bouridane, A., Crookes, D., Celebi, M.E., Wei, H.L.: Privacy-protected facial biometric verification via fuzzy forest learning. IEEE Trans. Fuzzy Syst. 24(4), 779–790 (2016)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Computer and Information Sciences, Northumbria University, Newcastle upon Tyne, UK
Ranjith K Dinakaran, Philip Easom, Ahmed Bouridane & Li Zhang
Computer Science, Imam Mohammed ibn Saud Islamic University, Riyadh, Kingdom of Saudi Arabia
Fozia Mehboob & Abdul Rauf
The School of Computing and Communication, Lancaster, UK
Richard Jiang

Authors

Ranjith K Dinakaran
View author publications
You can also search for this author in PubMed Google Scholar
Philip Easom
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Bouridane
View author publications
You can also search for this author in PubMed Google Scholar
Li Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Richard Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Fozia Mehboob
View author publications
You can also search for this author in PubMed Google Scholar
Abdul Rauf
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ranjith K Dinakaran .

Editor information

Editors and Affiliations

School of Computing, Computer Science Research Institute, Ulster University, Newtownabbey, UK
Yaxin Bi
The Science and Information (SAI) Organization, Bradford, West Yorkshire, UK
Rahul Bhatia
The Science and Information (SAI) Organization, Bradford, West Yorkshire, UK
Supriya Kapoor

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dinakaran, R.K. et al. (2020). Deep Learning Based Pedestrian Detection at Distance in Smart Cities. In: Bi, Y., Bhatia, R., Kapoor, S. (eds) Intelligent Systems and Applications. IntelliSys 2019. Advances in Intelligent Systems and Computing, vol 1038. Springer, Cham. https://doi.org/10.1007/978-3-030-29513-4_43

Download citation

DOI: https://doi.org/10.1007/978-3-030-29513-4_43
Published: 24 August 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29512-7
Online ISBN: 978-3-030-29513-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics