Pedestrian Detection in Unmanned Aerial Vehicle Scene

  • Qianqian Guo
  • Yihao Li
  • Dong WangEmail author
Part of the Studies in Computational Intelligence book series (SCI, volume 810)


With the increasing adoption of unmanned aerial vehicles (UAVs), pedestrian detection with use of such vehicles has been attracting attention. Object detection algorithms based on deep learning have considerably progressed in recent years, but applying existing research results directly to the UAV perspective is difficult. Therefore, in this study, we present a new dataset called UAVs-Pedestrian, which contains various scenes and angles, for improving test results. To validate our dataset, we use the classical detection algorithms SSD, YOLO, and Faster-RCNN. Findings indicate that our dataset is challenging and conducive to the study of pedestrian detection using UAVs.


Pedestrian detection UAVs-Pedestrian dataset Deep learning 



This work was supported by the Fundamental Research Funds for the Central Universities (No. DUT18JC30) and Undergraduate Innovation and Entrepreneurship Training Program (No. 2018101410201011075).


  1. 1.
    Cai, Z., Fan, Q., Feris, R.S., Vasconcelos, N.: A unified multi-scale deep convolutional neural network for fast object detection. In: European Conference on Computer Vision, pp. 354–370 (2016)Google Scholar
  2. 2.
    Girshick, R.: Fast R-CNN. IEEE Int. Conf. Comput. Vis. 1440–1448 (2015)Google Scholar
  3. 3.
    Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137 (2017)CrossRefGoogle Scholar
  4. 4.
    Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S.E., Fu, C., Berg, A.C.: SSD: single shot multibox detector. Eur. Conf. Comput. Vis. 21–37 (2016)Google Scholar
  5. 5.
    Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. IEEE Conf. Comput. Vis. Pattern Recognit. 779–788 (2016)Google Scholar
  6. 6.
    Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. IEEE Conf. Comput. Vis. Pattern Recognit. 6517–6525 (2017)Google Scholar
  7. 7.
    Everingham, M., Gool, L.J.V., Williams, C.K.I., Winn, J.M., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)Google Scholar
  8. 8.
    Lin, T., Maire, M., Belongie, S.J., Hays, J., Perona, P., Ramanan, D., Dollar, P., Zitnick, C.L.: Microsoft COCO: common objects in context. Eur. Conf. Comput. Vis. 740–755, (2014)Google Scholar
  9. 9.
    Redmon, J., FarhadiRedmon, A.: YOLOv3: an incremental improvement. IEEE Conf. Comput. Vis. Pattern Recognit. (2018)Google Scholar
  10. 10.
    Wang, L.: Places205-VGGNet models for scene recognition. IEEE Conf. Comput. Vis. Pattern Recognit. 1135–1155 (2015)Google Scholar
  11. 11.
    Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., Berg, A.C.: SSD: single shot multibox detector. Eur. Conf. Comput. Vis. 21–37 (2016)Google Scholar
  12. 12.
    Huang, J., Rathod, V., Sun, C., Zhu, M., Balan, A.K., Fathi, A., Fischer, I., Wojna, Z., Song, Y., Guadarrama, S., Murphy, K.: Speed/Accuracy Trade-offs for Modern Convolutional Object Detectors (2016). arXiv:1611.10012

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.School of Information and Communication EngineeringDalian University of TechnologyDalianChina

Personalised recommendations