Skip to main content

Person Detection in Thermal Videos Using YOLO

  • Conference paper
  • First Online:
Intelligent Systems and Applications (IntelliSys 2019)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1038))

Included in the following conference series:

Abstract

In this paper, the task of automatic person detection in thermal images using convolutional neural network-based models originally intended for detection in RGB images is investigated. The performance of the standard YOLOv3 model is compared with a custom trained model on a dataset of thermal images extracted from videos recorded at night in clear weather, rain and fog, at different ranges and with different types of movement – running, walking and sneaking. The experiments show excellent results in terms of average precision for all tested scenarios, and a significant improvement of performance for person detection in thermal imaging with a modest training set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), vol. 1, p. I (2001)

    Google Scholar 

  2. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), vol. 1, pp. 886–893 (2005)

    Google Scholar 

  3. Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

    Google Scholar 

  4. Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)

    Google Scholar 

  5. Liu, W., et al.: SSD: single shot multi-box detector. At a European Conference on Computer Vision, pp. 21–37 (2016)

    Google Scholar 

  6. He, K., Gkioxari, G., Dollar, P.: Mask R-CNN. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 2980–2988 (2017)

    Google Scholar 

  7. Dai, J., Li, Y., He, K., Sun, J.: R-FCN: object detection via region-based fully convolutional networks. In: Advances in Neural Information Processing Systems, pp. 379–387 (2016)

    Google Scholar 

  8. Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)

    Google Scholar 

  9. Chang, S., Yang, F., Wu, W., Cho, Y., Chen, S.: Nighttime pedestrian detection using thermal imaging based on HOG feature. In: Proceedings 2011 International Conference on System Science and Engineering, Macao, pp. 694–698 (2011)

    Google Scholar 

  10. Ge, J., Luo, Y., Tei, G.: Real-time pedestrian detection and tracking at nighttime for driver-assistance systems. IEEE Trans. Intell. Transp. Syst. 10(2), 283–298 (2009)

    Article  Google Scholar 

  11. Davis, J.W., Keck, M.A.: A two-stage template approach to person detection in thermal imagery. In: 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION 2005), Breckenridge, CO, vol. 1, pp. 364–369 (2005)

    Google Scholar 

  12. Redmon, J., Farhadi, A.: Yolov3: an incremental improvement, arXiv preprint arXiv:1804.02767 (2018)

  13. Buric, M., Pobar, M., Ivasic-Kos, M.: Object detection in sports videos. In: 41st International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) (2018)

    Google Scholar 

  14. Burić, M., Pobar, M., Ivašić-Kos, M.: Ball detection using YOLO and Mask R-CNN. In: 5th Annual Conference on Computational Science & Computational Intelligence (CSCI 2018), Las Vegas, USA (2018)

    Google Scholar 

  15. Kristo, M., Ivasic-Kos, M.: An overview of thermal face recognition methods. In: 2018 41st International Convention on Information and Communication Technology, Electronics, and Microelectronics (MIPRO) (2018)

    Google Scholar 

  16. Wu, Z., Fuller, N., Theriault, D., Betke, M.: A thermal infrared video benchmark for visual analysis. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 201–208 (2014)

    Google Scholar 

  17. Bhowmik, M.K., et al.: Thermal infrared face recognition–a biometric identification technique for robust security system. In: Reviews, Refinements and New Ideas in Face Recognition. InTech (2011)

    Google Scholar 

  18. Tanda, G.: The use of infrared thermography to detect the skin temperature response to physical activity. J. Phys: Conf. Ser. 655(1), 012062 (2015)

    Google Scholar 

  19. Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger, arXiv preprint (2017)

    Google Scholar 

  20. Lin, T.-Y., et al.: Feature pyramid networks for object detection, arXiv preprint arXiv:1804.02767 (2018)

  21. Lin, T.-Y., et al.: Microsoft coco: common objects in context. In: European Conference on Computer Vision, pp. 740–755 (2014)

    Chapter  Google Scholar 

  22. Buric, M., Pobar, M., Ivasic-Kos, M.: Adapting YOLO network for ball and player detection. In: ICPRAM (2019)

    Google Scholar 

  23. Krišto, M., Ivašić-Kos, M.: Thermal imaging dataset for person detection. In: 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) (2019, in press)

    Google Scholar 

  24. Ivasic-Kos, M., Kristo, M., Pobar, M.: Human detection in thermal imaging using YOLO. In: ICCTA (2019)

    Google Scholar 

  25. Dutta, A., Gupta, A., Zissermann, A.: VGG image annotator (VIA) (2016). http://www.robots.ox.ac.uk/~vgg/software/via

  26. Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)

    Article  Google Scholar 

Download references

Acknowledgment

This research was fully supported by the Croatian Science Foundation under the project IP-2016-06-8345 “Automatic recognition of actions and activities in multimedia content from the sports domain” (RAASS).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marina Ivasic-Kos .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ivasic-Kos, M., Kristo, M., Pobar, M. (2020). Person Detection in Thermal Videos Using YOLO. In: Bi, Y., Bhatia, R., Kapoor, S. (eds) Intelligent Systems and Applications. IntelliSys 2019. Advances in Intelligent Systems and Computing, vol 1038. Springer, Cham. https://doi.org/10.1007/978-3-030-29513-4_18

Download citation

Publish with us

Policies and ethics