Abstract
In this paper, the task of automatic person detection in thermal images using convolutional neural network-based models originally intended for detection in RGB images is investigated. The performance of the standard YOLOv3 model is compared with a custom trained model on a dataset of thermal images extracted from videos recorded at night in clear weather, rain and fog, at different ranges and with different types of movement – running, walking and sneaking. The experiments show excellent results in terms of average precision for all tested scenarios, and a significant improvement of performance for person detection in thermal imaging with a modest training set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), vol. 1, p. I (2001)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), vol. 1, pp. 886–893 (2005)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Liu, W., et al.: SSD: single shot multi-box detector. At a European Conference on Computer Vision, pp. 21–37 (2016)
He, K., Gkioxari, G., Dollar, P.: Mask R-CNN. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 2980–2988 (2017)
Dai, J., Li, Y., He, K., Sun, J.: R-FCN: object detection via region-based fully convolutional networks. In: Advances in Neural Information Processing Systems, pp. 379–387 (2016)
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Chang, S., Yang, F., Wu, W., Cho, Y., Chen, S.: Nighttime pedestrian detection using thermal imaging based on HOG feature. In: Proceedings 2011 International Conference on System Science and Engineering, Macao, pp. 694–698 (2011)
Ge, J., Luo, Y., Tei, G.: Real-time pedestrian detection and tracking at nighttime for driver-assistance systems. IEEE Trans. Intell. Transp. Syst. 10(2), 283–298 (2009)
Davis, J.W., Keck, M.A.: A two-stage template approach to person detection in thermal imagery. In: 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION 2005), Breckenridge, CO, vol. 1, pp. 364–369 (2005)
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement, arXiv preprint arXiv:1804.02767 (2018)
Buric, M., Pobar, M., Ivasic-Kos, M.: Object detection in sports videos. In: 41st International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) (2018)
Burić, M., Pobar, M., Ivašić-Kos, M.: Ball detection using YOLO and Mask R-CNN. In: 5th Annual Conference on Computational Science & Computational Intelligence (CSCI 2018), Las Vegas, USA (2018)
Kristo, M., Ivasic-Kos, M.: An overview of thermal face recognition methods. In: 2018 41st International Convention on Information and Communication Technology, Electronics, and Microelectronics (MIPRO) (2018)
Wu, Z., Fuller, N., Theriault, D., Betke, M.: A thermal infrared video benchmark for visual analysis. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 201–208 (2014)
Bhowmik, M.K., et al.: Thermal infrared face recognition–a biometric identification technique for robust security system. In: Reviews, Refinements and New Ideas in Face Recognition. InTech (2011)
Tanda, G.: The use of infrared thermography to detect the skin temperature response to physical activity. J. Phys: Conf. Ser. 655(1), 012062 (2015)
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger, arXiv preprint (2017)
Lin, T.-Y., et al.: Feature pyramid networks for object detection, arXiv preprint arXiv:1804.02767 (2018)
Lin, T.-Y., et al.: Microsoft coco: common objects in context. In: European Conference on Computer Vision, pp. 740–755 (2014)
Buric, M., Pobar, M., Ivasic-Kos, M.: Adapting YOLO network for ball and player detection. In: ICPRAM (2019)
Krišto, M., Ivašić-Kos, M.: Thermal imaging dataset for person detection. In: 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) (2019, in press)
Ivasic-Kos, M., Kristo, M., Pobar, M.: Human detection in thermal imaging using YOLO. In: ICCTA (2019)
Dutta, A., Gupta, A., Zissermann, A.: VGG image annotator (VIA) (2016). http://www.robots.ox.ac.uk/~vgg/software/via
Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)
Acknowledgment
This research was fully supported by the Croatian Science Foundation under the project IP-2016-06-8345 “Automatic recognition of actions and activities in multimedia content from the sports domain” (RAASS).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Ivasic-Kos, M., Kristo, M., Pobar, M. (2020). Person Detection in Thermal Videos Using YOLO. In: Bi, Y., Bhatia, R., Kapoor, S. (eds) Intelligent Systems and Applications. IntelliSys 2019. Advances in Intelligent Systems and Computing, vol 1038. Springer, Cham. https://doi.org/10.1007/978-3-030-29513-4_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-29513-4_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29512-7
Online ISBN: 978-3-030-29513-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)