Abstract
In order to accurately detect the number of birds around the transmission line, promptly drive the birds away to ensure the normal operation of the line, a DC-YOLO model is designed. This model is based on the deep learning target detection algorithm YOLO V3 and proposes two improvements: Replacing the convolutional layer in the original network with dilated convolution to maintain a larger receptive field and higher resolution, improving the model’s accuracy for small targets; The confidence score of the detection frame is updated by calculating the scale factor, and the detection frame with a score lower than the threshold is finally removed. The NMS algorithm is optimized to improve the model’s ability to detect occluded birds. Experimental results show that the DC-YOLO model detection accuracy can reach 86.31%, which can effectively detect birds around transmission lines.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Le Cun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning, pp. 1–33. MIT Press, Cambridge (2016)
Nordeng, I.E., Hasan, A., Olsen, D., Neubert, J.: DEBC detection with deep learning. In: Sharma, P., Bianchi, F.M. (eds.) SCIA 2017. LNCS, vol. 10269, pp. 248–259. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59126-1_21
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017)
Girshick, R., Donahue, J., Darrell, T., et al.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Tumas, P., Serackis, A.: Automated image annotation based on YOLOv3. In: 2018 IEEE 6th Workshop on Advances in Information, Electronic and Electrical Engineering (AIEEE), pp. 1–3. IEEE (2018)
Qu, H., Yuan, T., Sheng, Z., et al.: A pedestrian detection method based on YOLOv3 model and image enhanced by Retinex. In: 2018 11th International Congress on Image and Signal Processing, Bio Medical Engineering and Informatics (CISP-BMEI), pp. 1–5. IEEE (2018)
Yu, F., Koltun, V., Funkhouser, T.: Dilated residual networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 472–480 (2017)
Wang, Y., Wang, G., Chen, C., Pan, Z.: Multi-scale dilated convolution of convolutional neural network for image denoising. Multimed. Tools Appl. 78(14), 19945–19960 (2019)
Vo, D.M., Lee, S.W.: Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions. Multimed. Tools Appl. 77(14), 18689–18707 (2018)
Geng, L., Zhang, S., Tong, J., Xiao, Z.: Lung segmentation method with dilated convolution based on VGG-16 network. Comput. Assist. Surg. 24(sup2), 27–33 (2019)
Bodla, N., Singh, B., Chellappa, R., et al.: Soft-NMS–improving object detection with one line of code. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5561–5569 (2017)
He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, pp. 770–778 (2016)
Lin, T.Y., Dollár, P., Girshick, R., et al.: Feature pyramid networks for object detection. In:Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 89–95 (2018)
Rosenfeld, A., Thurston, M.: Edge and curve detection for visual scene analysis. IEEE Trans. Comput. 20(5), 562–569 (1971)
Hosang, J., Benenson, R., Schiele, B.: A convnet for non-maximum suppression. In: Rosenhahn, B., Andres, B. (eds.) GCPR 2016. LNCS, vol. 9796, pp. 192–204. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45886-1_16
Hosang, J., Benenson, R., Schiele, B.: Learning non-maximum suppression. In: Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 6469–6477. IEEE (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 IFIP International Federation for Information Processing
About this paper
Cite this paper
Zou, C., Liang, Yq. (2020). Bird Detection on Transmission Lines Based on DC-YOLO Model. In: Shi, Z., Vadera, S., Chang, E. (eds) Intelligent Information Processing X. IIP 2020. IFIP Advances in Information and Communication Technology, vol 581. Springer, Cham. https://doi.org/10.1007/978-3-030-46931-3_21
Download citation
DOI: https://doi.org/10.1007/978-3-030-46931-3_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-46930-6
Online ISBN: 978-3-030-46931-3
eBook Packages: Computer ScienceComputer Science (R0)