Bird Detection on Transmission Lines Based on DC-YOLO Model

Zou, Cong; Liang, Yong-quan

doi:10.1007/978-3-030-46931-3_21

Cong Zou¹⁸ &
Yong-quan Liang^18,19

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 581))

Included in the following conference series:

International Conference on Intelligent Information Processing

651 Accesses
4 Citations

Abstract

In order to accurately detect the number of birds around the transmission line, promptly drive the birds away to ensure the normal operation of the line, a DC-YOLO model is designed. This model is based on the deep learning target detection algorithm YOLO V3 and proposes two improvements: Replacing the convolutional layer in the original network with dilated convolution to maintain a larger receptive field and higher resolution, improving the model’s accuracy for small targets; The confidence score of the detection frame is updated by calculating the scale factor, and the detection frame with a score lower than the threshold is finally removed. The NMS algorithm is optimized to improve the model’s ability to detect occluded birds. Experimental results show that the DC-YOLO model detection accuracy can reach 86.31%, which can effectively detect birds around transmission lines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Le Cun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)
Article Google Scholar
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning, pp. 1–33. MIT Press, Cambridge (2016)
MATH Google Scholar
Nordeng, I.E., Hasan, A., Olsen, D., Neubert, J.: DEBC detection with deep learning. In: Sharma, P., Bianchi, F.M. (eds.) SCIA 2017. LNCS, vol. 10269, pp. 248–259. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59126-1_21
Chapter Google Scholar
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Google Scholar
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017)
Article Google Scholar
Girshick, R., Donahue, J., Darrell, T., et al.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Tumas, P., Serackis, A.: Automated image annotation based on YOLOv3. In: 2018 IEEE 6th Workshop on Advances in Information, Electronic and Electrical Engineering (AIEEE), pp. 1–3. IEEE (2018)
Google Scholar
Qu, H., Yuan, T., Sheng, Z., et al.: A pedestrian detection method based on YOLOv3 model and image enhanced by Retinex. In: 2018 11th International Congress on Image and Signal Processing, Bio Medical Engineering and Informatics (CISP-BMEI), pp. 1–5. IEEE (2018)
Google Scholar
Yu, F., Koltun, V., Funkhouser, T.: Dilated residual networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 472–480 (2017)
Google Scholar
Wang, Y., Wang, G., Chen, C., Pan, Z.: Multi-scale dilated convolution of convolutional neural network for image denoising. Multimed. Tools Appl. 78(14), 19945–19960 (2019)
Article Google Scholar
Vo, D.M., Lee, S.W.: Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions. Multimed. Tools Appl. 77(14), 18689–18707 (2018)
Article Google Scholar
Geng, L., Zhang, S., Tong, J., Xiao, Z.: Lung segmentation method with dilated convolution based on VGG-16 network. Comput. Assist. Surg. 24(sup2), 27–33 (2019)
Article Google Scholar
Bodla, N., Singh, B., Chellappa, R., et al.: Soft-NMS–improving object detection with one line of code. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5561–5569 (2017)
Google Scholar
He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, pp. 770–778 (2016)
Google Scholar
Lin, T.Y., Dollár, P., Girshick, R., et al.: Feature pyramid networks for object detection. In:Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 89–95 (2018)
Google Scholar
Rosenfeld, A., Thurston, M.: Edge and curve detection for visual scene analysis. IEEE Trans. Comput. 20(5), 562–569 (1971)
Article Google Scholar
Hosang, J., Benenson, R., Schiele, B.: A convnet for non-maximum suppression. In: Rosenhahn, B., Andres, B. (eds.) GCPR 2016. LNCS, vol. 9796, pp. 192–204. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45886-1_16
Chapter Google Scholar
Hosang, J., Benenson, R., Schiele, B.: Learning non-maximum suppression. In: Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 6469–6477. IEEE (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao, China
Cong Zou & Yong-quan Liang
Provincial Key Laboratory for Information Technology of Wisdom Mining of Shandong Province, Shandong University of Science and Technology, Qingdao, China
Yong-quan Liang

Authors

Cong Zou
View author publications
You can also search for this author in PubMed Google Scholar
Yong-quan Liang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yong-quan Liang .

Editor information

Editors and Affiliations

Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Zhongzhi Shi
University of Salford, Manchester, UK
Sunil Vadera
Australian Defence Force Academy, UNSW Canberra, Canberra, ACT, Australia
Elizabeth Chang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zou, C., Liang, Yq. (2020). Bird Detection on Transmission Lines Based on DC-YOLO Model. In: Shi, Z., Vadera, S., Chang, E. (eds) Intelligent Information Processing X. IIP 2020. IFIP Advances in Information and Communication Technology, vol 581. Springer, Cham. https://doi.org/10.1007/978-3-030-46931-3_21

Download citation

DOI: https://doi.org/10.1007/978-3-030-46931-3_21
Published: 26 June 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-46930-6
Online ISBN: 978-3-030-46931-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Federation for Information Processing (opens in a new tab)