Object Detection from Images Based on MFF-RPN and Multi-scale CNN

Zhou, Jiayi; Zheng, Hengyi; Yin, Hongpeng; Chai, Yi

doi:10.1007/978-981-10-6499-9_33

Jiayi Zhou⁵,
Hengyi Zheng,
Hongpeng Yin⁷ &
…
Yi Chai^5,6

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 460))

Included in the following conference series:

Chinese Intelligent Systems Conference

1208 Accesses

Abstract

In this paper, an object detection model based on MFF-RPN and Multi-scale CNN is proposed. Firstly, the region proposal network based on multi-level feature fusion (MFF-RPN) is presented to extract the candidate proposals. Secondly, a convolutional neural network (CNN) with different scale convolution kernels is conducted to extract features adaptively. Finally, multi-task loss is employed to establish a complex mapping between image object features and object detection mode. The experimental results prove that the proposed algorithm gets better classification performance and higher detection accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation. In: Computer vision and pattern recognition. IEEE;2014. p. 580−87.
Google Scholar
He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell. 2015;37(9):1904.
Article Google Scholar
Girshick R, Fast R-CNN. In: IEEE international conference on computer vision (CVPR). IEEE;2015. p. 1440−48.
Google Scholar
Ren S, He K, Girshick R, et al. Faster R-CNN. Towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell. (PAMI) 2015;1−1.
Google Scholar
Ghodrati A, Diba A, Pedersoli M, et al. Deepproposal: Hunting objects by cascading deep convolutional layers. In: IEEE International conference on computer vision (CVPR);2015. p. 2578−86.
Google Scholar
Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation. In: IEEE conference on computer vision and pattern recognition (CVPR);2015. p. 3431−40.
Google Scholar
Hariharan B, Arbeláez P, Girshick R, et al. Hypercolumns for object segmentation and fine-grained localization. In: IEEE conference on computer vision and pattern recognition;2015. p. 447−56.
Google Scholar
Neubeck A, Van Gool L. Efficient non-maximum suppression. In: International conference on pattern recognition. IEEE computer society;2006. p. 850−55.
Google Scholar
Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector. In: European conference on computer vision (ECCV). Springer International Publishing;2016. p. 21−37.
Google Scholar
Erhan D, Szegedy C, Toshev A, et al. Scalable object detection using deep neural networks. In: IEEE conference on computer vision and pattern recognition (CVPR);2014. p. 2147−54.
Google Scholar
Zeiler MD, Fergus R. Visualizing and understanding convolutional networks. In: European conference on computer vision. Springer International Publishing;2014. p. 818−33.
Google Scholar
Everingham M, Eslami SMA, Van Gool L, et al. The pascal visual object classes challenge: a retrospective. Int J Comput Vis. 2015;111(1):98–136.
Article Google Scholar

Download references

Acknowledgements

We would like to thank the supports by Chongqing Nature Science Foundation for Fundamental science and frontier technologies (cstc2015jcyjB0569), China Central Universities Foundation (106112016CDJZR175511).

Author information

Authors and Affiliations

School of Automation, Chongqing University, Chongqing, 400044, China
Jiayi Zhou & Yi Chai
Key Laboratory of Power Transmission Equipment and System Security, Chongqing, 400044, China
Yi Chai
College of Mechanical Engineering, Chongqing University, Chongqing, 400044, China
Hongpeng Yin

Authors

Jiayi Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Hengyi Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Hongpeng Yin
View author publications
You can also search for this author in PubMed Google Scholar
Yi Chai
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Beihang University , Beijing, China
Yingmin Jia
Beijing University of Posts and Telecommunications, Beijing, China
Junping Du
University of Science and Technology Beijing, Beijing, China
Weicun Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhou, J., Zheng, H., Yin, H., Chai, Y. (2018). Object Detection from Images Based on MFF-RPN and Multi-scale CNN. In: Jia, Y., Du, J., Zhang, W. (eds) Proceedings of 2017 Chinese Intelligent Systems Conference. CISC 2017. Lecture Notes in Electrical Engineering, vol 460. Springer, Singapore. https://doi.org/10.1007/978-981-10-6499-9_33

Download citation

DOI: https://doi.org/10.1007/978-981-10-6499-9_33
Published: 30 September 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6498-2
Online ISBN: 978-981-10-6499-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics