Abstract
In this paper, an object detection model based on MFF-RPN and Multi-scale CNN is proposed. Firstly, the region proposal network based on multi-level feature fusion (MFF-RPN) is presented to extract the candidate proposals. Secondly, a convolutional neural network (CNN) with different scale convolution kernels is conducted to extract features adaptively. Finally, multi-task loss is employed to establish a complex mapping between image object features and object detection mode. The experimental results prove that the proposed algorithm gets better classification performance and higher detection accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation. In: Computer vision and pattern recognition. IEEE;2014. p. 580−87.
He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell. 2015;37(9):1904.
Girshick R, Fast R-CNN. In: IEEE international conference on computer vision (CVPR). IEEE;2015. p. 1440−48.
Ren S, He K, Girshick R, et al. Faster R-CNN. Towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell. (PAMI) 2015;1−1.
Ghodrati A, Diba A, Pedersoli M, et al. Deepproposal: Hunting objects by cascading deep convolutional layers. In: IEEE International conference on computer vision (CVPR);2015. p. 2578−86.
Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation. In: IEEE conference on computer vision and pattern recognition (CVPR);2015. p. 3431−40.
Hariharan B, Arbeláez P, Girshick R, et al. Hypercolumns for object segmentation and fine-grained localization. In: IEEE conference on computer vision and pattern recognition;2015. p. 447−56.
Neubeck A, Van Gool L. Efficient non-maximum suppression. In: International conference on pattern recognition. IEEE computer society;2006. p. 850−55.
Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector. In: European conference on computer vision (ECCV). Springer International Publishing;2016. p. 21−37.
Erhan D, Szegedy C, Toshev A, et al. Scalable object detection using deep neural networks. In: IEEE conference on computer vision and pattern recognition (CVPR);2014. p. 2147−54.
Zeiler MD, Fergus R. Visualizing and understanding convolutional networks. In: European conference on computer vision. Springer International Publishing;2014. p. 818−33.
Everingham M, Eslami SMA, Van Gool L, et al. The pascal visual object classes challenge: a retrospective. Int J Comput Vis. 2015;111(1):98–136.
Acknowledgements
We would like to thank the supports by Chongqing Nature Science Foundation for Fundamental science and frontier technologies (cstc2015jcyjB0569), China Central Universities Foundation (106112016CDJZR175511).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhou, J., Zheng, H., Yin, H., Chai, Y. (2018). Object Detection from Images Based on MFF-RPN and Multi-scale CNN. In: Jia, Y., Du, J., Zhang, W. (eds) Proceedings of 2017 Chinese Intelligent Systems Conference. CISC 2017. Lecture Notes in Electrical Engineering, vol 460. Springer, Singapore. https://doi.org/10.1007/978-981-10-6499-9_33
Download citation
DOI: https://doi.org/10.1007/978-981-10-6499-9_33
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6498-2
Online ISBN: 978-981-10-6499-9
eBook Packages: EngineeringEngineering (R0)