Skip to main content

An Object Detection Algorithm for Deep Learning Based on Batch Normalization

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10699))

Abstract

Based on the advantage of deep learning in object extraction, in this paper we design a deep network that adds Batch-Normalization to the convolution layer. Batch-Normalization has three main advantages. Firstly, it normalizes the input data, which can speed up the fitting of parameters. Secondly, Batch-Normalization can reconstruct the distribution of the input data, so that the feature of input data will not be lost. Thirdly, Batch-Normalization is able to prevent over-fitting, so it can replace Dropout, Local Response Normalization to simplify the network. The network in this paper adopted region proposal to get region of interests. Training classification and position adjustment at the same time to improve accuracy. Comprehensive experimental results have demonstrated the efficacy of the proposed network for objects detection.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Zeng, B., Wang, G., Lin, X.: Real-time pedestrian detection based on color self-similarity. J. Tsinghua Univ. (Sci. Technol.) 52(04), 571–574 (2012)

    Google Scholar 

  2. Mu, Y., Yan, S., Liu, Y., et al.: Discriminative local binary patterns for human detection in personal album. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1–8. DBLP (2008)

    Google Scholar 

  3. Wu, J., Geyer, C., Rehg, J.M.: Real-time human detection using contour cues. In: IEEE International Conference on Robotics and Automation, pp. 860–867. IEEE (2011)

    Google Scholar 

  4. Zhou, Z., Yu, S., Zhang, R., Yang, X.: A method of face recognition based on SIFT operator. J. Image Graph. 13(10), 1882–1885 (2008)

    Google Scholar 

  5. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 1, pp. 886–893. IEEE Xplore (2005)

    Google Scholar 

  6. Lienhart, R., Maydt, J.: An extended set of Haar-like features for rapid object detection. In: 2002 Proceedings of International Conference on Image Processing, vol.1, pp. I-900-I-903. IEEE (2002)

    Google Scholar 

  7. Yang, X., Yang, Y.: A high efficiency vehicle detection method based on HOG-LBP. Comput. Eng. 09, 210–214 (2014)

    Google Scholar 

  8. Ke, Y., Sukthankar, R.: PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004, vol. 2, pp. II-506-II-513. IEEE (2004)

    Google Scholar 

  9. Walk, S., Majer, N., Schindler, K., et al.: New features and insights for pedestrian detection. In: Computer Vision and Pattern Recognition, pp. 1030–1037. IEEE (2010)

    Google Scholar 

  10. http://www.dataguru.cn/thread-371987-1-1.html

  11. Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1–8. DBLP (2008)

    Google Scholar 

  12. Freund, Y., Schapire, R.E.: A desicion-theoretic generalization of on-line learning and an application to boosting. In: Vitányi, P. (ed.) EuroCOLT 1995. LNCS, vol. 904, pp. 23–37. Springer, Heidelberg (1995). https://doi.org/10.1007/3-540-59119-2_166

    Chapter  Google Scholar 

  13. Girshick, R., Donahue, J., Darrell, T., et al.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2016)

    Article  Google Scholar 

  14. He, K., Zhang, X., Ren, S., et al.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015)

    Article  Google Scholar 

  15. Girshick R.: Fast R-CNN. In: IEEE International Conference on Computer Vision, pp. 1440–1448. IEEE Computer Society (2015)

    Google Scholar 

  16. Ren, S., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: nternational Conference on Neural Information Processing Systems, pp. 91–99. MIT Press (2015)

    Google Scholar 

  17. Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International Conference on Machine Learning, pp. 448–456 (2015). JMLR.org

  18. He, K., Gkioxari, G., Dollár, P., et al.: Mask R-CNN (2017)

    Google Scholar 

  19. Arjovsky, M., Bottou, L.: Towards principled methods for training generative adversarial networks. In: ICLR (2017)

    Google Scholar 

  20. Zhao, J., et al.: Energy-based generative adversarial networks. In: ICLR (2017)

    Google Scholar 

  21. Zhou, Y., Zeng, F.: 2D compressive sensing and multi-feature fusion for effective 3D shape retrieval. Inf. Sci. 101–120 (2017)

    Google Scholar 

  22. Gai, K., Qiu, M., Ming, Z., Zhao, H., Qiu, L.: Spoofing-jamming attack strategy using optimal power distributions in wireless smart grid networks. IEEE Trans. Smart Grid 8(5), 2431–2439 (2017)

    Article  Google Scholar 

  23. Gai, K., Qiu, M., Tao, L., Zhu, Y.: Intrusion detection techniques for mobile cloud computing in heterogeneous 5G. Secur. Commun. Netw. 9(16), 3049–3058 (2016)

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to thank the editors and the anonymous reviewers for their constructive comments to further improve the quality of this paper. This work is partially supported by the following projects in china: National Natural Science Foundation of China (No. 61602116), Natural Science Foundation of Guangdong Province (No. 2015A030313635, No. 2017A030313388), Science and Technology Project of Guangdong Province (No. 2014A010103037), Special Found for Science and Technology Innovation of Foshan City (No. 2015AG10008, No. 2016GA10156, No. 2014AG10001), Education Department of Guangdong Province (No. 2015KTSCX153) and Outstanding Youth Teacher Training Program of Foshan University (No. FSYQ201411).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Changqing Yuan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhou, Y., Yuan, C., Zeng, F., Qian, J., Wu, C. (2018). An Object Detection Algorithm for Deep Learning Based on Batch Normalization. In: Qiu, M. (eds) Smart Computing and Communication. SmartCom 2017. Lecture Notes in Computer Science(), vol 10699. Springer, Cham. https://doi.org/10.1007/978-3-319-73830-7_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-73830-7_43

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-73829-1

  • Online ISBN: 978-3-319-73830-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics