Recent Developments on Convergence of Online Gradient Methods for Neural Network Training

Wu, Wei; Li, Zhengxue; Feng, Guori; Zhang, Naimin; Nan, Dong; Shao, Zhiqiong; Yang, Jie; Zhang, Liqing; Xu, Yuesheng

doi:10.1007/978-3-540-28647-9_40

Wei Wu¹⁹,
Zhengxue Li¹⁹,
Guori Feng²⁰,
Naimin Zhang^19,21,
Dong Nan¹⁹,
Zhiqiong Shao¹⁹,
Jie Yang¹⁹,
Liqing Zhang¹⁹ &
…
Yuesheng Xu²²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3173))

Included in the following conference series:

International Symposium on Neural Networks

1034 Accesses

Abstract

A survey is presented on some recent developments on the convergence of online gradient methods for feedforward neural networks such as BP neural networks. Unlike most of the convergence results which are of probabilistic and non-monotone nature, the convergence results we show here have a deterministic and monotone nature. Also considered are the cases where a momentum or a penalty term is added to the error function to improve the performance of the training procedure.

Partly supported by the National Natural Science Foundation of China.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ellacott, S.W.: The Numerical Analysis Approach of Neural Networks. In: Taylor, J.G. (ed.) Mathematical Approaches to Neural Networks, pp. 103–138. North-Holland, Amsterdam (1993)
Google Scholar
Fine, T.L., Mukherjee, S.: Parameter Convergence and Learning Curves for Neural Networks. Neural Computation 11, 747–769 (1999)
Article Google Scholar
Finnoff, W.: Diffusion Approximations for the Constant Learning Rate Backpropagation Algorithm and Resistance to Locol Minima. Neural Computation 6, 285–295 (1994)
Article Google Scholar
Haykin, S.: Neural Networks, 2nd edn. Prentice-Hall, Englewood Cliffs (1999)
MATH Google Scholar
Liang, Y.C., et al.: Successive Approximation Training Algorithm for Feedforward Neural Networks. Neurocomputing 42, 11–322 (2002)
Google Scholar
Li, Z., Wu, W., Tian, Y.: Convergence of an Online Gradient Method for Feedforward Neural Networks with Stochastic Inputs. Journal of Computational and Applied Mathematics 163(1), 165–176 (2004)
Article MATH MathSciNet Google Scholar
Liu, W.B., Dai, Y.H.: Minimization Algorithms Based on Supervisor and Searcher Cooperation: I-Fast and Robust Gradient Algorithms for Minimization Problems with Strong Noise. Journal of Optimization Theory and Application 111, 359–379 (2001)
Article MATH MathSciNet Google Scholar
Rumelhart, D.E., Hinton, G.E.: Learning Representations of Back-propagation Errors. Nature 323, 533–536 (1986)
Article Google Scholar
Wu, W., Feng, G.R., Li, X.: Training Multylayer Perceptrons via Minimization of Sum of Ridge Functions. Advances in Computational Mathematics 17, 331–347 (2003)
Article MathSciNet Google Scholar
Wu, W., Feng, G., Li, Z., Xu, Y.: Deterministic Convergence of an Online Gradient Method for BP Neural Networks. Accepted by IEEE Transactions on Neural Networks (2004)
Google Scholar
Wu, W., Shao, Z.: Convergence of Online Gradient Methods for Continuous Perceptrons with Linearly Separable Training Patterns. Applied Mathematics Letters 16, 999–1002 (2003)
Article MATH MathSciNet Google Scholar
Wu, W., Xu, Y.S.: Deterministic Convergence of an On-line Gradient Method for Neural Networks. Journal of Computational and Applied Mathematics 144, 335–347 (2002)
Article MATH MathSciNet Google Scholar
Zhang, L., Wu, W.: Online Gradient Method with a Penalty Term for Feedforward Neural Networks (to appear)
Google Scholar
Zhang, L., Wu, W.: Online Gradient Method with a Penalty Term for BP Nneural Networks (to appear)
Google Scholar
Zhang, N., Wu, W., Zheng, G.: Convergence of Gradient Method with a Momentum Term for Feedforward Neural Networks (to appear)
Google Scholar
Zhang, N., Wu, W.: Convergence of Gradient Method with a Momentum Term for BP Neural Networks (to appear)
Google Scholar

Download references

Author information

Authors and Affiliations

Appl. Math. Dept., Dalian University of Technology, Dalian, 116023, China
Wei Wu, Zhengxue Li, Naimin Zhang, Dong Nan, Zhiqiong Shao, Jie Yang & Liqing Zhang
Math. Dept., Shanghai Jiaotong University, Shanghai, 200000, China
Guori Feng
Information Engineering College, Dalian University, Dalian, 116622, China
Naimin Zhang
Syracuse University, New York State, USA
Yuesheng Xu

Authors

Wei Wu
View author publications
You can also search for this author in PubMed Google Scholar
Zhengxue Li
View author publications
You can also search for this author in PubMed Google Scholar
Guori Feng
View author publications
You can also search for this author in PubMed Google Scholar
Naimin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Dong Nan
View author publications
You can also search for this author in PubMed Google Scholar
Zhiqiong Shao
View author publications
You can also search for this author in PubMed Google Scholar
Jie Yang
View author publications
You can also search for this author in PubMed Google Scholar
Liqing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yuesheng Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electronic and Information Engineering, Dalian University of Technology, 116023, Dalian, China
Fu-Liang Yin
Department of Mechanical and Automation Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
Jun Wang
School of Electronic and Information Engineering, Dalian University of Technology, 116023, Dalian, Liaoning, China
Chengan Guo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, W. et al. (2004). Recent Developments on Convergence of Online Gradient Methods for Neural Network Training. In: Yin, FL., Wang, J., Guo, C. (eds) Advances in Neural Networks – ISNN 2004. ISNN 2004. Lecture Notes in Computer Science, vol 3173. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28647-9_40

Download citation

DOI: https://doi.org/10.1007/978-3-540-28647-9_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22841-7
Online ISBN: 978-3-540-28647-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics