Neural Network Training with Second Order Algorithms

Yu, H.; Wilamowski, B. M.

doi:10.1007/978-3-642-23172-8_30

H. Yu⁴ &
B. M. Wilamowski⁴

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 99))

952 Accesses
7 Citations

Abstract

Second order algorithms are very efficient for neural network training because of their fast convergence. In traditional Implementations of second order algorithms [Hagan and Menhaj 1994], Jacobian matrix is calculated and stored, which may cause memory limitation problems when training large-sized patterns. In this paper, the proposed computation is introduced to solve the memory limitation problem in second order algorithms. The proposed method calculates gradient vector and Hessian matrix directly, without Jacobian matrix storage and multiplication. Memory cost for training is significantly reduced by replacing matrix operations with vector operations. At the same time, training speed is also improved due to the memory reduction. The proposed implementation of second order algorithms can be applied to train basically an unlimited number of patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cao, L.J., Keerthi, S.S., Ong, C.J., Zhang, J.Q., Periyathamby, U., Fu, X.J., Lee, H.P.: Parallel sequential minimal optimization for the training of support vector machines. IEEE Trans. on Neural Networks 17(4), 1039–1049 (2006)
Article Google Scholar
Hagan, M.T., Menhaj, M.B.: Training feedforward networks with the Marquardt algorithm. IEEE Trans. on Neural Networks 5(6), 989–993 (1994)
Article Google Scholar
Hohil, M.E., Liu, D., Smith, S.H.: Solving the N-bit parity problem using neural networks. Neural Networks 12, 1321–1323 (1999)
Article MATH Google Scholar
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. Nature 323, 533–536 (1986)
Article Google Scholar
Wilamowski, B.M.: Neural network architectures and learning algorithms: How not to be frustrated with neural networks. IEEE Industrial Electronics Magazine 3(4), 56–63 (2009)
Article Google Scholar
Wilamowski, B.M., Cotton, N.J., Kaynak, O., Dundar, G.: Computing gradient vector and jacobian matrix in arbitrarily connected neural networks. IEEE Trans. on Industrial Electronics 55(10), 3784–3790 (2008)
Article Google Scholar
Yu, H., Wilamowski, B.M.: Neural network learning without backpropagation. IEEE Trans. on Neural Networks 21(11) (2010)
Google Scholar
Yu, H., Wilamowski, B.M.: Improved Computation for Levenberg Marquardt Training. IEEE Trans. on Neural Networks 21(6), 930–937 (2010)
Article Google Scholar
Yu, H., Wilamowski, B.M.: Efficient and reliable training of neural networks. In: Proc. 2nd IEEE Human System Interaction Conf. HSI 2009, Catania, Italy, pp. 109–115 (2009)
Google Scholar
Yu, H., Wilamowski, B.M.: C++ implementation of neural networks trainer. In: Proc. of 13th Int. Conf. on Intelligent Engineering Systems, INES 2009, Barbados (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, Auburn University, Auburn, AL, USA
H. Yu & B. M. Wilamowski

Authors

H. Yu
View author publications
You can also search for this author in PubMed Google Scholar
B. M. Wilamowski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Expert Systems and Artificial Intelligence, University of Information Technology and Management, 35-225, Rzeszów, Poland
Zdzisław S. Hippe & Teresa Mroczek &
M. Nalecz Institute of Biocybernetics and Biomedical Engineering, Polish Academy of Sciences, 4 Ks. Trojdena Str.,, 02-109, Warsaw, Poland
Juliusz L. Kulikowski

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Yu, H., Wilamowski, B.M. (2012). Neural Network Training with Second Order Algorithms. In: Hippe, Z.S., Kulikowski, J.L., Mroczek, T. (eds) Human – Computer Systems Interaction: Backgrounds and Applications 2. Advances in Intelligent and Soft Computing, vol 99. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23172-8_30

Download citation

DOI: https://doi.org/10.1007/978-3-642-23172-8_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23171-1
Online ISBN: 978-3-642-23172-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics