Practical Consideration on Generalization Property of Natural Gradient Learning

Park, Hyeyoung

doi:10.1007/3-540-45720-8_47

Hyeyoung Park⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2084))

Included in the following conference series:

International Work-Conference on Artificial Neural Networks

1421 Accesses
1 Citations

Abstract

Natural gradient learning is known to resolve the plateau problem, which is the main cause of slow learning speed of neural networks. The adaptive natural gradien tlearning, which is an adaptive method of realizing the natural gradien tlearning for neural networks, has also been developed and its practical advan tage has been confirmed. In this paper, we consider the generalization propert yof the natural gradien t method. Theoretically, the standard gradient method and the natural gradien tmethod has the same minimum in the error surface, thus the generalization performance should also be the same. However, in the practical sense, it is feasible that the natural gradien tmethod gives smaller training error when the standard method stops learning in a plateau. In this case, the solutions that are practically obtained are different from each other, and their generalization performances also come to be different. Since these situations are very often in practical problems, it is necessary to compare the generalization property of the natural gradient learning method with the standard method. In this paper, we show a case that the practical generalization performance of the natural gradient learning is poorer than the standard gradient method, and try to solve the problem by including a regularization term in the natural gradient learning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

S. Amari: Natural Gradient Works Efficiently in Learning, Neural Computation, 10, 251–276, 1998.
Article Google Scholar
Amari, S., Park, H., and Fukumizu, F.: Adaptive method of realizing natural gradient learning for multilayer perceptrons, Neural Computation, 12, 1399–1409, 2000.
Article Google Scholar
Bishop, C.: Neural Networks for Pattern Recognition, Oxford University Press, 1995.
Google Scholar
Biehl, W., Riegler, P. and Wöhler, C.: Transient Dynamics of On-line Learning in Two-layered Neural Networks, Journal of Physics, A, 29, 4769–4780, 1996.
Article MATH Google Scholar
Fukumizu, K. and Amari, S.: Local Minima and Plateaus in Hierarchical Structures of Multilayer Perceptrons, in preparation, 1999.
Google Scholar
Park, H. and Amari, S.: Escaping from Plateaus of Multilayer Perceptron Learning by Natural Gradient, The 2nd RIEC International Symposium on Design and Architecture of Information Processing Systems Based on the Brain Information Principles, 189–192, 1998.
Google Scholar
Park, H., Amari, S. and Fukumizu, K.: Adaptive natural gradient learning algorithms for various stochastic models, Neural Networks, 13, 755–764, 2000.
Article Google Scholar
Rattray, M., D. Saad, and S. Amari: Natural Gradient Descent for On-line Learning, Physical Review Letters, 81, 5461–5464, 1998.
Article Google Scholar
Saad, D. and Solla, S. A.: On-line Learning in Soft Committee Machines, Physical Review E, 52, 4225–4243, 1995.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory for Mathematical Neuroscience, RIKEN Brain Science Institute, 2-1 Hirosawa, Wako, Saitama, 351-0198, Japan
Hyeyoung Park

Authors

Hyeyoung Park
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departamento de Inteligencia Artificial Sanda del Rey, Universidad Nacional de Educación a Distancia, s/n., Madrid, 28040, Spain
José Mira
Departamento de Arquitectura y Tecnología de Computadores, Universidad de Granada, Campus Fuentenueva, 18071, Granada, Spain
Alberto Prieto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Park, H. (2001). Practical Consideration on Generalization Property of Natural Gradient Learning. In: Mira, J., Prieto, A. (eds) Connectionist Models of Neurons, Learning Processes, and Artificial Intelligence. IWANN 2001. Lecture Notes in Computer Science, vol 2084. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45720-8_47

Download citation

DOI: https://doi.org/10.1007/3-540-45720-8_47
Published: 12 June 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42235-8
Online ISBN: 978-3-540-45720-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics