An Immunological Approach to Initialize Feedforward Neural Network Weights

de Castro, Leandro Nunes; Von Zuben, Fernando J.

doi:10.1007/978-3-7091-6230-9_30

An Immunological Approach to Initialize Feedforward Neural Network Weights

Leandro Nunes de Castro &
Fernando J. Von Zuben⁴

Conference paper

288 Accesses
11 Citations

Abstract

The initial weight vector to be used in supervised learning for multilayer feedforward neural networks has a strong influence in the learning speed and in the quality of the solution obtained after convergence. An inadequate initial choice may cause the training process to get stuck in a poor local minimum, or to face abnormal numerical problems. In this paper, we propose a biologically inspired method based on artificial immune systems. This new strategy is applied to several benchmark and real-world problems, and its performance is compared to that produced by other approaches already suggested in the literature.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kolen, J. F. & Pollack, J. B.: Back Propagation is Sensitive to Initial Conditions, Technical Report TR 90-JK-BPSIC, 1990.
Google Scholar
Shepherd, A. J.: Second-Order Methods for Neural Networks — Fast and Reliable Methods for Multi-Layer Perceptrons, Springer, 1997.
Google Scholar
Hertz, J., Krogh, A. & Palmer, R.G.: Introduction to the Theory of Neural Computation. Addison-Wesley Publishing Company, 1991.
Google Scholar
Janeway Jr., C. A & P. Travers: Immunobiology The Immune System in Health and Disease, garland Publishing Inc., N.Y., 2nd ed., 1994.
Google Scholar
Haykin S.: Neural Networks — A Comprehensive Foundation, Prentice Hall, 2nd ed., 1999.
Google Scholar
Kirkpatrick, S., Gelatt Jr., C. D. & Vecchi, M. P.: Optimization by Simulated Annealing, Science, 220(4598), 671–680, (1987).
Google Scholar
Perelson, A S. & Oster, G. F.: Theoretical Studies of Clonal Selection: Minimal Antibody Repertoire Size and Reliability of Self-Nonself Discrimination, J. theor. Biol., 81, 645–670, (1979).
Google Scholar
Smith, D. J., Forrest, S., Hightower, R. R. & Perelson, S. A.: Deriving Shape Space Parameters from Immunological Data, J. theor. Biol., 189, 141–150, (1997).
Google Scholar
Boers, E. G. W. & Kuiper, H.: Biological Metaphors and the Design of Modular Artificial Neural Networks Master Thesis, Leiden University, Netherlands, (1992).
Google Scholar
Nguyen, D. & Widrow, B.: Improving the Learning Speed of Two-Layer Neural Networks by Choosing Initial Values of the Adaptive Weights, Proc. IJCN’90, 3, 21–26, (1990). Master Thesis, Leiden University, Netherlands, (1992).
Google Scholar
Kim, Y. K. & Ra, J. B.: Weight Value Initialization for Improving Training Speed in the Backpropagation Network, Proc. ofIJCNN’91, 3, 2396–2401, (1991).
Google Scholar
Lehtokangas, M., Saarinen, J., Kaski, K. & Huuhtanen, P.: Initializing Weights of a Multilayer Perceptron by Using the Orthogonal Least Squares Algorithm, NEUROCOM, 7,982–999, (1995).
Google Scholar
De Castro, L. N. & Von Zuben F. J.: A Hybrid Paradigm for Weight Initialization in Supervised Feedforward Neural Network Learning, Proc. ICS’98, Workshop on Artificial Intelligence, 30–37, (1998).
Google Scholar
Barreiros, J. A. L., Ribeiro, R. R. P., Affonso, C. M. & Santos, E. P.:Estabilizador de Sistemas de Potenciě Adaptativo com Pré-Programação de Parâmetros e Rede Neural Artificial, LAC: EGT, 538–542, (1997).
Google Scholar
DeCastro, L. N., Von Zuben, F. J. & Martins, W.: Hybrid and Constructive Neural Networks Applied to a Prediction Problem in Agriculture. Proc. of IJCNN’98 3,1932–1936, (1998).
Google Scholar
ftp://ftp.ics.uci.edu/pub/machine-learning-data bases
Google Scholar
Fahlman, S. E.: An Empirical Study of Learning Speed in Back-Propagation Networks, Tech. Rep., CMU-CS-88-162, Carnegie Mellon University, Pittsburg, (1988).
Google Scholar
Moller, M. F.: A Scaled Conjugate Gradient Algorithm for Fast Supervised Learning, Neural Networks, 6, 525–533, (dy1993).
Google Scholar
Pearlmutter, B. A: Fast Exact Calculation by the Hessian, NEUROCOM, 6,147–160, (1994).
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electrical and Computer Engineering, State University of Campinas, SP, Brazil
Fernando J. Von Zuben

Authors

Leandro Nunes de Castro
View author publications
You can also search for this author in PubMed Google Scholar
Fernando J. Von Zuben
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer Science, Czech Republic
Věra Kůrková & Roman Neruda &
Institute of Information Theory and Automation, Academy of Sciences of the Czech Republic, Prague, Czech Republic
Miroslav Kárný
Division of Mathematics, School of Mathematical and Information Sciences Coventry University, Coventry, UK
Nigel C. Steele

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

de Castro, L.N., Von Zuben, F.J. (2001). An Immunological Approach to Initialize Feedforward Neural Network Weights. In: Kůrková, V., Neruda, R., Kárný, M., Steele, N.C. (eds) Artificial Neural Nets and Genetic Algorithms. Springer, Vienna. https://doi.org/10.1007/978-3-7091-6230-9_30

Download citation

DOI: https://doi.org/10.1007/978-3-7091-6230-9_30
Publisher Name: Springer, Vienna
Print ISBN: 978-3-211-83651-4
Online ISBN: 978-3-7091-6230-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics