Sampling Hidden Parameters from Oracle Distribution

Sonoda, Sho; Murata, Noboru

doi:10.1007/978-3-319-11179-7_68

Sampling Hidden Parameters from Oracle Distribution

Sho Sonoda²¹ &
Noboru Murata²¹

Conference paper

4276 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8681))

Abstract

A new sampling learning method for neural networks is proposed. Derived from an integral representation of neural networks, an oracle probability distribution of hidden parameters is introduced. In general rigorous sampling from the oracle distribution holds numerical difficulty, a linear-time sampling algorithm is also developed. Numerical experiments showed that when hidden parameters were initialized by the oracle distribution, following backpropagation converged faster to better parameters than when parameters were initialized by a normal distribution.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bengio, Y., Courville, A., Vincent, P.: Representation learning: A review and new perspectives. IEEE Trans. on PAMI 35(8), 1798–1828 (2013)
Article Google Scholar
Denoeux, T., Lengellé, R.: Initializing back propagation networks with prototypes. Neural Networks 6(3), 351–363 (1993)
Article Google Scholar
De Freitas, J.F.G., Niranjan, M., Gee, A.H., Doucet, A.: Sequential Monte Carlo methods to train neural network models. Neural Computation 12(4), 955–993 (2000)
Article Google Scholar
Kůrková, V.: Complexity estimates based on integral transforms induced by computational units. Neural Networks 33, 160–167 (2012)
Article MATH Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proceedings of the IEEE 11, 2278–2324 (1998)
Article Google Scholar
LeCun, Y.A., Bottou, L., Orr, G.B., Müller, K.-R.: Efficient Backprop. In: Montavon, G., Orr, G.B., Müller, K.-R. (eds.) NN: Tricks of the Trade, 2nd edn. LNCS, vol. 7700, pp. 9–48. Springer, Heidelberg (2012)
Google Scholar
LeCun, Y., Cortes, C.: The MNIST database of handwritten digits, http://yann.lecun.com/exdb/mnist/
Murata, N.: An integral representation of functions using three-layered networks and their approximation bounds. Neural Networks 9(6), 947–956 (1996)
Article Google Scholar
Sprecher, D.A.: A numerical implementation of Kolmogorov’s superpositions. Neural Networks 9(5), 765–772 (1996)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Schools of Advanced Science and Engineering, Waseda University, 3–4–1 Ohkubo, Shinjuku-ku, Tokyo, 169–8555, Japan
Sho Sonoda & Noboru Murata

Authors

Sho Sonoda
View author publications
You can also search for this author in PubMed Google Scholar
Noboru Murata
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics, University of Hamburg, Vogt-Kölln-Straße 30, 22527, Hamburg, Germany
Stefan Wermter , Cornelius Weber & Sven Magg , &
Department of Informatics, Nicolaus Compernicus University, ul. Grudziądzka 5, 87-100, Torun, Poland
Włodzisław Duch
Department of Modern Languages, University of Helsinki, P.O. Box 24, 00014, Helsinki, Finland
Timo Honkela
Institute of Information and Communication Technologies, Bulgarian Academy of Sciences, Acad. G. Bonchev str. bl. 25A, 1113, Sofia, Bulgaria
Petia Koprinkova-Hristova
Institute of Neural Information Processing, University of Ulm, 89069, Oberer Eselsberg, Ulm, Germany
Günther Palm
Department of Information Systems, Quartier UNIL-Dorigny, Bâtiment Internef, University of Lausanne, 1015, Lausanne, Switzerland
Alessandro E. P. Villa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sonoda, S., Murata, N. (2014). Sampling Hidden Parameters from Oracle Distribution. In: Wermter, S., et al. Artificial Neural Networks and Machine Learning – ICANN 2014. ICANN 2014. Lecture Notes in Computer Science, vol 8681. Springer, Cham. https://doi.org/10.1007/978-3-319-11179-7_68

Download citation

DOI: https://doi.org/10.1007/978-3-319-11179-7_68
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11178-0
Online ISBN: 978-3-319-11179-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics