Skip to main content

Abstract

Neural networks are commonly used for classification and regression. The Bayesian approach may be employed, but choosing a prior for the parameters presents challenges. This paper reviews several priors in the literature and introduces Jeffreys priors for neural network models. The effect on the posterior is demonstrated through an example.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berger, J. O., De Oliveira, V., and Sans ó, B.. (2001). “Objective Bayesian Analysis of Apatially Correlated Data,” Journal of the American Statistical Association, 96, 1361–1374.

    Article  MathSciNet  MATH  Google Scholar 

  2. Bishop, C. M. (1995). Neural Networks for Pattern Recognition. Clarendon, Oxford.

    Google Scholar 

  3. Breiman, L. and Friedman, J. H. (1985). “Estimating Optimal Transformations for Multiple Regression and Correlation,” Journal of the American Statistical Association, 80, 580–619.

    Article  MathSciNet  MATH  Google Scholar 

  4. Fine, T. L. (1999). Feedforward Neural Network Methodology. Springer, New-York.

    MATH  Google Scholar 

  5. Gelman, A., Carlin, J. B., Stern, H. S., and Rubin, D. B. (1995). Bayesian Data Analysis. Chapman and Hall, London.

    Google Scholar 

  6. Hartigan, J. A. (1964). “Invariant Prior Distributions,” Annals of Mathematical Statistics, 35, 836–845.

    Article  MathSciNet  MATH  Google Scholar 

  7. Jeffreys, H. (1961). Theory of Probability (third edition). Oxford University Press, New York.

    MATH  Google Scholar 

  8. Kass, R. E., and Wasserman, L. (1996). “The Selection of Prior Distributions by Formal Rules,” Journal of the American Statistical Association, 91, 1343–1370.

    Article  MATH  Google Scholar 

  9. Lee, H. K. H. (2001). “Model Selection for Neural Network Classification,” Journal of Classification, 18, 227–243.

    MathSciNet  MATH  Google Scholar 

  10. Lee, H. K. H. (2003). “A Noninformative Prior for Neural Networks,” Machine Learning, 50, 197–2152.

    Article  MATH  Google Scholar 

  11. MacKay, D. J. C. (1992). Bayesian Methods for Adaptive Methods. Ph.D. dissertation, California Institute of Technology.

    Google Scholar 

  12. Müller, P., and Rios Insua, D. (1998). “Issues in Bayesian Analysis of Neural Network Models,” Neural Computation, 10, 571–592.

    Article  Google Scholar 

  13. Neal, R. M. (1996). Bayesian Learning for Neural Networks. Springer, New York.

    Book  MATH  Google Scholar 

  14. Ripley, B. D. (1993). Pattern Recognition and Neural Networks. Cambridge University Press, Cambridge.

    Google Scholar 

  15. Robinson, M. (2001). Priors for Bayesian Neural Networks. Master’s thesis, Department of Statistics, University of British Columbia.

    Google Scholar 

  16. Robinson, M. (2001b). “Priors for Bayesian Neural Networks,” in Computing Science and Statistics, eds. E. Wegman, A. Braverman, A. Goodman, and P. Smythe, 33, 122–127.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lee, H.K.H. (2004). Priors for Neural Networks. In: Banks, D., McMorris, F.R., Arabie, P., Gaul, W. (eds) Classification, Clustering, and Data Mining Applications. Studies in Classification, Data Analysis, and Knowledge Organisation. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17103-1_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17103-1_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22014-5

  • Online ISBN: 978-3-642-17103-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics