Generalising the Discriminative Restricted Boltzmann Machines

Cherla, Srikanth; Tran, Son N.; d’Avila Garcez, Artur; Weyde, Tillman

doi:10.1007/978-3-319-68612-7_13

Generalising the Discriminative Restricted Boltzmann Machines

Srikanth Cherla¹⁷,
Son N. Tran¹⁷,
Artur d’Avila Garcez¹⁷ &
…
Tillman Weyde¹⁷

Conference paper
First Online: 25 October 2017

4215 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10614))

Abstract

We present a novel theoretical result that generalises the Discriminative Restricted Boltzmann Machine (DRBM). While originally the DRBM was defined assuming the \(\{0, 1\}\)-Bernoulli distribution in each of its hidden units, this result makes it possible to derive cost functions for variants of the DRBM that utilise other distributions, including some that are often encountered in the literature. This paper shows that this function can be extended to the Binomial and \(\{-1,+1\}\)-Bernoulli hidden units.

Srikanth and Son contribute equally.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
Another version of this work is stored online at https://arxiv.org/abs/1604.01806.
2.
We obtained a marginally lower average loss of \(1.78\%\) in our evaluation of this model than the \(1.81\%\) reported in [5].
3.
Our evaluation resulted in a model with a classification accuracy of \(28.52\%\) in comparison with the \(27.6\%\) reported in [5].

References

Freund, Y., Haussler, D.: Unsupervised learning of distributions on binary vectors using two layer networks. In: Advances in Neural Information Processing Systems, pp. 912–919 (1992)
Google Scholar
Hastie, T., Tibshiran, R., Friedman, J., Franklin, J.: The Elements of Statistical Learning: Data Mining, Inference and Prediction. Springer Series in Statistics. Springer, New York (2005). Chap. 1
Google Scholar
Hinton, G.E., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554. doi:10.1162/neco.2006.18.7.1527
Lang, K.: Newsweeder: learning to filter netnews. In: Proceedings of the 12th International Conference on Machine Learning, pp. 331–339 (1995)
Google Scholar
Larochelle, H., Bengio, Y.: Classification using discriminative restricted Boltzmann machines. In: International Conference on Machine Learning, pp. 536–543. ACM Press (2008)
Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Mohamed, A.R., Dahl, G., Hinton, G.: Acoustic modeling using deep belief networks. IEEE Trans. Audio Speech Lang. Process. 20(1), 14–22 (2012)
Article Google Scholar
Nair, V., Hinton, G.: Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning (ICML2010), pp. 807–814 (2010)
Google Scholar
Smolensky, P.: Information processing in dynamical systems: foundations of harmony theory. In: Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol. 1, pp. 194–281. MIT Press (1986)
Google Scholar
Teh, Y.W., Hinton, G.: Rate-coded restricted boltzmann machines for face recognition. In: Advances in Neural Information Processing Systems, pp. 908–914 (2001)
Google Scholar
Welling, M., Rosen-Zvi, M., Hinton, G.: Exponential family harmoniums with an application to information retrieval. In: Advances in Neural Information Processing Systems, pp. 1481–1488 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Mathematics, Computer Science and Engineering, City University London, Northampton Square, London, EC1V 0HB, UK
Srikanth Cherla, Son N. Tran, Artur d’Avila Garcez & Tillman Weyde

Authors

Srikanth Cherla
View author publications
You can also search for this author in PubMed Google Scholar
Son N. Tran
View author publications
You can also search for this author in PubMed Google Scholar
Artur d’Avila Garcez
View author publications
You can also search for this author in PubMed Google Scholar
Tillman Weyde
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Son N. Tran .

Editor information

Editors and Affiliations

University of Lausanne, Lausanne, Switzerland
Alessandra Lintas
University of Genoa, Genoa, Italy
Stefano Rovetta
Universitat Pompeu Fabra, Barcelona, Spain
Paul F.M.J. Verschure
University of Lausanne, Lausanne, Switzerland
Alessandro E.P. Villa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cherla, S., Tran, S.N., d’Avila Garcez, A., Weyde, T. (2017). Generalising the Discriminative Restricted Boltzmann Machines. In: Lintas, A., Rovetta, S., Verschure, P., Villa, A. (eds) Artificial Neural Networks and Machine Learning – ICANN 2017. ICANN 2017. Lecture Notes in Computer Science(), vol 10614. Springer, Cham. https://doi.org/10.1007/978-3-319-68612-7_13

Download citation

DOI: https://doi.org/10.1007/978-3-319-68612-7_13
Published: 25 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68611-0
Online ISBN: 978-3-319-68612-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics