Deep Belief Network Using Reinforcement Learning and Its Applications to Time Series Forecasting

Hirata, Takaomi; Kuremoto, Takashi; Obayashi, Masanao; Mabu, Shingo; Kobayashi, Kunikazu

doi:10.1007/978-3-319-46675-0_4

Deep Belief Network Using Reinforcement Learning and Its Applications to Time Series Forecasting

Takaomi Hirata¹⁹,
Takashi Kuremoto¹⁹,
Masanao Obayashi¹⁹,
Shingo Mabu¹⁹ &
…
Kunikazu Kobayashi²⁰

Conference paper
First Online: 29 September 2016

3762 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9949))

Abstract

Artificial neural networks (ANNs) typified by deep learning (DL) is one of the artificial intelligence technology which is attracting the most attention of researchers recently. However, the learning algorithm used in DL is usually with the famous error-backpropagation (BP) method. In this paper, we adopt a reinforcement learning (RL) algorithm “Stochastic Gradient Ascent (SGA)” proposed by Kimura and Kobayashi into a Deep Belief Net (DBN) with multiple restricted Boltzmann machines (RBMs) instead of BP learning method. A long-term prediction experiment, which used a benchmark of time series forecasting competition, was performed to verify the effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Box, G.E.P., Pierce, D.A.: Distribution of residual autocorrelations in autoregressive-integrated moving average time series models. J. Am. Stat. Ass. 65(332), 1509–1526 (1970)
Article MathSciNet MATH Google Scholar
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representation by back-propagating errors. Nature 232(9), 533–536 (1986)
Article Google Scholar
Casdagli, M.: Nonlinear prediction of chaotic time series. Phys. D 35, 335–356 (1981)
Article MathSciNet MATH Google Scholar
Lendasse, A., Oja, E., Simula, O., Verleysen, M.: Time series prediction competition: the CATS benchmark. In: Proceedings of International Joint Conference on Neural Networks (IJCNN 2004), pp. 1615–1620 (2004)
Google Scholar
Lendasse, A., Oja, E., Simula, O., Verleysen, M.: Time series prediction competition: the CATS benchmark. Neurocomputing 70, 2325–2329 (2007)
Article Google Scholar
Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006)
Article MathSciNet MATH Google Scholar
Kuremoto, T., Obayashi, M., Kobayashi, M.: Neural forecasting systems. In: Weber, C., Elshaw, M., Mayer, N.M. (eds.) Reinforcement Learning, Theory and Applications, Chap. 1, pp. 1–20. INTECH (2008)
Google Scholar
Kuremoto, T., Kimura, S., Kobayashi, K., Obayashi, M.: Time series forecasting using a deep belief network with restricted Boltzmann machines. Neurocomputing 137(5), 47–56 (2014)
Article Google Scholar
Kuremoto, T., Hirata, T., Obayashi, M., Mabu, S., Kobayashi, K.: Forecast chaotic time series data by DBNs. In: Proceedings of the 7th International Congress on Image and Signal Processing (CISP 2014), pp. 1304–1309, October 2014
Google Scholar
Hirata, T., Kuremoto, T., Obayashi, M., Mabu, S.: Time series prediction using DBN and ARIMA. In: International Conference on Computer Application Technologies (CCATS 2015). Matsue, Japan, pp. 24–29, September 2015
Google Scholar
Zhang, G.P.: Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing 50, 159–175 (2003)
Article MATH Google Scholar
Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13, 281–305 (2012)
MathSciNet MATH Google Scholar
Kimura, H., Kobayashi, S.: Reinforcement learning for continuous action using stochastic gradient ascent. In: Proceedings of 5^th Intelligent Autonomous Systems, pp. 288–295 (1998)
Google Scholar

Download references

Acknowledgment

This work was supported by JSPS KAKENHI Grant No. 26330254 and No. 25330287.

Author information

Authors and Affiliations

Graduate School of Science and Engineering, Yamaguchi University, Tokiwadai 2-16-1, Ube, Yamaguchi, 755-8611, Japan
Takaomi Hirata, Takashi Kuremoto, Masanao Obayashi & Shingo Mabu
School of Information Science and Technology, Aichi Prefectural University, 1522-3 Ibaragabasama, Nagakute, Aichi, 480-1198, Japan
Kunikazu Kobayashi

Authors

Takaomi Hirata
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Kuremoto
View author publications
You can also search for this author in PubMed Google Scholar
Masanao Obayashi
View author publications
You can also search for this author in PubMed Google Scholar
Shingo Mabu
View author publications
You can also search for this author in PubMed Google Scholar
Kunikazu Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Takashi Kuremoto .

Editor information

Editors and Affiliations

The University of Tokyo , Tokyo, Japan
Akira Hirose
Kobe University , Kobe, Japan
Seiichi Ozawa
Okinawa Institute of Science and Technology Graduate University, Onna, Japan
Kenji Doya
Nara Institute of Science and Technology , Ikoma, Japan
Kazushi Ikeda
Kyungpook National University , Daegu, Korea (Republic of)
Minho Lee
Chinese Academy of Sciences , Beijing, China
Derong Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hirata, T., Kuremoto, T., Obayashi, M., Mabu, S., Kobayashi, K. (2016). Deep Belief Network Using Reinforcement Learning and Its Applications to Time Series Forecasting. In: Hirose, A., Ozawa, S., Doya, K., Ikeda, K., Lee, M., Liu, D. (eds) Neural Information Processing. ICONIP 2016. Lecture Notes in Computer Science(), vol 9949. Springer, Cham. https://doi.org/10.1007/978-3-319-46675-0_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-46675-0_4
Published: 29 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46674-3
Online ISBN: 978-3-319-46675-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics