Lazy Fully Probabilistic Design of Decision Strategies

Kárný, Miroslav; Macek, Karel; Guy, Tatiana V.

doi:10.1007/978-3-319-12436-0_16

Miroslav Kárný¹⁶,
Karel Macek¹⁶ &
Tatiana V. Guy¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8866))

Included in the following conference series:

International Symposium on Neural Networks

4186 Accesses
1 Citations

Abstract

Fully probabilistic design of decision strategies (FPD) extends Bayesian dynamic decision making. The FPD specifies the decision aim via so-called ideal - a probability density, which assigns high probability values to the desirable behaviours and low values to undesirable ones. The optimal decision strategy minimises the Kullback-Leibler divergence of the probability density describing the closed-loop behaviour to this ideal. In spite of the availability of explicit minimisers in the corresponding dynamic programming, it suffers from the curse of dimensionality connected with complexity of the value function. Recently proposed a lazy FPD tailors lazy learning, which builds a local model around the current behaviour, to estimation of the closed-loop model with the optimal strategy. This paper adds a theoretical support to the lazy FPD and outlines its further improvement.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bellman, R.: Adaptive Control Processes. Princeton U. Press, NJ (1961)
MATH Google Scholar
Berec, L., Kárný, M.: Identification of reality in Bayesian context. In: Warwick, K., Kárný, M. (eds.) Computer-Intensive Methods in Control and Signal Processing, pp. 181–193. Birkhäuser (1997)
Google Scholar
Berger, J.: Statistical Decision Theory and Bayesian Analysis. Springer, New York (1985)
Book MATH Google Scholar
Bertsekas, D.: Dynamic Programming and Optimal Control. Athena Scientific, US (2001)
MATH Google Scholar
Bontempi, G., Birattari, M., Bersini, H.: Lazy learning for local modelling & control design. Int. J. of Control 72(7–8), 643–658 (1999)
Article MATH MathSciNet Google Scholar
Cappe, O., Godsill, S., Moulines, E.: An overview of existing methods and recent advances in sequential monte carlo. Proc. of the IEEE 95(5), 899–924 (2007)
Article Google Scholar
Daum, F.: Nonlinear filters: beyond the kalman filter. IEEE Aerospace and Electronic Systems Magazine 20(8), 57–69 (2005)
Article Google Scholar
Doucet, A., Johansen, A.: A tutorial on particle filtering and smoothing: Fifteen years later. In: Handbook of Nonlinear Filtering. Oxford University Press, Oxford (2011)
Google Scholar
Feldbaum, A.: Theory of dual control. Autom. Remote Control 21(9) (1960)
Google Scholar
Gilboa, I., Schmeidler, D.: Case-based decsion theory. The Quaterly Journal of Economics 110, 605–639 (1995)
Article MATH Google Scholar
Guan, P., Raginsky, M., Willett, R.: Online Markov decision processes with Kull-back Leibler control cost. IEEE Trans. on Automatic, Control (2014)
Google Scholar
Kárný, M.: Towards fully probabilistic control design. Automatica 32(12), 1719–1722 (1996)
Article MATH MathSciNet Google Scholar
Kárný, M.: Adaptive systems: Local approximators? In: Workshop n Adaptive Systems in Control and Signal Processing, pp. 129–134. IFAC, Glasgow (1998)
Google Scholar
Kárný, M.: On approximate fully probabilistic design of decision making strategies. In: Guy, T., Kárný, M. (eds.) Proceedings of the 3rd International Workshop on Scalable Decision Making, ECML/PKDD 2013. UTIA AV ČR, Prague (2013) iSBN 978-80-903834-8-7
Google Scholar
Kárný, M.: Approximate bayesian recursive estimation. Information Sciences (2014), doi: 10.1016/j.ins.2014.01.048
Google Scholar
Kárný, M., Guy, T.V.: Fully probabilistic control design. Systems & Control Letters 55(4), 259–265 (2006)
Article MATH MathSciNet Google Scholar
Kárný, M., Kroupa, T.: Axiomatisation of fully probabilistic design. Information Sciences 186(1), 105–113 (2012)
Article MATH MathSciNet Google Scholar
Kulhavý, R., Zarrop, M.B.: On a general concept of forgetting. Int. J. of Control 58(4), 905–924 (1993)
Article MATH Google Scholar
Kullback, S., Leibler, R.: On information and sufficiency. Annals of Mathematical Statistics 22, 79–87 (1951)
Article MATH MathSciNet Google Scholar
Li, J., Dong, G., Ramamohanarao, K., Wong, L.: Deeps: a new instance-based lazy discovery and classification system. Machine Learning 54(2), 99–124 (2004)
Article MATH Google Scholar
Loeve, M.: Probability Theory. van Nostrand, Princeton, New Jersey (1962) (Russian translation, Moscow 1962)
Google Scholar
Macek, K., Guy, T., Kárný, M.: A lazy-learning concept of fully probabilistic decision making (2014) (unpublished manuscript)
Google Scholar
Martín-Sánchez, J., Lemos, J., Rodellar, J.: Survey of industrial optimized adaptive control. Int. J. of Adaptive Control and Signal Processing 26(10), 881–918 (2013).
Google Scholar
Peterka, V.: Bayesian system identification. In: Eykhoff, P. (ed.) Trends and Progress in System Identification, pp. 239–304. Pergamon Press, Oxford (1981)
Chapter Google Scholar
Qin, S., Badgwell, T.: A survey of industrial model predictive control technology. Control Engineering Practice 11(7), 733–764 (2003)
Article Google Scholar
Rao, M.: Measure Theory and Integration. John Wiley, NY (1987)
MATH Google Scholar
Roll, J., Nazin, A., Ljung, L.: Nonlinear system identification via direct weight optimization. Automatica 41(3), 475–490 (2004)
Article MathSciNet Google Scholar
Sanov, I.: On probability of large deviations of random variables. Matematičeskij Sbornik 42, 11–44 (in russian), also in selected translations mathematical statistics and probability. I 1961, 213–244 (1957)
Google Scholar
Savage, L.: Foundations of Statistics. Wiley, NY (1954)
MATH Google Scholar
Schon, T., Gustafsson, F., Nordlund, P.: Marginalized particle filters for mixed linear/nonlinear state-space models. IEEE Tran. on Signal Processing 53(7), 2279–2289 (2005)
Article MathSciNet Google Scholar
Si, J., Barto, A., Powell, W., Wunsch, D. (eds.): Handbook of Learning and Approximate Dynamic Programming. Wiley-IEEE Press, Danvers (2004)
Google Scholar
Tishby, N., Polani, D.: Information theory of decisions and actions. In: Cutsuridis, V., Hussain, A., Taylor, J. (eds.) Perception-Action Cycle. Springer Series in Cognitive and Neural Systems, pp. 601–636. Springer, New York (2011)
Chapter Google Scholar
Todorov, E.: Linearly-solvable Markov decision problems. In: Schölkopf, B., et al. (eds.) Advances in Neural Inf. Processing, pp. 1369–1376. MIT Press, NY (2006)
Google Scholar
Zhu, C., Zhu, W.: Feedback control of nonlinear stochastic systems for targeting a specified stationary probability density. Automatica 47(3), 539–544 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Information Theory and Automation, Academy of Sciences of the Czech Republic, P.O.Box 18, 182 08, Prague 8, Czech Republic
Miroslav Kárný, Karel Macek & Tatiana V. Guy

Authors

Miroslav Kárný
View author publications
You can also search for this author in PubMed Google Scholar
Karel Macek
View author publications
You can also search for this author in PubMed Google Scholar
Tatiana V. Guy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Miroslav Kárný .

Editor information

Editors and Affiliations

Wuhan, China
Zhigang Zeng
University of Macau, Macau, Macao
Yangmin Li
The Chinese University of Hong Kong, Hong Kong, Hong Kong, Hong Kong SAR
Irwin King

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kárný, M., Macek, K., Guy, T.V. (2014). Lazy Fully Probabilistic Design of Decision Strategies. In: Zeng, Z., Li, Y., King, I. (eds) Advances in Neural Networks – ISNN 2014. ISNN 2014. Lecture Notes in Computer Science(), vol 8866. Springer, Cham. https://doi.org/10.1007/978-3-319-12436-0_16

Download citation

DOI: https://doi.org/10.1007/978-3-319-12436-0_16
Published: 19 November 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12435-3
Online ISBN: 978-3-319-12436-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics