Influence Diagrams with Memory States: Representation and Algorithms

Wu, Xiaojian; Kumar, Akshat; Zilberstein, Shlomo

doi:10.1007/978-3-642-24873-3_23

Xiaojian Wu²²,
Akshat Kumar²² &
Shlomo Zilberstein²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6992))

Included in the following conference series:

International Conference on Algorithmic Decision Theory

1018 Accesses
1 Citations

Abstract

Influence diagrams (IDs) offer a powerful framework for decision making under uncertainty, but their applicability has been hindered by the exponential growth of runtime and memory usage—largely due to the no-forgetting assumption. We present a novel way to maintain a limited amount of memory to inform each decision and still obtain near-optimal policies. The approach is based on augmenting the graphical model with memory states that represent key aspects of previous observations—a method that has proved useful in POMDP solvers. We also derive an efficient EM-based message-passing algorithm to compute the policy. Experimental results show that this approach produces high-quality approximate polices and offers better scalability than existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Amato, C., Bernstein, D.S., Zilberstein, S.: Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs. Autonomous Agents and Multi-Agent Systems 21, 293–320 (2010)
Article Google Scholar
Bernstein, D.S., Amato, C., Hansen, E.A., Zilberstein, S.: Policy iteration for decentralized control of Markov decision processes. Journal of Artificial Intelligence Research 34, 89–132 (2009)
Article MathSciNet Google Scholar
Cecil Huang, A.D.: Inference in belief networks: A procedural guide. International Journal of Approximate Reasoning 15, 225–263 (1994)
Article MathSciNet Google Scholar
Cooper, G.: A method for using belief networks as influence diagrams. In: Proc. of the Conference on Uncertainty in Artificial Intelligence, pp. 55–63 (1988)
Google Scholar
Dechter, R.: A new perspective on algorithims for optimizing policies under uncertainty. In: Proc. of the International Conference on Artificial Intelligence Planning Systems, pp. 72–81 (2000)
Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical society, Series B 39(1), 1–38 (1977)
MathSciNet MATH Google Scholar
Hansen, E.A.: An improved policy iteration algorithm for partially observable MDPs. In: Proc. of Neural Information processing Systems, pp. 1015–1021 (1997)
Google Scholar
Howard, R.A., Matheson, J.E.: Infuence diagrams. In: Readings on the Principles and Applications of Decision Analysis, vol. II, pp. 719–762. Strategic Decisions Group (1984)
Google Scholar
Jensen, F., Jensen, F.V., Dittmer, S.L.: From influence diagrams to junction trees. In: Proc. of the Conference on Uncertainty in Artificial Intelligence, pp. 367–373 (1994)
Chapter Google Scholar
Kumar, A., Zilberstein, S.: Anytime planning for decentralized POMDPs using expectation maximization. In: Proc. of the Conference on Uncertainty in Artificial Intelligence, pp. 294–301 (2010)
Google Scholar
Nilsson, D., Lauritzen, S.: Representing and solving decision problems with limited information. Management Science 47(9), 1235–1251 (2001)
Article Google Scholar
Marinescu, R.: A new approach to influence diagram evaluation. In: Proc. of the 29th SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence (2009)
Google Scholar
Poupart, P., Boutilier, C.: Bounded finite state controllers. In: Proc. of Neural Information processing Systems, pp. 823–830 (2003)
Google Scholar
Qi, R., Poole, D.: A new method for influence diagram evaluation. Computational Intelligence 11, 498–528 (1995)
Article Google Scholar
Shachter, R.: Evaluating influence diagrams. Operations Research 34, 871–882 (1986)
Article MathSciNet Google Scholar
Shachter, R.: Probabilistic inference and influence diagrams. Operations Research 36, 589–605 (1988)
Article Google Scholar
Shachter, R.: An ordered examination of influence diagrams. Networks 20, 535–563 (1990)
Article MathSciNet Google Scholar
Toussaint, M., Charlin, L., Poupart, P.: Hierarchical POMDP controller optimization by likelihood maximization. In: Proc. of the Conference on Uncertainty in Artificial Intelligence, pp. 562–570 (2008)
Google Scholar
Toussaint, M., Harmeling, S., Storkey, A.: Probabilistic inference for solving (PO)MDPs. Technical Report EDI-INF-RR-0934, School of Informatics, University of Edinburgh (2006)
Google Scholar
Toussaint, M., Storkey, A.J.: Probabilistic inference for solving discrete and continuous state Markov decision processes. In: Proc. of International Conference on Machine Learning, pp. 945–952 (2006)
Google Scholar
Zhang, N.L., Qi, R., Poole, D.: A computational theory of decision networks. International Journal of Approximate Reasoning 11, 83–158 (1994)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, University of Massachusetts, Amherst, MA, 01003, USA
Xiaojian Wu, Akshat Kumar & Shlomo Zilberstein

Authors

Xiaojian Wu
View author publications
You can also search for this author in PubMed Google Scholar
Akshat Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Shlomo Zilberstein
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Ben-Gurion University of the Negev, 84105, Beer-Sheva, Israel
Ronen I. Brafman
DIMACS, Rutgers University, 96 Frelinghuysen Road, 08854-8018, Piscataway, NJ, USA
Fred S. Roberts
CNRS - LAMSADE, Université Paris Dauphine, Place du Maréchal de Lattre de Tassigny, Cedex 16, 75775, Paris, France
Alexis Tsoukiàs

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, X., Kumar, A., Zilberstein, S. (2011). Influence Diagrams with Memory States: Representation and Algorithms. In: Brafman, R.I., Roberts, F.S., Tsoukiàs, A. (eds) Algorithmic Decision Theory. ADT 2011. Lecture Notes in Computer Science(), vol 6992. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24873-3_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-24873-3_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24872-6
Online ISBN: 978-3-642-24873-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics