POMDP Filter: Pruning POMDP Value Functions with the Kaczmarz Iterative Method

Borera, Eddy C.; Pyeatt, Larry D.; Randrianasolo, Arisoa S.; Naser-Moghadasi, Madhi

doi:10.1007/978-3-642-16761-4_23

Eddy C. Borera²²,
Larry D. Pyeatt²²,
Arisoa S. Randrianasolo²² &
…
Madhi Naser-Moghadasi²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6437))

Included in the following conference series:

Mexican International Conference on Artificial Intelligence

1302 Accesses
1 Citations

Abstract

In recent years, there has been significant interest in developing techniques for finding policies for Partially Observable Markov Decision Problems (POMDPs). This paper introduces a new POMDP filtering technique that is based on Incremental Pruning [1], but relies on geometries of hyperplane arrangements to compute for optimal policy. This new approach applies notions of linear algebra to transform hyperplanes and treat their intersections as witness points [5]. The main idea behind this technique is that a vector that has the highest value at any of the intersection points must be part of the policy. IPBS is an alternative of using linear programming (LP), which requires powerful and expensive libraries, and which is subjected to numerical instability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cassandra, A., Littman, M.L., Zhang, N.L.: Incremental pruning: A simple, fast, exact algorithm for partially observable Markov decision processes. In: Proceedings of the Thirteenth Annual Conference on Uncertainty in Artificial Intelligence (1997)
Google Scholar
Cassandra, A.R., Kaelbling, L.P., Littman, M.L.: Acting optimally in partially observable stochastic domains. In: Proceedings of the Twelfth National Conference on Artificial Intelligence, Seattle, WA (1994)
Google Scholar
Galántai, A.: Projectors and Projection Methods. Kluwer Academic Pub., 3300 AH Dordrecht (2004)
Book MATH Google Scholar
Goldsmith, J., Mundhenk, M.: Complexity issues in Markov decision processes. In: Proceedings of the IEEE Conference on Computational Complexity. IEEE, Los Alamitos (1998)
Google Scholar
Littman, M.L.: The witness algorithm: Solving partially observable Markov decision processes. Technical Report CS-94-40, Brown University, Department of Computer Science, Providence, RI (December 1994)
Google Scholar
Pineau, J.: Tractable Planning Under Uncertainty: Exploiting Structure. Ph.D. thesis, Carnegie Mellon University (August 2004)
Google Scholar
Poupart, P., Boutilier, C.: VDCBPI: an approximate scalable algorithm for large scale POMDPs. In: Proceedings of NIPS, Vancouver (2004)
Google Scholar
Smith, T., Simmons, R.: Heuristic search value iteration for pomdps. In: Uncertainty in Artificial Intelligence (2004)
Google Scholar
Sondik, E.: The optimal control of partially observable Markov processes. Ph.D. thesis, Standford University (1971)
Google Scholar
Spaan, M.T.J., Vlassis, N.: Perseus: Randomized point-based value iteration for pomdps. JAIR 24, 195–220 (2005)
MATH Google Scholar
Spaan, M.T.J.: Cooperative active perception using POMDPs. In: AAAI 2008 Workshop on Advancements in POMDP Solvers (July 2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Texas Tech University, 302 Pine Street, Abilene, TX, 79601, USA
Eddy C. Borera, Larry D. Pyeatt, Arisoa S. Randrianasolo & Madhi Naser-Moghadasi

Authors

Eddy C. Borera
View author publications
You can also search for this author in PubMed Google Scholar
Larry D. Pyeatt
View author publications
You can also search for this author in PubMed Google Scholar
Arisoa S. Randrianasolo
View author publications
You can also search for this author in PubMed Google Scholar
Madhi Naser-Moghadasi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Instituto Politécnico Nacional, Centro de Investigación en Computación, Av. Juan Dios Batiz, s/n, Zacatenco, 07738, Mexico City, México
Grigori Sidorov
Area de Computación, Centro de Investigación en Matemáticas (CIMAT), Callejón de Jalisco s/n, Mineral de Valenciana, 36240, Guanajuato, México
Arturo Hernández Aguirre
Instituto Nacional de Astrofísica, Optica y Electrónica (INAOE), Ciencias Computacionales, Luis Enrique Erro No. 1, 72840, Santa María Tonantzintla, Puebla,, México
Carlos Alberto Reyes García

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Borera, E.C., Pyeatt, L.D., Randrianasolo, A.S., Naser-Moghadasi, M. (2010). POMDP Filter: Pruning POMDP Value Functions with the Kaczmarz Iterative Method. In: Sidorov, G., Hernández Aguirre, A., Reyes García, C.A. (eds) Advances in Artificial Intelligence. MICAI 2010. Lecture Notes in Computer Science(), vol 6437. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16761-4_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-16761-4_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16760-7
Online ISBN: 978-3-642-16761-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics