Skip to main content

POMDP Filter: Pruning POMDP Value Functions with the Kaczmarz Iterative Method

  • Conference paper
Advances in Artificial Intelligence (MICAI 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6437))

Included in the following conference series:

Abstract

In recent years, there has been significant interest in developing techniques for finding policies for Partially Observable Markov Decision Problems (POMDPs). This paper introduces a new POMDP filtering technique that is based on Incremental Pruning [1], but relies on geometries of hyperplane arrangements to compute for optimal policy. This new approach applies notions of linear algebra to transform hyperplanes and treat their intersections as witness points [5]. The main idea behind this technique is that a vector that has the highest value at any of the intersection points must be part of the policy. IPBS is an alternative of using linear programming (LP), which requires powerful and expensive libraries, and which is subjected to numerical instability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cassandra, A., Littman, M.L., Zhang, N.L.: Incremental pruning: A simple, fast, exact algorithm for partially observable Markov decision processes. In: Proceedings of the Thirteenth Annual Conference on Uncertainty in Artificial Intelligence (1997)

    Google Scholar 

  2. Cassandra, A.R., Kaelbling, L.P., Littman, M.L.: Acting optimally in partially observable stochastic domains. In: Proceedings of the Twelfth National Conference on Artificial Intelligence, Seattle, WA (1994)

    Google Scholar 

  3. Galántai, A.: Projectors and Projection Methods. Kluwer Academic Pub., 3300 AH Dordrecht (2004)

    Book  MATH  Google Scholar 

  4. Goldsmith, J., Mundhenk, M.: Complexity issues in Markov decision processes. In: Proceedings of the IEEE Conference on Computational Complexity. IEEE, Los Alamitos (1998)

    Google Scholar 

  5. Littman, M.L.: The witness algorithm: Solving partially observable Markov decision processes. Technical Report CS-94-40, Brown University, Department of Computer Science, Providence, RI (December 1994)

    Google Scholar 

  6. Pineau, J.: Tractable Planning Under Uncertainty: Exploiting Structure. Ph.D. thesis, Carnegie Mellon University (August 2004)

    Google Scholar 

  7. Poupart, P., Boutilier, C.: VDCBPI: an approximate scalable algorithm for large scale POMDPs. In: Proceedings of NIPS, Vancouver (2004)

    Google Scholar 

  8. Smith, T., Simmons, R.: Heuristic search value iteration for pomdps. In: Uncertainty in Artificial Intelligence (2004)

    Google Scholar 

  9. Sondik, E.: The optimal control of partially observable Markov processes. Ph.D. thesis, Standford University (1971)

    Google Scholar 

  10. Spaan, M.T.J., Vlassis, N.: Perseus: Randomized point-based value iteration for pomdps. JAIR 24, 195–220 (2005)

    MATH  Google Scholar 

  11. Spaan, M.T.J.: Cooperative active perception using POMDPs. In: AAAI 2008 Workshop on Advancements in POMDP Solvers (July 2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Borera, E.C., Pyeatt, L.D., Randrianasolo, A.S., Naser-Moghadasi, M. (2010). POMDP Filter: Pruning POMDP Value Functions with the Kaczmarz Iterative Method. In: Sidorov, G., Hernández Aguirre, A., Reyes García, C.A. (eds) Advances in Artificial Intelligence. MICAI 2010. Lecture Notes in Computer Science(), vol 6437. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16761-4_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16761-4_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16760-7

  • Online ISBN: 978-3-642-16761-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics