Abstract
My main research centers around sequential decision making under uncertainty. In a complex dynamical system useful abstractions of knowledge can be essential to an autonomous agent for e.cient decision making. Predictive State Representation, PSR, has been developed to provide a maintainable, self-verifiable and learnable representation of the knowledge of the world. I was very much intrigued by the PSR work, and started working on incorporating PSRs into POMDP control algorithms. Since the representational power of PSRs is equivalent to the belief state representation in POMDPs, one can imagine PSR planning algorithms, working in the context of controlling dynamical systems. In prior work [1] I developed an exact planning algorithm based on known PSR parameters. However, like all other exact algorithms, this approach has exponential complexity in the worst case. In preliminary experiments on a variety of standard domains, the empirical performance seems similar to belief-based planning.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Izadi, M.T., Precup, D.: A planning algorithm for Predicetieve State Representation. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence, IJCAI 2003 (2003)
Izadi, M.T., Rajwade, A.V., Precup, D.: Using core beliefs for point-based value iteration. To appear in the proceedings of the 19th International Joint Conference on Artificial Intelligence, IJCAI 2005 (2005)
Izadi, M.T., Precup, D.: Model reduction by linear PSRs. To appear in the proceedings of the 19th International Joint Conference on Artificial Intelligence, IJCAI 2005 (2005)
Pineau, J., Gordon, G., Thrun, S.: Point-based value iteration: An anytime algorithms for POMDPs. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence, IJCAI 2003 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Izadi, M.T. (2005). Sequential Decision Making Under Uncertainty. In: Zucker, JD., Saitta, L. (eds) Abstraction, Reformulation and Approximation. SARA 2005. Lecture Notes in Computer Science(), vol 3607. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11527862_33
Download citation
DOI: https://doi.org/10.1007/11527862_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27872-6
Online ISBN: 978-3-540-31882-8
eBook Packages: Computer ScienceComputer Science (R0)