Generation of Search Behavior by a Modification of Q-MDP Value Method

Ueda, Ryuichi

doi:10.1007/978-3-319-08338-4_1

Ryuichi Ueda⁶

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 302))

4531 Accesses

Abstract

We modify Q-MDP value method and observe the behaviors of a robot with the modified method in an environment, where state information of the robot is essentially indefinite. In Q-MDP value method, an action in every time step is chosen based on a calculation of expectation values with a probability distribution, which is the output of a probabilistic state estimator. The modified method uses a weighting function with the probability distribution in the calculation so as to give precedence to the states near the goal of the task. We applied our method to a simple robot navigation problem in an incomplete sensor environment. As a result, the method makes the robot take a kind of searching behavior without explicit implementation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 349.00; Price excludes VAT (USA)

Softcover Book: USD 449.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Silver, D., Veness, J.: Monte-Carlo Planning in Large POMDPs. In: NIPS. Volume 23. (2010) 2164–2172
Google Scholar
Bonet, B., Geffner, H.: Solving POMDPs: RTDP-BEL vs. Point-based Algorithms. In: IJCAI. (2009) 1641–1646
Google Scholar
Ong, S.C., Png, S.W., Hsu, D., Lee, W.S.: Planning under Uncertainty for Robotic Tasks with Mixed Observability. The International Journal of Robotics Research 29(8) (2010) 1053–1068
Article Google Scholar
Roy, N., Burgard, W., Fox, D., Thrun, S.: Coastal Navigation - Mobile Robot Navigation with Uncertainty in Dynamic Environments. In: Proc. of IEEE ICRA. (1999) 35–40
Google Scholar
Thrun, S., Burgard, W., Fox, D.: Probabilistic ROBOTICS. MIT Press (2005)
Google Scholar
Bellman, R.: Dynamic Programming. Princeton University Press, Princeton, NJ (1957)
MATH Google Scholar
Littman, M.L., et al.: Learning Policies for Partially Observable Environments: Scaling Up. In: Proceedings of International Conference on Machine Learning. (1995) 362–370
Google Scholar
Ueda, R., Arai, T., Sakamoto, K., Jitsukawa, Y., Umeda, K., Osumi, H., Kikuchi, T., Komura, M.: Real-Time Decision Making with State-Value Function under Uncertainty of State Estimation. In: Proc. of ICRA. (2005)
Google Scholar
Jitsukawa, Y., et al.: Fast Decision Making of Autonomous Robot under Dynamic Environment by Sampling Real-Time Q-MDP Value Method. In: Proc. of IROS. (2007) 1644–1650
Google Scholar
Thrun, S., et al.: Probabilistic ROBOTICS. MIT Press (2005)
Google Scholar
Latombe, J.C.: Robot Motion Planning. Kluwer Academic Publishers, Boston, MA (1991)
Book Google Scholar
Fox, D., Thrun, S., Burgard, W., Dellaert, F.: Particle Filters for Mobile Robot Localization. A. Doucet, N. de Freitas, and N. Gordon, editors, Sequential Monte Carlo Methods in Practice (2000) 470–498
Google Scholar
Lenser, S., Veloso, M.: Sensor resetting localization for poorly modelled robots. In: Proc. of IEEE ICRA. (2000) 1225–1232
Google Scholar

Download references

Author information

Authors and Affiliations

Advanced Institute of Industrial Technology, 1-10-40 Higashi Ohi, Shinagawa-ku, Tokyo, Japan
Ryuichi Ueda

Authors

Ryuichi Ueda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ryuichi Ueda .

Editor information

Editors and Affiliations

Information Engineering, University of Padua, Padua, Italy
Emanuele Menegatti
Robotics Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA
Nathan Michael
Fachbereich Informatik, Technische Universität Kaiserslautern Arbeitsgruppe Robotersysteme, Kaiserslautern, Germany
Karsten Berns
Integrated Information Technology, Aoyama Gakuin University, Tokyo, Japan
Hiroaki Yamaguchi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ueda, R. (2016). Generation of Search Behavior by a Modification of Q-MDP Value Method. In: Menegatti, E., Michael, N., Berns, K., Yamaguchi, H. (eds) Intelligent Autonomous Systems 13. Advances in Intelligent Systems and Computing, vol 302. Springer, Cham. https://doi.org/10.1007/978-3-319-08338-4_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-08338-4_1
Published: 03 September 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08337-7
Online ISBN: 978-3-319-08338-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics