Skip to main content

Fast-Maneuvering Target Seeking Based on Double-Action Q-Learning

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4571))

Abstract

In this paper, a reinforcement learning method called DAQL is proposed to solve the problem of seeking and homing onto a fast maneuvering target, within the context of mobile robots. This Q-learning based method considers both target and obstacle actions when determining its own action decisions, which enables the agent to learn more effectively in a dynamically changing environment. It particularly suits fast-maneuvering target cases, in which maneuvers of the target are unknown a priori. Simulation result depicts that the proposed method is able to choose a less convoluted path to reach the target when compared to the ideal proportional navigation (IPN) method in handling fast maneuvering and randomly moving target. Furthermore, it can learn to adapt to the physical limitation of the system and do not require specific initial conditions to be satisfied for successful navigation towards the moving target.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ge, S.S., Cui, Y.J.: New potential functions for mobile robot path planning. IEEE Transactions on Robotics and Automation 16, 615–620 (2000)

    Article  Google Scholar 

  2. Conte, G., Zulli, R.: Hierarchical Path Planning in a Multirobot Environment with a Simple Navigation Function. IEEE Transactions on Systems Man and Cybernetics 25, 651–654 (1995)

    Article  Google Scholar 

  3. Li, T.H.S., Chang, S.J., Tong, W.: Fuzzy target tracking control of autonomous mobile robots by using infrared sensors. IEEE Transactions on Fuzzy Systems 12, 491–501 (2004)

    Article  Google Scholar 

  4. Luo, R.C., Chen, T.M., Su, K.L.: Target tracking using hierarchical grey-fuzzy motion decision-making method. IEEE Transactions on Systems Man and Cybernetics Part a-Systems and Humans 31, 179–186 (2001)

    Article  Google Scholar 

  5. Dias, J., Paredes, C., Fonseca, I., Araujo, H., Batista, J., Almeida, A.T.: Simulating pursuit with machine experiments with robots and artificial vision. IEEE Transactions on Robotics and Automation 14, 1–18 (1998)

    Article  Google Scholar 

  6. Adams, M.D.: High speed target pursuit and asymptotic stability in mobile robotics. IEEE Transactions on Robotics and Automation 15, 230–237 (1999)

    Article  Google Scholar 

  7. Ge, S.S., Cui, Y.J.: Dynamic motion planning for mobile robots using potential field method. Autonomous Robots 13, 207–222 (2002)

    Article  MATH  Google Scholar 

  8. Belkhouche, F., Belkhouche, B., Rastgoufard, P.: Line of sight robot navigation toward a moving goal. IEEE Transactions on System Man and Cybernetic Part b-Cybernetic 36, 255–267 (2006)

    Article  Google Scholar 

  9. Yang, C.D., Yang, C.C.: A unified approach to proportional navigation. IEEE Transactions on Aerospace and Electronic Systems 33, 557–567 (1997)

    Article  Google Scholar 

  10. Shukla, U.S., Mahapatra, P.R.: The Proportional Navigation Dilemma - Pure or True. IEEE Transactions on Aerospace and Electronic Systems 26, 382–392 (1990)

    Article  Google Scholar 

  11. Yuan, P.J., Chern, J.S.: Ideal Proportional Navigation. Journal of Guidance Control and Dynamics 15, 1161–1165 (1992)

    Article  Google Scholar 

  12. Borg, J.M., Mehrandezh, M., Fenton, R.G., Benhabib, B.: An Ideal Proportional Navigation Guidance system for moving object interception-robotic experiments. In: Systems, Man, and Cybernetics, 2000 IEEE International Conference, vol. 5, pp. 3247–3252 (2000)

    Google Scholar 

  13. Mehrandezh, M., Sela, N.M., Fenton, R.G., Benhabib, B.: Robotic interception of moving objects using an augmented ideal proportional navigation guidance technique. IEEE Transactions on Systems Man and Cybernetics Part a-Systems and Humans 30, 238–250 (2000)

    Article  Google Scholar 

  14. Sutton, R.S., Barto, A.G.: Reinforcement learning: an introduction. MIT Press, Cambridge (1998)

    Google Scholar 

  15. Kaelbling, L.P.: Learning in embedded systems. MIT Press, Cambridge (1993)

    Google Scholar 

  16. Kaelbling, L.P., Littman, M.L., Moore, A.W.: Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4, 237–285 (1996)

    Google Scholar 

  17. Sutton, R.S.: Reinforcement Learning. The International Series in Engineering and Computer Science, vol. 173. Kluwer Academic Publishers, Dordrecht (1992)

    Google Scholar 

  18. Ngai, D.C.K., Yung, N.H.C.: Double action Q-learning for obstacle avoidance in a dynamically changing environment. In: Proceedings of the 2005 IEEE Intelligent Vehicles Symposium, Las Vegas, pp. 211–216 (2005)

    Google Scholar 

  19. Ngai, D.C.K., Yung, N.H.C.: Performance Evaluation of Double Action Q-Learning in Moving Obstacle Avoidance Problem. In: Proceedings of the 2005 IEEE International Conference on Systems, Man, and Cybernetics, Hawaii, October 2005, pp. 865–870 (2005)

    Google Scholar 

  20. Watkins, C.J.C.H., Dayan, P.: Technical Note: Q-learning. Machine Learning 8, 279–292 (1992)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Petra Perner

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ngai, D.C.K., Yung, N.H.C. (2007). Fast-Maneuvering Target Seeking Based on Double-Action Q-Learning. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2007. Lecture Notes in Computer Science(), vol 4571. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73499-4_49

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73499-4_49

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73498-7

  • Online ISBN: 978-3-540-73499-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics