Improved Q-Learning Algorithm for AGV Path Optimization

Huang, Yuchun; Wang, Chen

doi:10.1007/978-981-97-0665-5_8

Yuchun Huang³⁸ &
Chen Wang³⁸

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1154))

Included in the following conference series:

International Workshop of Advanced Manufacturing and Automation

Abstract

With the rapid development of intelligent manufacturing technology, AGV has developed vigorously in the fields of industry, agriculture and scientific research. In recent years, how to plan an optimal path in the navigation system of the automatic guided transport vehicle has become an important topic, which has attracted the attention of scholars. Scholars from different countries have proposed different optimization algorithms for path planning problems. Among them, Q-Learning has made good progress in AGV path planning. Although Q-learning performs well in this aspect, it still has the problem of slow convergence speed and easy to fall into local optimization. To solve the above problems, a Q-learning algorithm based on the beetle antennae search algorithm (BAS-QL) is proposed. In order to improve the convergence speed, the Q table is initialized by using beetle antennae search algorithm. In order to avoid the algorithm falling into local optimum, the attenuated Epsilon value is used. Finally, the optimal path for AGV trolley walking is solved, and the BAS-QL algorithm is verified by experiments. In the n = 8 and n = 10 raster graph experiments, BAS-QL reduces 15.21% and 3.98% in average time, 22.40% and −30.4% in average path length and 77.45% and 43.33% in average iteration times of optimal path compared with Q-Learning algorithm, which shows that this method can effectively improve the efficiency of route planning for AGV.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 229.00; Price excludes VAT (USA)

Hardcover Book: USD 299.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dantzig, G.B., Ramser, J.H.: The truck dispatching problem. Manage. Sci. 6(1) (1959)
Google Scholar
Li, Q., Li, B., Zhang, R., Jiang, T.: Research on AGV path planning based on improved Dijkstra algorithm. Mech. Eng. Autom. (01), 23–25+28 (2021)
Google Scholar
Zhou, Y., Huang, N.: Airport AGV path optimization model based on ant colony algorithm to optimize Dijkstra algorithm in urban systems. Sustainable Computing: Inform. Syst. (2022). (prepublish)
Google Scholar
Niu, Q., Li, B.: Omnidirectional AGV path planning based on simulated annealing genetic algorithm. Comput. Integr. Manuf. Syst. 1–21 (2022)
Google Scholar
Soong, L.E., Pauline, O., Chun, C.K.: Solving the optimal path planning of a mobile robot using improved Q-learning. Robot. Auton. Syst. 115, 143–161 (2019)
Article Google Scholar
Wang, F., Zhang, K., Xie, H., Chen, M.: Path optimization of mobile robot based on improved Q-learning algorithm. Syst. Eng. 40(04), 100–109 (2022)
Google Scholar
Low, E.S., Ong, P., Low, C.Y., Omar, R.: Modified Q-learning with distance metric and virtual target on path planning of mobile robot. Expert Syst. Appl. 199 (2022)
Google Scholar
Song, Y., Li, Y., Li, C.: Initialization of path planning reinforcement learning for mobile robots. Control Theory Appl. 29(12), 1623–1628 (2012)
Google Scholar
Xu, S., Gu, Y., Li, X., Chen, C., Hu, Y., Sang, Y., Jiang, W.: Indoor emergency path planning based on the Q-learning optimization algorithm. ISPRS Int. J. Geo-Inform. 11(1) (2022)
Google Scholar
Sadhu, A.K., Konar, A., Bhattacharjee, T., Das, S.: Synergism of firefly algorithm and Q-learning for robot arm path planning. Swarm Evol. Comput. 43 (2018)
Google Scholar
Zhang, F., Wang, C., Cheng, C., Yang, D., Pan, G.: Reinforcement learning path planning method with error estimation. Energies 15(1) (2021)
Google Scholar
Zhou, Z., Wang, C., Zhang, X., Liu, C., Tang, Y., Zhang, W.: Research on manipulator sorting method based on machine vision and improved genetic algorithm. Manuf. Technol. Mach. Tool (02), 25–29 (2022). https://doi.org/10.19287/j.cnki.1005-2402.2022.02.004

Download references

Author information

Authors and Affiliations

Department of Mechanical Engineering, Hubei University of Automotive Technology, Shiyan, 442002, China
Yuchun Huang & Chen Wang

Authors

Yuchun Huang
View author publications
You can also search for this author in PubMed Google Scholar
Chen Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chen Wang .

Editor information

Editors and Affiliations

University of Bedfordshire, Luton, UK
Yi Wang
Shanghai University of Engineering Science, Shanghai, China
Tao Yu
Department of Mechanical and Industrial Engineering, Norwegian University of Science and Technology, Trondheim, Norway
Kesheng Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huang, Y., Wang, C. (2024). Improved Q-Learning Algorithm for AGV Path Optimization. In: Wang, Y., Yu, T., Wang, K. (eds) Advanced Manufacturing and Automation XIII. IWAMA 2023. Lecture Notes in Electrical Engineering, vol 1154. Springer, Singapore. https://doi.org/10.1007/978-981-97-0665-5_8

Download citation

DOI: https://doi.org/10.1007/978-981-97-0665-5_8
Published: 25 February 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0664-8
Online ISBN: 978-981-97-0665-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics