Skip to main content

Improved Q-Learning Algorithm for AGV Path Optimization

  • Conference paper
  • First Online:
Advanced Manufacturing and Automation XIII (IWAMA 2023)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1154))

Included in the following conference series:

Abstract

With the rapid development of intelligent manufacturing technology, AGV has developed vigorously in the fields of industry, agriculture and scientific research. In recent years, how to plan an optimal path in the navigation system of the automatic guided transport vehicle has become an important topic, which has attracted the attention of scholars. Scholars from different countries have proposed different optimization algorithms for path planning problems. Among them, Q-Learning has made good progress in AGV path planning. Although Q-learning performs well in this aspect, it still has the problem of slow convergence speed and easy to fall into local optimization. To solve the above problems, a Q-learning algorithm based on the beetle antennae search algorithm (BAS-QL) is proposed. In order to improve the convergence speed, the Q table is initialized by using beetle antennae search algorithm. In order to avoid the algorithm falling into local optimum, the attenuated Epsilon value is used. Finally, the optimal path for AGV trolley walking is solved, and the BAS-QL algorithm is verified by experiments. In the n = 8 and n = 10 raster graph experiments, BAS-QL reduces 15.21% and 3.98% in average time, 22.40% and −30.4% in average path length and 77.45% and 43.33% in average iteration times of optimal path compared with Q-Learning algorithm, which shows that this method can effectively improve the efficiency of route planning for AGV.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dantzig, G.B., Ramser, J.H.: The truck dispatching problem. Manage. Sci. 6(1) (1959)

    Google Scholar 

  2. Li, Q., Li, B., Zhang, R., Jiang, T.: Research on AGV path planning based on improved Dijkstra algorithm. Mech. Eng. Autom. (01), 23–25+28 (2021)

    Google Scholar 

  3. Zhou, Y., Huang, N.: Airport AGV path optimization model based on ant colony algorithm to optimize Dijkstra algorithm in urban systems. Sustainable Computing: Inform. Syst. (2022). (prepublish)

    Google Scholar 

  4. Niu, Q., Li, B.: Omnidirectional AGV path planning based on simulated annealing genetic algorithm. Comput. Integr. Manuf. Syst. 1–21 (2022)

    Google Scholar 

  5. Soong, L.E., Pauline, O., Chun, C.K.: Solving the optimal path planning of a mobile robot using improved Q-learning. Robot. Auton. Syst. 115, 143–161 (2019)

    Article  Google Scholar 

  6. Wang, F., Zhang, K., Xie, H., Chen, M.: Path optimization of mobile robot based on improved Q-learning algorithm. Syst. Eng. 40(04), 100–109 (2022)

    Google Scholar 

  7. Low, E.S., Ong, P., Low, C.Y., Omar, R.: Modified Q-learning with distance metric and virtual target on path planning of mobile robot. Expert Syst. Appl. 199 (2022)

    Google Scholar 

  8. Song, Y., Li, Y., Li, C.: Initialization of path planning reinforcement learning for mobile robots. Control Theory Appl. 29(12), 1623–1628 (2012)

    Google Scholar 

  9. Xu, S., Gu, Y., Li, X., Chen, C., Hu, Y., Sang, Y., Jiang, W.: Indoor emergency path planning based on the Q-learning optimization algorithm. ISPRS Int. J. Geo-Inform. 11(1) (2022)

    Google Scholar 

  10. Sadhu, A.K., Konar, A., Bhattacharjee, T., Das, S.: Synergism of firefly algorithm and Q-learning for robot arm path planning. Swarm Evol. Comput. 43 (2018)

    Google Scholar 

  11. Zhang, F., Wang, C., Cheng, C., Yang, D., Pan, G.: Reinforcement learning path planning method with error estimation. Energies 15(1) (2021)

    Google Scholar 

  12. Zhou, Z., Wang, C., Zhang, X., Liu, C., Tang, Y., Zhang, W.: Research on manipulator sorting method based on machine vision and improved genetic algorithm. Manuf. Technol. Mach. Tool (02), 25–29 (2022). https://doi.org/10.19287/j.cnki.1005-2402.2022.02.004

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chen Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Huang, Y., Wang, C. (2024). Improved Q-Learning Algorithm for AGV Path Optimization. In: Wang, Y., Yu, T., Wang, K. (eds) Advanced Manufacturing and Automation XIII. IWAMA 2023. Lecture Notes in Electrical Engineering, vol 1154. Springer, Singapore. https://doi.org/10.1007/978-981-97-0665-5_8

Download citation

Publish with us

Policies and ethics