Research on intelligent collision avoidance decision-making of unmanned ship in unknown environments

Wang, Chengbo; Zhang, Xinyu; Cong, Longze; Li, Junjie; Zhang, Jiawei

doi:10.1007/s12530-018-9253-9

Research on intelligent collision avoidance decision-making of unmanned ship in unknown environments

Original Paper
Published: 20 September 2018

Volume 10, pages 649–658, (2019)
Cite this article

Evolving Systems Aims and scope Submit manuscript

Chengbo Wang^1,2,
Xinyu Zhang^1,2,
Longze Cong³,
Junjie Li^1,2 &
…
Jiawei Zhang^1,2

2153 Accesses
41 Citations
Explore all metrics

Abstract

To solve the problem of intelligent collision avoidance by unmanned ships in unknown environments, a deep reinforcement learning obstacle avoidance decision-making (DRLOAD) algorithm is proposed. The problems encountered in unmanned ships’ intelligent avoidance decisions are analyzed, and the design criteria for a proposed decision algorithm are put forward. Based on the Markov decision process, an intelligent collision avoidance model is established for unmanned ships. The optimal strategy for an intelligent decision system is determined through the value function which maximizes the return for the mapping of the in unmanned ship’s state to behavior. A reward function is specifically designed for obstacle avoidance, approaching a target and safety. Finally, simulation experiments are carried out in multi-state obstacle environments, demonstrate the effectiveness of the proposed DRLOAD algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 2

COLREGs-Compliant Collision Avoidance Method for Autonomous Ships via Deep Reinforcement Learning

Automatic Collision Avoidance Using Deep Reinforcement Learning with Grid Sensor

Path Planning of Maritime Autonomous Surface Ships in Unknown Environment with Reinforcement Learning

References

Akakpo GS, Ngankam TM (2015) A mathematical model for analysis on ships collision avoidance. Reg Marit Univ J 4:61–70
Google Scholar
Chen XM, Tian G, Miao YS, Gong JW (2017) Driving rule acquisition and decision algorithm to unmanned vehicle in urban traffic. J Trans Beijing Inst Technol 37(5):491–496
Google Scholar
Cheng Y, Zhang W (2017) Concise deep reinforcement learning obstacle avoidance for underactuated unmanned marine vessels. Neurocomputing 272:63–73
Article Google Scholar
Du M (2016) Research on behavior decision and movement planning of unmanned vehicles based on human driving behavior. Ph.D. Dissertation, University of Science and Technology of China, Hefei
Furda A, Vlacic L (2011) Enabling safe autonomous driving in real-world city traffic using multiple criteria decision making. IEEE Intell Transp Syst Mag 3(1):4–17
Article Google Scholar
Gao H (2016) Mobile robot path planning based on reinforcement learning. M.A. Thesis, Southwest Jiaotong University of China, Chengdu
Hasselt HV, Guez A, Silver D (2015) Deep reinforcement learning with double Q-learning. Comput Sci 3:1–12
Google Scholar
Li H (2004) Research on application of ship collision avoidance expert system based on AIS. Dalian Maritime University of China, Dalian
Google Scholar
Li X, Xu X, Zuo L (2016) Reinforcement learning based overtaking decision-making for highway autonomous driving. In: Sixth international conference on intelligent control and information processing, Wuhan, China, pp 336–342
Liu H, Zhang Z (2015) Sailing ship safety encounter distance. J Shipp Manag 37(10):12–13
Google Scholar
Liu Y, Song R, Bucknall R (2015) A practical path planning and navigation algorithm for an unmanned surface vehicle using the fast marching algorithm. In: OCEANS, Genova, Italy, pp 1–7
Liu Q, Zhai J-W, Zhang Z-Z, Zhong S, Zhou Q, Zhang P et al (2017) A survey on deep reinforcement learning. Chin J Comput 1–28 [2017-12-24]. http://kns.cnki.net/kcms/detail/11.1826.TP.20170119.1030.002.html
Peng G, Huang X, Yang T, Gao J, Wu Y, Xiong Y (2004) Mobile robot behavior decision and control based on neural net work and fuzzy inference. J Huazhong Univ Sci Technol 32:129–132
Google Scholar
Perera LP, Carvalho JP, Soares CG (2010) Autonomous guidance and navigation based on the COLREGs rules and regulations of collision avoidance. Advanced ship design for pollution prevention, UK
Smierzchalski R, Michalewicz Z (2000) Modeling of ship trajectory in collision situations by an evolutionary algorithm. IEEE Trans Evol Comput 4(3):227–241
Article Google Scholar
Sun L (2000) The research on mathematical models of decision making in ship collision avoidance. Ph.D. Dissertation, Dalian Maritime University of China, Dalian
Szepesvari C (2011) Algorithms for reinforcement learning. Wiley encyclopedia of operations research and management science. Wiley, Hoboken, pp 632–636
Google Scholar
Tan F, Yan P, Guan X (2017) Deep reinforcement learning: from Q-learning to deep Q-learning. In: International conference on neural information processing. Springer, Cham, pp 475–483
Temizer S, Kochenderfer M, Kaelbling L, Lozanoperez T, J Kuchar (2010) Collision avoidance for unmanned aircraft using markov decision processes. In: AIAA guidance, navigation, and control conference, Toronto, Ontario Canada, pp 1–22
Tian G (2016) Research on bionic lane change decision model of unmanned vehicles in complex dynamic urban environment. Beijing Institute of Technology, Beijing
Google Scholar
Wang B (2006) Research on obstacle avoidance of mobile robot based on multisensor information fusion. M.A. Thesis, Nanjing University of Science and Technology of China, Nanjing
Xue H, Dong HE (2017) Research on mathematical model and computer simulation of ship automatic intelligent collision avoidance. Ship Sci Technol 16:31–33
Google Scholar
Yasuno T, Kamano T, Suzuki T, Uemura K, Harada H, Kataoka Y (2015) Autonomous mobile robot based on behavior decision skill and control skill of the operator. Electr Eng Jpn 131(2):30–39
Article Google Scholar
Zhao Y-W, Tan D-L (2001) Multi-behavior integrated-decision method based on feasibility of velocity vectors. J Inf Control 30(1):72–75
Google Scholar
Zhao D-B, Shao K, Zhu Y-H, Li D, Chen Y-R, Wang H-T et al (2016) Review of deep reinforcement learning and discussions on the development of computer Go. J Control Theory Appl 33(6):701–717
MATH Google Scholar
Zhen R (2013) Research on intelligent decision method of unmanned vehicle based on augmented learning. M.A. Thesis, National University of Defense Technology, Changsha
Zheng R, Liu C, Guo Q (2014) A decision-making method for autonomous vehicles based on simulation and reinforcement learning. In: International conference on machine learning and cybernetics, Tianjin, China, pp 362–369

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Grant 51309043), the Nature Science of Foundation of Liaoning Province (Grant 2015020626), the Outstanding Young Scholars Growth Plan of Liaoning Province (Grant LJQ201405), a basic research project from the Key Laboratory of Liaoning Provincial Education Department (Grant LZ2015009), and the Fundamental Research Funds for the Central Universities (Grant 3132016315).

Author information

Authors and Affiliations

Navigation College, Dalian Maritime University, Dalian, China
Chengbo Wang, Xinyu Zhang, Junjie Li & Jiawei Zhang
Key Laboratory of Marine Simulation and Control for Ministry of Communications, Dalian Maritime University, Dalian, China
Chengbo Wang, Xinyu Zhang, Junjie Li & Jiawei Zhang
School of Maritime Economics and Management, Dalian Maritime University, Dalian, China
Longze Cong

Authors

Chengbo Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xinyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Longze Cong
View author publications
You can also search for this author in PubMed Google Scholar
Junjie Li
View author publications
You can also search for this author in PubMed Google Scholar
Jiawei Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xinyu Zhang.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, C., Zhang, X., Cong, L. et al. Research on intelligent collision avoidance decision-making of unmanned ship in unknown environments. Evolving Systems 10, 649–658 (2019). https://doi.org/10.1007/s12530-018-9253-9

Download citation

Received: 18 January 2018
Accepted: 06 September 2018
Published: 20 September 2018
Issue Date: December 2019
DOI: https://doi.org/10.1007/s12530-018-9253-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Research on intelligent collision avoidance decision-making of unmanned ship in unknown environments

Abstract

Access this article

Similar content being viewed by others

COLREGs-Compliant Collision Avoidance Method for Autonomous Ships via Deep Reinforcement Learning

Automatic Collision Avoidance Using Deep Reinforcement Learning with Grid Sensor

Path Planning of Maritime Autonomous Surface Ships in Unknown Environment with Reinforcement Learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation