Consideration of State Representation for Semi-autonomous Reinforcement Learning of Sailing Within a Navigable Area

Manabe, Hideaki; Tachibana, Kanta

doi:10.1007/978-3-319-23335-2_7

Hideaki Manabe³ &
Kanta Tachibana⁴

Included in the following conference series:

World Robotic Sailing championship and International Robotic Sailing Conference

870 Accesses

Abstract

To sail quickly to a goal within a navigable area, complex control of the rudder and sail is required. Sailors must determine the current action with consideration of the time series of states; i.e., both current and future states. Reinforcement learning is an appropriate method for learning a complex problem, such as sailing. In this paper, we apply the navigable area such that a robotic sailor must avoid touching a boundary. To realise a higher layer of sailing architecture, the action space is simplified and discretised to the degree of the sailboat direction change. Moreover, we utilize semi-autonomous reinforcement learning, also known as imitation learning, in which a human selects an action and a robot updates its Q-values to evaluate pairs of states and actions until the robot’s action selection is equivalent to the human’s. For semi-autonomous learning, as well as for normal reinforcement learning, a representation of the state space is important. The state representation should be defined so that the state space is discretised to specify a desirable action, thereby removing any redundancy if possible. In this paper, we verify and investigate the possibility of state representation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Konidaris G, Barto A (2008) Skill discovery in continuous reinforcement learning domains using skill chaining, advances in neural information processing systems 22 (NIPS 2009), Computer Science Department, University of Massachusetts Amherst
Google Scholar
Kuhl FP, Giardina CR (1982) Elliptic fourier features of a closed contour, Fairleigh Dickinson University, U.S. Army Armament Research and Development Command. In: Computer graphics and image processing, vol 18, pp 236–258
Google Scholar
Ross S, Gordon GJ, Bagnell JA (2011) A reduction of imitation learning and structured prediction to no-regret online learning. In: Appearing in proceedings of the 14th international conference on artificial intelligence and statistics (AISTATS), JMLR: W&CP 15, vol 15. Fort Lauderdale, FL, USA
Google Scholar
Shon AP, Verma D, Rao RPN (2007) Active imitation learning, AAAI conference on artificial intelligence, Department of Computer Science and Engineering, pp 756–762
Google Scholar
Sterne PJ (2004) Reinforcement sailing [master’s thesis], Master of science. School of Informatics, University of Edinburgh
Google Scholar
Sutton RS, Barto AG (1998) Reinforcement learning, a bradford book. The MIT Press, Cambridge
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory for Intelligence, Kogakuin University, Nishi-Shinjuku, Shinjuku, Tokyo, Japan
Hideaki Manabe
Laboratory for Intelligence, Faculty of Informatics, Kogakuin University, Nishi-Shinjuku, Shinjuku, Tokyo, Japan
Kanta Tachibana

Authors

Hideaki Manabe
View author publications
You can also search for this author in PubMed Google Scholar
Kanta Tachibana
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Hideaki Manabe or Kanta Tachibana .

Editor information

Editors and Affiliations

Åland University of Applied Sciences, Mariehamn, Finland
Anna Friebe
Åland University of Applied Sciences, Mariehamn, Finland
Florian Haug

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Manabe, H., Tachibana, K. (2016). Consideration of State Representation for Semi-autonomous Reinforcement Learning of Sailing Within a Navigable Area. In: Friebe, A., Haug, F. (eds) Robotic Sailing 2015. WRSC/IRSC 2015. Springer, Cham. https://doi.org/10.1007/978-3-319-23335-2_7

Download citation

DOI: https://doi.org/10.1007/978-3-319-23335-2_7
Published: 12 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23334-5
Online ISBN: 978-3-319-23335-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics