Abstract
To sail quickly to a goal within a navigable area, complex control of the rudder and sail is required. Sailors must determine the current action with consideration of the time series of states; i.e., both current and future states. Reinforcement learning is an appropriate method for learning a complex problem, such as sailing. In this paper, we apply the navigable area such that a robotic sailor must avoid touching a boundary. To realise a higher layer of sailing architecture, the action space is simplified and discretised to the degree of the sailboat direction change. Moreover, we utilize semi-autonomous reinforcement learning, also known as imitation learning, in which a human selects an action and a robot updates its Q-values to evaluate pairs of states and actions until the robot’s action selection is equivalent to the human’s. For semi-autonomous learning, as well as for normal reinforcement learning, a representation of the state space is important. The state representation should be defined so that the state space is discretised to specify a desirable action, thereby removing any redundancy if possible. In this paper, we verify and investigate the possibility of state representation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Konidaris G, Barto A (2008) Skill discovery in continuous reinforcement learning domains using skill chaining, advances in neural information processing systems 22 (NIPS 2009), Computer Science Department, University of Massachusetts Amherst
Kuhl FP, Giardina CR (1982) Elliptic fourier features of a closed contour, Fairleigh Dickinson University, U.S. Army Armament Research and Development Command. In: Computer graphics and image processing, vol 18, pp 236–258
Ross S, Gordon GJ, Bagnell JA (2011) A reduction of imitation learning and structured prediction to no-regret online learning. In: Appearing in proceedings of the 14th international conference on artificial intelligence and statistics (AISTATS), JMLR: W&CP 15, vol 15. Fort Lauderdale, FL, USA
Shon AP, Verma D, Rao RPN (2007) Active imitation learning, AAAI conference on artificial intelligence, Department of Computer Science and Engineering, pp 756–762
Sterne PJ (2004) Reinforcement sailing [master’s thesis], Master of science. School of Informatics, University of Edinburgh
Sutton RS, Barto AG (1998) Reinforcement learning, a bradford book. The MIT Press, Cambridge
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Manabe, H., Tachibana, K. (2016). Consideration of State Representation for Semi-autonomous Reinforcement Learning of Sailing Within a Navigable Area. In: Friebe, A., Haug, F. (eds) Robotic Sailing 2015. WRSC/IRSC 2015. Springer, Cham. https://doi.org/10.1007/978-3-319-23335-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-23335-2_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23334-5
Online ISBN: 978-3-319-23335-2
eBook Packages: EngineeringEngineering (R0)