Skip to main content

Consideration of State Representation for Semi-autonomous Reinforcement Learning of Sailing Within a Navigable Area

  • Conference paper
  • First Online:
Robotic Sailing 2015 (WRSC/IRSC 2015)

Abstract

To sail quickly to a goal within a navigable area, complex control of the rudder and sail is required. Sailors must determine the current action with consideration of the time series of states; i.e., both current and future states. Reinforcement learning is an appropriate method for learning a complex problem, such as sailing. In this paper, we apply the navigable area such that a robotic sailor must avoid touching a boundary. To realise a higher layer of sailing architecture, the action space is simplified and discretised to the degree of the sailboat direction change. Moreover, we utilize semi-autonomous reinforcement learning, also known as imitation learning, in which a human selects an action and a robot updates its Q-values to evaluate pairs of states and actions until the robot’s action selection is equivalent to the human’s. For semi-autonomous learning, as well as for normal reinforcement learning, a representation of the state space is important. The state representation should be defined so that the state space is discretised to specify a desirable action, thereby removing any redundancy if possible. In this paper, we verify and investigate the possibility of state representation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Konidaris G, Barto A (2008) Skill discovery in continuous reinforcement learning domains using skill chaining, advances in neural information processing systems 22 (NIPS 2009), Computer Science Department, University of Massachusetts Amherst

    Google Scholar 

  2. Kuhl FP, Giardina CR (1982) Elliptic fourier features of a closed contour, Fairleigh Dickinson University, U.S. Army Armament Research and Development Command. In: Computer graphics and image processing, vol 18, pp 236–258

    Google Scholar 

  3. Ross S, Gordon GJ, Bagnell JA (2011) A reduction of imitation learning and structured prediction to no-regret online learning. In: Appearing in proceedings of the 14th international conference on artificial intelligence and statistics (AISTATS), JMLR: W&CP 15, vol 15. Fort Lauderdale, FL, USA

    Google Scholar 

  4. Shon AP, Verma D, Rao RPN (2007) Active imitation learning, AAAI conference on artificial intelligence, Department of Computer Science and Engineering, pp 756–762

    Google Scholar 

  5. Sterne PJ (2004) Reinforcement sailing [master’s thesis], Master of science. School of Informatics, University of Edinburgh

    Google Scholar 

  6. Sutton RS, Barto AG (1998) Reinforcement learning, a bradford book. The MIT Press, Cambridge

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Hideaki Manabe or Kanta Tachibana .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Manabe, H., Tachibana, K. (2016). Consideration of State Representation for Semi-autonomous Reinforcement Learning of Sailing Within a Navigable Area. In: Friebe, A., Haug, F. (eds) Robotic Sailing 2015. WRSC/IRSC 2015. Springer, Cham. https://doi.org/10.1007/978-3-319-23335-2_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23335-2_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23334-5

  • Online ISBN: 978-3-319-23335-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics