Sensorimotor self-learning model based on operant conditioning for two-wheeled robot

Zhang, Xiaoping; Ruan, Xiaogang; Xiao, Yao; Huang, Jing

doi:10.1007/s12204-017-1814-8

Sensorimotor self-learning model based on operant conditioning for two-wheeled robot

Published: 31 March 2017

Volume 22, pages 148–155, (2017)
Cite this article

Journal of Shanghai Jiaotong University (Science) Aims and scope Submit manuscript

Xiaoping Zhang (张晓平)^1,2,
Xiaogang Ruan (阮晓钢)¹,
Yao Xiao (肖尧)¹ &
…
Jing Huang (黄静)¹

167 Accesses
5 Citations
14 Altmetric
2 Mentions
Explore all metrics

Abstract

Traditional control methods of two-wheeled robot are usually model-based and require the robot’s precise mathematic model which is hard to get. A sensorimotor self-learning model named SMM TWR is presented in this paper to handle these problems. The model consists of seven elements: the discrete learning time set, the sensory state set, the motion set, the sensorimotor mapping, the state orientation unit, the learning mechanism and the model’s entropy. The learning mechanism for SMM TWR is designed based on the theory of operant conditioning (OC), and it adjusts the sensorimotor mapping at every learning step. This helps the robot to choose motions. The leaning direction of the mechanism is decided by the state orientation unit. Simulation results show that with the sensorimotor model designed, the robot is endowed the abilities of self-learning and self-organizing, and it can learn the skills to keep itself balance through interacting with the environment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Perception-Driven Learning of High-Dynamic Jumping Motions for Single-Legged Robots

Article 29 May 2024

A Survey of Wheeled-Legged Robots

Quadrupedal Locomotion in an Energy-efficient Way Based on Reinforcement Learning

Article 08 April 2024

References

CHAN R P M, STOL K A, HALKYARD C R. Review of modelling and control of two-wheeled robots [J]. Annual Reviews in Control, 2013, 37: 89–103.
Article Google Scholar
SUPRAPTO B Y, AMRI D, DWIJAYANTI S. Comparison of control methods PD, PI, and PID on two wheeled self balancing robot [C]//Proceeding of International Conference on Electrical Engineering, Computer Science and Informatics. Yogyakarta, Indonesia: IEEE, 2014: 67–71.
Google Scholar
BATURE A A, BUYAMIN S, AHMAD M N, et al. A comparison of controllers for balancing two wheeled inverted pendulum robot [J]. International Journal of Mechanical & Mechatronics Engineering, 2014, 14(3): 62–68.
Google Scholar
ALARFAJ M, KANTOR G. Centrifugal force compensation of a two-wheeled balancing robot [C]//Proceeding of International Conference on Control, Automation, Robotics and Vision. Singapore: IEEE, 2010: 2333–2338.
Google Scholar
ZHOU Y S, WANG Z H. Motion controller design of wheeled inverted pendulum with an input delay via optimal control theory[J]. Journal of Optimization Theory and Application, 2016, 168(2): 625–645.
Article MathSciNet MATH Google Scholar
LI C Q, GAO X S, HUANG Q, et al. A coaxial couple wheeled robot with T-S fuzzy equilibrium control [J]. Industrial Robot: An International Journal, 2011, 38(3): 292–300.
Article Google Scholar
NASIR A N K, AHMAD M A, GHAZALI R, et al. Performance comparison between fuzzy logic controller (FLC) and PID controller for a highly nonlinear twowheels balancing robot [C]//2011 First International Conference on Informatics and Computational Intelligence. Bandung, Indonesia: IEEE, 2011: 176–181.
Chapter Google Scholar
YUE M, WANG S, SUN J Z. Simultaneous balancing and trajectory tracking control for two-wheeled inverted pendulum vehicles: A composite control approach [J]. Neurocomputing, 2016, 191: 44–54.
Article Google Scholar
RUAN X G, WU X. The skinner automaton: A psychological model formalizing the theory of operant conditioning [J]. Science China Technological Sciences, 2013, 56(11): 2745–2761.
Article Google Scholar
RUAN X G, CHEN J, YU N G. Thalamic cooperation between the cerebellum and basal ganglia with a new tropism-based action-dependent heuristic dynamic programming method [J]. Neurocomputing, 2012, 93: 27–40.
Article Google Scholar
SKINNER B F. The behavior of organisms: An experimental analysis [M]. New York: D Appleton-Century Company, 1938.
Google Scholar
ROSEN B E, GOODWIN J M, VIDAL J J. Machine operant conditioning [C]//Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Piscataway, USA: IEEE, 1988: 1500–1501.
Chapter Google Scholar
ZALAMA E, GóMEZ J, PAUL M, et al. Adaptive behavior navigation of a mobile robot [J]. IEEE Transactions on Systems, Man, and Cybernetics. Part A: Systems and Humans, 2002, 32(1): 160–169.
Article Google Scholar
ITOH K, MIWA H, MATSUMOTO M, et al. Behavior model of humanoid robots based on operant conditioning [C]//Proceedings of 2005 5th IEEE-RAS International Conference on Humanoid Robots. Tsukuba: IEEE, 2005: 220–225.
Chapter Google Scholar
TANIGUGHI T, SAWARAGI T. Incremental acquisition of behaviors and signs based on a reinforcement learning schemata model and a spike timingdependent plasticity network [J]. Advanced Robotics, 2007, 21(10): 1177–1199.
Article Google Scholar
CHEU E Y, QUEK C, NG S K. ARPOP: An appetitive reward-based pseudo-outer-product neural fuzzy inference system inspired from the operant conditioning of feeding behavior in aplysia [J]. IEEE Transactions on Neural Networks and Learning Systems, 2012, 23(2): 317–329.
Article Google Scholar
PIAGET J. The origins of intelligence in children [M]. New York: International Universities Press, 1952.
Book Google Scholar
LEE D D, SEUNG H S. Learning in intelligent embedded systems [C]//Proceedings of the Embedded Systems Workshop. Cambridge, USA: IEEE, 1999: 133–139.
Google Scholar
NATALE L, ORABONA F, BERTON F, et al. From sensorimotor development to object perception [C]//Proceedings of 2005 5th IEEE-RAS International Conference on Humanoid Robots. Tsukuba: IEEE, 2005: 226–231.
Chapter Google Scholar
HOFFMANN H. Perception through visual motor anticipation in a mobile robot [J]. Neural Networks, 2007, 20(1): 22–33.
Article MATH Google Scholar
REN H G, SHI T, ZHANG R C. Foundation of the sensorimotor system cognitive model with operant conditioning mechanism [J]. Robot, 2012, 34(3): 292–298 (in Chinese).
Article Google Scholar

Download references

Acknowledgments

Part of this research was done at the Department of Psychology, Michigan State University. The authors would like to express their thanks to Professor LIU Taosheng and his lab for help.

Author information

Authors and Affiliations

College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, 100124, China
Xiaoping Zhang (张晓平), Xiaogang Ruan (阮晓钢), Yao Xiao (肖尧) & Jing Huang (黄静)
Department of Psychology, Michigan State University, Michigan, MI, 48824, USA
Xiaoping Zhang (张晓平)

Authors

Xiaoping Zhang (张晓平)
View author publications
You can also search for this author in PubMed Google Scholar
Xiaogang Ruan (阮晓钢)
View author publications
You can also search for this author in PubMed Google Scholar
Yao Xiao (肖尧)
View author publications
You can also search for this author in PubMed Google Scholar
Jing Huang (黄静)
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaoping Zhang (张晓平).

Additional information

Foundation item: the National Natural Science Foundation of China (No. 61375086), the Key Project of Science and Technique Plan of Beijing Municipal Commission of Education (No. KZ201210005001), the National Basic Research Program (973) of China (No. 2012CB720000), and the China Scholarship Council Program (No. 201406540017)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, X., Ruan, X., Xiao, Y. et al. Sensorimotor self-learning model based on operant conditioning for two-wheeled robot. J. Shanghai Jiaotong Univ. (Sci.) 22, 148–155 (2017). https://doi.org/10.1007/s12204-017-1814-8

Download citation

Received: 01 July 2015
Published: 31 March 2017
Issue Date: April 2017
DOI: https://doi.org/10.1007/s12204-017-1814-8

Key words

CLC number

TP 181

Document code

A

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sensorimotor self-learning model based on operant conditioning for two-wheeled robot

Abstract

Access this article

Similar content being viewed by others

Perception-Driven Learning of High-Dynamic Jumping Motions for Single-Legged Robots

A Survey of Wheeled-Legged Robots

Quadrupedal Locomotion in an Energy-efficient Way Based on Reinforcement Learning

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Key words

CLC number

Document code

Navigation

Sensorimotor self-learning model based on operant conditioning for two-wheeled robot

Abstract

Access this article

Similar content being viewed by others

Perception-Driven Learning of High-Dynamic Jumping Motions for Single-Legged Robots

A Survey of Wheeled-Legged Robots

Quadrupedal Locomotion in an Energy-efficient Way Based on Reinforcement Learning

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

CLC number

Document code

Search

Navigation