Coordination of Communication in Robot Teams by Reinforcement Learning
In Multi-agent systems, the study of language and communication is an active field of research. In this paper we present the application of Reinforcement Learning (RL) to the self-emergence of a common lexicon in robot teams. By modeling the vocabulary or lexicon of each agent as an association matrix or look-up table that maps the meanings (i.e. the objects encountered by the robots or the states of the environment itself) into symbols or signals we check whether it is possible for the robot team to converge in an autonomous, decentralized way to a common lexicon by means of RL, so that the communication efficiency of the entire robot team is optimal. We have conducted several experiments aimed at testing whether it is possible to converge with RL to an optimal Saussurean Communication System. We have organized our experiments alongside two main lines: first, we have investigated the effect of the team size centered on teams of moderated size in the order of 5 and 10 individuals, typical of multi-robot systems. Second, and foremost, we have also investigated the effect of the lexicon size on the convergence results. To analyze the convergence of the robot team we have defined the team’s consensus when all the robots (i.e. 100% of the population) share the same association matrix or lexicon. As a general conclusion we have shown that RL allows the convergence to lexicon consensus in a population of autonomous agents.
KeywordsMulti-agent systems Multi-robot systems Dynamics of artificial languages Computational semiotics Reinforcement learning Self-collective coordination Language games Signaling games
Unable to display preview. Download preview PDF.
- 3.Lewis, D.K.: Convention. Harvard University Press, Cambridge (1969)Google Scholar
- 5.Maravall, D., de Lope, J., Domínguez, R.: Self-emergence of lexicon consensus in a population of autonomous agents by means of evolutionary strategies. In: Corchado, E., Graña Romay, M., Manhaes Savio, A. (eds.) HAIS 2010. LNCS, vol. 6077, pp. 77–84. Springer, Heidelberg (2010)CrossRefGoogle Scholar
- 6.Maravall, D., de Lope, J., Domínguez, R.: Self-emergence of a common lexicon by evolution in teams of autonomous agents. Neurocomputing (in press)Google Scholar
- 8.Peirce, C.S.: Selected Writings. Dover, New York (1966)Google Scholar
- 9.de Saussure, F.: Cours de Linguistic Général. Payot, Paris (1916); Ibidem Course on General Linguistics. English Edition. McGraw-Hill, New York (1969)Google Scholar