An agent-based model of the emergence and evolution of a language system for boolean coordination

Sierra-Santibáñez, Josefina

doi:10.1007/s10458-018-9384-1

An agent-based model of the emergence and evolution of a language system for boolean coordination

Published: 19 March 2018

Volume 32, pages 417–458, (2018)
Cite this article

Autonomous Agents and Multi-Agent Systems Aims and scope Submit manuscript

Josefina Sierra-Santibáñez ORCID: orcid.org/0000-0003-3529-6713¹

434 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

This paper presents an agent-based model of the emergence and evolution of a language system for Boolean coordination. The model assumes the agents have cognitive capacities for invention, adoption, abstraction, repair and adaptation, a common lexicon for basic concepts, and the ability to construct complex concepts using recursive combinations of basic concepts and logical operations such as negation, conjunction or disjunction. It also supposes the agents initially have neither a lexicon for logical operations nor the ability to express logical combinations of basic concepts through language. The results of the experiments we have performed show that a language system for Boolean coordination emerges as a result of a process of self-organisation of the agents’ linguistic interactions when these agents adapt their preferences for vocabulary, syntactic categories and word order to those they observe are used more often by other agents. Such a language system allows the unambiguous communication of higher-order logic terms representing logical combinations of basic properties with non-trivial recursive structure, and it can be reliably transmitted across generations according to the results of our experiments. Furthermore, the conceptual and linguistic systems, and simplification and repair operations of the agent-based model proposed are more general than those defined in previous works, because they not only allow the simulation of the emergence and evolution of a language system for the Boolean coordination of basic properties, but also for the Boolean coordination of higher-order logic terms of any Boolean type which can represent the meaning of nouns, sentences, verbs, adjectives, adverbs, prepositions, prepositional phrases and subexpressions not traditionally analysed as forming constituents, using linguistic devices such as syntactic categories, word order and function words.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Near-term advances in quantum natural language processing

Article 11 April 2024

Embodied human language models vs. Large Language Models, or why Artificial Intelligence cannot explain the modal be able to

Article 07 February 2024

Semantic memory: A review of methods, models, and current challenges

Article 03 September 2020

Notes

\(\text{ Bool }\) is the type of Boolean values and \(\text{ Ind }\) the type of individuals. See Sect. 2.1.1 for a definition of symple types in the \(\lambda \)-Calculus.
\(\text{ up }\) and \(\text{ le }\) are constants of type \(\text{ Ind }\rightarrow \text{ Bool }\) denoting the properties of being in an upper and left position respectively. If the agents used First Order Logic instead of Higher Order Logic to represent their meanings, they would approximate the meaning of this noun phrase by using formula \((\text{ up }(x) \vee \text{ le }(x)) \wedge \lnot (\text{ up }(x) \wedge \text{ le }(x))\).
When we omit parentheses from functional types, we assume associativity is applied to the right, so that \(\text{ Ind }\rightarrow \text{ Ind }\rightarrow \text{ Bool }\) is equivalent to \(\text{ Ind }\rightarrow (\text{ Ind }\rightarrow \text{ Bool })\).
Assuming square and upper have the standard natural language interpretations, \(square(o_1)\) will be true if \(o_1\) denotes an object that is a square, upper(square) will denote the property of being a square situated in an upper position, and \(upper(square(o_1))\) will be true if \(o_1\) is a square which is in an upper position.
Treating move as a binary relation means interpreting \(move(x,r_1)\) as ‘\(r_1\) moves x’. Currying move implies that move(x) means ‘moves x’, and \(move(x)(r_1)\) means ‘\(r_1\) moves x’.
Generalized quantifiers, e.g. \(\mathtt{every}_{\tau }\) or \(\mathtt{some}_{\tau }\), quantify over objects of type \(\tau \).
See definition 6 on page 9 for formal definitions of lexicon and lexical entry.
See Sect. 3.1 for a formal definition of the set of syntactic categories used in this paper.
We will omit parentheses within categories, taking the forward slash to be left associative, the backward slash to be right associative, and the backward slash bind more tightly than the forward slash. Thus \((\text{ np }\backslash \text{ s })/ \text{ np }\) is equivalent to \(\text{ np }\backslash \text{ s }/ \text{ np }\).
Lambek extended pure applicative categorial grammar according to a simple algebraic interpretation of the slashes. In the Lambek Calculus an expression is assigned to category \(A\slash B\) (respectively, to category \(B\backslash A\)) if and only if when it is followed (respectively, preceded) by an expression of category B, it produces an expression of category A. However, applicative categorial grammar only respects one half of the biconditional.
Syntactic categories pr / pr or pr\(\backslash \)pr are used to place the word associated with not before or after the expression associated with its argument. Categories pr / pr / pr, pr\(\backslash \)pr / pr and pr\(\backslash \)pr\(\backslash \)pr place the words associated with and,or before, between or after the expressions associated with their arguments.
Affixes are groups of letters attached to words.
A conceptualisation of a subset of objects (i.e. a meaning) is a higher-order logic term that is true for all the objects in the subset and false for the rest of the objects in the speaker’s and hearer’s context.
If F is a constant of type \(\mathtt{\text{ Ind }\rightarrow \text{ Bool }}\), i.e. a basic property, invention is not necessary, because an entry for F already exists in the common lexicon.
The reason for this is that the lexical entries for basic properties are part of the common lexicon of basic concepts initially shared by all the agents in the population.
It should be noted that although the choice of words for expressing different Boolean operators is independent of each other, the choice of a lexical entry for expressing a particular operator in a given sentence is always the same for all the occurrences of such an operator in the higher-order term expressed by such a sentence. Likewise, the choice of a lexical entry for parsing a particular word in a given sentence is always consistent for all the occurrences of such word in the sentence.
In particular, the Ciao Prolog System [10], available from www.clip.dia.fi.upm.es, has been used.
The same arguments as those used in Sect. 4.1 can be used to justify the method used to compute the score of a particular meaning obtained by parsing a given expression.
In the experiments, the syntactic category C of the expressions invented by the agents is pr.
In the experiments, the agents do not try to simplify associations between expressions and more complex meanings, except in the case of learning the expressions used by other agents to refer to the grouping operator id, which we discuss in the next subsection in the context of repair operations.
On the other hand, if a repair operation had been applied to a sentence that contained the logical operator ’or’ before any repair operation had been applied to a sentence that contained the logical operator ’and’, lexical entry 10 would have been replaced with lexical entry 14 in step 1, and lexical entry 13 would have been added to the agent’s lexicon in step 2. Later on, if the agent applies a repair operation to a sentence containing ’and’, it would replace lexical entry 9 with lexical entry 12 and step 2 would not be applied.
We describe repair operations for arbitrary syntactic categories c, because they are applicable to lexical entries for coordinators of expressions of any Boolean type. However, in the experiments, c is always pr.
They could have been triggered by the generation of ambiguous expressions for other types of meanings, e.g. \(\mathtt{and(\alpha )(not(\beta ))}\) or \(\mathtt{and(\alpha )(or(\beta )(\gamma ))}\). But in the present experiments this is not the case.
We use again an arbitrary syntactic category c to describe the adoption of lexical entries constructed during the application of repair operations, because the mechanisms proposed are applicable to lexical entries for coordinators of expressions of any Boolean type. However, in the experiments, c is always pr.
\(\mu \) is the initial score agents assign to the lexical entries they create during simplification or repair.
Adaptation captures the speed at which an agent can adapt to a new situation, which determines the time it takes for a population to agree on a common language. Amplification relates to the extent to which an agent is able to escape from a behaviour which represents a suboptimal fixed point of its response function. This is achieved by amplifying small deviations from suboptimal equilibrium states, thus making them unstable.
Basic properties are propositional constants representing properties such as being light, dark, in an upper or lower position, or on the right or left side of the whiteboard. [8, 20, 22] also use simulations with software agents and initialise the agents with a common vocabulary for basic concepts.
The length of the sliding window used in the experiments described in [41, 42] was of ten games, and that used in the experiments described in [8] of fifty games.
The results described in [41, 42] are the average of ten simulation runs, whereas those reported in [8] are the average of fifty simulation runs.
Note that null lexical variability means that all the agents in the population prefer the same expression (both word and position) for naming the Boolean operators (and,or,not) and the grouping operator (id).
In [18], the number of words invented per object in the naming game is estimated to be in O(n), where n is the population size, and the time for n / 2 words to spread in the population in \(O(n^2 \cdot \ln (n))\). The asymptotic notation O(g(n)) is defined as follows. For a given function g(n), we denote by O(g(n)) the set of functions \(O(g(n)) = \{f(n) \; : \; \text{ there } \text{ exist } \text{ positive } \text{ constants } \; c \; \text{ and } \; n_0 \; \text{ such } \text{ that } \; 0 \le f(n) \le c \cdot g(n) \; \text{ for } \text{ all } \; n \ge n_0\}\).
The experiments described in [8, 56] also use a population of ten agents.
The number of games required to reach full communicative success (or full coherence) reported in [41, 42] is the maximum number of games required in the ten simulation runs carried out in those experiments.
The number of games during which the agents invent or adopt words reported in [41, 42] is the maximum for the ten simulation runs carried out in those experiments.
The initial groups of elder, adult and young agents contain \(\frac{n}{3}, \frac{n}{3}+n\;{ mod}(3)\) and \(\frac{n}{3}\) agents respectively, where n is the population size, \(\frac{n}{3}\) is the integer division of n by 3 and \(n\;\text{ mod }(3)\) the remainder.
Note that in Figs. 6 and 9, the length of the first turnover interval is different from 1000 games, but the length of the rest of the turnover intervals is of 1000 games.
See appendix A of [42] for a discussion of the expressiveness of the language systems constructed in the experiments reported in that paper, and a proof of the ability of such language systems to unambiguously express every propositional logic formula, which are represented in [42] using Lisp-like notation [34, 35].

References

Ajdukiewicz, K. (1935). Die syntaktische konnexitat. Studia Philosophica, 1(1), 1–27.
MATH Google Scholar
Alfonseca, M., & Soler-Gil, F. J. (2013). Evolving an ecology of mathematical expressions with grammatical evolution. Biosystems, 111(2), 111–119.
Article Google Scholar
Alfonseca, M., & Soler-Gil, F. J. (2015). Evolving a predator-prey ecosystem of mathematical expressions with grammatical evolution. Complexity, 20(3), 66–83. https://doi.org/10.1002/cplx.21507. Published online.
Article Google Scholar
Allen, J. (1995). Natural language understanding (2nd ed.). San Francisco: The Benjamin/Cummings Publishing Company.
MATH Google Scholar
Bar-Hillel, Y. (1950). On syntactical categories. Journal of Symbolic Logic, 15, 1–16.
Article MathSciNet MATH Google Scholar
Baroncheli, A., Felici, M., Caglioti, E., Loreto, V., & Steels, L. (2006). Sharp transition towards shared vocabularies in multi-agent systems. Journal of Statistical Mechanics, 2006, P06014.
Article MATH Google Scholar
Batali, J. (1998). Computational simulations of the emergence of grammar. In: J. R. Hurforf, M. Studdert-Kennedy, & C. Knight (Eds.), Approaches to the evolution of language: Social and cognitive bases (pp. 405–426). Cambridge University Press.
Beuls, K., & Steels, L. (2013). Agent-based models of strategies for the emergence and evolution of grammatical agreement. PLoS ONE, 8(3), e58960.
Article Google Scholar
Briscoe, T. (Ed.). (2002). Linguistic Evolution through Language Acquisition: Formal and Computational Models. Cambridge: Cambridge University Press.
Google Scholar
Bueno, F. Cabeza, D. Carro, M. Hermenegildo, M. López-García P. & Puebla, G. (August 1997) The Ciao Prolog system. reference manual. Technical Report CLIP3/97.1, School of Computer Science, Technical University of Madrid (UPM), Available from http://www.clip.dia.fi.upm.es/.
Carpenter, B. (1997). Type-logical semantics. Cambridge: MIT Press.
MATH Google Scholar
Cartmill, E. A., Roberts, S., Lyn, H., & Cornish, H. (Eds.). (2014). Evolution of Language, Proceedings of the Tenth International Conference EVOLANG. Singapore: World Scientific.
Google Scholar
Church, A. (1940). A formulation of a simple theory of types. Journal of Symbolic Logic, 5, 56–68.
Article MathSciNet MATH Google Scholar
Clocksin, W. F., & Mellish, C. S. (Eds.). (1996). Programming in prolog (4th ed.). Berlin: Springer.
MATH Google Scholar
Colmerauer, A. Kanoui, H. Pasero, R. & Roussel, P. (1973). Un système de Communication Homme-machine en Francais, Research Report. Technical report, Groupe Intelligence Artificielle, Université Aix-Marseille II, France.
Dall’Asta, L., Baronchelli, A., Barrat, A., & Loreto, V. (2006). Non-equilibrium dynamics of language games on complex networks. Physical Review, 74(3), 036105.
Google Scholar
de Boer, B. (2001). The origins of vowels systems. Oxford: Oxford University Press.
Google Scholar
de Vylder, B. (2008). The Evolution of conventions in multi-agent systems. Ph.D. thesis, Artificial Intelligence Lab, Free University of Brussels.
Di Sciullo, M., & Boeckx, C. (Eds.). (2011). The biolinguistic enterprise. New perspectives on the evolution and nature of the human language faculty. Oxford: Oxford University Press.
Google Scholar
Garcia-Casademont, E., & Steels, L. (2016). Insight grammar learning. Journal of Cognitive Science, 17(1), 27–62.
Article Google Scholar
Gazdar, G. (1980). A cross-categorial semantics for coordination. Linguistics and Philosophy, 3(3), 407–409.
Article MathSciNet Google Scholar
Gerasymova, K. Spranger, M. & Beuls, K. (2012). A language strategy for aspect: Encoding aktionsarten through morphology. In Experiments in cultural language evolution (pp. 257–276). Springer.
Grosz, B., Jones, K., & Webber, B. (Eds.). (1986). Readings in natural language processing. Burlington: Morgan Kaufmann.
Google Scholar
Hurford, J., Studdert-Kennedy, M., & Kight, C. (Eds.). (1998). Approaches to the evolution of language: Social and cognitive bases. Cambridge University Press.
Kay, M. (1973). The MIND system. In Natural language processing, pp. 155–188. Algorithmics Press.
Kay, M. (1980). Algorithm schemata and data structures in syntactic processing. Technical Report CSL-80-12, Xerox Corporation, Reprinted in [22].
Keenan, E. L., & Faltz, L. M. (1985). Boolean semantics for natural language. In L. McNally, Y. Sharvit, & Z. Szabo (Eds.), Studies in Linguistics and Philosophy (vol. 23). Netherlands: Springer.
Kirby, S. (2002). Learning, bottlenecks and the evolution of recursive syntax. In Linguistic evolution through language acquisition: Formal and computational models, pp. 96–109. Cambridge University Press.
Lambek, J. (1958). The mathematics of sentence structure. American Mathematical Monthly, 65, 154–169.
Article MathSciNet MATH Google Scholar
Lambek, J. (1961). On the calculus of syntactic types. In R. Jakobson (Ed.) Structure of language and its mathematical aspects. Proceedings of symposia in applied mathematics (pp. 166–178). American Mathematical Society.
Lara, J. & Alfonseca, M. (2000). Some strategies for the simulation of vocabulary agreement in multi-agents communities. Journal of Artificial Societies and Social Simulation, 3(4). Retrieved January 1, 2013 from http://jasss.soc.surrey.ac.uk/3/4/2.html.
Lara, J. & Alfonseca, M. (2002). The role of oblivion, memory size and spatial separation in dynamic language games. Journal of Artificial Societies and Social Simulation, 5(2). Retrieved January 1, 2013 from http://jasss.soc.surrey.ac.uk/5/2/1.html.
Lyon, C., Nehaniv, C., & Cangelosi, A. (Eds.). (2007). Emergence of language and communication., Lecture notes in computer science Berlin: Springer.
MATH Google Scholar
McCarthy, J. (1960). Recursive functions of symbolic expressions and their computation by machine, part I. Communications of the ACM, 3(4), 184–195.
Article MATH Google Scholar
McCarthy, J. (1990). Formalizing Common Sense. Papers by John McCarthy. Ablex. Edited by Vladimir Lifschitz.
Minett, J. W., & Wang, W. S. Y. (Eds.). (2005). Language acquisition, change and emergence: Essays in evolutionary linguistics. Hong Kong: City University of Hong Kong Press.
Google Scholar
Montague, R. (1973). The proper treatment of quantification in ordinary English. In Approaches to Natural Language: Proceedings of the 1970 Stanford Workshop on Grammar and Semantics. Dordrech: Reidel, Reprinted in Thomanson. R. editor, Formal Philosophy, pp. 247–270. New Haven: Yale University Press.
Piaget, J. (1985). The Equilibration of Cognitive Structures: The Central Problem of Intellectual Development. Chicago: University of Chicago Press.
Google Scholar
Sierra-Santibáñez, J. (2001). Grounded models as a basis for intuitive reasoning. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, IJCAI-2001, pp. 401–406.
Sierra-Santibáñez, J. (2002) Grounded models as a basis for intuitive and deductive reasoning: The acquisition of logical categories. In Proceedings of the European Conference on Artificial Intelligence, ECAI-2002, pp. 93–97.
Sierra-Santibáñez, J. (2014). An agent-based model studying the acquisition of a language system of logical constructions. In Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, AAAI-2014, pp. 350–357. AAAI Press.
Sierra-Santibáñez, J. (2015). An agent-based model of the emergence and transmission of a language system for the expression of logical combinations. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, AAAI-2015, pp. 492–499. AAAI Press.
Steedman, M. (1985). Dependency and coordination in the grammar of Dutch and English. Language, 61, 523–568.
Article Google Scholar
Steels, L. (1995). A self-organizing spatial vocabulary. Artificial Life, 2(3), 319–332.
Article Google Scholar
Steels, L. (1998). The origins of ontologies and communication conventions in multi-agent systems. Autonomous Agents and Multi-Agent Systems, 1(2), 169–194.
Article Google Scholar
Steels, L. (1998). The origins of syntax in visually grounded robotic agents. Artificial Intelligence, 103(1–2), 133–156.
Article MATH Google Scholar
Steels, L. (2011). Modeling the cultural evolution of language. Physics of Life Reviews, 8, 339–356.
Article Google Scholar
Steels, L. (Ed.). (2012). Experiments in cultural language evolution. Amsterdam: John Benjamins.
Google Scholar
Steels, L. (2015). The Talking Heads experiment: Origins of words and meanings (Computational Models of Language Evolution) (Volume 1). Berlin: Language Science Press.
Book Google Scholar
Steels, L. (2016). Agent-based models for the emergence and evolution of grammar. Phil. Trans. R. Soc. B, 371(20150447), 1–9.
Google Scholar
Steels, L., & Belpaeme, T. (2005). Coordinating perceptually grounded categories through language: a case study for colour. Behavioral and Brain Sciences, 28, 469–529.
Google Scholar
Steels, L. & Garcia-Casademont, E. (2015) How to play the syntax game. In Proceedings of the European Conference on Artificial Life, pp. 479–486. MIT Press.
Steels, L. & Vogt, P. (1997) Grounding adaptive language games in robotic agents. In Proceedings of the European Conference on Artificial Life, MIT Press.
Tomasello, M. (2003). Constructing a Language: A Usage-Based Theory of Language Acquisition. Cambridge: Harvard Univ Press.
Google Scholar
Tomasello, M. (2006). Acquiring linguistic constructions. In Handbook of child psychology. Wiley Online Library.
Vogt, P. (2005). The emergence of compositional structures in perceptually grounded language games. Artificial Intelligence, 167(1–2), 206–242.
Article Google Scholar
Williams, M. A., McCarthy, J., Gardenfors, P., Stanton, C., & Karol, A. (2009). A grouding framework. Autonomous Agents and Multi-Agent Systems, 19, 272–296.
Article Google Scholar
Wittgenstein, L. (1953). Philosophical investigations. New York: Macmillan.
MATH Google Scholar
Zuidema, W., & de Boer, B. (2010). Multi-agent simulations of the evolution of combinatorial phonology. Adaptive Behavior, 18(2), 141–154.
Article Google Scholar

Download references

Acknowledgements

I should like to express my gratitude to Manuel Alfonseca for reading several versions of this paper and providing valuable comments.

Author information

Authors and Affiliations

Department of Computer Science, Technical University of Catalonia, Barcelona, Catalonia, Spain
Josefina Sierra-Santibáñez

Authors

Josefina Sierra-Santibáñez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Josefina Sierra-Santibáñez.

Additional information

This work has been partially supported by funds from the Spanish Ministry for Economy and Competitiveness (MINECO) and the European Union (FEDER funds) under grant GRAMM (TIN2017-86727-C2-1-R), grant APCOM (TIN2014-57226-P), and Generalitat de Catalunya (AGAUR) under projects ALBCOM (2017 SGR 786) and LARCA (2014 SGR 890).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sierra-Santibáñez, J. An agent-based model of the emergence and evolution of a language system for boolean coordination. Auton Agent Multi-Agent Syst 32, 417–458 (2018). https://doi.org/10.1007/s10458-018-9384-1

Download citation

Published: 19 March 2018
Issue Date: July 2018
DOI: https://doi.org/10.1007/s10458-018-9384-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An agent-based model of the emergence and evolution of a language system for boolean coordination

Abstract

Access this article

Similar content being viewed by others

Near-term advances in quantum natural language processing

Embodied human language models vs. Large Language Models, or why Artificial Intelligence cannot explain the modal be able to

Semantic memory: A review of methods, models, and current challenges

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An agent-based model of the emergence and evolution of a language system for boolean coordination

Abstract

Access this article

Similar content being viewed by others

Near-term advances in quantum natural language processing

Embodied human language models vs. Large Language Models, or why Artificial Intelligence cannot explain the modal be able to

Semantic memory: A review of methods, models, and current challenges

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation