Self domestication and the evolution of language

Thomas, James; Kirby, Simon

doi:10.1007/s10539-018-9612-8

Self domestication and the evolution of language

Open access
Published: 27 March 2018

Volume 33, article number 9, (2018)
Cite this article

Download PDF

You have full access to this open access article

Biology & Philosophy Aims and scope Submit manuscript

Self domestication and the evolution of language

Download PDF

James Thomas¹ &
Simon Kirby¹

15k Accesses
55 Citations
61 Altmetric
2 Mentions
Explore all metrics

Abstract

We set out an account of how self-domestication plays a crucial role in the evolution of language. In doing so, we focus on the growing body of work that treats language structure as emerging from the process of cultural transmission. We argue that a full recognition of the importance of cultural transmission fundamentally changes the kind of questions we should be asking regarding the biological basis of language structure. If we think of language structure as reflecting an accumulated set of changes in our genome, then we might ask something like, “What are the genetic bases of language structure and why were they selected?” However, if cultural evolution can account for language structure, then this question no longer applies. Instead, we face the task of accounting for the origin of the traits that enabled that process of structure-creating cultural evolution to get started in the first place. In light of work on cultural evolution, then, the new question for biological evolution becomes, “How did those precursor traits evolve?” We identify two key precursor traits: (1) the transmission of the communication system through learning; and (2) the ability to infer the communicative intent associated with a signal or action. We then describe two comparative case studies—the Bengalese finch and the domestic dog—in which parallel traits can be seen emerging following domestication. Finally, we turn to the role of domestication in human evolution. We argue that the cultural evolution of language structure has its origin in an earlier process of self-domestication.

Judaism and Evolution

Is Population Genetics Really Relevant to Evolutionary Biology?

Article Open access 02 March 2024

Can a Muslim be an Evolutionist?

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The last two decades have seen a resurgence of interest in the evolution of language. In its initial phase (e.g. Pinker and Bloom 1990), much of this work treated language structure as a genetically encoded biological trait. On this view, any structure seen in language simply reflected what was encoded in the genome, as part of a complex adaptation for communication (Pinker and Jackendoff 2005). In recent years, however, a growing body of work has begun to show that many aspects of language structure are the result of language itself adapting to constraints imposed by the way it is transmitted (see Kirby 2017; Tamariz and Kirby 2015; Kirby et al. 2014; Dediu et al. 2013 for recent reviews).

This basic finding is now supported by experimental studies and computational modelling into a range of linguistic phenomena. These include the emergence of compositionality (Kirby 2002; Brighton et al. 2005; Kirby et al. 2008); morphosyntactic regularity (Kirby 2002); recursive syntax (Kirby 2002); subjacency (Christiansen et al. 2002); regularisation of variation (Smith and Wonnacott 2010); the emergence of arbitrary signals (Theisen et al. 2010) and discrete phonological units (Oudeyer 2005, 2006); and duality of patterning (Roberts and Galantucci 2012; Verhoef et al. 2014). However, the implications of these findings—what they mean for our wider understanding of the evolution of language—have received relatively little attention. It is these implications, the new questions they raise, and the role of self-domestication in answering those questions that forms our focus here.

We start by considering the kind of questions we might ask of biological evolution. Simply put, we see the process uncovered by work on the cultural transmission of language as a kind of ‘informational regularity’, akin to the regularities afforded the evolutionary process by the laws of physics and mathematics (e.g. Kauffman 1993; Goodwin 1994; Stewart 1998). Such regularities have the common feature of providing structure ‘for free’. Given this, it makes little sense—and indeed renders it unnecessary—to seek a biological explanation for language structure itself. Instead, the core question for biological evolution should be the origin of the key precursors that make the cultural evolution of language possible in the first place.

As a first approximation, we suggest that the emergence of structured language through cultural evolution required two key precursors. The first is that communicative signals need to be learned from others, rather than being present from birth. Cultural evolution can only occur if something is learned, transmitted between generations, and changes in response to that transmission. The second is that this learning needs to be guided by a sensitivity to the communicative intent of others. Guided, that is, by the ability to recognise when another individual’s movement, gesture or sound was made in order to communicate, and what it was intended to mean.

In addressing the origin of these precursors, we turn to the resources of comparative biology. In particular, we looked for evolutionary analogies (Gould 1976): Instances of similar traits emerging in other, distantly related species, through a process of parallel evolution. We identify two species, each of which exhibits aspects of one of the two precursors. In both cases, these instances of parallel evolution seem to be linked to the domestication of the species. The first of these species is the Bengalese finch (e.g. Okanoya 2012), which following domestication has come to rely much more on learning to transmit its song between generations, thus serving as a parallel for the first precursor. The second is the domestic dog (e.g. Hare et al. 2002), which exhibits a particularly acute awareness of when actions or gestures are meant communicatively, thus paralleling the second precursor.

This leads to the question of whether domestication might also explain the emergence of these precursors in humans. The idea that humans are a domesticated species has deep intellectual roots, tracing back at least to classical antiquity (Leach 2007). In the second half of this paper, we present an overview of the concepts, evolutionary processes, and outcomes associated with domestication, together with their applicability to the human case. Central to this discussion is the notion of the domestic phenotype: a suite of skeletal, dental, soft tissue, behavioural, and reproductive changes that are common to a wide range of domesticated species. There is now much evidence that humans also exhibit the domestic phenotype.

In summary, we argue that recent work on the cultural evolution of language renders a biological account of language structure unnecessary. Rather than seek a biological account of the emergence of language structure itself, we think the focus should be on the biological underpinnings of the cultural process. A survey of some relevant comparative studies suggest that the conditions typical of domestication may play a key role in accounting for how such a cultural process may have managed to get started. Linking this with the growing interest in the role of domestication in human evolution, we suggest that the biological precursors of structure-creating cultural evolution lie in an earlier process of self-domestication.

The cultural evolution of language

In the late 1990s, a number of researchers began to model the ways in which languages evolve culturally in response to being transmitted through multiple generations of individuals, each of which learned the system through the observation of a subset of other individuals’ signalling behaviour (see Kirby et al. 2014; Smith 2014 for reviews). That the learners only observe a subset of the signalling behaviour of the previous generation is key, as this creates a bottleneck on the transmission process. In a typical computational simulation of this process, the initial generation of learners were trained on a set of non-compositional, random signal-meaning associations. Those same individuals then went on to produce signals themselves, which then served as the input to the next generation of learners, and so on. This is the core of what is meant by the term ‘iterated learning’. The central question, then, concerned the kinds of language that can survive in a world in which learners are only ever exposed to a subset of a language’s constituent signal-meaning pairs. The core finding of this work was that random languages become compositional over the course of the simulations. Compositional languages survive transmission through the bottleneck, whereas random languages do not, because they are simpler, or more compressible (Brighton 2003), and thus easier to learn.

However, this modelling work was vulnerable to two main criticisms. The first was that the apparently emergent structure simply reflected the learning algorithms built into the agents, rather than anything about the transmission process itself. To what extent, then, is there an effect of cultural transmission, over and above that of the influence of the learning biases or algorithms of the learners? The second concerns the applicability of these findings to the real world. To what extent would the results of these simulations be mirrored in work done with human beings?

The first criticism was addressed through the application of Bayesian techniques (Griffiths and Kalish 2007; Kirby et al. 2007). The key contribution of the Bayesian approach lies in the concept of the prior, and its ability to make learning biases explicit. This allows us to see what contribution iterated transmission is making, if any, over and above that of individual biases. A crucial finding of this Bayesian work is that there are a range of conditions under which cultural transmission has the effect of amplifying learning biases. More specifically, as long as learners possess some kind of bias for structure, however weak that bias might be, cultural transmission can serve to amplify the effect of that bias, such that the resulting language is highly structured (Smith and Kirby 2008). Another way to think about this is in terms of the strength of the bias being masked (Deacon 2009) by the presence of cultural transmission. As a result, a highly structured language will emerge over the course of repeated transmission, regardless of whether the individual agents have a weak or strong bias for structure. In turn, this sets up an evolutionary process whereby the weakest possible bias in favour of structure is likely to be favoured (Thompson et al. 2016).

The second criticism was addressed through the expansion of iterated learning to experimental studies in the lab (Scott-Phillips and Kirby 2010; Tamariz 2017). The aim of these studies is to replicate the logic of the simulations as closely as possible with real participants. They can be seen, then, as a combination of the kind of artificial language learning experiments seen in psycholinguistics (e.g. Reber 1967) with the diffusion chain paradigm from experimental cultural evolution (e.g. Mesoudi and Whiten 2008). In one of the earlier of these studies, Kirby et al. (2008) trained participants on an artificial language for naming coloured shapes. The initial language consisted of an entirely random set of signal-meaning pairs. Having been trained on half the language—an effective ‘bottleneck’ on transmission—these first participants were then asked to label the full set of shapes, forcing them to recall what they had been trained on and to generalise to the whole set. The output from this first set of participants was then used as the training language for the next set of participants, with this process being repeated for each generation.

Intriguingly, the key insight from this early experimental work was that its results did not match those of the simulation studies. As the language was transmitted between generations of participants it did indeed become simpler and easier to learn. However, it did so by becoming a degenerate, or systematically underspecified, language, in which a single signal was associated with multiple meanings. Becoming easier to learn through the simple shedding of distinct signals is clearly an adaptation to passing through the bottleneck. However, it is not a realistic account of the emergence of the kind of structured language we see in the world. This discrepancy between the experimental and simulation studies is resolved, however, if participants are placed in an interactive context (Kirby et al. 2015; Winters et al. 2015). The need to be used for communication introduces a second pressure into the environment of the language, with the result that languages again became compositional over the course of transmission.

The collective findings of the last two decades work on the cultural evolution of language lead us, then, to identify two key pressures in the environment of the language (Kirby et al. 2015; see also Kemp and Regier 2012). The first is that language must be learnable. If a language is too complex or difficult to learn, then it will simply not get passed on with any fidelity. This is a pressure, then, for ever greater simplicity (Brighton 2003; Brighton et al. 2005). Against this pressure to simplify, however, lies the fact that language is used to communicate. Language must be expressive enough to be useful for communication. There is, then, an inevitable trade-off in the form a culturally transmitted language comes to take. The simplest possible language would be one in which a single signal was associated with every meaning, however this would be of little use in communication. Conversely, a language with a unique and unrelated signal for every meaning would permit totally unambiguous communication but be near impossible to learn. Cultural evolution shapes language structure in response to just this trade-off. The process of cultural transmission, with its interplay between the pressure to simplify and the pressure to have communicative utility, generates a compositional language, which is structured to meet both pressures.

This, then, is what we mean when we say that cultural evolution presents as a kind of ‘informational regularity’. The very process of transmission, whether implemented in simulations or in human participants, promotes the structuring of the transmitted system and serves to amplify any biases for structure that may be present in learners. Initially random systems of signals, then, become structured simply by virtue of being culturally transmitted, without any need for a concomitant change in the learners who use the system. It is this sense that structure is provided ‘for free’ to biological evolution. In short, structured systems survive because they are easier to learn. However, as experimental work has shown, the kind of structure that results from cultural transmission is not necessarily the kind of structure we see in language. Recall, for example, how under certain circumstances the transmitted system can become systematically underspecified. The compositional structure we see in language is, then, a result of this process of cultural transmission occurring in a context where the learners use the system to communicate. Compositional structure is what results when a pressure for communicative utility is added to a process, cultural transmission, that is itself already structure-creating in nature. This renders an account of language structure rooted in biological evolution unnecessary. Instead, we argue that we should look to biological evolution to provide an account of how this cultural process became possible in the first place.

The biological precursors of a culturally evolving language

The learning of new signals

For structure to emerge through cultural evolution, it is necessary that the system be learned from others. However, the communication systems of most species are not transmitted in this way. The pattern across mammals, at least as far as vocal communication is concerned, is that most species have a limited repertoire of signals which are present in their adult form from birth (Seyfarth and Cheney 2010). We should be clear here, however, about what we mean by ‘learning’. When we talk about learning we are specifically talking about production learning (Seyfarth and Cheney 2010), where an existing signal is modified or a new signal is acquired. This stands in contrast to comprehension learning, which refers to the ability to extract a new meaning or inference from a signal; and usage learning, where the usage of a signal is modified based on the current situation or context (Janik and Slater 2000; Seyfarth and Cheney 2010).

There are, of course, examples of production learning found in nature. Among mammals, known vocal learners include some species of whales and dolphins (Reiss and McCowan 1993; Rendell and Whitehead 2001), bats (Boughman 1998), seals (Ralls et al. 1985), and elephants (Poole et al. 2005). We have no doubt that many further examples of mammalian vocal learning will be discovered in the future. Among birds, production learning is found in both parrots (Pepperberg 2010) and hummingbirds (Baptista and Schuchmann 1990). Of course, the most unequivocal evidence of vocal production learning is found in songbirds (Nottebohm and Liu 2010), many species of which require exposure to other singers during development in order to develop species-typical song (Beecher et al. 2010). The importance and widespread nature of learning in songbirds makes them a particularly good ‘natural laboratory’ for the question of how and why a central role for learning might have emerged in relation to language.

While communication through vocal signals is widespread in nature, communication through gesture—that is, through “manual communication without touching another individual or a substrate”—is found almost exclusively in apes and humans (Pollick and de Waal 2007: 8184). The gestural communication of apes is significantly more flexible and less tied to emotional reactions or specific contexts than either their vocal or facial expressions (Pollick and de Waal 2007). The comprehension and usage of ape gestures in the wild is known to shift between contexts (Hobaiter and Byrne 2011), and the emergence of new, non species-typical gestures has been observed in captivity (Leavens et al. 2005). Of course, gesture, although learned, is not the predominant modality of language as we know it today. This, amongst other things, has lead some to suggest that language may have originated in the gestural modality, only later becoming primarily vocal (e.g. Corballis 2002). In contrast, others have suggested that it is not so much that language itself switched modality, but that the same underlying cognitive capabilities that permit the flexibility of learned gesture in apes may have been extended to the vocal domain (e.g. Tomasello 2008).

Communicative inference: linking signals to meanings

However, vocal production learning is not itself enough. What is required for language is the production learning of new signal-meaning associations. There seems little evidence that any of the vocal production learners discussed above are learning new signal-meaning associations, or even that their signals have any semantic content at all (Fitch 2005). Even in one of the clearest examples of vocal production learning, that of the songbirds, there appears to be no evidence that there is any semanticity to the learned song, or that song elements can be rearranged to yield changes in meaning (Berwick et al. 2011). There are, however, some instances of signal-meaning associations being learned in apes. Learning of this kind can be seen in the process of ontogenetic ritualisation (Tomasello 1996), in which signal-meaning associations are constructed through repeated interactions. It can also be seen in ape language research (Savage-Rumbaugh et al. 1986, 1998, 2005; Lyn 2007), in the form of learned lexigrams and gestures.

Language is unusual, however, because it is both learned and symbolic (Deacon 1997). As such, the link between signals and their meanings is neither innately specified nor inherent in the form of the signal (Oliphant 2002). This greatly complicates the task of acquiring new signal-meaning pairs, because it requires not just associative learning between items, but also some way of figuring out what words actually mean. To learn a new signal-meaning pair in a language-like system, then, requires the capacity to infer what a communicator intended the signal to mean.

In language, the inferential acquisition of new signal-meaning pairs is most clearly exemplified by word learning. Many different processes are likely involved in word learning (Markman 1994; Samuelson and Smith 1998; Saffran 2003; Smith et al. 2011). However, it is the social-pragmatic account (e.g. Tomasello 2000) that has the most to say about the problem of meaning inference. This account is rooted in our awareness of others as intentional agents (Tomasello 1999), and our capacity to engage in joint-attentional activities (Tomasello et al. 2005), against a background of mutually shared knowledge, expectations and goals. This background, often referred to as ‘common ground’ (Clark 1996) or in terms of a ‘mutual cognitive environment’ (Sperber and Wilson 1995), creates a situation in which the range of potential referents for a given utterance is drastically reduced. In summary, then, our second precursor is not simply the production learning of new signal-meaning associations, but the ability to acquire these associations through an inference of communicative intent.

However, there is an even more basic form of this precursor, which stands as a requirement for any account of learned symbols to be possible in the first place. This concerns the recognition that an action or behaviour was meant communicatively at all (Scott-Phillips et al. 2009). In contrast to inferring the meaning of a particular signal, we might call this a general sensitivity to communicative intent: an awareness, that is, that a particular signal or action was made in order to communicate. Given that the full suite of capacities underpinning joint-attentional situations and the inference of communicative intent are likely unique to humans, we think it more promising to focus on this more basic form of the precursor.

The origin of the precursors in domestication

In the following sections, we discuss two comparative studies, which each present as evolutionary analogies of one of the two preconditions for a structure-creating process of cultural evolution. In each of the two examples, the parallel evolution of these key precursor traits occurred in the context of domestication. We explore what it is about domestication that likely lead to this outcome.

The Bengalese finch and the learning of signals

The Bengalese finch is a domesticated strain of the white-rumped munia (Okanoya 2002), a bird native to tropical continental Asia and some of the surrounding islands. For the last 250 years the Bengalese finch has been bred in Japan for its white plumage (Okanoya 2004; Svanberg 2008). Importantly, the Bengalese finch has not been bred for its song. Despite this, the song of the Bengalese finch has changed remarkably over the course of its domestication. It is the nature of these changes, together with the reasons why domestication had this kind of effect on its song, that makes this bird significant for those interested in the cultural evolution of language.

The role played by learning in songbirds differs along a number of dimensions (Beecher and Burt 2004; Beecher and Brenowitz 2005; Beecher et al. 2010; Soma 2011). As such, the changes to the Bengalese song brought about through domestication are best appreciated against a backdrop of the similarities between the Bengalese finch and its wild ancestor. Firstly, both the wild and domesticated species are closed learners (Okanoya and Yamaguchi 1997; Soma et al. 2006), meaning they can only acquire their species-typical song during a developmental ‘sensitive period’. Secondly, both species require exposure to conspecific song during development (Bao et al. 2003; Peng et al. 2012). Species-typical song will not develop if they are reared in isolation, as it can in some species (e.g. Kroodsma et al. 1997; Leitner et al. 2002). Finally, both the wild and domesticated strains are ‘social learners’, who learn better from conspecifics than from prerecorded ‘tape tutors’ (Eales 1989; Soma 2011).

Both species, then, are vocal learners. Domestication has not turned a non-learner into a vocal learner. What has changed, however, is the role and importance of learning—specifically, learning from others—to the transmission of the song between generations. This can be seen in three further dimensions along which the wild and domesticated strains differ. Firstly, the domesticated Bengalese now sings a much more complex and syntactically rich song, with greater levels of unpredictability in the patterns of transition between notes and note groups than is seen in the wild munia (Okanoya 2002, 2012). Secondly, cross-fostering experiments (Takahasi and Okanoya 2010) have shown that Bengalese chicks exhibit much lower copying fidelity in what they learn from tutor birds. Whereas munia chicks copy tutors with a high level of fidelity, Bengalese chicks combine the tutor’s song with their own improvisations and variations. Finally, and most importantly, Bengalese finches are much less constrained in what they are able to learn. Song learning in the white-rumped munia is highly canalized, such that munia chicks are only able to acquire a narrow range of species-specific song. In contrast, Bengalese chicks are much less constrained in what they are able to learn (Takahasi and Okanoya 2010).

Three important points follow from these differences. The first is that the reduction in learning constraints seen in the Bengalese finch means that the specifics of experience during development (e.g. particular tutor used as model) have a much greater influence on the structure of the resulting song. The second is that the reduction in high-fidelity copying combined with the broader range of what Bengalese chicks will copy has resulted in a much greater variation in song between different finches than is seen in their munia ancestors. Finally, all three of these differences combined have meant that many Bengalese finches have come to sing songs of much greater complexity than seen in white-rumped munias.

In the wild-living white-rumped munia, we have an example of a stereotypic, highly canalized communication system in which learning plays a minimal role. In its domesticated descendent, the Bengalese finch, song learning is less canalized, the songs themselves are less stereotypic and the influence of traditional transmission on song structure has increased. We see in this example, then, a parallel with the first of the preconditions identified above: an increase in the role of learning and cultural transmission. This change occurred in the context of domestication. Recall, however, that despite this context it cannot be attributed to artificial selection for more complex song. Why, then, might domestication have changed this bird’s song in this way?

One of the major characteristics of domestication is the buffered nature of the environment (Zohary et al. 1998; Price 1999, 2002; Deacon 2010), in which organisms are no longer subject to many of the selective pressures typically found in the wild, such as predation, unpredictable variation in food supply, and climatic variation. Deacon (2003, 2009, 2010) has proposed that domestication operated to relax various selection pressures on munia song that had kept it simple and stereotypical in the wild, allowing the song to become more complex under domestication (see also, Ritchie and Kirby 2007). This relaxation of selective pressure, argues Deacon, resulted in a breakdown of the learning biases and other factors that had kept the song simple in the wild and served to restrict the potential role for learning in shaping song characteristics. In turn, this opened up the possibility for learning and other aspects of early experience to influence song structure much more greatly under domestication. It is important to note that this is not the relaxation of selective pressure, per se, such that no selection occurs, but of specific pressures that served to restrict the potential contribution of learning and individual experience to the resulting song.

One such pressure concerns the need for accurate species recognition. Kagawa et al. (2012) compared the songs of three wild populations of white-rumped munia on the island of Taiwan. The syntactic complexity of munia song was found to vary in relation to the number of sympatric, closely related species. One of the key functions of song is species recognition, which is important in order to avoid the infertile hybrids that often result from cross-species matings. This is best achieved through the use of simple, stereotypic songs that exhibit little variation. In locations with fewer sympatric close relations, however, the selective pressure on species recognition is relaxed. The greater song complexity found in areas with fewer sympatric species could well be another example of song complexification following a relaxation of selective pressure. Kagawa et al. have, then, identified a key selection pressure that is both relaxed under domestication and found to be related to song complexity in the wild.

The second strand of evidence relates to the differing levels of stress hormones found in white-rumped munia and Bengalese finches. Suzuki et al. (2012) report measurements of fecal corticosterone, a hormone known to be directly involved in the development of the song system (Suzuki et al. 2011). Bengalese finches were found to have lower levels of corticosterone than white-rumped munia, regardless of whether the munia had been wild-caught or captive raised, indicating that it is domestication of the lineage that matters and not simply the conditions in which an individual bird was raised. Indeed, changes in hormonal regulation are known to commonly follow from domestication more generally (Price 2002; Trut et al. 2009). A range of work shows that higher levels of corticosterone negatively affect the development of the song system and can reduce the complexity of the resulting song (Spencer et al. 2003; Buchanan et al. 2004). If this is the case, then the finding that domestication can reduce levels of corticosterone in finches—perhaps through consistently reduced levels of stress in a buffered environment—might well provide a physical mechanism whereby the relaxation of selection following domestication could induce song complexification.

Finally, it is also clear that both female Bengalese finches and female munias have a preference for more complex song (Okanoya 2002). The potential role of sexual selection is somewhat attenuated by the fact that Bengalese breeding has long been under human control, although there is still scope for sexual selection to influence song structure through the higher ‘breeding efficiency’ of bird pairs in which the male sings a more complex song (Okanoya 2004). The precise nature of the interplay between relaxed selection and female preference remains unclear. It may be as simple as the two factors acting to reinforce one another. Of course, we can ask why such female preference for complexity should be satisfied through complexity that is learned, rather than, say, the impressive improvisation seen in some other species (e.g. Leitner et al. 2002), either of which would fit equally well with the major selective theory of song complexity in birds, the developmental stress theory (Nowicki et al. 1998; Buchanan et al. 2004; Ritchie et al. 2008). One possibility is that the environment of domestication, having already relaxed selection on song simplicity, and thus facilitated a greater role for learning in song transmission, set up just the conditions for a demonstration of fitness through learning rather than through improvisation or other means (Thomas 2013).

The domestic dog and communicative inference

Starting in the late 1990s a number of studies appeared describing how domestic dogs were particularly adept at using human communicative cues, such as pointing (Hare et al. 1998; Soproni et al. 2001), gaze (Hare et al. 2002), location markers (Agnetta et al. 2000), and even 3D replicas and photographs (Kaminski et al. 2009). Of particular interest was the fact that dogs seemed to outperform chimpanzees and other apes (Hare et al. 2002; Hare and Tomasello 2005; Gómez 2005; Miklósi 2007), and indeed seemed more similar to human children in this respect, although the true capacity of apes in this is a matter of debate (see Mulcahy and Call 2009; Mulcahy and Hedge 2012; Kirchhofer et al. 2012). Furthermore, these abilities are found across a wide range of different breeds (Wobber et al. 2009), including breeds that had been bred as working dogs like retrievers, companion dogs like toy poodles, and even once-domestic but now-feral breeds like the Australian dingo (Smith and Litchfield 2010).

These studies all utilised a variant of the object choice task (see Miklósi and Soproni 2006). In this procedure, a piece of food or other desirable item is placed in one of two or more locations. The location of the food is then indicated to the subject through pointing or some other cue, and the subject is then allowed to choose between the locations. The question of interest is whether the subject can use the cue to select the correct location. More specifically, however, what matters is not the ability to respond to the cues per se, but the extent to which a comprehension of the communicative nature of the cues is necessary for success on the task. It is quite possible, for example, to be successful with some cues, such as location tapping or sustained, close-in pointing, purely as a result of stimulus enhancement. Other cues, however, such as iconic representations and brief points from more distant locations, are much less salient in this regard. Finally, such comprehension is even more strongly confirmed if responses are modified based on the ostensive content of those cues. For example, by responding differently to intentionally given communications than to very similar physical actions produced ‘by accident’.

Studies with wolves and young puppies suggest that this ability in dogs is neither a simple inheritance from the canid line more generally, nor dependent on exposure to humans during development. Miklósi et al. (2003) compared dogs and wolves that had been socialised with humans to a comparable level. They found that dogs significantly outperformed wolves on the object choice task. Virányi et al. (2008) conducted a longitudinal study with sets of hand-reared wolves and dogs. When tested at a young age, the dogs significantly outperformed the wolves despite similar levels of exposure to humans. They then went on to re-test the wolves at regular intervals. The wolves performance steadily increased with each re-testing, such that eventually the best subset of these highly trained wolves reached a comparable level of performance with naive dogs, who had not previously been tested. Echoing Virányi et al’s findings, Riedel et al. (2008) found a similar level of performance across dogs of all ages, including puppies as young as 6 weeks old.

We should note, however, that others have disputed the claims of dogs’ superiority to wolves and the presence of the capacity in young dogs (Udell et al. 2008; Wynne et al. 2008). This has lead to the suggestion of the so-called ‘two-stage hypothesis’, which suggests that dogs’ abilities stem from a combination of an initial exposure to humans during their early socialisation period, followed by extended reinforcement learning over the course of life (Udell et al. 2010). This, then, is something of a domain-general account of how these abilities emerged, in contrast to the more domain-specific account rooted in domestication. However, a range of further evidence suggests that the abilities found in dogs go well beyond what could reasonably be accounted for through a domain-general effect of reinforcement learning.

The most significant of these further findings concerns a number of parallels between dogs and human infants in their response to communicative cues. Firstly, like human infants, but unlike other apes (Hare and Tomasello 2004; Herrmann et al. 2006), dogs appear to show a particular sensitivity to cues in co-operative contexts (Pettersson et al. 2011), rather than in competitive situations. Secondly, dogs, again like human infants, show a particular sensitivity to the ostensive content of signals and cues (Kaminski et al. 2012). Dogs respond differently to intentionally given cues, than to similar actions produced ‘accidentally’, and show sensitivity to a range of ostensive cues, such as establishing eye contact and calling their name. Finally, dogs even exhibit some similar errors to those seen in human infants in interpreting communicative cues (Topál et al. 2008, 2009), including the so-called A-not-B error related to object permanence.

We should pause here to note that these abilities have been investigated in a number of species other than dogs, including dolphins (Pack and Herman 2004, 2006, 2007), seals (Shapiro et al. 2003; Scheumann and Call 2004), horses (Proops et al. 2010), and goats (Kaminski et al. 2005). Studies have also been conducted with a number of bird species including parrots (Giret et al. 2009), and numerous kinds of corvids (e.g. Schloegl et al. 2008; Tornick et al. 2011). In many cases the results can be explained in terms of stimulus enhancement, with levels of correct response correlating to the saliency of the cue used. However, in some cases, particularly dolphins and seals, there does indeed seem to be some genuine understanding of the communicative nature of the cues. However, much like with socialised wolves, these more impressive cases typically involve individuals who have had intensive, long-term contact with humans, often participating in research programs, demonstrations or shows for many years. In addition, there have been a number of studies of other domesticated species, including cats, horses and goats (see Miklósi and Soproni 2006; Thomas 2013 for reviews), which have returned somewhat inconclusive results.

Having an evolutionary history of domestication is not, then, a necessary condition for the sophisticated utilisation of human communicative cues. However, there may be multiple routes, each comprised of different proportions of phylogenetic and ontogenetic contributions, that can lead to similar phenotypic outcomes (Miklósi and Topál 2011). Broadly speaking, the ontogenetic route, taken by dolphins, seals and intensively socialised wolves, consists of long-term exposure to humans. In contrast, the phylogenetic route, seemingly taken by the dog over the course of domestication, means it requires little or no exposure to humans for comparable capacities to become manifest (Miklósi and Topál 2011). We are left, then, with much the same question as followed from the case of the Bengalese finch: what is it about the process of domestication that caused this change in dogs? Fortunately, however, there is a long-running experiment, expressly designed to investigate the domestication of the dog.

The farm fox experiment (Belyaev 1979; Trut 1999; Trut et al 2009) was started in 1959 by the Russian geneticist Dmitry K. Belyaev. The experiment took the Siberian silver fox—a regional variant of the more familiar red fox—as its model animal, and began a selective breeding program, still running today, to recreate the domestication of the dog, and to investigate the origins of the physical and behavioural characteristics typical of domesticated species. At the core of the experiment is the breeding of three lines of foxes, tame, aggressive, and a control group. For reasons of clarity and space we will focus on the tame-line foxes.

Selection in the tame-line foxes was solely based on their temperament, as assessed through their reactions to humans (Kukekova et al. 2006, 2008, 2012). Foxes were then classified into groups based on their overall aggressive behaviour, with the tamest, least aggressive foxes known as the ‘domesticated elite’. The selective pressure applied to the tame line of foxes was very strong, with only the top 10% of most tame individuals being allowed to breed (Trut et al. 2009). Unsurprisingly, this rapidly increased the percentage of foxes classified as ‘domesticated elite’, from 1 to 2% at the beginning of the experiment to almost the entire population after fifty or so generations (Trut et al 2009).

What is perhaps more surprising, however, was the range of other changes that also occurred in the tame line of foxes, as listed in Table 1 (after Trut 1999; Kukekova et al. 2006; Trut et al. 2009; Bidau 2009).

Table 1 Correlated phenotypic changes following selection on temperament

Full size table

The most striking thing about this list is how many of these changes are typically found in domesticated species (Price 1999), forming part of the domestic phenotype. One remarkable finding of the farm fox experiment, then, is that many of these typical outcomes of domestication can be produced simply as a by-product of selection against aggression. For present purposes, however, the most important change that occurred in the tame line of foxes was that, like domestic dogs, they also came to exhibit a sensitivity to communicative intent.

Hare et al. (2005) conducted an object-choice task, similar to those described above, comparing the abilities of dog pups, tame-line domesticated fox kits and control fox kits. The three groups were tested on their ability to use a point-and-gaze cue to select the correct location of some hidden food. The two major findings were that tame-line fox kits performed as well as dog puppies, and that the tame-line kits outperformed kits of the control population. There was also no evidence of learning during the experiment, as the tame-line kits performed as well in the initial trials as in later ones.

Temperament is the only criteria on which these foxes were selected. The fact that the sensitivity to communicative intent has emerged in the tame fox line lends support, therefore, to the emotional reactivity hypothesis (Hare et al. 2005; Hare and Tomasello 2005; Melis et al. 2006). This is the view that cognitive changes, particularly those involving co-operative behaviour, may not always requires direct selection, but can appear as a by-product of selection acting on systems of emotion or aggression that had previously prevented the use of preexisting skills in these kinds of co-operative contexts. This speaks directly to the question of why and how domestication might have resulted in this ability emerging in dogs. The answer arising from the farm-fox experiment is that such capacities are likely to have emerged as a by-product of selection targeting defensive and aggressive behaviours.

Bridging the ‘gap’ to humans

In the Bengalese finch, relaxed selection, changes in the regulation of stress hormones, and female preferences have combined to expand the role played by learning. This provides a parallel to the first of our precursor traits, regarding the importance of learning in the transmission of a communication system. Recall that learning plays little role in the transmission of most species’ communication systems. The Bengalese finch provides us with a documented case study of how learning might take on a greater role. In the domestic dog, selection on temperament has enabled the emergence of a particularly acute sensitivity to communicative cues. This serves as a parallel to our second precursor trait, that the kind of learning required for a system like language is one that is fundamentally rooted in communicative inference. Of course, neither the Bengalese finch nor the domestic dog provide a full analog to their respective traits in humans. It is, after all, no surprise that the full depth and complexity of language learning and human social cognition would not be present in other species. However, in both instances we see the parallel evolution of the core elements of the two precursor traits which we identify as underpinning the cultural evolution of language structure. We think the fact that both these instances of parallel evolution occurred in the context of domestication provides an important clue as to how these key precursor traits might have evolved in humans.

However, we also acknowledge that there remains a significant explanatory “gap” between humans and language on the one hand, and the two case studies of domestication on the other. If we were to be critical of our argument so far, we might put it somewhat like this. What we have is two “pieces” that appear to fit together: the preconditions required for a structure-creating process of cultural transmission, and the two case studies of domestication in which parallels to those preconditions can be seen emerging. What remains to be demonstrated is whether, and even how, these two pieces might be part of the same “puzzle”. Several questions naturally arise here. What is ‘domestication’? What has domestication got to do with human evolution? How could domestication-like changes have occurred in humans?

What is domestication and why is it relevant to humans?

Why we should even consider the possibility of domestication having played a role in human evolution? After all, does not domestication require that there be a domesticator—an outside agency selectively breeding the species? In this section we contrast two conceptions of domestication (see Thomas 2013). (1) The conditions view, in which domestication is characterised in terms of being under the control of another species. (2) The outcomes view, in which domestication is characterised by the typical traits that are shared by many domesticated species, known as the domestic phenotype.

The conditions view of domestication

The view of domestication held by many people is probably well captured by the following quote:

[a domestic animal is] bred in captivity for the purposes of subsistence or profit, in a human community that controls its breeding, its organisation of territory and its food supply.

(Clutton-Brock 1992: 41, our emphasis)

As the emphasis makes clear, this view focuses on domestication as the human ‘mastery’ of nature, through the control of other species, by humans, for our own conscious purposes. To an extent, of course, this description of domestication is accurate. However, it also brings with it a number of problems.

For one, while it is an accurate description of the current-day living conditions of many domesticated species, it is an entirely insufficient account of how those species came to take on their present-day characteristics. This is because many aspects of the domestic phenotype can be traced not to selective breeding but to continuing natural selection under domestication (Price and King 1968; Price 1999). The environment of domestication is characterised by reduced living space, increased predictability of food and water supply, dietary changes, an altered social structure, and greater availability of shelter from the elements, resulting in profound changes to an organism’s microclimate (Price and King 1968; Carlstead 1996; Price 1999). Against this backdrop, major evolutionary changes should be expected even in the total absence any artificial selection. A range of domestication-typical changes in mammals, birds, and fish have been associated to some degree with natural selection under domestication. These include reductions in body size (Tchernov and Horwitz 1991); reductions in cranial and skeletal robusticity (Zohary et al. 1998; Houde et al. 2010); reduced sexual dimorphism (Polák and Frynta 2009, 2010); reduced brain size (Kruska 2005); the breakdown of seasonal breeding patterns (Price 1999; Tchernov and Horwitz 1991); and changes in temperament, environmental reactivity, and predator vigilance (Håkansson and Jensen 2008; Campler et al. 2009).

In addition, the conditions view of domestication has the tendency to make us view it as a unitary process. Historically, however, there have been a number of ‘pathways’ to domestication (Zeder 2012). These are as varied as the prey pathway where a previously hunted animal comes under direct human control, as was the case with sheep goats, and cattle; and the commensal pathway, where the process of domestication is initiated by the domesticated species itself in coming to live among humans, as was the case for dogs (Morey 1994). Finally, the systematic application of selective breeding is a recent development in the long history of domestication (Leach 2007), which is measured in tens of millennia. All of this is not to say that artificial selection and selective breeding are unimportant. Rather, the point is that the domestic phenotype cannot be reduced to the product of selective breeding. It is the outcome of a range of evolutionary processes taking place against a particular environmental backdrop, much of which has long been shared by humans themselves.

The outcomes view of domestication

In contrast to the view described above, it is also possible to view domestication in terms of its typical evolutionary outcomes. It has long been known that many phenotypic similarities can be seen across a wide range of domesticated species (Darwin 1868; Price and King 1968; Price 2002). This suite of phenotypic changes has come to be known as the domestic phenotype. The following tables list some of its main characteristics, and should be read in terms of how domesticated species typically differ from their wild equivalents (Tables 2, 3). The tables are based on overviews by Leach (2003), Price (1984, 1999, 2002), Clutton-Brock (1999) and Trut et al. (2009).

Table 2 Hard tissue changes in the domestic phenotype

Full size table

Table 3 Soft tissue and behavioural changes in the domestic phenotype

Full size table

This view of domestication is, of course, not incompatible with the conditions view; however, a focus on the evolutionary outcomes of domestication has a number of advantages as as general ‘organising framework’ for thinking about domestication in general, and about the possibility of human self-domestication in particular. Firstly, by focusing on the outcomes of domestication it remains agnostic about the processes and pathways that lead to those outcomes. Secondly, it provides an objective set of criteria for assessing whether a given species is indeed ‘domesticated’. Indeed, the domestic phenotype is used by archaeologists as diagnostic of domestication having occurred in the past (Zeder et al. 2006). Finally, it allows us to re-frame the question of human self-domestication in very concrete terms, and away from potentially unhelpful metaphorical formulations. Humans can be considered domesticated to the extent that they: (1) share in the domestic phenotype; and (2) that those phenotypic similarities have arisen in response to similar evolutionary circumstances and selective pressures, and are underpinned by similar biological mechanisms.

The domestic phenotype in humans

The idea that humans are a ‘self-domesticated’ species has deep intellectual roots, tracing back at least to classical antiquity (Leach 2007). Over the centuries this view has picked up a number of unpleasant political associations (Brüne 2007). However, from a scientific perspective the main driver of the idea has been the observation that humans, too, share many aspects of the domestic phenotype. This observation can be seen in the writings of Charles Darwin (1871), the anthropologist Franz Boas (1938), and any number of more recent scholars who have compared aspects of human evolution to the outcomes of domestication (e.g. Ashley Montagu 1955; Gould 1977; Leach 2003, 2007; Hare and Tomasello 2005; Deacon 2009, 2010; Bednarik 2012). Unlike most domesticated species, modern humans have no living ‘wild’ ancestor against which their phenotypic traits can be compared. As such, most of these observations compare the modern human phenotype with trends over the course of human evolution, as seen in the fossilised remains of human ancestors, or, where this is not possible, with their closest living relatives, the great apes.

Modern humans have shown a marked decrease in skeletal and cranial robusticity over the last 100,000 years (Ruff et al. 1993; Lahr and Wright 1996; Leach 2003; Bednarik 2012). They have also seen a significant reduction in teeth size (Brace et al. 1987), and in the occurrence of tooth-crowding and malocclusion (Larsson et al. 2005; Leach 2003). Compared both to extant great apes and to ancestral human species, modern humans exhibit a significant retention of juvenile characteristics into adulthood (Gould 1977; Shea 1989; Zollikofer and Ponce de León 2010). In recent years the evidence of neoteny in modern humans has expanded to include aspects of gene expression in the brain (Somel et al. 2009, 2012; Liu et al. 2012), and the timing of synaptogensis (Bufill et al. 2011). Modern humans also exhibit very low levels of sexual dimorphism compared both to other apes (Plavcan 2012) and ancestral species of hominids (Harmon 2006; Gordon et al. 2008; Kimbel and Delezene 2009). Unlike other great ape species, human females do not have distinct ‘breeding seasons’, and thus exhibit a form of ‘extended sexuality’ (Rodrı́guez-Gironés and Enquist 2001), notwithstanding differences in fertility and preferences across the oestrus cycle (Gangestad and Thornhill 2008). There are also early signs that humans may differ in temperament to the other great apes, in ways similar to domesticated species (Herrmann et al. 2011). Finally, it seems that this suite of changes is linked, representing the systemic impact of an underlying mechanism (Trut et al. 2009; Bidau 2009; Wilkins et al. 2014). Evidence is now emerging for a similar links between features such as cranial robusticity, temperament, and neoteny in humans (e.g. Cieri et al. 2014).

Documenting the full range of these parallels, together with the nuances of the arguments over the validity of each one, is beyond the scope of what we can manage here, and the interested reader is referred to the references cited above, particularly Leach (2003, 2007), together with the much fuller version of this discussion in Thomas (2013). We should also mention the one aspect of the domestic phenotype which humans certainly do not parallel: a reduction in brain size. Rather than see brain size reducing, the direction of human evolution has been towards an increase in brain size (Rightmire 2004), with any trends in the opposite direction linked to a concomitant reduction in body size (Ruff et al. 1993). It may be that this is one trait where the difference between domestication and self-domestication is actually important, with humans, as both constructors and inhabitants of their environment, not subject to the same reduction in stimulation and opportunities for sensory exploration (Price 2002) experienced by other species living in that environment (Leach 2003).

How might humans have come to share in the domestic phenotype?

The fact that humans exhibit many aspects of the domestic phenotype is the primary reason why the idea of human self-domestication should be taken seriously. However, this still leaves open the question of how these parallels might have occurred. In this last section we provide a brief tour of several areas of research aiming to address this question. We first consider aspects of the selective environment that might account for these parallels, focusing on the role of adaptation to the human-made environment and selection against aggression. We then review some evidence regarding the biological mechanisms underpinning the domestic phenotype.

The selective environment of domestication

As discussed above, many aspects of the domestic phenotype are linked to ongoing natural selection in the human-made environment, with the dramatic changes in living space, food availability and type, microclimate, elemental shelter, etc. that such an environment introduces. What is less commonly recognised, however, is that it is humans themselves, as nature’s quintessential niche constructors (Odling-Smee et al. 2003), who have likely been affected most by this environment, given that they have lived in it longest of all. Indeed, as Leach (2003) notes, many of the explanations for the domestication-typical changes seen in human beings, particularly in the last 50,000 years or so, point to aspects of this human-made environment, such as increasing sedentism and associated reductions in activity (Ruff et al. 1993), changes in climate and microclimate (Pearson 2000), and dietary shifts (Cohen and Armelagos 1984; Lieberman 1996). Similar changes in response to the human-made environment have also been observed in commensals—species who live with us but are not controlled by us—such as the house mouse (Tchernov 1984), and in the ‘inadvertent domestication’ observed in captive breeding programs for endangered species (O’Regan and Kitchener 2005). Once it is recognised that many aspects of the domestic phenotype are associated with the adaptation to a human-made environment, the idea that humans might share those ‘domesticated’ traits comes to be much easier to understand.

The second key factor is the role played by selection against aggression. One of the most important contributions of the farm fox experiment to our understanding of domestication is the extent to which the domestic phenotype can emerge through a ‘correlated cascade’ of changes following selection on temperament. One question that arises from this is whether there are any examples of a similar set of changes following natural selection in the wild. Hare et al. (2012) present a range of evidence suggesting that the bonobo is just such a case. Bonobos differ from chimpanzees along a number of physical (Cramer 1977; Zihlman and Cramer 1978; Pilbrow 2006), behavioural and temperamental (Hare et al. 2007; Hare and Kwetuenda 2010) dimensions that closely parallel the differences between wild and domesticated species. Hare et al. (2012) argue that these differences are ultimately rooted in aspects of the bonobo’s feeding ecology, which have had profound implications for the structuring of bonobo society, especially the favouring of greater co-operation and reduced levels of aggression. The bonobo, then, may be a wild analogue to the proof-of-concept findings of the farm-fox experiment. Furthermore, it may also serve as something of a template for how selection against aggression could be linked to the domestic phenotype in humans. In particular, a growing body of work is now citing changes in human feeding ecology, primarily our shifting to a cooked and processed diet (Wrangham et al. 1999; Wrangham and Conklin-Brittain 2003; Wrangham 2009), as a potential source of similar selective pressure in favour of co-operation and reduced aggression. This possibility is clearly more speculative than the impact of the human-made environment. However, in the farm-fox experiment we have confirmation that this kind of selective regime can result in the domestic phenotype, and in the bonobo we have a close relative, for which there is good evidence that a similar process, this time of natural selection, has had a similar phenotypic outcome.

The physical mechanisms underpinning domestication

We now turn to the mechanisms underpinning the domestic phenotype. However, before our brief review of work in this area, it is worth saying something about the criteria such a mechanism has to meet. The domestic phenotype has two key features: the range of species in which it has been observed, and the seemingly disparate set of traits of which it is comprised. To account for the domestic phenotype, therefore, any proposed mechanism must be both highly conserved across species and capable of explaining how such an apparently unconnected set of traits so frequently occur together. Follow-up studies on the mechanisms at work in the farm fox experiment has identified changes in the domesticated foxes’ neuroendocrine system as being of fundamental importance (Trut et al. 2009). In particular, a reduction in the production of glucocorticoids and other stress hormones, together with changes in the levels of neurotransmitters such as serotonin. The importance of the role played by the neuroendocrine system is also supported by work with the Bengalese finch (Suzuki et al. 2011, 2012), bonobos (Surbeck et al. 2012a, b), and domesticated species more broadly (Price 2002).

This neuroendocrinal mechanism meets one of the two criteria: the systems involved are highly conserved across species (Bidau 2009). However, as others have noted (e.g. Wilkins et al. 2014), it does less well against the second criteria: it is unclear how such neuroendocrinal changes account for the diverse range of traits that comprise the domestic phenotype. Wilkins et al. (2014) argue that this diverse set of traits, including the neuroendocrinal changes, are linked by shifts in the development, migration, and interaction of Neural Crest Cells (NCC), a vertebrate-specific class of the developmentally important stem cells. They review a wide range of clinical and experimental work which shows similarities between aspects of the domestic phenotype and the effects of genetic disorders, so-called neurocristopathies, that affect the generation and function of NCC. Importantly, Wilkins et al. distinguish between NCC as the shared developmental basis linking the various traits of the domestic phenotype and their emergence over ontogeny, and the polygenic nature of the underlying genetic explanation. This allows them to present a unified account of the diverse traits of the domestic phenotype without needing to talk in overly simplistic terms of ‘domestication genes’.

More recently it has been suggested that changes in the development and regulation of NCC are linked not just to the domestic phenotype but also to the structural ‘language readiness’ of the human brain (Benítez-Burraco et al. 2016). In particular, Benı́tez-Burraco et al. argue for a link between changes to the NCC and the development of the human-typical ‘globular’ brain shape. This builds on previous work in which they have argued that the distinctive globular shape of the human brain is linked to key features of its modern-day patterns of neural connectivity (Boeckx and Benítez-Burraco 2014a, b), which in turn facilitate what they term ‘cross-modular’ thinking. In linguistic terms, this is exemplified by something like the syntax-semantics interface. In more general terms, it relates to the capacity to make links across cognitive domains, something which may be core to the uniqueness of modern human cognition (e.g. Mithen 1996; Hauser 2009). This work is obviously in its very early stages, but is particularly intriguing regarding the parallels it offers with our work on the mechanism and necessary biological foundations for the cultural evolution of language.

Why has domestication had this effect on humans and not other species?

If domestication set the stage for the cultural evolution of language, it is quite reasonable to ask why language itself is not part of the domestic phenotype. Why is something ‘language like’ not seen in other domesticated species? Focusing just on our two central case studies, why do we only see one of the two precursor traits in each instance, and yet humans exhibit both together? These are difficult questions, for which we do not pretend to have definitive answers. However, we think the following points are worth taking into consideration.

Much as we have focused on their similarities, there is also a need to acknowledge the differences between domesticated species. One important way in which they differ is the ‘pathway’ they take towards domestication. Some species, like cattle and sheep, are former prey animals that have been slowly corralled into our system of agriculture. Others, like dogs, began the process as freely associating commensals. In the human case, the process of domestication was one of self-domestication. We have already discussed the potential consequence of this fact in terms of human brain size increasing, rather than the typical domesticated pattern of reducing brain size. We are highly sceptical of any attempt to draw direct links between brain size and particular capabilities. However, it is at least plausible that this increase in brain size is one contributing factor to the emergence of language (see MacWhinney 2005).

Another way in which domesticated species differ is in terms of their evolutionary history prior to domestication. The evolutionary histories of many lineages have rendered them unamenable to domestication at all (Diamond 1997). Furthermore, if there is a key commonality between the Bengalese finch and the domestic dog, it is that domestication has acted to unleash ‘potentials’ that were already there in the ancestral population. The white-rumped munia is a vocal learner, but freed from selection to keep songs simple and canalized, the role of vocal learning expanded. The grey wolf exhibits sophisticated social cognition, and can reach dog-like levels of performance given extensive contact with humans and repeated exposure to the object-choice task, but does not seem to learn new signals. In the human case, might the combination of primate social cognition with, at least in the gestural realm, the capacity of primates to learn new signals, explain why both precursors emerged together?

We recognise that these brief thoughts can barely begin to address this question. However, we think there are ways in which experimental work could be done in this area. For example, as noted above, many of the more particular impacts of domestication take the form of unleashed potentials. More precisely, potentials that have thus-far been limited by aspects of temperament. It should be possible to identify what these might be in particular instances. For example, Melis et al. (2006) found that chimpanzees who were seemingly unable to solve a co-operative dyadic task could do so if dyad-pairing was manipulated such that individuals with mutually high tolerance were paired together. The poor performance of chimpanzees, relative to dogs, on tasks of co-operative communication stands, then, as something that might be remedied through a change in chimpanzee temperament.

Summary

We have not attempted to present a comprehensive overview of human self-domestication. Instead, we have focused on the more modest task of trying to close the perceived ‘gap’ between the two sets of data that form the core of this paper: the preconditions required for a structure-creating process of cultural transmission, and the two case studies of domestication in which parallels to those preconditions can be seen emerging. We hope we have helped close it somewhat in the following three ways. First, in focusing on the domestic phenotype we aim to root the idea of humans as domesticates in a concrete, coherent, and falsifiable framework. The focus on a particular set of traits, the domestic phenotype, and the evolutionary explanations for those traits allows us to move beyond metaphorical formulations of what it means to be ‘self-domesticated’. Second, we have identified two evolutionary circumstances—adaptation to the human-made environment and selection on temperament—that are known to contribute to the emergence of the domestic phenotype in other species. The first of these has definitely been a major factor in human evolution; the role of the second, while more speculative, is supported by a range of comparative and archaeological evidence. Finally, we have reviewed a range of work on the biological mechanisms underpinning domestication. These mechanisms are highly conserved—and thus present in a wide range of species, including humans—and can account for the diverse traits of the domestic phenotype. We also touched on some recent work suggestive of a link between the mechanisms mediating the domestic phenotype and language itself.

Conclusion

There is now a wealth of evidence showing how language structure emerges through a process of cultural evolution. However, the wider implications of this work have received insufficient attention. In particular, our growing knowledge of the role played by cultural evolution has significant implications for what we should expect biological evolution to account for in the emergence of language. Rather than accounting for language structure itself, the key task for biological evolution lies in accounting for the foundational traits that make a process of structure-creating cultural evolution possible. We identified two key traits: the central role of learning in the transmission of the communication system; and the ability to recognise the communicative intent of a signal or action.

In the Bengalese finch and the domestic dog we have two comparative case studies, each of which show one of these traits emerging in the context of domestication. Two key features of the domestication process stand out as particularly important in accounting for these instances of parallel evolution. The first concerns the relaxation of various selection pressures that had been important in the wild. The second concerns the systemic impact of selection acting on the biological systems underpinning temperament and aggression.

Humans share many of the hallmarks of a domesticated species. Much of human evolution has taken place in just the kind of human-made, selection-buffering environment shared by domesticated species. There is also good evidence that humans may have undergone a similar kind of selection on temperament. Given these parallels, we think the two case studies speak directly to the origin of these precursor traits in humans. The cultural evolution of language structure is rooted in an earlier process of self-domestication.

References

Agnetta B, Hare B, Tomasello M (2000) Cues to food location that domestic dogs (Canis familiaris) of different ages do and do not use. Anim Cogn 3:107–112
Article Google Scholar
Ashley Montagu MF (1955) Time, morphology, and neoteny in the evolution of man. Am Anthropol 57:13–27
Article Google Scholar
Bao C, Zeng L, Zuo M (2003) The impact of deafness to the survival of the newborn cells in the brain of juvenile white-rumped munia, Lonchura striata. Zool Sci 20:1079–1085
Article Google Scholar
Baptista LF, Schuchmann K-L (1990) Song learning in the Anna hummingbird (Calypte anna). Ethology 84:15–26
Article Google Scholar
Bednarik RG (2012) The origins of human modernity. Humanities 1:1–53
Article Google Scholar
Beecher MD, Brenowitz EA (2005) Functional aspects of song learning in songbirds. Trends Ecol Evol 20:143–149
Article Google Scholar
Beecher MD, Burt JM (2004) The role of social interaction in bird song learning. Curr Dir Psychol Sci 13:224–228
Article Google Scholar
Beecher MD, George FK, Le Michel M, Richard FT (2010) Birdsong and vocal learning during development. encyclopedia of behavioral neuroscience. In: Koob GF, le Moal M, Thompson RF (eds) encyclopedia of behavioral neuroscience, vol 1. Academic Press, Cambridge, pp 164–168
Chapter Google Scholar
Belyaev DK (1979) Destabilizing selection as a factor in domestication. J Hered 70:301–308
Article Google Scholar
Benítez-Burraco A, Theofanopoulou C, Boeckx C (2016) Globularization and domestication. Topoi. https://doi.org/10.1007/s11245-016-9399-7
Article Google Scholar
Berwick RC, Okanoya K, Beckers GJL, Bolhuis JJ (2011) Songs to syntax: the linguistics of birdsong. Trends Cogn Sci 15:113–121
Article Google Scholar
Bidau CJ (2009) Domestication through the centuries: Darwin’s ideas and Dmitry Belyaev’s long-term experiment in silver foxes. Gayana 73:55–72
Google Scholar
Boas F (1938) The mind of primitive man. Macmillan, New York, pp 122–144
Google Scholar
Boeckx CA, Benítez-Burraco A (2014a) Globularity and language-readiness: generating new predictions by expanding the set of genes of interest. Front Psychol 5:1324
Google Scholar
Boeckx CA, Benítez-Burraco A (2014b) The shape of the human language-ready brain. Front Psychol 5:282
Google Scholar
Boughman JW (1998) Vocal learning by greater spear–nosed bats. Proc R Soc Lond B Biol Sci 265:227–233
Article Google Scholar
Brace CL, Rosenberg KR, Hunt KD (1987) Gradual change in human tooth size in the late Pleistocene and post-Pleistocene. Evolution 41:705–720
Google Scholar
Brighton H (2003) Simplicity as a driving force in linguistic evolution. The University of Edinburgh, Edinburgh
Google Scholar
Brighton H, Smith K, Kirby S (2005) Language as an evolutionary system. Phys Life Rev 2:177–226
Article Google Scholar
Brüne M (2007) On human self-domestication, psychiatry, and eugenics. Philos Ethics Humanit Med 2:21
Article Google Scholar
Buchanan KL, Leitner S, Spencer KA et al (2004) Developmental stress selectively affects the song control nucleus HVC in the zebra finch. Proc R Soc Lond B Biol Sci 271:2381–2386
Article Google Scholar
Bufill E, Agustí J, Blesa R (2011) Human neoteny revisited: the case of synaptic plasticity. Am J Hum Biol 23:729–739
Article Google Scholar
Campler M, Jöngren M, Jensen P (2009) Fearfulness in red junglefowl and domesticated White Leghorn chickens. Behav Proc 81:39–43
Article Google Scholar
Carlstead K (1996) Effects of captivity on the behavior of wild mammals. In: Kleiman DG, Allen ME, Thompson KV, Lumpkin S (eds) Wild animals in captivity: principles and techniques. University of Chicago Press, Chicago, pp 317–333
Google Scholar
Christiansen MH, Dale RAC, Ellefson MR, Conway CM (2002) The role of sequential learning in language evolution: computational and experimental studies. In: Cangelosi A, Parisi D (eds) Simulating the evolution of language. Springer, London, pp 165–187
Chapter Google Scholar
Cieri RL, Churchill SE, Franciscus RG et al (2014) Craniofacial feminization, social tolerance, and the origins of behavioral modernity. Curr Anthropol 55:419–443
Article Google Scholar
Clark HH (1996) Using language. Cambridge University Press, Cambridge
Book Google Scholar
Clutton-Brock J (1992) How the wild beasts were tamed. New Sci 133:41
Google Scholar
Clutton-Brock J (1999) A natural history of domesticated mammals. Cambridge University Press, Cambridge
Google Scholar
Cohen MN, Armelagos GJ (1984) Paleopathology at the origins of agriculture. Academic Press, Cambridge
Google Scholar
Corballis MC (2002) From hand to mouth: the origins of language. Princeton University Press, Princeton
Google Scholar
Cramer DL (1977) Craniofacial morphology of Pan paniscus. A morphometric and evolutionary appraisal. Contrib Primatol 10:1–64
Google Scholar
Darwin C (1868) Variation of plants and animals under domestication. John Murray, London
Google Scholar
Darwin C (1871) The descent of man, and selection in relation to sex. John Murray, London
Book Google Scholar
Deacon TW (1997) The symbolic species: the co-evolution of language and the brain. WW Norton & Company, New York
Google Scholar
Deacon TW (2003) Multilevel selection in a complex adaptive system: the problem of language origins. In: Weber BH, Depew DJ (eds) Evolution and learning: the Baldwin effect reconsidered. MIT Press, Cambridge, pp 81–106
Google Scholar
Deacon TW (2009) Relaxed selection and the role of epigenesis in the evolution of language. In: Blumberg MS, Freeman JH, Robinson SR (eds) Oxford handbook of developmental behavioral neuroscience. Oxford University Press, Oxford, pp 730–752
Google Scholar
Deacon TW (2010) A role for relaxed selection in the evolution of the language capacity. Proc Natl Acad Sci 107:9000–9006
Article Google Scholar
Dediu D, Cysouw M, Levinson SC et al (2013) Cultural evolution of language. In: Richerson PJ, Christiansen MH (eds) Cultural evolution: society, technology, language, and religion. MIT Press, Cambridge, pp 303–332
Google Scholar
Diamond JM (1997) Guns, germs, and steel: a short history of everybody for the last 13,000 years. Johnathan Cape
Eales LA (1989) The influences of visual and vocal interaction on song learning in zebra finches. Anim Behav 37:507–508
Article Google Scholar
Fitch WT (2005) The evolution of language: a comparative review. Biol Philos 20:193–203
Article Google Scholar
Gangestad SW, Thornhill R (2008) Human oestrus. Proc R Soc Lond B Biol Sci 275:991–1000
Article Google Scholar
Giret N, Miklósi Á, Kreutzer M, Bovet D (2009) Use of experimenter-given cues by African gray parrots (Psittacus erithacus). Anim Cogn 12:1–10
Article Google Scholar
Gómez J-C (2005) Species comparative studies and cognitive development. Trends Cogn Sci 9:118–125
Article Google Scholar
Goodwin BC (1994) How the leopard changed its spots: the evolution of complexity. Princeton University Press, Princeton
Google Scholar
Gordon AD, Green DJ, Richmond BG (2008) Strong postcranial size dimorphism in Australopithecus afarensis: results from two new resampling methods for multivariate data sets with missing data. Am J Phys Anthropol 135:311–328
Article Google Scholar
Gould SJ (1976) In defense of the analog: a commentary to N. Hotton. In: Masterson RB, Hodos W, Jerison H (eds) Evolution, brain and behavior: persistent problems. Lawrence Erlbaum Associates, New Jersey, pp 175–179
Google Scholar
Gould SJ (1977) Ontogeny and phylogeny. Belknap Press, Cambridge
Google Scholar
Griffiths TL, Kalish ML (2007) Language evolution by iterated learning with Bayesian agents. Cogn Sci 31:441–480
Article Google Scholar
Håkansson J, Jensen P (2008) A longitudinal study of antipredator behaviour in four successive generations of two populations of captive red junglefowl. Appl Anim Behav Sci 114:409–418
Article Google Scholar
Hare B, Kwetuenda S (2010) Bonobos voluntarily share their own food with others. Curr Biol 20:R230–R231
Article Google Scholar
Hare B, Tomasello M (2004) Chimpanzees are more skilful in competitive than in cooperative cognitive tasks. Anim Behav 68:571–581
Article Google Scholar
Hare B, Tomasello M (2005) Human-like social skills in dogs? Trends Cogn Sci 9:439–444
Article Google Scholar
Hare B, Call J, Tomasello M (1998) Communication of food location between human and dog (Canis familiaris). Evol Commun 2:137–159
Article Google Scholar
Hare B, Brown M, Williamson C, Tomasello M (2002) The domestication of social cognition in dogs. Science 298(5598):1634–1636
Article Google Scholar
Hare B, Plyusnina I, Ignacio N et al (2005) Social cognitive evolution in captive foxes is a correlated by-product of experimental domestication. Curr Biol 15:226–230
Article Google Scholar
Hare B, Melis AP, Woods V et al (2007) Tolerance allows bonobos to outperform chimpanzees on a cooperative task. Curr Biol 17:619–623
Article Google Scholar
Hare B, Wobber V, Wrangham R (2012) The self-domestication hypothesis: evolution of bonobo psychology is due to selection against aggression. Anim Behav 83:573–585
Article Google Scholar
Harmon EH (2006) Size and shape variation in Australopithecus afarensis proximal femora. J Hum Evol 51:217–227
Article Google Scholar
Hauser MD (2009) The possibility of impossible cultures. Nature 460:190–196
Article Google Scholar
Herrmann E, Melis AP, Tomasello M (2006) Apes’ use of iconic cues in the object-choice task. Anim Cogn 9:118–130
Article Google Scholar
Herrmann E, Hare B, Cissewski J, Tomasello M (2011) A comparison of temperament in nonhuman apes and human infants. Dev Sci 14:1393–1405
Article Google Scholar
Hobaiter C, Byrne RW (2011) The gestural repertoire of the wild chimpanzee. Anim Cogn 14:745–767
Article Google Scholar
Houde ALS, Fraser DJ, Hutchings JA (2010) Reduced anti-predator responses in multi-generational hybrids of farmed and wild Atlantic salmon (Salmo salar L.). Conserv Genet 11:785–794
Article Google Scholar
Janik VM, Slater PJB (2000) The different roles of social learning in vocal communication. Anim Behav 60:1–11
Article Google Scholar
Kagawa H, Yamada H, Lin R et al (2012) Ecological correlates of song complexity in white-rumped munias: the implication of relaxation of selection as a cause for signal variation in birdsong. Interact Stud 13:263–284
Article Google Scholar
Kaminski J, Riedel J, Call J, Tomasello M (2005) Domestic goats, Capra hircus, follow gaze direction and use social cues in an object choice task. Anim Behav 69:11–18
Article Google Scholar
Kaminski J, Tempelmann S, Call J, Tomasello M (2009) Domestic dogs comprehend human communication with iconic signs. Dev Sci 12:831–837
Article Google Scholar
Kaminski J, Schulz L, Tomasello M (2012) How dogs know when communication is intended for them. Dev Sci 15:222–232
Article Google Scholar
Kauffman SA (1993) The origins of order: self organization and selection in evolution. Oxford University Press, Oxford
Google Scholar
Kemp C, Regier T (2012) Kinship categories across languages reflect general communicative principles. Science 336:1049–1054
Article Google Scholar
Kimbel WH, Delezene LK (2009) “Lucy” redux: a review of research on Australopithecus afarensis. Am J Phys Anthropol 140:2–48
Article Google Scholar
Kirby S (2002) Learning, bottlenecks and the evolution of recursive syntax. In: Briscoe EJ (ed) Linguistic evolution through language acquisition: formal and computational models. Cambridge University Press, Cambridge, pp 173–204
Chapter Google Scholar
Kirby S (2017) Culture and biology in the origins of linguistic structure. Psyhonomic Bull Rev 24(1):118–137
Article Google Scholar
Kirby S, Dowman M, Griffiths TL (2007) Innateness and culture in the evolution of language. Proc Natl Acad Sci 104:5241–5245
Article Google Scholar
Kirby S, Cornish H, Smith K (2008) Cumulative cultural evolution in the laboratory: an experimental approach to the origins of structure in human language. Proc Natl Acad Sci 105:10681–10686
Article Google Scholar
Kirby S, Griffiths T, Smith K (2014) Iterated learning and the evolution of language. Curr Opin Neurobiol 28:108–114
Article Google Scholar
Kirby S, Tamariz M, Cornish H, Smith K (2015) Compression and communication in the cultural evolution of linguistic structure. Cognition 141:87–102
Article Google Scholar
Kirchhofer KC, Zimmermann F, Kaminski J, Tomasello M (2012) Dogs (Canis familiaris), but not chimpanzees (Pan troglodytes), understand imperative pointing. PLoS One 7:e30913
Article Google Scholar
Kroodsma DE, Houlihan PW, Fallon PA, Wells JA (1997) Song development by grey catbirds. Anim Behav 54:457
Article Google Scholar
Kruska DCT (2005) On the evolutionary significance of encephalization in some eutherian mammals: effects of adaptive radiation, domestication, and feralization. Brain Behav Evol 65:73–108
Article Google Scholar
Kukekova AV, Acland GM, Oskina IN et al (2006) The genetics of domesticated behavior in canids: what can dogs and silver foxes tell us about each other? In: Ostrander EA, Giger U, Linblad-Toh K (eds) The dog and its genome. Cold Spring Harbor Laboratory Press, New York, pp 515–537
Google Scholar
Kukekova AV, Trut LN, Chase K et al (2008) Measurement of segregating behaviors in experimental silver fox pedigrees. Behav Genet 38:185–194
Article Google Scholar
Kukekova AV, Temnykh SV, Johnson JL et al (2012) Genetics of behavior in the silver fox. Mamm Genome 23:164–177
Article Google Scholar
Lahr MM, Wright RVS (1996) The question of robusticity and the relationship between cranial size and shape in Homo sapiens. J Hum Evol 31:157–191
Article Google Scholar
Larsson E, Øgaard B, Lindsten R et al (2005) Craniofacial and dentofacial development in pigs fed soft and hard diets. Am J Orthod Dentofac Orthop 128:731–739
Article Google Scholar
Leach HM (2003) Human domestication reconsidered. Curr Anthropol 44:349–368
Article Google Scholar
Leach HM (2007) Selection and the unforeseen consequences of domestication. In: Cassidy R, Mullin M (eds) Where the wild things are now: domestication reconsidered. Berg, Oxford, pp 71–100
Google Scholar
Leavens DA, Hopkins WD, Bard KA (2005) Understanding the point of chimpanzee pointing epigenesis and ecological validity. Curr Dir Psychol Sci 14:185–189
Article Google Scholar
Leitner S, Nicholson J, Leisler B et al (2002) Song and the song control pathway in the brain can develop independently of exposure to song in the sedge warbler. Proc R Soc Lond B Biol Sci 269:2519–2524
Article Google Scholar
Lieberman DE (1996) How and why humans grow thin skulls: experimental evidence for systemic cortical robusticity. Am J Phys Anthropol 101:217–236
Article Google Scholar
Liu X, Somel M, Tang L et al (2012) Extension of cortical synaptic development distinguishes humans from chimpanzees and macaques. Genome Res 22:611–622
Article Google Scholar
Lyn H (2007) Mental representation of symbols as revealed by vocabulary errors in two bonobos (Pan paniscus). Anim Cogn 10:461–475
Article Google Scholar
MacWhinney B (2005) Language evolution and human development. In: Bjorklund D, Pellegrini A (eds) Origins of the social mind: evolutionary psychology and child development. Guilford Press, New York, pp 383–410
Google Scholar
Markman EM (1994) Constraints on word meaning in early language acquisition. Lingua 92:199–227
Article Google Scholar
Melis AP, Hare B, Tomasello M (2006) Engineering cooperation in chimpanzees: tolerance constraints on cooperation. Anim Behav 72:275–286
Article Google Scholar
Mesoudi A, Whiten A (2008) The multiple roles of cultural transmission experiments in understanding human cultural evolution. Philos Trans R Soc Lond B Biol Sci 363:3489–3501
Article Google Scholar
Miklósi Á (2007) Dog behaviour, evolution, and cognition. Oxford University Press, Oxford
Book Google Scholar
Miklósi Á, Soproni K (2006) A comparative analysis of animals’ understanding of the human pointing gesture. Anim Cogn 9:81–93
Article Google Scholar
Miklósi Á, Topál J (2011) On the hunt for the gene of perspective taking: pitfalls in methodology. Learn Behav 39:310–313
Article Google Scholar
Miklósi Á, Kubinyi E, Topál J et al (2003) A simple reason for a big difference: wolves do not look back at humans, but dogs do. Curr Biol 13:763–766
Article Google Scholar
Mithen S (1996) The prehistory of the mind: a search for the origins of art, science and religion. Thames and Hudson, London
Google Scholar
Morey DF (1994) The early evolution of the domestic dog. Am Sci 82:336
Google Scholar
Mulcahy NJ, Call J (2009) The performance of bonobos (Pan paniscus), chimpanzees (Pan troglodytes), and orangutans (Pongo pygmaeus) in two versions of an object-choice task. J Comp Psychol 123:304–309
Article Google Scholar
Mulcahy NJ, Hedge V (2012) Are great apes tested with an abject object-choice task? Anim Behav 83:313–321
Article Google Scholar
Nottebohm F, Liu W-C (2010) The origins of vocal learning: new sounds, new circuits, new cells. Brain Lang 115:3–17
Article Google Scholar
Nowicki S, Peters S, Podos J (1998) Song learning, early nutrition and sexual selection in songbirds. Am Zool 38:179–190
Article Google Scholar
O’Regan HJ, Kitchener AC (2005) The effects of captivity on the morphology of captive, domesticated and feral mammals. Mamm Rev 35:215–230
Article Google Scholar
Odling-Smee FJ, Laland KN, Feldman MW (2003) Niche construction: the neglected process in evolution. Princeton University Press, Princeton
Google Scholar
Okanoya K (2002) Sexual display as a syntactical vehicle: the evolution of syntax in birdsong and human language through sexual selection. In: Wray A (ed) The transition to language. Oxford University Press, Oxford, pp 46–63
Google Scholar
Okanoya K (2004) The Bengalese finch: a window on the behavioral neurobiology of birdsong syntax. Ann N Y Acad Sci 1016:724–735
Article Google Scholar
Okanoya K (2012) Behavioural factors governing song complexity in Bengalese finches. Int J Comp Psychol 25:44–59
Google Scholar
Okanoya K, Yamaguchi A (1997) Adult Bengalese finches (Lonchura striata var. domestica) require real-time auditory feedback to produce normal song syntax. J Neurobiol 33:343–356
Article Google Scholar
Oliphant M (2002) Learned systems of arbitrary reference: the foundation of human linguistic uniqueness. In: Briscoe EJ (ed) Linguistic evolution through language acquisition: formal and computational models. Cambridge University Press, Cambridge, pp 23–52
Chapter Google Scholar
Oudeyer P-Y (2005) The self-organization of combinatoriality and phonotactics in vocalization systems. Connect Sci 17:325–341
Article Google Scholar
Oudeyer P-Y (2006) Self-organization in the evolution of speech. OUP, Oxford
Book Google Scholar
Pack AA, Herman LM (2004) Bottlenosed dolphins (Tursiops truncatus) comprehend the referent of both static and dynamic human gazing and pointing in an object-choice task. J Comp Psychol 118:160–171
Article Google Scholar
Pack AA, Herman LM (2006) Dolphin social cognition and joint attention: our current understanding. Aquat Mamm 32:443–460
Article Google Scholar
Pack AA, Herman LM (2007) The dolphin’s (Tursiops truncatus) understanding of human gazing and pointing: knowing what and where. J Comp Psychol 121:34–45
Article Google Scholar
Pearson OM (2000) Activity, climate, and postcranial robusticity. Curr Anthropol 41:569–607
Google Scholar
Peng Z, Zhang X, Xi C et al (2012) Changes in ultra-structures and electrophysiological properties in HVC of untutored and deafened Bengalese finches relation to normally reared birds: implications for song learning. Brain Res Bull 89:211–222
Article Google Scholar
Pepperberg IM (2010) Vocal learning in Grey parrots: a brief review of perception, production, and cross-species comparisons. Brain Lang 115:81–91
Article Google Scholar
Pettersson H, Kaminski J, Herrmann E, Tomasello M (2011) Understanding of human communicative motives in domestic dogs. Appl Anim Behav Sci 133:235–245
Article Google Scholar
Pilbrow V (2006) Population systematics of chimpanzees using molar morphometrics. J Hum Evol 51:646–662
Article Google Scholar
Pinker S, Bloom P (1990) Natural language and natural selection. Behav Brain Sci 13:707–727
Article Google Scholar
Pinker S, Jackendoff R (2005) The faculty of language: what’s special about it? Cognition 95:201–236
Article Google Scholar
Plavcan JM (2012) Sexual size dimorphism, canine dimorphism, and male-male competition in primates. Hum Nat 23:45–67
Article Google Scholar
Polák J, Frynta D (2009) Sexual size dimorphism in domestic goats, sheep, and their wild relatives. Biol J Linn Soc 98:872–883
Article Google Scholar
Polák J, Frynta D (2010) Patterns of sexual size dimorphism in cattle breeds support Rensch’s rule. Evol Ecol 24:1255–1266
Article Google Scholar
Pollick AS, De Waal FBM (2007) Ape gestures and language evolution. Proc Natl Acad Sci 104:8184–8189
Article Google Scholar
Poole JH, Tyack PL, Stoeger-Horwath AS, Watwood S (2005) Animal behaviour: elephants are capable of vocal learning. Nature 434:455–456
Article Google Scholar
Price EO (1984) Behavioral aspects of animal domestication. Q Rev Biol 59:1–32
Article Google Scholar
Price EO (1999) Behavioral development in animals undergoing domestication. Appl Anim Behav Sci 65:245–271
Article Google Scholar
Price EO (2002) Animal domestication and behavior. Cabi, Wallingford
Book Google Scholar
Price EO, King JA (1968) Domestication and adaptation. In: Hafez ESE (ed) Adaptation of domestic animals. Lea & Febiger, Philadelphia, pp 34–45
Google Scholar
Proops L, Walton M, McComb K (2010) The use of human-given cues by domestic horses, Equus caballus, during an object choice task. Anim Behav 79:1205–1209
Article Google Scholar
Ralls K, Fiorelli P, Gish S (1985) Vocalizations and vocal mimicry in captive harbor seals, Phoca vitulina. Can J Zool 63:1050–1056
Article Google Scholar
Reber AS (1967) Implicit learning of artificial grammars. J Verbal Learn Verbal Behav 6:855–863
Article Google Scholar
Reiss D, McCowan B (1993) Spontaneous vocal mimicry and production by bottlenose dolphins (Tursiops truncatus): evidence for vocal learning. J Comp Psychol 107:301–312
Article Google Scholar
Rendell L, Whitehead H (2001) Culture in whales and dolphins. Behav Brain Sci 24:309–324
Article Google Scholar
Riedel J, Schumann K, Kaminski J et al (2008) The early ontogeny of human–dog communication. Anim Behav 75:1003–1014
Article Google Scholar
Rightmire GP (2004) Brain size and encephalization in early to mid-pleistocene homo. Am J Phys Anthropol 124:109–123
Article Google Scholar
Ritchie GRS, Kirby S (2007) A possible role for selective masking in the evolution of complex, learned communication systems. In: Lyon C, Nehaniv C, Cangelosi A (eds) Emergence of communication and language. Springer, Berlin, pp 387–401
Chapter Google Scholar
Ritchie GRS, Kirby S, Hawkey DJC (2008) Song learning as an indicator mechanism: modelling the developmental stress hypothesis. J Theor Biol 251:570–583
Article Google Scholar
Roberts G, Galantucci B (2012) The emergence of duality of patterning: insights from the laboratory. Lang Cogn 4:297–318
Article Google Scholar
Rodrı́guez-Gironés MA, Enquist M (2001) The evolution of female sexuality. Anim Behav 61:695–704
Article Google Scholar
Ruff CB, Trinkaus E, Walker A, Larsen CS (1993) Postcranial robusticity in Homo. I: temporal trends and mechanical interpretation. Am J Phys Anthropol 91:21–53
Article Google Scholar
Saffran JR (2003) Statistical language learning mechanisms and constraints. Curr Dir Psychol Sci 12:110–114
Article Google Scholar
Samuelson LK, Smith LB (1998) Memory and attention make smart word learning: an alternative account of Akhtar, Carpenter, and Tomasello. Child Dev 69:94–104
Article Google Scholar
Savage-Rumbaugh S, McDonald K, Sevcik RA et al (1986) Spontaneous symbol acquisition and communicative use by pygmy chimpanzees (Pan paniscus). J Exp Psychol Gen 115:211
Article Google Scholar
Savage-Rumbaugh S, Shanker SG, Taylor TJ (1998) Apes, language, and the human mind. Oxford University Press, Oxford
Google Scholar
Savage-Rumbaugh S, Fields WM, Segerdahl P, Rumbaugh D (2005) Culture prefigures cognition in Pan/Homo bonobos. Theoria 20:311–328
Google Scholar
Scheumann M, Call J (2004) The use of experimenter-given cues by South African fur seals (Arctocephalus pusillus). Anim Cogn 7:224–230
Article Google Scholar
Schloegl C, Kotrschal K, Bugnyar T (2008) Do common ravens (Corvus corax) rely on human or conspecific gaze cues to detect hidden food? Anim Cogn 11:231–241
Article Google Scholar
Scott-Phillips TC, Kirby S (2010) Language evolution in the laboratory. Trends Cogn Sci 14:411–417
Article Google Scholar
Scott-Phillips TC, Kirby S, Ritchie GRS (2009) Signalling signalhood and the emergence of communication. Cognition 113:226–233
Article Google Scholar
Seyfarth RM, Cheney DL (2010) Production, usage, and comprehension in animal vocalizations. Brain Lang 115:92–100
Article Google Scholar
Shapiro AD, Janik VM, Slater PJB (2003) A gray seal’s (Halichoerus grypus) responses to experimenter-given pointing and directional cues. J Comp Psychol 117:355–362
Article Google Scholar
Shea BT (1989) Heterochrony in human evolution: the case for neoteny reconsidered. Am J Phys Anthropol 32:69–101
Article Google Scholar
Smith AD (2014) Models of language evolution and change. Wiley Interdiscip Rev Cogn Sci 5:281–293
Article Google Scholar
Smith K, Kirby S (2008) Cultural evolution: implications for understanding the human language faculty and its evolution. Philos Trans R Soc B Biol Sci 363:3591–3603
Article Google Scholar
Smith BP, Litchfield CA (2010) Dingoes (Canis dingo) can use human social cues to locate hidden food. Anim Cogn 13:367–376
Article Google Scholar
Smith K, Wonnacott E (2010) Eliminating unpredictable variation through iterated learning. Cognition 116:444–449
Article Google Scholar
Smith K, Smith ADM, Blythe RA (2011) Cross-situational learning: an experimental study of word-learning mechanisms. Cogn Sci 35:480–498
Article Google Scholar
Soma MF (2011) Social factors in song learning: a review of Estrildid finch research. Ornithol Sci 10:89–100
Article Google Scholar
Soma M, Takahasi M, Hasegawa T, Okanoya K (2006) Trade-offs and correlations among multiple song features in the Bengalese Finch. Ornithol Sci 5:77–84
Article Google Scholar
Somel M, Franz H, Yan Z et al (2009) Transcriptional neoteny in the human brain. Proc Natl Acad Sci 106:5743–5748
Article Google Scholar
Somel M, Tang L, Khaitovich P (2012) The role of neoteny in human evolution: from genes to the phenotype. In: Hirai H, Imai H, Go Y (eds) Post-genome biology of primates. Springer, Berlin, pp 23–41
Chapter Google Scholar
Soproni K, Miklósi Á, Topál J, Csányi V (2001) Comprehension of human communicative signs in pet dogs (Canis familiaris). J Comp Psychol 115:122–126
Article Google Scholar
Spencer KA, Buchanan KL, Goldsmith AR, Catchpole CK (2003) Song as an honest signal of developmental stress in the zebra finch (Taeniopygia guttata). Horm Behav 44:132–139
Article Google Scholar
Sperber D, Wilson D (1995) Relevance: Communication and cognition. Wiley-Blackwell
Stewart I (1998) Life’s other secret: the new mathematics of the living world. Penguin/Wiley, New York
Google Scholar
Surbeck M, Deschner T, Schubert G et al (2012a) Mate competition, testosterone and intersexual relationships in bonobos, Pan paniscus. Anim Behav 83:659–669
Article Google Scholar
Surbeck M, Deschner T, Weltring A, Hohmann G (2012b) Social correlates of variation in urinary cortisol in wild male bonobos (Pan paniscus). Horm Behav 62:27–35
Article Google Scholar
Suzuki K, Matsunaga E, Kobayashi T, Okanoya K (2011) Expression patterns of mineralocorticoid and glucocorticoid receptors in Bengalese finch (Lonchura striata var. domestica) brain suggest a relationship between stress hormones and song-system development. Neuroscience 194:72–83
Article Google Scholar
Suzuki K, Yamada H, Kobayashi T, Okanoya K (2012) Decreased fecal corticosterone levels due to domestication: a comparison between the white-backed Munia (Lonchura striata) and its domesticated strain, the Bengalese finch (Lonchura striata var. domestica) with a suggestion for complex song evolution. J Exp Zool Part A Ecol Genet Physiol 317:561–570
Article Google Scholar
Svanberg I (2008) Towards a cultural history of the Bengalese Finch (Lonchura domestica). Der Zool Gart 77:334–344
Article Google Scholar
Takahasi M, Okanoya K (2010) Song learning in wild and domesticated strains of white-rumped munia, lonchura striata, compared by cross-fostering procedures: domestication increases song variability by decreasing strain-specific bias. Ethology 116:396–405
Article Google Scholar
Tamariz M (2017) Experimental studies on the cultural evolution of language. Annu Rev Linguist 3:389–407
Article Google Scholar
Tamariz M, Kirby S (2015) Culture: copying, compression, and conventionality. Cogn Sci 39:171–183
Article Google Scholar
Tchernov E (1984) Commensal animals and human sedentism in the Middle East. Anim Archaeol 3:91–115
Google Scholar
Tchernov E, Horwitz LK (1991) Body size diminution under domestication: unconscious selection in primeval domesticates. J Anthropol Archaeol 10:54–75
Article Google Scholar
Theisen CA, Oberlander J, Kirby S (2010) Systematicity and arbitrariness in novel communication systems. Interact Stud 11(1):14–32
Article Google Scholar
Thomas J (2013) Self-domestication and language evolution. Ph.D. thesis. The University of Edinburgh
Thompson B, Kirby S, Smith K (2016) Culture shapes the evolution of cognition. Proc Natl Acad Sci 113:4530–4535
Article Google Scholar
Tomasello M (1996) Do apes ape? In: Heyes C, Galef B (eds) Social learning in animals: the roots of culture. Academic Press, Cambridge, pp 319–346
Chapter Google Scholar
Tomasello M (1999) The cultural origins of human cognition. Harvard University Press, Cambridge
Google Scholar
Tomasello M (2000) The social-pragmatic theory of word learning. Pragmatics 10:401–414
Article Google Scholar
Tomasello M (2008) Origins of human communication. MIT Press, Cambridge
Google Scholar
Tomasello M, Carpenter M, Call J et al (2005) Understanding and sharing intentions: the origins of cultural cognition. Behav Brain Sci 28:675–691
Google Scholar
Topál J, Gergely G, Miklósi Á et al (2008) Infants’ perseverative search errors are induced by pragmatic misinterpretation. Science 321:1831–1834
Article Google Scholar
Topál J, Gergely G, Erd\Hohegyi Á et al (2009) Differential sensitivity to human communication in dogs, wolves, and human infants. Science 325:1269–1272
Article Google Scholar
Tornick JK, Gibson BM, Kispert D, Wilkinson M (2011) Clark’s nutcrackers (Nucifraga columbiana) use gestures to identify the location of hidden food. Anim Cogn 14:117–125
Article Google Scholar
Trut LN (1999) Early canid domestication: the farm-fox experiment. Am Sci 87:160–169
Article Google Scholar
Trut LN, Oskina I, Kharlamova A (2009) Animal evolution during domestication: the domesticated fox as a model. BioEssays 31:349–360
Article Google Scholar
Udell MAR, Dorey NR, Wynne CDL (2008) Wolves outperform dogs in following human social cues. Anim Behav 76:1767–1773
Article Google Scholar
Udell MAR, Dorey NR, Wynne CDL (2010) What did domestication do to dogs? a new account of dogs’ sensitivity to human actions. Biol Rev 85:327–345
Article Google Scholar
Verhoef T, Kirby S, de Boer B (2014) Emergence of combinatorial structure and economy through iterated learning with continuous acoustic signals. J Phon 43:57–68
Article Google Scholar
Virányi Z, Gácsi M, Kubinyi E et al (2008) Comprehension of human pointing gestures in young human-reared wolves (Canis lupus) and dogs (Canis familiaris). Anim Cogn 11:373–387
Article Google Scholar
Wilkins AS, Wrangham RW, Fitch WT (2014) The “domestication syndrome” in mammals: a unified explanation based on neural crest cell behavior and genetics. Genetics 197:795–808
Article Google Scholar
Winters J, Kirby S, Smith K (2015) Languages adapt to their contextual niche. Lang Cogn 7:415–449
Article Google Scholar
Wobber V, Hare B, Koler-Matznick J et al (2009) Breed differences in domestic dogs’ (Canis familiaris) comprehension of human communicative signals. Interact Stud 10:206–224
Article Google Scholar
Wrangham RW (2009) Catching fire: how cooking made us human. Basic Books, New York
Google Scholar
Wrangham R, Conklin-Brittain N (2003) Cooking as a biological trait. Comp Biochem Physiol A Mol Integr Physiol 136:35–46
Article Google Scholar
Wrangham RW, Jones JH, Laden G et al (1999) The raw and the stolen. Curr Anthropol 40:567–594
Google Scholar
Wynne CDL, Udell MAR, Lord KA (2008) Ontogeny’s impacts on human–dog communication. Anim Behav 76:e1–e4
Article Google Scholar
Zeder MA (2012) Pathways to animal domestication. In Gepts P, Famula TR, Bettinger RL, Brush SB, Damania AB, McGuire PE, Qualset CO (eds) Biodiversity in agriculture: Domestication, evolution, and sustainability. Cambridge University Press, pp 227–259
Zeder MA, Emshwiller E, Smith BD, Bradley DG (2006) Documenting domestication: the intersection of genetics and archaeology. Trends Genet 22:139–155
Article Google Scholar
Zihlman AL, Cramer DL (1978) Skeletal differences between pygmy (Pan paniscus) and common chimpanzees (Pan troglodytes). Folia Primatol 29:86–94
Article Google Scholar
Zohary D, Tchernov E, Horwitz LK (1998) The role of unconscious selection in the domestication of sheep and goats. J Zool 245:129–135
Article Google Scholar
Zollikofer CPE, Ponce de León MS (2010) The evolution of hominin ontogenies. Semin Cell Dev Biol 21:441–452
Article Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Language Evolution, University of Edinburgh, 3 Charles Street, Edinburgh, EH8 9AD, UK
James Thomas & Simon Kirby

Authors

James Thomas
View author publications
You can also search for this author in PubMed Google Scholar
Simon Kirby
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to James Thomas.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Thomas, J., Kirby, S. Self domestication and the evolution of language. Biol Philos 33, 9 (2018). https://doi.org/10.1007/s10539-018-9612-8

Download citation

Received: 10 May 2017
Accepted: 13 March 2018
Published: 27 March 2018
DOI: https://doi.org/10.1007/s10539-018-9612-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Self domestication and the evolution of language

Abstract

Similar content being viewed by others

Judaism and Evolution

Is Population Genetics Really Relevant to Evolutionary Biology?

Can a Muslim be an Evolutionist?

Introduction

The cultural evolution of language

The biological precursors of a culturally evolving language

The learning of new signals

Communicative inference: linking signals to meanings

The origin of the precursors in domestication

The Bengalese finch and the learning of signals

The domestic dog and communicative inference

Bridging the ‘gap’ to humans

What is domestication and why is it relevant to humans?

The conditions view of domestication

The outcomes view of domestication

The domestic phenotype in humans

How might humans have come to share in the domestic phenotype?

The selective environment of domestication

The physical mechanisms underpinning domestication

Why has domestication had this effect on humans and not other species?

Summary

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Self domestication and the evolution of language

Abstract

Similar content being viewed by others

Judaism and Evolution

Is Population Genetics Really Relevant to Evolutionary Biology?

Can a Muslim be an Evolutionist?

Introduction

The cultural evolution of language

The biological precursors of a culturally evolving language

The learning of new signals

Communicative inference: linking signals to meanings

The origin of the precursors in domestication

The Bengalese finch and the learning of signals

The domestic dog and communicative inference

Bridging the ‘gap’ to humans

What is domestication and why is it relevant to humans?

The conditions view of domestication

The outcomes view of domestication

The domestic phenotype in humans

How might humans have come to share in the domestic phenotype?

The selective environment of domestication

The physical mechanisms underpinning domestication

Why has domestication had this effect on humans and not other species?

Summary

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation