Introduction

Life in stable groups unfailingly entails repeated interactions between different individuals, creating particular costs and benefits for individual group members and the group as a whole. The “social intelligence hypothesis” suggests that increased socio-cognitive skills are necessary to effectively navigate this social network in order to reap the benefits of sociality (Jolly 1966; Humphrey 1976; Whiten and Byrne 1988; Dunbar 1993). Social competence, which describes the ability to reliably assess the behavior of others and to respond flexibly and adaptively to it, for example by remembering prior interactions, anticipating others’ behaviors, and by coordinating one’s own behavior with that of other group mates, summarizes the key cognitive abilities that are favored in this context (Cheney and Seyfarth 2005; Dunbar 2009; Taborsky and Oliveira 2012; Sewall 2015). Increased social competence has been suggested to trigger the need for more complex communicative systems, being the key behavioral mechanism mediating these interactions (McComb and Semple 2005; Freeberg 2006; Freeberg et al. 2012a, b; Sewall 2015).

The traditional approach to studying communication is based on the sender-receiver paradigm, i.e., a dyad connected by the transmission of a signal in a particular channel (Shannon and Weaver 1951). More recently, however, it has been recognized that signals may connect several senders and receivers within a group (McGregor and Peake 2000; Cheney and Seyfarth 2005; McComb and Reby 2005; Fichtel and Manser 2010; Pollard and Blumstein 2012). In particular, multiple individuals may communicate simultaneously, and conspicuous signals may draw the attention of unwanted receivers (McGregor and Peake 2000; Peake 2005; Fichtel and Manser 2010). Hence, specific mechanisms are required to direct a signal of a given sender towards a specific receiver (McGregor and Peake 2000; Freeberg et al. 2012b).

Moreover, group-living individuals are usually involved in a wider range of interactions with diverse social partners, ultimately triggering needs to transmit a broader diversity of information and to express a wider range of emotional and motivational states (Morton 1977; Freeberg et al. 2012b; Ord and Garcia-Porta 2012). Accordingly, the “social complexity hypothesis for communicative complexity” (SCHCC) posits that animals living in groups with comparatively greater social complexity will exhibit greater complexity in their signaling systems (Freeberg 2006; Freeberg et al. 2012a, b). This hypothesis formalizes ideas that have been acknowledged ever since Darwin (1872), but that had never been explicitly tested (see Freeberg et al. 2012b for a historical perspective). Accordingly, the complexity of social groups ought to drive communicative complexity in both a proximate (context for signal development) and ultimate (social selection pressure) sense towards increased signal diversity among species, populations, and groups (Freeberg et al. 2012a; Krams et al. 2012). Empirical evidence from diverse taxa and for three different communicative modalities (auditory, visual, and olfactory) has supported this hypothesis (Table S1). However, there are also several studies not reporting the predicted positive relationships (Table S2) and the work of Freeberg (2006) seems still to be unique in adopting an experimental approach to test predictions of this hypothesis. Building on influential previous work by Freeberg et al. (2012a, b), in the present paper, we point out new perspectives that significantly expand the existing framework.

Specifically, three key issues have emerged from the recent literature that may offer interesting new perspectives on the SCHCC (Fig. 1). The first issue concerns the operational definition of the main variables (1 and 2 in Fig. 1). Because the unspecified usage of the term complexity can lead to a lack of agreement on the relevant variables, there is a need for better quantitative tools to estimate social and communicative complexity more systematically across taxa (Freeberg et al. 2012b; Bergman and Beehner 2015; Fischer et al. 2016, 2017; Pika 2017; Kappeler 2019, topical collection on Social complexity). Secondly, while numerous studies have highlighted a link between social and communicative complexity, their correlative nature does not permit conclusions about the direction of causality (3 in Fig. 1). Hence, a systematic control for alternative hypotheses that invoke important effects of ecology, morphology, or phylogenetic history on communicative complexity appears essential when testing the link between social and communicative complexity. Finally, we note that researchers rarely address the actual ways in which variation in signaling is directly affected by social factors (4 in Fig. 1). Indeed, the underlying mechanisms of this link are usually not explored, failing to uncover the specific attribute of communication that would be co-evolving with specific aspects of sociality (Gustison et al. 2012, 2019, topical collection on Social complexity; Pollard and Blumstein 2012). In this review, we will address these three crucial issues, taking stock of the current state of the art and point out what we believe are the main associated perspectives. In doing so, we hope to provide guidelines and inspiration for future tests of the SCHCC.

Fig. 1
figure 1

Schematic representation of three key issues associated with tests of the “social complexity hypothesis for communicative complexity” (SCHCC)

What are we looking at? Need for good operational definitions

Whereas the term “complexity” is omnipresent in scientific publications, there is no general agreement on its definition, neither within nor among disciplines (Adami 2002; Bennett 2003). The two Latin roots com (together) and plectere (woven, plaited, twined, entwined) (Harper 2001) indicate that multiple parts are connected to each other in flexible ways. Pollard and Blumstein (2012) accordingly suggested that “a system is typically considered more complex if it contains more parts, more variability or types of parts, more connections or types of connections between parts or more layers of embedded meaning”. This verbal definition, though helpful, does not offer any operational criteria to evaluate social and communicative complexity, but highlights the importance of a holistic approach (Freeberg et al. 2012b). Hence, there is currently a need for better tools and concepts to estimate both social and communicative complexity more reliably across taxa (Freeberg et al. 2012b; Bergman and Beehner 2015; Fischer et al. 2016, 2017; Pika 2017; Kappeler 2019, topical collection on Social complexity).

Another issue when trying to define and quantify social and communicative complexity in the context of the SCHCC concerns the non-independence between communication and sociality that may even amount to circularity. If social complexity is measured through communicative interactions or communicative complexity through its social outcomes, then we should not be surprised to find correlations between these variables. An analogous problem exists between sociality and cognition, leading Bergman and Beehner (2015) to suggest to measure social cognition in a different context than the chosen measure of social complexity. In addition, communication, together with social interactions, constitutes the core of a species’ social structure (Kappeler 2019, topical collection on Social complexity). Hence, measures thereof that are correlated with measures of social organization, such as group size, may therefore reduce the risk of redundancy and non-independence, and this is what several studies effectively did (e.g., Blumstein and Armitage 1997; McComb and Semple 2005). However, this approach can and has been criticized on the grounds that measures of social organization (notably group size) alone do not fully capture all relevant dimensions of social complexity (Dunbar and Shultz 2010; Kappeler 2019, topical collection on Social complexity). Thus, comprehensive measures of social complexity, in particular, need to be developed (Kappeler et al. 2019, topical collection on Social complexity).

Social complexity: developing broadly applicable quantitative measures

The main variable used to assess social complexity in the context of the SCHCC has been group size (Table S1 and S2). Some studies have used various derived or more specific measures for sub-units (e.g., foraging group, grooming clique, or group density), but all of them are based on the notion that individuals in larger units are expected to interact with more partners, triggering a need to convey more diverse and flexible messages (Freeberg et al. 2012b). Group size is a convenient proxy of social complexity as it is relatively easy to assess for large numbers of species, but it is commonly recognized as only a crude measure of social complexity (McComb and Semple 2005; Shultz and Dunbar 2007; Freeberg et al. 2012b; Bergman and Beehner 2015). Indeed, even if group size is undeniably a key determinant of social complexity (Kappeler 2019, topical collection on Social complexity) and has been shown to be a good proxy for social complexity in some taxa (Bourke 1999; Jeanson et al. 2007), it may fail to capture differences in social complexity, especially across taxa. For example, a herd of ungulates, a troop of baboons and a colony of ants may be constituted of the same number of individuals, but no one in the field would consider them as equally socially complex.

Because the notion of complexity is not only based on the number of units but also on the connections between these parts (Freeberg et al. 2012b; Pollard and Blumstein 2012), some studies considered other social features, such as mating system, dominance style, or breeding strategies, as better or alternative proxies of social complexity. However, some of these alternative classifications have themselves been questioned, and their ranking can be controversial and largely dependent on the taxa considered (Kappeler 2019, topical collection on Social complexity). For example, pair-living species may be considered as the simplest form of social complexity because they have the smallest group size, but this type of social system typically also includes long-term bonds that require familiarity and a subtle coordination of each other’s behavior. In some taxa, they have therefore been considered as a very complex form of sociality (ungulates: Shultz and Dunbar 2006; bats: Pitnick et al. 2006), whereas in others certain pair-living species have been characterized as the baseline of social complexity (primates: Dröscher and Kappeler 2013).

Similarly, regarding the patterns of group-level dominance relationships, there is no agreement on which pattern is more socially complex. Some authors argued that linear dominance hierarchies are socially more complex than egalitarian ones because they require particular cognitive abilities, such as transitive inference (MacLean et al. 2008). However, non-linear hierarchies could equally be considered as more complex, especially from the animal’s perspective, as they represent more uncertainty and necessitate more social competence (Taborsky and Oliveira 2012). Hence, it is important to clarify the level of analysis; whereas some studies address complexity at the group level, others focus on the individual level, resulting in divergent interpretations (Aureli and Schino 2019, topical collection on Social complexity).

The ranking of social features in terms of social complexity may also be prone to circularity because social complexity is usually evaluated with regard to the number of cognitive skills or the type and frequency of communicative interactions. Hence, the usage of particular social features may not solve the definitional problem and may even enhance subjectivity. Against this background, the verbal definition by Freeberg et al. (2012b), who defined “complex social systems are those in which individuals frequently interact in many different contexts with many different individuals, and often repeatedly interact with many of the same individuals over time,” represents important progress. More recently, Bergman and Beehner (2015) suggested to measure social complexity as the “number of differentiated relationships that individuals have.” These verbal definitions, although non-operational, are essential pre-requisites for the development of meaningful quantitative measures, at least of variation in aspects of social structure, such as the one proposed by Fischer et al. (2017) based on several common social indices and cluster analysis, or the one offered by Weiss et al. (2019, topical collection on Social complexity) applying binomial mixture models to association indices.

Facing this diversity of social complexity measures, it is evident that this concept encompasses different aspects and that a single measure may not be sufficient to assess it globally. Hence, a better appreciation of the complexity of a social system may rather be provided by a set of operational measures describing all of the four fundamental aspects proposed by Kappeler (2019, topical collection on Social complexity), i.e., social organization, social structure, mating system, and care system (Table 1). This systematic approach may facilitate progress with future comparative studies in this domain.

Table 1 Summary of the different social and communicative variables used to test the social complexity hypothesis for communicative complexity (SCHCC) in different taxa and modalities

Hence, we believe that in the context of the SCHCC, more effort needs to be invested into the choice and definition of the variables used to assess social complexity. In particular, favoring broadly applicable quantitative measures rather than subjective categories should facilitate comparative work and clarify the level at which complexity is evaluated (Weiss et al. 2019, topical collection on Social complexity). Moreover, measures implicitly based on communicative variables should be avoided in order to stave off circularity. Rather than a single measure, we propose that a set of operational measures matching the criterion mentioned above and accounting for all four main components of social complexity (Kappeler 2019, topical collection on Social complexity) will help to move the field forward. Better identifying the social complexity aspects included in the test should also allow for better predictions and understanding of the mechanisms underlying the observed relationship (see below).

Communicative complexity: from partitioning signals towards a systematic multimodal approach

One of the main issues associated with communicative complexity results from the partitioning of the distinct modalities in which signals are produced. Each signaling modality, defined as the sensory system with which a signal is produced and perceived by the receiver (Smith and Evans 2013), has historically been highlighted in different taxa. This bias has led to the development of different research questions, approaches, and methodologies across these different modalities (Liebal et al. 2013). Our own sensory bias, together with differences in the ease of data collection and unequal development of technological tools, have resulted in large biases in the number of studies between the different modalities.

In the context of the SCHCC, but also more generally in communication research, mainly the auditory, olfactory, and visual modalities have been explored, with a strong bias towards acoustic communication (Baptista and Gaunt 1994; Slocombe et al. 2011). Its universality and the fact that most species produce acoustic signals easily perceptible by humans may explain this bias. Indeed, because many olfactory signals are not easily perceived by humans, more sophisticated analytical methods are required to study them in more detail. Only a few studies have addressed the complexity of these signals and even fewer from a comparative perspective (Kather and Martin 2015). Visual signals are highly diverse because they can be produced by completely different processes, such as movements of the whole body or body parts (e.g., foot-flagging of some frogs) as well as changes in coloration or shape (e.g., feather erection display of birds), making it difficult to define broadly applicable measures of visual signal complexity (Endler 1992; Cuthill et al. 2017). Moreover, visual signals vary widely in persistence, ranging from state signals, such as static features of coloration patterns, to dynamic signals with limited duration that require an action by the signaler to be initiated (Smith and Evans 2013).

However, despite this imbalance in previous studies of different modalities and the diversity of methods used to assess communicative complexity across these modalities, some commonalities exist (Table 1). According to the most common definitions, communicative complexity can be quantified via two main approaches: the number of distinct signals or signaling units or the number of bits of information included in signals (Griebel and Oller 2008; Freeberg et al. 2012b; Pollard and Blumstein 2012). In the following, we summarize how these two approaches have been applied across the three main modalities studied in the context of the SCHCC and identify their current limitations. At the end of this section, we argue in favor of enhanced acknowledgement of the multimodal aspect of communication within the framework of the SCHCC.

Number of distinct signals or signaling units

In all communicative modalities, the number of distinct signals or signaling units has been used as a proxy for communicative complexity. To assess vocal communicative complexity across species, vocal repertoire size, defined as the number of call type individuals of a population or species produce, has been mainly used. To this end, call types have been classified by visual inspections of spectrograms, a rather subjective approach. Although the more recently used quantitative approaches to estimate repertoire size are favored, subjectivity still persists in the way algorithms are implemented to classify call types (Fischer et al. 2016). This bias, together with the diversity of quantitative methods applied to identify call types, currently results in substantial inhomogeneity among studies. For example, the vocal repertoire of the Mongoose lemur (Eulemur mongoz) has been characterized by visual inspection as composed of either 8 (Petter and Charles-Dominique 1979), 10 (Curtis and Zaramody 1999), or 14 (Gamba et al. 2015) call types, whereas quantitative analyses suggested 9 (Gamba et al. 2015) or 15 (Nadhurou et al. 2015) different call types. Thus, there is currently no general agreement on how to identify different call types and, hence, how to assess vocal repertoire size objectively.

Similarly, the main variable used to access complexity in olfactory signaling is the number of distinct chemical compounds contained in the different excretions (guinea pigs: Beruter et al. 1973; primates: delBarco-Trillo et al. 2011, 2012; hymenoptera: Kather and Martin 2015). While some semiochemicals are composed of only one molecule (e.g., the female sex pheromone (Z)-7-dodecen-I-yl-acetate common to the Asian elephant (Elephas maximus) and several species of moths; Wyatt 2003), others are composed of a combination of several molecules. Moreover, the diversity of scent sources, as urine, feces, saliva, skin secretions, or specialized sent glands, has also been considered as a proxy to assess olfactory complexity (delBarco-Trillo et al. 2011; delBarco-Trillo and Drea 2014). In addition, olfactory communication is characterized by its temporal duality: while scent deposition is a one-off event, the scent itself usually persists for some time and can be perceived by receivers even after the departure of the sender. Because scent deposition is usually accompanied by a typical behavior, we propose that the number of scent-marking behaviors can also be used to characterize the complexity of an olfactory system, even though these behaviors often have a strong visual component. Although repertoires of scent-marking behaviors have been reported for some species, they have rarely been interpreted in the context of the SCHCC (Drea and Scordato 2008; Colquhoun 2011; delBarco-Trillo et al. 2011).

Regarding visual signals, the numbers of both static and visual signals have been used as a proxy for complexity, although only a few studies actually tested the SCHCC. For example, among 25 wasp species, flexibly nesting species (with either one or several foundresses) exhibited more intraspecific variation in body markings than species having a unique nesting strategy (Tibbetts 2004; Table S1). In addition, dynamic signals, such as the number of visually distinct facial movements correlate with group size in Old World primates (Dobson 2009; Table S1).

To summarize, the number of signals or signaling units is a widespread method of accounting for complexity across modalities (Table 1). However, this section also emphasizes the diversity of levels that may be evaluated through this approach. While some of these approaches may be applied at the system level across modalities (e.g., number of signaling behaviors), some others are more modality-specific and are usually applicable at the signal level (e.g., number of compounds, number of notes).

Number of bits of information or amount of uncertainty

This approach is based on information theory and represents a measure of the entropy of a system, which is an estimate of the amount of uncertainty (Shannon and Weaver 1951; Fischer et al. 2016). Thus, Shannon’s entropy is not a measure of what one communicates, but rather a measure of what one could communicate (Shannon and Weaver 1951; Robinson 2008). This is an essential conceptual difference that may help to avoid circularity. Indeed, considering the social information contained in signals, rather than the possibilities offered by the structure of these signals to convey flexible and diverse information as a measure of communicative complexity, may lead to non-informative tests of the SCHCC. From a fixed set of signals, more information may first emerge by combinations, either through assemblage (simultaneous expression of different signals) or sequence (succession of different signals). Hence, acoustic units, i.e., uninterrupted traces in a spectrographic representation, and discrete calls can be combined into sequences (Bouchet et al. 2013; Manser et al. 2014).

Vocal sequences have been an intense area of research in studies of birdsong (Kroodsma 2004; Catchpole and Slater 2008b), but the existence of non-random vocal structures has also been shown in several mammalian orders, including cetaceans (Riesch et al. 2008; Shapiro et al. 2011; Cholewiak et al. 2013; Zwamborn and Whitehead 2017), primates (Clarke et al. 2006; Arnold and Zuberbühler 2008; Zuberbühler and Lemasson 2014; Gustison et al. 2016), bats (Kanwal et al. 1994; Bohn et al. 2009), and hyraxes (Kershenbaum et al. 2012). The combination of signals is an alternative way to increase the amount of information transmitted, and several methods have been proposed to analyze the structure and rhythmicity of these sequences (reviewed in Kershenbaum et al. 2016; Ravignani and Norton 2017). Indeed, the number of call/note combinations correlates with group size in several species (Freeberg 2006; Bouchet et al. 2013; Manser et al. 2014), suggesting that this is a promising area for future research on communicative complexity.

Similarly to vocalizations, olfactory signals can also be produced in sequences and the combinational deposition of scents appears to be widespread in mammals (Mills et al. 1980; Clapham et al. 2014; Vogt et al. 2014). For instance, male ring-tailed lemurs (Lemur catta) flexibly use two specific scent glands, by either depositing only one type of scent or a mixture of the two secretions. The mixture elicited a higher responsiveness, suggesting that they contain a larger number of messages and persist for longer (Greene et al. 2016). The superposition or juxtaposition of different scent marks may also be studied following the same analytical methods as the ones mentioned above for acoustic sequences (Kappeler 1998). Similarly, numerous courtship displays have been described as assemblages or sequences of different visual signals (Lorenz 1941), but these combinations have rarely been related to sociality in a comparative perspective (Crook 1964).

Variability in the structure of signals presents another potential way to increase the amount of transmitted information. For example, vocal systems have been suggested to be either discrete, with acoustically distinct call types that are easily discriminated from each other, or graded, with the acoustic structure of the vocalizations forming a continuum without clear boundaries between different call types (Marler 1977; Keenan et al. 2013). The gradation level is a measure of the overlap in temporal and spectral features existing between two calls, and might be one of the main reasons for the inconsistency in measures of vocal repertoire sizes. The higher the level of gradation in a system, the more difficult it is to cluster its components, but the higher is the potential for information transmission. Gradation may therefore represent a measure of bits of information that may be contained in signals. Because vocal repertoires usually contain a combination of discrete and graded calls (Fichtel et al. 2001; Bouchet et al. 2013; Manser et al. 2014; Fischer et al. 2016), the level of gradation can also be used to operationalize the complexity of a vocal system (Fischer et al. 2016).

Different selective pressures seem to shape signals as a function of the quantity of information required from a signal. While stereotypical signals with high context specificity reduce ambiguity, more flexible signals can convey more subtle and complex information about the sender’s motivational state or a particular context (Fischer 1998; Fichtel et al. 2001; Griebel and Oller 2008; Manser et al. 2014). Accordingly, the “call social function hypothesis” proposes that the level of overlap in temporal and spectral features of a call type will be influenced by its social function (Snowdon et al. 1997; Griebel and Oller 2008; Lemasson and Hausberger 2011; Bouchet et al. 2013; Keenan et al. 2013). In female Campbell’s monkeys (Cercopithecus campbelli), calls associated with the highest affiliative social value exhibited the greatest gradation level, whereas calls associated with agonism were the most stereotyped, and less social alarm calls were of intermediate variability (Lemasson and Hausberger 2011; Table S1).

Furthermore, inter-individual variation in call structure is another source of gradation that might be impacted by group size. The more group members interact, the more they may benefit from accurately recognizing the sender of a call, but this recognition task becomes increasingly difficult with increasing group size (Freeberg 2006; Pollard and Blumstein 2011). Hence, if individual recognition is important, more individualistic signatures should evolve with increasing group size as shown, for example, in chick begging calls of swallows (cliff swallows Hirundo pyrrhonota and barn swallows Hirundo rustica; Medvin et al. 1993), alarm calls of ground-dwelling sciurids (Pollard and Blumstein 2011), and infant isolation calls of bats (Wilkinson 2003; Table S1).

Hence, the gradation level of the vocal system (1 in Fig. 2), the gradation level of different call categories (2 in Fig. 2), and the stereotypy of different call types (3 in Fig. 2), as well as the level of individuality in different call types (4 in Fig. 2), are all sources of gradation that may be impacted by social variables. However, a practical quantitative measure is still lacking to evaluate gradation at the system level. The distribution of call stereotypy coefficients has recently been suggested as a potential measure of gradation for comparative analyses, with right-skewed distributions (closer to 1) indicating higher differentiation and left-skewed distributions (closer to 0) indicating a higher degree of gradation (Wadewitz et al. 2015; Fischer et al. 2016). By applying a fuzzy-c-means cluster algorithm, which allows for imperfect membership, calls can be assigned to different clusters. For each call a membership value is attributed for each cluster, ranging from 1 (the call fully displays the properties of the cluster in question) to 0 (the call does not display any of the properties of the cluster). The call stereotypy coefficient (referred to as “typicality coefficient” in Wadewitz et al. 2015 and Fischer et al. 2016) can then be obtained by subtracting the two highest membership values associated with this call. Hence, this measure allows for quantifying the gradation level between two call types but does not provide a generic measure of the gradation level of the vocal repertoire. Therefore we suggest that the distribution of the standard deviations across membership values for each call might be a more appropriate measure of gradation at the system level.

Fig. 2
figure 2

Schematic representation of (i) the different levels of gradation: gradation level of the repertoire (for a given species), gradation level of the call category (for a given species and call category, e.g., affiliative calls), call stereotypy (for a given species and call type), and individuality (for a given call and a given individual) and (ii) the associated potential comparative levels (1 to 4). Each gray capital letter represents a different call type, e.g., A is a call type of category 1 produced by the individuals of species I

Although this approach to assess the gradation level between signals has almost exclusively been adopted in the acoustic modality, the level of overlap in signal features may theoretically also be applied to other modalities. Hence, the relative proportions of different chemical components may also be considered as a form of gradation, with possible variation among species, groups, and individuals (Symonds and Elgar 2008; delBarco-Trillo et al. 2011). This gradation approach has also been suggested to be applicable to dynamic visual signals, such as facial expressions and gestures (Fischer et al. 2016), but this remains to be explored in practice.

A final approach to assess communicative complexity is to evaluate the perceptive adaptations of a system. This approach is highly complementary with the precedent one as in all species, sensory capabilities co-evolved with the range of signals they produce. Moreover, it should provide information on the ability of the receiver to discriminate between different stimuli. Hence, from an evolutionary point of view, this approach is significant, but has only rarely been addressed (Ramsier et al. 2012; Elgar 2015). For example, foraging group size co-varied with the overall hearing sensitivity and high-frequency limits across 11 species of strepsirrhine primates, suggesting that social complexity favored enhanced hearing sensitivities, especially at higher frequencies (Ramsier et al. 2012; Table S1). Similarly, in the olfactory domain, social Hymenoptera species have higher densities of hair-like sensillae, with which they perceive olfactory signals, compared to species that became solitary (Wittwer et al. 2017; Table S1). Comparative studies on visual adaptations may also be informative (Endler et al. 2005), but studies of variation in visual communication have so far mostly considered effects of ecological factors, such as background vegetation (Fleishman 1992), habitat type (Malacarne et al. 1991), or diurnality and predatory habits (Veilleux and Kirk 2014).

To summarize, evaluating the amount of uncertainty in a signal or signaling system represents a common approach to evaluate communicative complexity across modalities (Table 1). Although the most common approach is to study of combinational use of signals, the concept of gradation, currently mainly used for the vocal modality, is a really promising one that may allow for a systematic evaluation of communicative of complexity at both the signal and system level. This approach may benefit by also considering the complexity of the associated perceptive adaptations.

Thus, across all three modalities discussed here, several complementary approaches of complexity appear particularly relevant. These approaches are based on the number of signals or signaling units and the uncertainty (combinations and gradation of signals). These approaches are applicable across modalities and address both signal and system levels and may therefore also be used for broad comparative studies. As for measures of social complexity, we encourage the development of complementary quantitative operational measures of communicative complexity that incorporate the insights addressed above.

Integrating the multimodal nature of communication

While the field of animal communication has moved in the early 2000s from a unimodal approach to a multimodal one, acknowledging the fact that most animals produce signals across different modalities (McGurk and Macdonald 1976; Partan and Marler 1999; Rowe 1999; Candolin 2003; Hebets and Papaj 2005), much research in the context of the SCHCC has remained focused on a single modality. However, there are several good reasons for acknowledging the multimodal nature of signals and communicative systems in this framework. Two fundamental aspects of communication effectively deserve some attention, first, at the signal level, the fact that numerous signals are by themselves multimodal and, second, at the system level, the fact that individuals, even when not using multimodal signals, usually communicate through different modalities (Liebal et al. 2013; Pika 2017).

Multimodal signals are by definition complex, because they involve more than one signaling and perceptive system (Rowe 1999; Smith and Evans 2013). Moreover, multimodal signals have been suggested to be widespread in group-living animals (Partan and Marler 1999), but this assumption has never been formally tested. Isolating one component of a multimodal signal may lead to false conclusions because some signals have a different function when they are expressed independently or together with another signal component (“multiple signals/messages hypothesis” and “disambiguation hypothesis”; Hebets and Papaj 2005; Liebal et al. 2013). For example, female red-winged blackbirds (Agelaius phoeniceus) use the same visual display for courtship and aggression, and only the addition of a vocalization allows discrimination between contexts (Beletsky 1983).

At the system level, focusing on one modality only may lead to over- or underestimation of the relationship between social and communicative complexity, respectively. For example, submission in primates can be expressed either by visual or acoustic signals (Jolly 1966; de Waal and Luttrell 1985), showing that the same social selective pressure may result in different signaling adaptations that are equally complex but expressed through different modalities. Second, the flexible use of different modalities in multimodal signals permits another level of complexity embedded in multimodal signaling (Liebal et al. 2013; Wilke et al. 2017). For example, captive chimpanzees (Pan troglodytes) used more tactile and vocal signals when the experimenter was facing away, but more visual-gestural signals when the experimenter was facing the chimpanzee (Leavens et al. 2010). Thus, depending on the species in question, it may also be highly relevant to include some modalities (e.g., vibratory, thermal, electrical) that are not addressed here because they have not yet been considered in the framework of the SCHCC.

Hence, we suggest that establishing a cross-modal signal repertoire would not only fill the gap in our current understanding of the multimodal nature of most signals (Partan and Marler 2005) and contribute to a more comprehensive assessment of communicative complexity, but would also allow more meaningful tests of the SCHCC (Liebal et al. 2013; Waller et al. 2013). Moreover, applying a multimodal approach may also help to uncover the different selective pressures acting on the communicative system and to better understand adaptive functions that might not be clear from the study of its components independently (Liebal et al. 2013).

Integration of expanded operational definitions

We argue that integrating appropriate quantitative measures of social complexity (as described above) together with cross-modal measures of communicative complexity (number of signaling units and uncertainty) in a holistic fashion should facilitate comparative research on the SCHCC and may reveal important avenues for future research. Although the development of appropriate analytical tools to study multimodal communication has been debated for a while (Partan 2013), a recent network approach has offered great promise. This method has been used to uncover the contributions of female mate choice and male-male competition in the North American barn swallows (Hirundo rusticaerythrogaster) via examining a comprehensive array of phenotypic variables by a correlation-based phenotype network (Wilkins et al. 2015; Hebets et al. 2016). A similar approach could be used for a systematic analysis of communication systems and their links with social parameters. For example, each node in a signal phenotype network may represent a different signal/signaling unit or a different communicative complexity measure, with different node shapes representing different modalities or types of signals. The edge between two signals may then represent the correlation between these two signals across individuals in terms of either frequency of occurrence (Example 1 in Fig. 3) or value (Example 2 in Fig. 3) or their temporal association (Example 3 in Fig. 3), depending on the chosen measures. The comparison of these communicative system representations across different social context (e.g., activities or audiences) may, for instance, reveal new interesting relationships between communicative signals and social parameters. Integrating relevant social parameters into these networks would also allow exploring the specific links between different social and communication variables at the system level (Wilkins et al. 2015). The application of such a system-inspired framework, for example, revealed dynamic changes in signal structure and function across environmental and social contexts in courtship displays of wolf spiders (Schizocosa floridana) (Rosenthal et al. 2018).

Fig. 3
figure 3

Three heuristic examples of possible network construction presented on a conceptual diagram showing different levels of system degeneracy and modularity. The meaning of the edge and nodes for each example are explained in the bottom-right quarter. This figure is adapted from Wilkins et al. 2015

Hence, we believe that this method would not only offer instructive visual representations of multimodal communicative systems, but also an opportunity to extract new quantitative and broadly applicable measures that could be interpreted in terms of communicative complexity. This method indeed offers the possibility to characterize a given system in terms of degeneracy, modularity and pluripotentiality (Wilkins et al. 2015; Hebets et al. 2016). Degeneracy is a measure of network density and represents the number of significant correlative relationships; a highly degenerated system will be composed of signals potentially carrying the same information or having the same function (see “redundancy” in Wilkins et al. 2015; Examples 1 and 2 in Fig. 3). Modularity represents the degree to which relationships exist across versus within clusters, here representing signal types or modalities; a highly modular system will be composed of more modality-specialized individuals (Wilkins et al. 2015; Example 2 in Fig. 3). Pluripotentiality is a measure of the degree to which identical display components function across contexts; a highly pluripotent system will be composed of more functionally flexible signals that are used across contexts (Hebets et al. 2016).

Hence, in the context of the SCHCC, we make a plea for more comprehensive study of communicative systems, integrating a set of quantitative and complementary measures of social and communicative complexity and accounting for the multimodal aspect of communication. We believe that his approach would bring considerable insights in the current debate, particularly allowing for the emergence of more specific predictions on the nature of the link between social and communicative factors.

The links between social and communicative complexity

Although several studies have supported the social complexity hypothesis by revealing correlative relationships between social and communicative variables, only a few studies have examined and discussed the nature of these links in detail. Hence, the direction of causality and the mechanisms underlying these links remain currently unclear (Freeberg et al. 2012b; Ord and Garcia-Porta 2012; Gustison et al. 2016, 2019, topical collection on Social complexity), and these questions would benefit from additional research.

Control for alternative hypotheses: directionality and strength of the links

When studying the SCHCC, it is essential to also consider potential alternative hypotheses driving the evolution of signals. This is fundamental in both situations when a correlational link is found or when no such link is found, even though a link would be predicted (Freeberg et al. 2012b). For both outcomes, several alternative explanations are possible (Fig. 4), revealing two fundamental problems. First, the relative role of social complexity in shaping communicative complexity needs to be assessed. Social complexity may shape the complexity of a particular set of signals synergistically with other selective pressures also selecting for more complexity (Fig. 4, scenarios A and B) or despite other additional selective pressures (selective forces act in opposite directions; Fig. 4, scenarios C and D), making it hard to extract the actual effect of social complexity itself (Fig. 4, scenarios B and D).

Fig. 4
figure 4

Schematic representation of different potential evolutionary scenarios depicting the link between social and communicative complexity. A: scenario in which a link is observed, social complexity is the main selective pressure driving communicative complexity, in parallel other factors also contribute selecting for communicative complexity; B: scenario in which no link is observed with social complexity because other selective pressures have a stronger effect on the evolution of communicative complexity; C: scenario in which a link is observed, social complexity is the main selective pressure driving communicative complexity even if other factors constrain the evolution of communicative complexity; D: scenario in which no link is observed with social complexity because the effect is counterbalanced by other selective pressures constraining the evolution of communicative complexity; E: scenario in which a link is observed but communicative complexity is under other selective pressures and drive itself the evolution of more complex social systems

Second, correlative studies do not permit conclusions about the direction of causality. Complex communicative systems may evolve in response to pressures related to social complexity, but complex communicative systems might also be driven by other selective pressures independent of changes in sociality that could in turn facilitate the evolution of greater social complexity (Fig. 4, scenario E; McComb and Reby 2005; Freeberg et al. 2012b; Ord and Garcia-Porta 2012). For example, a comparison across 22 species of lizards revealed that the number of head-bobbings was better explained by habitat use, i.e., the level of arboreality, than by the intensity of competition among males (Ord and Garcia-Porta 2012). Similarly, a comparison across 23 species of birds revealed that syllable repertoire size was better explained by body size than by the level of extra-pair paternity, and a comparison across 32 species of frogs revealed that the level of call amplitude modulation was better explained by phylogeny than by the intensity of competition among males (Ord and Garcia-Porta 2012).

Hence, non-social selective pressures such as ecology, anatomy or a phylogenetic null model explained changes in signaling complexity better than the specific aspects of social complexity considered in these analyses (Ord and Garcia-Porta 2012; Fig. 4, scenario B). The mechanisms underlying these selective pressures have been explicated by Freeberg et al. (2012b) and are summarized in Table 2. In general, the evolution of complex signaling systems is presumably not the result of a single specific selective pressure but rather the result of a combination of several ones (Freeberg et al. 2012b; Ord and Garcia-Porta 2012; Ramsier et al. 2012; Manser et al. 2014). Thus, if two species have similar social systems but only one experiences a specific ecological challenge, different types of signals may evolve, but these signals will not necessarily differ in complexity. And, as argued above, the same selective pressure may lead to different adaptations that may be equivalent in terms of complexity but involve different modalities.

Table 2 Summary of the mechanisms underlying alternative hypotheses for the evolution of communicative complexity. This table is inspired by Freeberg et al. (2012b)

To deal with these complications, appropriate statistical tools controlling for alternative hypotheses are required. For example, phylogenetic comparative studies can reveal the order in which different traits evolved in a given lineage (e.g., Podos 2001; Wittwer et al. 2017). Model fitting procedures may also allow for assessing the relative role of the different factors in selecting for communicative complexity and test the SCHCC against alternative hypotheses (Ord and Garcia-Porta 2012). Control of the alternative hypotheses may also be achieved by a quasi-experimental approach that includes the study of closely related species sharing similar habitats, climatic conditions, food resources, parasites, and predators, but exhibiting differences in their social systems (Krams et al. 2012; Ord and Garcia-Porta 2012).

Identification of the underlying mechanisms: exploring the relationship(s) between signal variation and social factors

How variation in signaling is directly affected by social factors is usually not explored, failing to uncover both the specific attributes of communication that would evolve in response to specific aspects of sociality and the actual way these social factors may influence signaling behavior (Pollard and Blumstein 2012; Gustison et al. 2012, 2019, topical collection on Social complexity). In the next sections, we offer suggestions on how to explore the nature of these relationship with both, ultimate (e.g., function of derived calls or complex signals) and proximate approaches (e.g., audience effect and social learning).

Studying the social function of derived signals

One approach to begin closing this gap is to study the social function of derived signals to obtain a better understanding of the functional meaning of the observed differences (e.g., why a larger vocal repertoire evolved in the more socially complex species). Identifying the relevant differences (e.g., which are the derived calls?), their social functions (e.g., what are the social functions of these derived calls?) and assessing potential fitness consequences of these derived signals in detail will help to identify the potential social selective factors driving the evolution of these signals (Gustison et al. 2012; Liebal et al. 2013).

Regarding the vocal modality, it has been shown that rather than an overall increase in repertoire size, an increased number of different calls are generally found only in one or two call categories, with species exhibiting greater social complexity having vocal repertoires including a greater proportion of vocalizations used in affiliative contexts (Le Roux et al. 2009; Taylor and Reby 2010; Fedurek and Slocombe 2011; Briefer 2012; Gustison et al. 2012). A descriptive identification of derived call categories has been conducted in a relatively small set of closely related species (Kroodsma 1977; Stirling and Thomas 2003; Manser et al. 2014), but recently also in a comparative study across 253 bird species (Leighton 2017; Table S1). Birds breeding cooperatively had a significantly larger vocal repertoire size than other species. This increase in repertoire size was due to specific increases in the number of alarm and contact calls but not aggression, flight, territorial, or mating calls. Although this contextual information is undeniably valuable for discussing the SCHCC, these broad categories nevertheless fail to uncover the specific social function at stake.

In contrast, a detailed comparison of the vocal repertoire of two closely related primate species (chacma baboons, Papio ursinus, and geladas, Theropithecus gelada) with different social systems, identified eight homologous calls, but also six derived calls present only in the geladas, suggesting that the socially more complex geladas have larger vocal repertoires (Gustison et al. 2012; Table S1). Gelada males use these calls to maintain social relationships with females in their units and specifically direct these calls towards females after conflicts. Interestingly, this difference parallels a social divergence between the two species because gelada males form long-term bonds with several females in a harem-like reproductive unit, whereas chacma males only form temporary associations with females. Hence, comparing the nature of the difference between signal repertoire sizes between species that differ in aspects of their sociality can help to uncover the role of the respective social factors in explaining the observed variation between the communicative systems.

Audience effects and signal complexity

The audience effect is defined as a change in signaling behavior caused by the presence of the audience; the audience being defined as any individual that is within the signal range (Marler et al. 1986; McGregor and Dabelsteen 1996). Therefore, signaling behaviors may be influenced not only by the characteristics of a targeted receiver (Fröhlich et al. 2016) or its attentional state (Leavens et al. 2010; Smith et al. 2011) but also by the nature of the audience (Slocombe and Zuberbühler 2007; Kalan and Boesch 2015; Crockford et al. 2017) including unwanted receivers (Matos and McGregor 2002; Smith et al. 2011). Hence, the SCHCC could provide a theoretical framework for testing audience effects by considering variation in the audience as an aspect of social complexity. Studying audience effects on complex signals may therefore help to uncover the particular social selective pressures that may have led to the evolution of flexible signals. Multimodal or multicomponent signals appear to be particularly good candidates to examine this audience effect.

First, signals including functionally redundant components may serve to increase signal detection and memorization (“backup hypothesis”: Johnstone 1997; “receiver psychology hypothesis”: Rowe 1999). This type of signal may optimize signal transmission but may also attract the attention of unwanted receivers. Hence, to balance benefits and costs, multicomponent signals should be associated with some behavioral flexibility, allowing variation in usage and signal structure (e.g., only one component is expressed) between contexts (e.g., more discrete signal when unwanted receivers are in proximity). Male jungle fowl (Gallus gallus), for example, exhibit variability in their tidbitting food-call recruitment behavior used not only to attract females but also to trigger aggression of male competitors (Smith et al. 2011). Tidbitting is a visual signal consisting of repetitive movements of the head and neck and can be combined with repetitive pulsatile sounds. Subordinate males adapt this display to the attentional state of the alpha male of the group by producing more frequently bimodal tidbitting when the alpha male is distracted (Smith et al. 2011).

Second, signals including non-functionally redundant components may carry additional information (“multiple messages hypothesis”: Møller and Pomiankowski 1993) that might be addressed at different audiences and may also be flexibly adapted between contexts. Even if it remains to be tested, this may be particularly true for mammal scent-marking behaviors (“demonstrative marking hypothesis”; Estes 1967; Johnston 2005) because of its temporal duality. Scent deposition behaviors often include obvious visual signals that may reach individuals present in the vicinity of the sender, while the scent itself may be directed towards both the present and future audiences.

Hence, studying the flexibility of complex signal usage (e.g., occurrence or structural modifications) across social contexts (audiences) will permit to identify the different individual social characteristics that may elicit or constrain complex signal expression. These social characteristics may later constitute social pressures acting for or against the evolution of these complex signaling behaviors.

Ontogeny of flexible signaling behavior: social contexts associated with vocal production learning flexibility in vertebrates

Despite fundamental similarities in the structure and mechanisms of vocal production systems across vertebrates, important differences exist between species at the level of flexibility in vocal production. Accordingly, a distinction is usually made between vocal learners and non-vocal learners. Non-vocal learner species produce adult-like vocalizations from birth onward and usually possess a relatively small vocal repertoire, which is genetically controlled and evolutionarily conserved. In contrast, vocal learner species learn their vocalizations from another conspecific (social tutor). Early vocalizations are usually structurally simple and highly variable, becoming more complex and stereotyped with age. In some species, vocal learning occurs only during a specific early sensitive period, whereas others exhibit open-ended vocal learning (Egnor and Hauser 2004; Catchpole and Slater 2008a).

In contrast to vocal contextual learning (in which “a pre-existing signal comes to be associated with a new context”, Janik and Slater 2000), vocal production learning refers more specifically to the modification in the structure of vocal signals through experience with another conspecific (Janik and Slater 2000; Ruch et al. 2018). These changes in vocalizations can be classified as either vocal accommodation, when existing vocalizations are modified, or lexical learning, when a new vocalization is acquired (Ruch et al. 2018). The latter form of vocal learning is particularly interesting in the context of the SCHCC as it may lead to an increasing number of signals used by an individual across its lifespan. So far, evidence for lexical learning was only found in three distantly related groups of birds (parrots, hummingbirds, and songbirds) and four distantly related groups of mammals, i.e., humans (but not in other primates), bats, cetaceans, and elephants (Nowicki and Searcy 2014; Ruch et al. 2018).

In most of these species, individuals exhibit babbling-like behavior in early life (bats: Knörnschild et al. 2006; cetaceans: Vergara and Barrett-Lennard 2008; parrots: Masin et al. 2004). Babbling not only is primarily considered as an essential practice for vocal learning but also seems to play a key role in social exchanges with caregivers (Vergara and Barrett-Lennard 2008; Tallerman and Gibson 2012). In humans, babbling triggers positive responses from kin as well as non-kin caretakers (Tallerman and Gibson 2012). Hence, babbling should be particularly adaptive in species displaying allomaternal care, as for example in cooperative breeders. In this social environment, young have to compete with each other for the attention of non-kin helpers. Interestingly, allomaternal care is present in all the families mentioned above exhibiting lexical learning (cetaceans: Hill and Campbell 2014; humans: Tallerman and Gibson 2012; elephants: Rapaport and Haight 1987; birds: Hatchwell 2009). For example, humans and callitrichids exhibit the greatest degree of allomaternal care among primates, and they are the only primate taxa with babbling infants (Tecot et al. 2012; Burkart et al. 2017). This potential link between the presence of allomaternal care and lexical learning is also consistent with the relatively recent idea of a possible co-evolution between vocal communication complexity and cooperation (Tallerman and Gibson 2012; Freeberg and Krams 2015). Hence, more elaborate vocal behavior and babbling-like behavior might be more common in species with allomaternal care. The examples illustrate the point that studying the social environments in which flexible vocal learning takes place may also illuminate the causal relationships between social variables and communicative complexity.

Conclusions

Many previous studies of communicative complexity have focused on specific links between single social and communicative variables, oversimplifying the complexity of these interrelations and ignoring the specific underlying mechanisms. We therefore make a plea for more specific predictions and a more comprehensive study of communicative systems. We particularly recommend:

  1. (1.)

    To apply broadly applicable quantitative measures of social complexity in order to avoid the pitfalls of subjectivity and circularity and to clarify the level at which complexity is evaluated.

  2. (2.)

    To assess communicative complexity at the system (repertoire) level, accounting for the multimodal nature of communication at both the signal and system levels.

  3. (3.)

    To control for alternative hypotheses to the SCHCC through the application of appropriate statistical methods or careful selection of study species.

  4. (4.)

    To develop more specific predictions about particular social factors that may impact a specific communication variable and the underlying mechanisms at play.

Moreover, we argue that integrating appropriate social and communicative complexity measures and studying the social contexts promoting complex signal expression in more detail may advance our current understanding of the links between social and communicative complexity. Integrating quantitative measures of social complexity with cross-modal measures of communicative complexity should help to uncover the selective pressures acting on the communication system and may additionally provide an opportunity to extract new quantitative and broadly applicable measures of communicative complexity at the system level. Studies of the social contexts associated with complex signal expression at both the ultimate and proximate level may identify the specific attributes of communication that evolve in response to specific aspects of sociality.

In summary, we propose that expanding tests of the SCHCC in scope (systematic approach across modalities) and depth (characterization of the observed relationships) will significantly advance our understanding of the intricate links between animal sociality and communication. Studying the SCHCC along these lines may also inform current debates in the study of social cognition, where communication is rarely discussed, as well as in studies of language origins and evolution.