Chimpanzee (Pan troglodytes) agonistic screams are graded vocal signals that are produced in a context-specific manner. Screams given by aggressors and victims can be discriminated based on their acoustic structure but the mechanisms of listener comprehension of these calls are currently unknown. In this study, we show that chimpanzees extract social information from these vocal signals that, combined with their more general social knowledge, enables them to understand the nature of out-of-sight social interactions. In playback experiments, we broadcast congruent and incongruent sequences of agonistic calls and monitored the response of bystanders. Congruent sequences were in accordance with existing social dominance relations; incongruent ones violated them. Subjects looked significantly longer at incongruent sequences, despite them being acoustically less salient (fewer call types from fewer individuals) than congruent ones. We concluded that chimpanzees categorised an apparently simple acoustic signal into victim and aggressor screams and used pragmatics to form inferences about third-party interactions they could not see.
Citation: Slocombe KE, Kaller T, Call J, Zuberbühler K (2010) Chimpanzees Extract Social Information from Agonistic Screams. PLoS ONE 5(7): e11473. https://doi.org/10.1371/journal.pone.0011473
Editor: David Reby, University of Sussex, United Kingdom
Received: May 6, 2010; Accepted: June 4, 2010; Published: July 14, 2010
Copyright: © 2010 Slocombe et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by a grant from the Biotechnology and Biological Sciences Research Council to KZ. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Theories of the origins of human language draw heavily on comparative evidence of extant primates [e.g. 1], . To date, vocal research has focussed almost exclusively on monkey species whereas gestural research relies almost entirely on apes . It is imperative that research investigating vocal competencies in apes is forthcoming to provide a valid comparison and also to test whether the monkey evidence is a case of convergent evolution. So far, only a small number of studies have tested call comprehension in great apes ,  and thus we know very little about the abilities of great apes to extract and integrate information from calls; a vital part of human communicative abilities.
We address this issue with a study on how chimpanzees process vocal utterances produced during agonistic interactions. Many species of animal produce vocalisations during agonistic interactions, in the roles of both the aggressor and victim. Specific calls are produced to defend a territory (sand gobies ; Lusitanian toadfish ; crickets ) or a food source (pipistrelle bats ; dogs ) and these signals tend to be honest signals of body size and therefore fighting ability of the caller (dogs ; sand gobies ; crickets ; red deer ). Many species also produce specific vocalizations when experiencing aggression, including female rats , rhesus monkeys  and chimpanzees .
Detailed investigations of signalling during agonistic contexts have revealed advanced social and communication skills in some monkey species, with rhesus monkeys (Macaca mulatta) being capable of making inferences about an ongoing conflict based on the victim screams of their offspring . In chimpanzees, observational studies of agonistic encounters have already revealed context-specific vocal behaviour in a number of ways. Firstly, chimpanzee victims vary the acoustic structure of their screams as a function of the severity of aggression they are facing , and field experiments have shown that receivers can discriminate between these differences . Secondly, chimpanzees also vary their screams depending on their social role in an agonistic interaction with victims and aggressors producing acoustically distinct screams , but it is currently not known whether others attend to these acoustic differences.
Here, we tested whether, as bystanders of social conflicts, chimpanzees are able to extract information about the social role (victim or aggressor) of the two protagonists from their screams and integrate this with more general social knowledge to make inferences about the nature of a fight they cannot see. This is an ecologically valid problem for chimpanzees who typically dwell in dense forested habitat . Low visibility and a fission-fusion social system have the effect that chimpanzees typically hear many more fights than they see (Slocombe, unpublished data), potentially favouring the evolution of advanced comprehension abilities.
To test whether chimpanzee listeners were able to extract social information from agonistic screams, we presented adult individuals with congruent and incongruent sequences of screams, that is, stimuli that did or did not violate the existing social hierarchy. We adapted the basic experimental design, originally used by Cheney and colleagues  to examine causal reasoning in baboons, to address a different question of call comprehension in chimpanzees. We expected individuals to show greater orienting responses to incongruent than congruent stimulus sequences, in line with other animal and human infant research [e.g. 19]. Incongruent sequences consisted of low-ranking aggressor screams followed by high-ranking victim screams. These sequences represented an interaction that was at odds with the existing social hierarchy. Congruent stimuli consisted of exactly the same scream sequences, but with the addition of a pant hoot vocalisation from a displaying male who outranked both screaming individuals. Congruent call sequences thus simulated a plausible situation where it was most likely that the high-ranking's victim scream was elicited by the dominant male's intimidation display, rather than the low-ranking individual's social machinations. Individual recognition was necessary to infer the congruency of the sequences and there is good evidence that chimpanzee calls are individually distinctive vocalisations that can also be discriminated by others [e.g. 20], .
If chimpanzees are able to discriminate and understand the meaning of the different calls, as well as the social constraints under which the two callers operate, they should look longer at the speaker after hearing incongruent sequences than congruent ones. In contrast, if chimpanzees are unable to do so, their response pattern should be random, or in the other direction since congruent sequences are acoustically more salient than the incongruent ones.
We compared the looking duration of subjects in response to the two playback sequences. As a group, subjects looked significantly longer at the speaker in the incongruent compared to the congruent condition (incongruent: median 7.13s, inter-quartile range 10.29; congruent: median 4.13 s, inter-quartile range 6.35; Wilcoxon test Z = -2.31, p = 0.02; table 1).
At the individual level, one subject showed equal responses to both trial types. The nine remaining individuals discriminated the two sequences with 8 out of 9 looking longer at the incongruent than the congruent stimuli, with only one individual showing the reverse pattern, a level significantly above that expected by chance (Binomial (0.5), exact p = 0.039).
To control for potential variation in the amount of natural orientation subjects displayed towards the sleeping room, we examined the looking duration to the speaker in the minute after the victim scream, once the baseline looking time at the speaker in the minute before playback had been subtracted. Subjects still looked longer at the speaker in response to the incongruent than the congruent stimuli (median 5.80 s, inter-quartile range 8.84; median 0.56 s, inter-quartile range 7.94); Wilcoxon test Z = -2.80, p = 0.002.
No differences were found in terms of locomotor behaviour. The incongruent playbacks elicited four of nine individuals to approach the doors to the sleeping room, the congruent playbacks three of nine.
We have demonstrated with this study that chimpanzees showed greater orientation responses to incongruent than congruent scream sequences, suggesting that they extracted information about the social role of the two callers from their agonistic screams and made sense of the simulated interaction by interpreting their vocal behaviour within a wider social context. Specifically, subjects responded stronger to sequences that simulated a social interaction that violated the dominance hierarchy, compared to sequences that did not. To perceive the underlying social anomaly subjects could not simply rely on the acoustic surface features of the stimuli, but must have been able to make some inferences about the direction of aggression by assigning two distinct social roles to the callers (victim and aggressor). Results further demonstrate that chimpanzees can identify individual group members from their agonistic screams and that they can integrate invisible events simulated by the calls with their existing social knowledge about the expected social interactions of the call producers with regards to their social standing in the group.
The observed differences in looking responses cannot be explained by the acoustic features of the playback stimuli. Subjects responded comparatively weakly to congruent stimuli despite these sequences being more salient and attention-grabbing in a number of ways. Congruent stimuli were acoustically more salient than the incongruent ones, as they contained more call types from more individuals. In addition, the congruent sequence simulated interactions involving more individuals, including a top-ranking male, which generally evokes much interest in naturally occurring fights (Slocombe, unpublished data). Despite these factors subjects responded relatively weakly to these sequences. It was also not the case that chimpanzees' behaviour could be explained in terms of a stronger response to novel sequences of calls, because the pattern of screaming in both conditions was identical, therefore equally novel.
This study has shown that chimpanzees are capable of extracting a range of social information from the calls of familiar group members. Given the wealth of studies showing that monkeys are very adept at extracting social information from calls [e.g.14], [ 18], [ 22] the current study indicates that such comprehension skills are also present in the great apes and as such phylogenetically old abilities. Further research into chimpanzee vocal behaviour may provide more evidence for continuity between elements of monkey and human vocal communication with considerable relevance for theories of language evolution.
In conclusion, in this study listeners meaningfully distinguished between victim and aggressor screams and integrated this information with pragmatic social knowledge in order to draw inferences about the nature of an interaction they could not see. Victim and aggressor screams are two distinct scream types within the largely graded vocal system of chimpanzees, which are discriminated by receivers and appear to be meaningful to them. This result further demonstrates that even in graded vocal systems meaningful acoustic variation can be present within the major call types, which, combined with the ability to integrate pragmatic knowledge, enables individuals to communicate in complex ways.
Materials and Methods
3 male and 7 female chimpanzees (10-31 years old), housed at the Wolfgang Köhler Primate Research Centre (WKPRC), Leipzig, Germany, participated in this study. The subjects were housed socially in a group consisting of 18 individuals in spacious, naturally designed indoor (430 m2) and outdoor (4000 m2) enclosures, and a sleeping room which consisted of five interlinked cages (each 5.1-7.3 m2). During the period of testing, all apes received their complete daily diet consisting of various fresh fruits, vegetables, leaves, cereals, eggs and meat, and were never deprived of food or water at any time. In addition to 4-5 feeds each day the apes had access to several enrichment devices from which they could extract nuts and fruits with the help of tools.
We determined the dominance relations between stimulus providers by analysing the patterns of pant-grunting behaviour, the standard indicator of rank relations in chimpanzees . Pant-grunt data were collected on an all-occurrence basis during regular 2-hour observation sessions (4-5/week) at WKPRC from April-September 2007, which revealed the dominance relationships shown in table 2.
The School of Psychology Ethics committee, University of St Andrews gave ethical clearance for this non invasive, behavioural study and, in accordance with ethical guidelines, we terminated any trial in which the subject became acutely distressed, for instance if producing a screaming tantrum. This only occurred once and the subject was immediately released to rejoin her group, before the playback occurred.
The incongruent stimuli consisted of an aggressor scream bout of a lower-ranking individual followed directly by a victim scream bout of a higher-ranking individual. This sequence simulated an unusual event, because chimpanzees are rarely injured or made fearful by lower-ranking group members.
Congruent stimuli could have consisted of an inverted sequence (high-ranking aggressor scream then low-ranking victim scream). However, this approach would have introduced the confound of novelty when comparing the congruent and incongruent stimuli, as it is comparatively rare for a high-ranking individual to produce victim screams. In this case meaningful interpretation of a longer orientation to the incongruent stimuli (the only sequence to contain this call) would have been impossible as it could have reflected interest in (1) an unusual agonistic interaction that was only possible to understand with extraction of social information from the calls or (2) a relatively novel sound. In order to exclude this confound it was therefore imperative that the sequences of aggressor and victim screams remained identical in the two conditions.
In order to construct the congruent sequence we thus kept the scream sequences identical and, in line with the original study by Cheney and colleagues , introduced calls from a third individual to make the sequence congruent. In this study we overlaid a pant-hoot build up given by a top-ranking male in the middle of the playback stimulus so it overlapped with parts of the aggressor (mean 1.48 sec) and the victim (mean 1.41 sec) calls (Fig. 1).
Illustrated are (A) Channel 1, (B) Channel 2, (C) The point where the aggressor scream ends and the victim scream starts, (D) Aggressor scream given by low ranking individual, (E) Victim scream given by high ranking individual, (F) Pant hoot build up given by dominant male. The incongruent stimuli consisted of Channel 1 only.
This way, the incongruent sequence became congruent with the dominance hierarchy in the group, due to the simulated presence of a third individual dominant to both the victim and aggressor. This sequence thus simulated two independent events, none of which violated any social expectations. Firstly, a low ranking individual attacking an unknown (silent) individual, and second a high ranking individual responding with victim screams to the male's display pant hoots, a common response to such intimidation displays .
Screams and pant hoots were recorded opportunistically during natural agonistic encounters in the group (June-August 2006 and June 2007). Only recordings made during unambiguous contexts were included (aggressor screams: physical contact, charging or lunging at victim; victim screams: physical contact or being chased by aggressor; pant hoots: display charges towards other group members). Acoustic analyses confirmed that the structure of these screams mirrored the acoustic structure of victim and aggressor calls reported for wild chimpanzees [15; see supplementary methods in file S1].
All calls were recorded using a SENNHEISER K6/ME67 directional microphone and a MARANTZ PMD660 solid state recorder (sampling rate of 44.1 kHz, 16 bits accuracy). Calls were edited using RAVEN Pro 1.3. Stimuli were stored as .wav files on a Toshiba Tecra-M3 laptop and broadcast through a NAGRA-DSM speaker.
Stimulus scream bouts contained 5 or 6 calls, had a mean duration of 4.87 sec, (SD = 0.64), and were clear from other chimpanzees calling and excessive background noise. The screams were approximately equalized in amplitude (Mean RMS volume = 5212, SD = 986). An average of 3.1 sec (SD = 0.1) of the build up of the pant hoot was used. The playback volume for each stimulus set was modified so it sounded natural to the experimenter from the location the subject would hear it from. The incongruent and congruent stimulus pairs were identical in total duration and were played at identical amplitudes. Four different stimulus sets were made, each played to between two and four subjects [see supplementary methods in file S1]. Although, ideally, each individual would have heard a unique stimulus sequence, this was not possible due to the difficulty of obtaining high quality recordings of single individuals screaming in the two narrowly defined contexts. We thus tested individuals with a smaller set of high quality, realistic recordings. Using small stimulus sets can create problems with data interpretation, especially if subjects attend to peculiarities in the acoustic structure of stimuli instead of the conveyed meaning . Field studies with primates have shown that individuals attend to the meaning of calls, not just their acoustic surface features , indicating that it is acceptable to use the same stimulus sequence on several individuals.
This study followed a counterbalanced within-subjects design, with six subjects receiving the incongruent condition first. Each subject had at least 7 days between trials (mean = 24, SD = 10).
Each trial involved 6-7 different individuals: one subject, two or three call providers and two bystanders. The rest of the group were in the outdoor enclosure. We assumed that chimpanzees keep track of each other's whereabouts so we created spatially realistic constellations before broadcasting any stimuli. The subject was first separated from the others in the sleeping room, but he/she could still see 4 or 5 other individuals, including the call providers (fig. 2).
A. Initial position of all chimpanzees in sleeping room. B. Subject enters indoor enclosure. C. Call providers and bystanders enter outdoor enclosure. D. Subject positioned 5 m from sleeping room wall. Experimenter 1 films subject. Experimenter 2 broadcasts calls from sleeping room.
The subject was released into the indoor enclosure and usually approached the food (scattered muesli) placed 5 m from the sleeping room wall. The keeper then released all other individuals, including the call providers, into the outdoor enclosure, where they would not hear their own calls being broadcast. It was unavoidable that the subject would hear the hydraulic doors associated with releasing individuals outside, but due to the presence of the bystanders they could not know which of the initial 4 or 5 other individuals had been released. In order to create a realistic situation and give the impression that some unknown individuals remained in the sleeping room, the keeper then moved the internal doors and shouted in manner consistent with moving chimpanzees within the sleeping room. Experimenter 1, who filmed the subject, waited until the subject was sitting in the desired location for one minute. Experimenter 2 then broadcast the stimulus from the sleeping room. The subject's response to the playback was filmed, then after 5 minutes the keepers simulated the release of the call providers into the outdoor enclosure, by shouting and operating doors. The subject then rejoined the group in the outdoor enclosure.
We coded the video footage frame-by-frame (25 frames/second) using Adobe Premiere Pro CS3. We measured (1) duration of looking towards the sleeping room in the minute before the playback; (2) duration of looking towards the sleeping room in the minute after the onset of the victim screams. As the congruent or incongruent nature of the stimulus sequences only emerged at the onset of the victim screams, we measured from this point (3) whether the subject approached the sleeping room doors in the minute after playback.
In order to ensure the videos had been accurately coded, 5/20 randomly chosen trials (25%) were recoded on the same three measures by an independent individual blind to the hypotheses and the trial type. We gained 100% agreement on whether the subject approached the sleeping room and a Pearson's correlation showed our duration measurements were also highly similar (Duration looking at sleeping room pre-playback, r = 0.992; Duration looking at sleeping room post-playback, r = 0.971), thus we had confidence the videos had been accurately coded.
Due to small sample sizes, we conducted two-tailed non parametric statistical tests, specifically Wilcoxon signed-rank and Binomial tests. In line with recommendations given by Mundry and Fischer  exact rather than asymptotic p-values are reported.
Details of acoustic analysis of stimuli and descriptive comparison to published acoustic structure of chimpanzee screams Details on the selection of stimulus sets for each subject.
(0.04 MB DOC)
We thank WKPRC staff and Yvonne Rekers for help with conducting trials. We are grateful to Johannes Grossmann, Katja Grosse, Josefine Kalbitz, Patricia Kanngiesser and Katrin Riedl for long-term observational data and Scott Franklin for blind coding of the videos.
Conceived and designed the experiments: KS JC KZ. Performed the experiments: KS TK. Analyzed the data: KS TK. Contributed reagents/materials/analysis tools: JC. Wrote the paper: KS KZ.
Tomasello M (2008) Origins of Human Communication. Cambridge, , MA: MIT Press.
Corballis MC (2002) From hand to mouth, the origins of language. Princeton, , New Jersey: Princeton University Press.
Slocombe KE, Waller B, Liebal KThe language void: the need for multimodality. Animal Behaviour: in review.
- 4. Slocombe KE , Zuberbühler K (2005) Functionally referential communication in a chimpanzee. Current Biology 15: 1779–1784.
- 5. Slocombe KE, Townsend SW, Zuberbühler K (2009) Wild chimpanzees distinguish between different scream types: evidence from a playback study. Animal Cognition 12: 441–449.
- 6. Amorim MCP , Neves ASM (2008) Male painted gobies (Pomatoschistus pictus) vocalise to defend territories. Behaviour 145: 1065–1083.
- 7. Vasconcelos RO, Simoes JM, Almada VC, Fonseca PJ, Amorim MCP (2010) Vocal Behavior During Territorial Intrusions in the Lusitanian Toadfish: Boatwhistles Also Function as Territorial ‘Keep-Out’ Signals. Ethology 116: 155–165.
- 8. Brown WD, Smith AT, Moskalik B, Gabriel J (2006) Aggressive contests in house crickets: size, motivation and the information content of aggressive songs. Animal Behaviour 72: 225–233.
- 9. Barlow KE, Jones G (1997) Function of pipistrelle social calls: Field data and a playback experiment. Animal Behaviour 53: 991–999.
- 10. Farago T, Pongracz P, Range F, Viranyi Z, Miklosi A (2010) ‘The bone is mine’: affective and referential aspects of dog growls. Animal Behaviour 79: 917–925.
- 11. Taylor AM, Reby D, McComb K (2010) Size communication in domestic dog, Canis familiaris, growls. Animal Behaviour 79: 205–210.
- 12. Reby D, McComb K, Cargnelutti B, Darwin C, Fitch WT, et al. (2005) Red deer stags use formants as assessment cues during intrasexual agonistic interactions. Proceedings of the Royal Society B-Biological Sciences 272: 941–947.
- 13. Haney M , Miczek KA (1993) Ultrasounds During Agonistic Interactions Between Female Rats (Rattus norvegicus). Journal of Comparative Psychology 107: 373–379.
- 14. Gouzoules S, Gouzoules H, Marler P (1984) Rhesus monkey (Macaca mulatta) screams: Representational signalling in the recruitment of agonistic aid. Animal Behaviour 32: 182–193.
- 15. Slocombe KE , Zuberbühler K (2005b) Agonistic screams in wild chimpanzees vary as a function of social role. Journal of Comparative Psychology 119: 67–77.
- 16. Slocombe KE , Zuberbühler K (2007) Chimpanzees modify recruitment screams as a function of audience composition. Proc Natl Acad Sci U S A 104: 17228–17233.
Goodall J (1986) The chimpanzees of Gombe. Patterns of behavior. Cambridge, , MA: Belknap Press of Harvard University Press.
- 18. Cheney D, Seyfarth R, Silk J (1995) The responses of female baboons (Papio cynocephalus ursinus) to anomalous social interactions: Evidence for causal reasoning? Journal of Comparative Psychology 109: 134–141.
- 19. Onishi K, Baillargeon R, Leslie A (2007) 15-month-old infants detect violations in pretend scenarios. Acta Psychologica 124: 106–128.
- 20. Kojima S, Izumi A, Ceugniet M (2003) Identification of vocalisers by pant hoots, pant grunts and screams in a chimpanzee. Primates 44: 225–230.
- 21. Herbinger I, Papworth S, Boesch C, Zuberbühler K (2009) Vocal, gestural and locomotor responses of wild chimpanzees to familiar and unfamiliar intruders: a playback study. Animal Behaviour 78: 1389–1396.
- 22. Seyfarth RM, Cheney DL, Bergman TJ (2005) Primate social cognition and the origins of language. Trends in Cognitive Science 9: 264–266.
- 23. Newton-Fisher NE (2004) Hierarchy and status in Budongo chimpanzees. Primates 45: 81–87.
- 24. McGregor PK, Catchpole CK, Dabelsteen T, Falls JB, Fusani L, et al. (1992) Design of Playback Experiments - the Thornbridge Hall Nato Arw Consensus. Playback and Studies of Animals Communication 228: 1–9.
- 25. Zuberbühler K, Cheney DL, Seyfarth RM (1999) Conceptual semantics in a nonhuman primate. Journal of Comparative Psychology 113: 33–42.
- 26. Mundry R , Fischer J (1998) Use of Statistical Programs for Nonparametric Tests of Small Samples Often Leads to Incorrect P Values: Examples From Animal Behaviour. Animal Behaviour 56: 256–259.