Figures
Abstract
In human languages, words that are used more frequently tend to be shorter than words that are used less frequently. This pattern is known as Zipf’s law of abbreviation. It has been attributed to the principle of least effort – communication is more efficient when words that are used more frequently are easier to produce. Zipf’s law of abbreviation appears to hold in all human languages, and recently attention has turned to whether it also holds in animal communication. In birdsong, which has been used as a model for human language learning and development, researchers have focused on whether more frequently used notes or phrases are shorter than those that are used less frequently. Because birdsong can be highly stereotyped, have high interindividual variation, and have phrase repertoires that are small relative to human language lexicons, studying Zipf’s law of abbreviation in birdsong presents challenges that do not arise when studying human languages. In this paper, we describe a new method for assessing evidence for Zipf’s law of abbreviation in birdsong, and we introduce the R package ZLAvian to implement this method. We used ZLAvian to study Zipf’s law of abbreviation in the songs of 11 bird populations archived in the open-access repository Bird-DB. We did not find strong evidence for Zipf’s law of abbreviation in any population when studied alone, but we found evidence for Zipf’s law in a synthetic analysis across all populations. Overall, the negative concordance between phrase length and frequency of use in birdsong was several times weaker than the negative concordance between word length and frequency of use in written human languages. The method and the results we present here offer a new foundation for researchers studying if or how the principle of least effort shapes animal communication.
Author summary
Since ancient times, people have been fascinated by birdsong, and imagined it to be the “language of birds.” This analogy has become more exciting as researchers have discovered that many genes and parts of the brain involved in birdsong learning and development are also involved in human speech. But, there is still much we do not know about how birdsong and human language are similar or different. Recently, researchers have been interested in whether Zipf’s Law of Abbreviation (ZLA) holds in birdsong. In human language, ZLA says that words that are used more frequently tend to be shorter, because communication is more efficient if we have short words for the ideas we use most often. In birdsong, researchers have asked whether more frequently used notes are shorter, but results so far have been inconclusive. We developed a new computational tool for studying ZLA in birdsong and applied it to songs from 11 bird populations. We found evidence for ZLA in the set of populations we studied, but the pattern is weaker than in written human languages. More bird populations will need to be studied to confirm our results, and our computational tool will help researchers do that work.
Citation: Gilman RT, Durrant C, Malpas L, Lewis RN (2025) Does Zipf’s law of abbreviation shape birdsong? PLoS Comput Biol 21(8): e1013228. https://doi.org/10.1371/journal.pcbi.1013228
Editor: Tobias Bollenbach, Universitat zu Koln, GERMANY
Received: December 17, 2023; Accepted: June 13, 2025; Published: August 13, 2025
Copyright: © 2025 Gilman et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data and code used in this manuscript are available from https://doi.org/10.48420/24586791.v2.
Funding: RNL was partly funded by UK Natural Environment Research Council grant NE/L002469/1 "Training the Next Generation of Environmental Scientists." CDD was partly funded by the University of Manchester Merged Endowment Fund. The funders played no role in the study design, data collection, analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Over the past three decades, birdsong has gained currency as a tractable model for studying how language develops and is transmitted in humans [1–6]. This has been due in part to the discovery of biological similarities between birdsong and human speech, including analogies in learning patterns [1,4,6], brain mechanisms [7], and regulatory genetics [1,8]. Birdsong is also amenable to experimentation that might be impractical or unethical in humans [1]. These attributes have made birdsong particularly appealing as a model system for studying human speech pathologies [9–11]. The growing importance of birdsong as a model of human language necessitates a clear understanding of how birdsong and human language are similar and how they differ, so we can better understand both the potential applications and the limitations of the model [12].
Zipf’s law of abbreviation (ZLA) is a universal pattern in human languages [13]. ZLA states that words that are used more frequently tend to be shorter than words that are used less frequently [14]. This has been attributed to the principle of least effort. If an idea must be conveyed frequently, users will find or create shorter words to convey that idea, thus making communication more efficient [14–17]. If users must convey an idea only infrequently, then they can invest effort in longer words to ensure that the idea is communicated clearly [16]. Evidence supporting ZLA has been found in each of the nearly 1,000 human languages where it has been studied [13], and the law applies to both spoken language [18–22] and written characters [23,24]. Researchers have reported mixed support for ZLA in the vocal communication of other animals [25], including primates [19,26–30], cetaceans [31–33], bats [34], and hyraxes [35].
Relatively few studies have looked for patterns consistent with ZLA in birds. More than 30 years ago, Hailman and colleagues [36] reported that in black-capped chickadees (Parus atricapillus) shorter bouts of calls were more frequent than longer bouts of calls, but they found no evidence that shorter call types were more frequent than longer call types. This has been cited as an example of ZLA in birds [17,27,29,37], but it is not clear that the pattern Hailman and colleagues [36] reported should emerge due to the mechanism Zipf [15] proposed. Birdsong or calls can be segmented into notes (continuous sounds separated by periods of silence), phrases (short series of notes that frequently or always appear together), calls or songs (series of notes or phrases separated by longer periods of silence), and bouts (series of often similar calls or songs separated by even longer periods of silence) [6,12,36]. If notes or phrases are analogous to words (which is debated [38]), then calls and bouts may be analogous to sentences and orations, respectively. ZLA posits a relationship between the frequency and length of words, but it is not clear that the same relationship should emerge at these higher levels. Indeed, a simple process in which birds begin bouts of calls and then decide independently after each call whether to stop or continue could produce patterns similar to those Hailman and colleagues reported [36]. In 2013, Ferrer-i-Cancho and Hernández-Fernández [19] found no evidence for ZLA in the calls of common ravens (Corvus corax) in data collected by Connor [39]. In 2020, Favaro and colleagues [40] reported that shorter note types appear more frequently than longer note types in the calls of captive African penguins (Spheniscus demersus). However, the study population used only three note types, so the perfect negative concordance between note duration and frequency of use that the authors observed could easily have arisen by chance. More recently, Lewis and colleagues [41] found no evidence for ZLA in the songs of a domesticated population of Java sparrows (Padda oryzivora), but Youngblood [42] reported evidence for ZLA in wild house finches (Haemorhous mexicanus). Thus, whether there are patterns consistent with ZLA in bird vocalizations remains an open question.
Given the mixed evidence for ZLA in bird vocalizations, one might reasonably ask whether we should expect to see ZLA in birdsong at all. In human languages, words have lexical meanings, and those meanings can be independent of the length of the word. For example, we can shorten “television” to “TV” or “telly” and the meaning does not change. In birdsong, the sound of a note may determine its value to the listener [12]. For example, in some species, females appear to interpret specific note types as indicators of male quality, perhaps because those note types are difficult to produce [43–46]. If a male produced shorter or longer versions of those note types, then the information conveyed to females about his quality might change. Thus, replacing long note types with short ones might not make communication more efficient but rather impede accurate communication among birds. This could prevent ZLA from emerging in birdsong. Thus, both empirically and theoretically, the questions of whether birdsong adheres to ZLA or should even be expected to adhere to ZLA are unanswered.
Challenges to studying Zipf’s law of abbreviation in birdsong
Assessing the evidence for ZLA in birdsong presents several challenges that we do not encounter when studying ZLA in humans. First, relative to the number of words in human languages, the number of note types used by most bird populations is small. A small number of note types makes it more difficult to detect a significant concordance between note type frequency and duration [13,19]. If the number of note types is very small, as in the calls of African penguins [40], then even a perfect concordance between the frequency and duration of note types may provide only weak evidence for ZLA. No amount of additional study can resolve this problem. Thus, we may never be able to say with confidence that ZLA exists in some populations. Instead, researchers interested in ZLA in birdsong may need to assess large numbers of populations and draw conclusions based on the full body of evidence.
A second challenge stems from the fact that different birds in the same population can have very different note type repertoires [47]. In humans, individuals in a population that shares a language are likely to use similar sets of words with similar frequencies. Thus, it may be reasonable to study ZLA at the population level. Researchers can select representative multiauthor texts and assess the concordance between the length and frequency of use of each word using simple rank correlations [13]. In contrast, in many bird species, individuals in the same population use different and sometimes non-overlapping sets of note types [47]. This makes it difficult to adequately sample the use of note types in those populations. The problem is compounded by the fact that, in at least some species, song durations themselves appear to be constrained: birds that use longer note types sing fewer notes in each song [48,49]. In such species, if birds that sing shorter note types are at least as common as birds that sing longer note types, then we might see patterns consistent with ZLA at the population level even if no individual bird uses short note types more frequently than it uses long ones. However, such a pattern would not provide evidence for the principle of least effort proposed to underlie ZLA.
Because the principle of least effort suggests that individuals should use shorter types (i.e., words or notes) more frequently than longer ones, we might wish to look for ZLA at the level of individuals rather than populations. That is, if we choose a random individual from a population, are we likely to find that this individual uses shorter note types more frequently than longer ones? However, this question is made difficult by the fact that songs produced by individual birds in the same population may not be independent. In many species, songs are highly stereotyped and birds learn their songs from others [47,50,51]. If we find that two birds have note use consistent with ZLA, then the pattern may have arisen independently in each bird, or it may have arisen only once and both birds may have learned it from the same source. The second case is weaker evidence for ZLA. Thus, any attempt to study ZLA at the level of individuals must adequately account for the potential non-independence of individuals’ songs.
Finally, perhaps the biggest challenge to studying ZLA in birdsong arises from the inherent difficulty of classifying notes. In human languages, especially in the written form, we can usually agree on whether two units represent the same word or different words [13,15,52]. In birdsong, determining whether two notes belong to the same note type is less straightforward. Notes are usually assigned to types by expert inspection of spectrograms [47,53] or sometimes by computational clustering [42,54–56]. Both methods are highly repeatable [47,56]. However, high repeatability does not ensure that the assigned note types match the intent of the birds that produced those notes. Different birds may produce notes that are very similar but are nonetheless objectively distinguishable among individuals (e.g., because they have slightly different peak frequencies or durations [47]). Should we assign these notes to the same or different types? Similarly, individual birds may produce objectively distinguishable versions of similar notes at different points in their song. In general, we cannot know whether the bird intends to produce slightly different notes, or whether it is attempting to produce the same note each time but its performance is constrained by the position of the note in the song. It is not clear that we can resolve this problem empirically. We could ask whether listening birds can distinguish between notes, but the ability of other birds to distinguish between notes does not necessarily indicate the intent of the producer. By analogy, I might attempt to imitate a word pronounced by my colleague, but listeners may still be able to distinguish my attempt from theirs. In some study populations (e.g., [47]), we may know which birds learned their songs from which others, and we may be able to use analogies in song structure to infer that similar-sounding notes are attempts to produce the same note type. However, for most populations, we do not know which birds learned from which others, and inferring the intent behind individual notes may be fundamentally beyond our grasp. In cases where it is difficult to decide whether notes should be assigned to the same or different types, it is likely that the ambiguous notes will have similar characteristics, and therefore the decision to split or merge types may have little effect on the durations of the note types. However, the decision to split or merge types will necessarily affect the frequencies with which those types appear, and so may affect our inferences about ZLA.
An especially difficult problem arises if classification errors are not independent of note durations. For example, we might be more likely to split longer note types, or to merge shorter note types, simply because longer note types give us more opportunity to identify potential differences among notes. If this is true, then our classification system will overestimate the frequencies of short note types and underestimate the frequencies of long note types in the population repertoire. This would produce a pattern consistent with ZLA not because of how birds use note types but rather because of how we perceive the note types they use.
Overview of the current study
With these challenges in mind, Lewis and colleagues [41] developed a novel method for assessing ZLA in bird populations. This method studies ZLA at the individual level and is appropriate for systems where the repertoires of individual birds may be different and where frequencies of use may be learned and thus non-independent among birds. Here, we introduce the R [57] package ZLAvian (https://CRAN.R-project.org/package=ZLAvian) to implement Lewis and colleagues’ [41] method. We apply ZLAvian to assess evidence for ZLA in 11 populations from 7 species of songbirds with songs archived in Bird-DB, an open access repository of annotated birdsong [58]. We synthesise results from the 11 populations as a first pass at quantifying ZLA in birdsong. Our work offers both a computational tool and an empirical foundation for studying ZLA in bird communication.
Methods
In this section, we describe Lewis and colleagues’ [41] method for studying ZLA in birdsong, as implemented in the R package ZLAvian. We formalize the method in Box 1. Then, we describe how we applied this analysis to study ZLA in the bird populations with songs archived in Bird-DB [58].
Box 1. Formalisation of the method proposed by Lewis and colleagues
Let be the number of note types and b be the number of birds in a dataset. Let
be the log-transformed duration of instance
of note type
produced by bird
. Let
be the number of times bird
produced note type
, and let
be the total number of instances of note type
across all birds in the dataset. Then, the mean log-transformed duration of note type
as produced by bird
is
. Let
be the concordance (i.e., Kendall’s
) between the
s and the
s in bird
. Then,
is a random variable with variance given by
where is the number of note types in the repertoire of bird
[59]. The inverse-variance weighted mean concordance in the population is
. This is our test statistic for ZLA in the population.
Each differs from the expected log-transformed duration for note type
in the population due to variation within and among birds. We obtain
, the expected log-transformed duration for note type
in the population, by fitting the random effects model
for each note type using the package lme4 in R [60]. Here, the vector holds the log-transformed durations of all instances of note type
in the data, and
is an
indicator matrix where entry
if the
th instance of note type
was produced by bird
and
otherwise. The vector
has
entries drawn from
andthe vector
has
entries drawn from
. Thus,
and
are the variances in the mean log-transformed durations of note type
among and within birds, respectively. Let D be the
deviation matrix in which entry
describes the deviation of the mean log-transformed duration of note type
as produced by bird
from the expectation for the population.
We can obtain one value of under the null by permuting the log-transformed note durations among the note types without changing the set of note types that each bird produces. To do this, let
be an
matrix with each column equal to
, where
is some permutation of the
s. Then, the matrix
represents a resampling of the mean log-transformed note type durations under the assumption that the note type duration is independent of the frequency of use, and where each bird’s note type durations differ from the expected value for the population by the same magnitude as in the original data. Let
be the concordance between the
s and the
s in bird
. Then,
is one possible value of
under the null. We obtain a null distribution for
by repeated permutations.
Statistical approach
ZLAvian works by computing the concordances between note type duration and frequency of use in individual birds sampled from a population, computing the weighted mean concordance across all birds in the sample as a test statistic, and comparing that test statistic to a null distribution obtained by permuting durations among note types while maintaining the song structure within and among birds. The analysis requires birdsong data in which notes have been assigned to types, the duration of each note has been measured, and each note can be attributed to an individual bird. In its default mode, ZLAvian computes and studies log-transformed note durations rather than working with raw note durations. This is important for the computation of the null distribution. We explain the decision to study log-transformed note durations after we explain how the null distribution is computed.
To obtain a test statistic for ZLA in a sampled population, ZLAvian computes the mean log-transformed duration of each note type as produced by each bird in the sample, and counts the number of times that each bird produced each note type. Then, it computes the concordance (i.e., Kendall’s ) between the mean log-transformed duration and the frequency of use of note types within birds. This results in one value of
for each bird. ZLAvian uses these
s to compute
, the weighted mean value of
in the population. In the default mode, ZLAvian weights each value of
by its inverse variance [59]. Because
is a random variable, weighting by the inverse variance accounts for the fact that the
s in birds with larger note type repertoires provide more accurate estimates of the population mean [61]. However, ZLAvian also offers users the option of computing
with each bird weighted equally. In either case, the computed
serves as a test statistic but also has a useful biological interpretation. Kendall’s
, and therefore
, ranges from −1 to 1 and is linearly related to the probability of concordance among observations. That is, if we were to randomly select a bird from the study population and then randomly select two note types as produced by that bird, then
is the probability that the longer note type would appear more frequently. This makes
an intuitive metric for comparing the observed strength of ZLA across populations.
Next, ZLAvian computes a null distribution for . To do this, it first computes the expected log-transformed duration for each note type in the population. It estimates each expected log-transformed duration as the intercept in an intercept-only random effects model of the observed log-transformed durations for that note type, with a random effect for each bird that produced the note type. This method accords more weight to birds that produced the note type more frequently, because we can more accurately estimate the mean note type duration in those birds. ZLAvian then permutes the expected log-transformed durations among the note types at the population level. Thus, if a note type is assigned a particular duration by permutation, it is assigned that same duration in all birds that produced that note type. This permutation results in a set of population mean log-transformed durations for note types that we might see under the null hypothesis that note type durations and frequencies of use are independent, but it maintains the observed distribution of note type frequencies within and among birds in the population. This accounts for the fact that birds may learn note types and frequencies of use from other birds.
In nature, individual birds may produce the same note types in slightly different ways, and therefore the mean duration of each note type as produced by each bird will differ from the mean duration of that note type in the population. As a result, the same note types may have different rank order durations in different birds. How each bird produces each note type may be learned from other birds, and durations may not be independent among the note types a bird produces. For example, a bird that produces a longer version of one note type may be more likely to produce longer versions of some other note types. We want to account for differences in note type durations among birds under maximally conservative assumptions about how note type versions are learned. To do this, ZLAvian computes the deviation of each bird’s mean log-transformed note type duration from the population mean for each note type that the bird produced. This results in a deviation matrix with one entry per note type per bird. With equal probability, ZLAvian either adds the deviation matrix to or subtracts the deviation matrix from the permuted population mean log-transformed durations. This allows the rank order of note type durations to differ among birds, but maintains the structure of deviations in note type durations within and among birds. The result is a matrix of permuted and adjusted note type durations that might have been produced by birds in the sample if note type durations and frequencies of use were independent.
Finally, ZLAvian computes , the analog of
in the permuted and adjusted data. The null distribution for
is the set of
s computed for every possible permutation of the mean log-transformed note type durations with the positive or negative deviation structures added. In most cases, the set of possible permutations is too large to compute all possible values of
. Therefore, ZLAvian estimates the null distribution from a randomly chosen subset of the possible permutations. The p-value for the hypothesis that ZLA manifests in the study population is the proportion of the null distribution in which
is equal to or smaller than
Lewis and colleagues [41] developed this method to study the concordance between note type durations and frequencies of use in birdsong, but the method is transferable to other taxa and to other measures of production effort (e.g., bandwidth, concavity, excursion [42]).
The log transformation of note durations is important when the observed variability among birds in note type durations is reassigned to the note types after the population mean durations have been permuted. In many bird populations, the variability among individuals in the durations of particular note types scales with the population mean durations of those note types [48] (see also S1 Appendix). This is what we would expect if, for example, there are more opportunities for errors in duration to accumulate in longer note types. Furthermore, the differences among the durations of shorter note types tend to be smaller than the differences among the durations of longer note types – that is, the distributions of population mean note type durations are often right-skewed. If we were to permute the population mean raw durations among the note types and then reassign the original raw deviances to the permuted note type durations, we would often assign large deviances from long note types to note types that received short durations by permutation. As a result, the rank orders of note types with longer durations in the observed data would vary more among individuals in the permuted data than they did in the observed data, and the rank orders of note types with shorter durations in the observed data would vary less among individuals in the permuted data than they did in the observed data. This would change the relative importance of long and short note types in generating concordances at the population level. Log transformation reduces or removes the relationship between the mean and the interindividual variability of note type durations, and eliminates the right skew in the distribution of note type durations. Thus, we can permute log-transformed durations among note types without biasing interindividual variability in the rank orders of note types towards note types with longer observed durations. Some authors have advocated studying median rather than mean durations to assess ZLA in animal communication [20], but log transformation makes the distributions of durations approximately symmetrical, so means and medians are similar. Nonetheless, ZLAvian offers users the option to study ZLA using medians rather than means, and in the supplementary information we show that the qualitative results are similar in the populations we studied (S1 Table). ZLAvian also offers users the option to study raw rather than log-transformed measures of production effort. We do not advocate this when using durations to study ZLA, but it may be appropriate for users studying measures of production effort for which interindividual variability does not scale with the mean.
The method proposed by Lewis and colleagues [41] assesses evidence for ZLA within individuals while accounting for non-independence among individuals due to song learning or stereotypy. However, it does not correct for flaws in the classification of notes into types. Such flaws result in errors in the data, and in general statistical methods cannot correct such errors. However, we can assess how different kinds of note type misclassifications will affect our inferences (S2 Appendix). If we incorrectly merge note types (i.e., assign notes to the same type when they should belong to different types), then we will overestimate the variance of the null distribution and our inferences will be conservative. If we incorrectly split note types, then we will underestimate the variance of the null distribution and our inferences will be anticonservative. Attempts to assess ZLA in birdsong must be interpreted in light of these potential biases.
Application of ZLAvian to birdsong archived in Bird-DB
We downloaded 660 annotations of birdsong representing seven bird species (California thrasher, Toxostoma redivivum; redthroat, Pyrrholaemus brunneus; black-headed grosbeak, Pheucticus melanocephalus; sage thrasher, Oreoscoptes montanus; Cassin’s vireo, Vireo cassinii; western tanager, Piranga ludoviciana; gray shrikethrush, Colluricincla harmonica) from the open access repository Bird-DB [58] on 18 April 2022. Annotations on Bird-DB can include one or more songs, but all songs in the same annotation are from the same bird. The phrases in each annotation are classified to types, and the starting and ending times for each phrase are reported. We used the reported starting and ending times to compute the phrase durations. The recordings represented in Bird-DB were collected on different days and at different locations, and we assumed that each annotation represents a different bird. Most phrases in Bird-DB are monosyllabic, and thus correspond to individual notes. A small number of phrases consist of short sequences of notes that typically appear together. Such polysyllabic phrases are often analysed as single units, and we follow this convention in the body of this paper. Our results are qualitatively similar if we divide the polysyllabic phrases into individual notes and use notes rather than phrases as the primary unit of analysis (S3 Appendix).
We cleaned the annotations downloaded from Bird-DB prior to analysis. Phrase types in Bird-DB are identified by two- or three-letter strings. We excluded any phrase with a type identifier that includes non-alphabetic characters, or that comprises fewer than two or more than three characters. These are likely to be data entry errors, and we cannot confidently assign these phrases to types. We also excluded annotations that include only one repeated monosyllabic phrase type. These annotations may represent alarm calls, and alarm calls may adhere to different rules than other calls or songs. Assessing concordance within annotations requires at least two phrase types, so no information about concordance was lost by excluding annotations that consisted of only one phrase type.
In some cases, songs from the same species on Bird-DB were annotated under different classification systems. We cannot analyse these songs together, because we do not know which phrases in one classification system correspond to which phrases in the other. If we treat phrases from different classification systems as different when they are in fact the same, we will overestimate the number of phrases in the species’ repertoire and underestimate the variances of null distributions, and the p-values we obtain when testing for ZLA in that species will be anticonservative (S2 Appendix). Therefore, when songs from the same species were annotated using different classification systems, we treated annotations classified by each system as different populations. Tests conducted on multiple populations from the same species cannot be regarded as independent, because populations may share phrases and song structures. Nonetheless, tests of different populations that produce similar results can provide corroborating evidence for or against the presence of ZLA in that species.
For each population represented in Bird-DB, we report the number of annotations studied, the total number of phrase types across all annotations, the mean number of phrases and phrase types that appear in each annotation, the mean Shannon diversity of phrase types in annotations, the concordance between phrase type duration and frequency of use at the population level, and the mean concordance between phrase type duration and frequency of use by individuals in the population (i.e., ). For population-level and individual-level concordances, we report the one-tailed p-values testing whether each concordance is more negative than we would expect under the null hypothesis that the note type duration and the frequency of use are unrelated. Thus, small p-values indicate patterns strongly consistent with ZLA and large p-values (i.e., close to 1) indicate patterns strongly contrary to ZLA. Finally, for each population we report the threshold value of
that would be necessary to infer statistically significant (
) support for ZLA at the individual level. We call this the detection threshold. All reported measures were computed by ZLAvian.
Because phrase type repertoires in birdsong can be small, biologically plausible negative concordances between phrase durations and frequencies of use may not be statistically significant in individual bird populations, and researchers may need to synthesise across many populations to understand ZLA in birdsong. To do this, we fit an intercept-only model of in the populations we studied, with a random effect of species to account for correlation among the concordances observed in populations of the same species. This analysis asks whether we would expect the concordance between phrase duration and frequency of use in a randomly selected bird species to be negative, even if concordances are not significantly negative when bird populations are studied individually.
If birdsong adheres to ZLA, we would like to know whether the strength of ZLA in birdsong is different from that in human languages. Following [13], we measured the length in characters and the frequency of use of words in 462 translations of the Universal Declaration of Human Rights downloaded from https://unicode.org/udhr/index.html on 21 May 2023. We computed the concordance between the length and the frequency of use of words in each translation. We compared the mean concordances we observed in our seven bird species to those we found in written human languages using a t-test with a Welch correction for unequal variance. This approach discards information because it averages among populations within each bird species. Therefore, we also fit a Bayesian hierarchical model of
in birdsong and written human language, with a random effect of species in the birdsong data, and allowing unequal variance in the birdsong and human language data. We computed the probability of direction for the difference in the mean
in birdsong and written human language to confirm the inference from our t-test [62].
To understand how large phrase or note type repertoires in birdsong need to be for researchers to detect statistically significant evidence for ZLA in a population, we regressed the repertoire sizes in the populations we studied on the detection thresholds we achieved for each population, allowing random intercepts for each species. If researchers know the strength of ZLA they hope to detect in a population, this analysis allows them to estimate how large a phrase repertoire they would need for that strength of ZLA to be statistically significant.
Results
We assessed the evidence for ZLA in 11 populations from 7 bird species (Table 1). Each population was represented by 2–296 annotations (mean 51.0, median 13, sd 87.2). The number of phrase types per population ranged from 9 to 748 (mean 188, median 114, sd 219) and the number of phrase types per annotation in the populations ranged from 2.8 to 89.5 (mean 30.0, median 24.9, sd 23.8).
At the population-level, the concordance between phrase type duration and frequency of use was negative (i.e., consistent with ZLA) in 8 of the 11 populations we studied (Table 1 and Fig 1). In two of these, western tanagers and one population of California thrashers, the p-values were less than 0.05 (i.e., strongly consistent with ZLA). Among the 3 positive population-level concordances, one in black-headed grosbeaks and one in Cassin’s vireos had p-values greater than 0.95 (i.e., strongly contrary to ZLA). At the individual level, 10 of 11 concordances were negative, but only one (in western tanagers) had a p-value less than 0.05.
Each open circle represents the mean frequency of use and the mean duration of one phrase type used in the population. Horizontal (vertical) grey lines show the range of relative frequencies of use (durations) among birds that used each phrase type. Phrase types that appear together in the repertoire of at least one bird are connected by colored lines. Heavier lines indicate that the pair of phrases was used by more birds. Blue (red) lines indicate that the concordance between frequency of use and duration was positive (negative) for that phrase pair. Intermediate colors indicate that the concordance was positive in some birds and negative in others. For example, this can occur when some birds use the phrase type more frequently than other birds. ZLA is present when the concordance between frequency of use and duration is negative. Thus, we should expect to see more red lines in figures for populations that adhere to ZLA. For comparison, the last panel shows the concordance between word length and frequency of use in English based on the first 10 psalms (American Standard Version), with each psalm treated as if it were produced by a different author. There is strong evidence for ZLA in English based on this sample ( = -0.253, p < 0.001).
Across all populations, the expected mean individual concordance between phrase duration and frequency of use in birdsong was significantly negative ( = -0.071 ± 0.031, p = 0.028). The mean concordance we observed in human languages was -0.212 ± 0.002. Thus, concordances were more negative in written human languages than in birdsong (p = 0.001, probability of direction = 0.001; Fig 2).
Error bars show standard errors. Concordances are more negative in human languages than in birdsong (p = 0.001).
As expected, the repertoire size required to infer statistically significant ZLA in a population is larger when ZLA in the population is weaker (p < 0.001; Fig 3). If birdsong in a population exhibited the expected strength of ZLA that we observed in our study ( = -0.071), we would expect to need a repertoire of approximately 194 phrases (95% confidence interval for the expectation, 143–260 phrases) to infer statistical significance.
Open circles show the detection thresholds and repertoire sizes for the populations we studied. The black dotted line shows the expected concordance across all populations. The red dashed line shows the predicted repertoire size needed to infer statistically significant evidence of ZLA in a birdsong sample with a given concordance between phrase duration and frequency of use. The shaded grey area shows the 95% confidence interval for the predicted repertoire size.
Discussion
Across the bird species we studied, the expected concordance between phrase type duration and frequency of use was significantly negative, consistent with Zipf’s law of abbreviation [14,15]. Within the individual populations, only 1 of 11 concordances was significantly negative, and this was without correcting for multiple testing. Thus, considering the populations individually, we cannot be confident that any particular population exhibits ZLA. Nonetheless, in 10 of 11 populations, the best estimates for the mean individual concordances were negative, and similar nonsignificant trends have been reported in the songs of Java sparrows [41] and in call repertoires at the population level in common ravens [19] and African penguins [40]. Taken together, this evidence is consistent with weak ZLA in bird vocalisations that is difficult to detect when phrase or note repertoires are small. It may be necessary to assess ZLA in many different bird species before we can draw clear conclusions about its existence or strength in birdsong generally. The results and tools we present here are an important step towards this goal.
Some of the populations we studied were represented by very small numbers of birds. When sample sizes are small, we cannot be confident that patterns we detect in the samples are general to the populations as a whole. For example, in western tanagers, we found significant evidence for ZLA in a sample of only three birds. We are confident that the repertoires of these birds conform to ZLA, but we cannot be confident that the repertoires of all birds in the population do. Nonetheless, the fact that we found negative concordances in samples from many different populations suggests that the pattern is unlikely to be due to stochasticity in sampling. Larger samples from more populations will be needed to confirm this result.
The birdsong phrases we analysed in our study were assigned to types by humans. We cannot know whether humans perceive phrases in the same way that birds do. When birds produce longer phrases, there may be more opportunities for error than when they produce shorter phrases. If a bird attempts to produce the same phrase type several times, if some of those attempts include errors, and if researchers interpret phrases with and without errors as different types, then we may systematically overestimate the number of long phrase types and underestimate the frequency with which each long phrase type is used in that bird’s repertoire. If researchers are less able to distinguish differences among phrase types when those phrase types are short, and if this leads researchers to merge phrase types that are different for birds, then we may systematically underestimate the number of short phrase types and overestimate the frequency of each short phrase type in birds’ repertoires. In either case, our tests for ZLA will be anticonservative. Thus, misclassification of phrase types by humans could contribute to the negative concordances between phrase duration and frequency of use that we observed in bird populations.
We assessed the concordances between phrase duration and frequency of use in populations and also within individuals. Concordances at the population level sometimes differed qualitatively from those at the individual level. Patterns at the population level might arise if birds have different repertoires, and if some repertoires are more common or if birds with some repertoires are recorded more frequently than others. If individuals develop shorter versions of phrases they use more frequently, as proposed by Zipf [14,15], then we should expect concordances between phrase duration and frequency of use within individuals. Many researchers have studied ZLA in animals at the population level [19,25–31,34,35,40,42], but our results underscore the importance of verifying these patterns at the individual level. Grzybek and Stadlober [63] have made a similar argument for the study of some laws that describe human languages.
The negative concordances between phrase duration and frequency of use that we observed in birdsong were several times weaker than those we observed between word length and frequency of use in written human languages (mean in birdsong vs mean
in written human languages). Token length (i.e., phrase duration in birdsong or the number of characters in written words) was measured differently in the two systems, but we do not believe that differences in the how token lengths were measured explain the greater strength of ZLA in human languages. In studies of human languages, ZLA has been as strong or stronger when word length is measured by spoken duration than when it is measured in letters [21,22]. On the other hand, if token length is a less accurate measure of effort in birdsong than in written human language, then selection for a negative concordance between token length and frequency of use in birdsong might be weaker. We know of no study that has attempted to quantify the effort associated with token length in different communication systems. Alternatively or additionally, negative concordances might be weaker in birdsong than in human language because the function of the tokens differs in the two systems. In human languages, words have lexical meanings. By developing shorter versions of words they use more frequently, users of human language can communicate more efficiently [15]. In birdsong, notes or phrases may not have meanings independent of the notes or phrases themselves [12]. In the context of courtship or territory defence, the primary function of birdsong may be to advertise the quality of the singer, and notes or phrases that are more difficult to produce may indicate higher quality [46,64,65]. If this is true, then it may be impossible to shorten the duration of note types without also changing the message conveyed to listeners. This would disable the mechanism thought to promote ZLA in human languages. In some animals where patterns consistent with ZLA have been identified, the tokens studied are thought to have semantic meanings [26,66], making these communication systems more similar than birdsong to human language.
We used mean individual concordance (i.e., ) as a test statistic in part because it is an intuitive measure of the strength of ZLA in a population. The p-values associated with
for each population measure the strength of evidence, but p-values by themselves are not evidence of strength. Even a weak negative concordance can be strongly significant if the population repertoire is large, and even a strong concordance can be nonsignificant if the population repertoire is small (Fig 3). Researchers who have studied ZLA in animals or in human languages have sometimes not reported strengths of concordance, and this may inhibit comparisons among systems and synthesis across studies. We urge authors of future work to report
, or simply
if they study ZLA at the population level, along with the p-values that evaluate its significance.
Detecting ZLA is difficult when note or phrase repertoires are small, and researchers may wonder how large a repertoire is needed to provide statistically significant evidence for a plausible strength of ZLA in a bird population. Fig 3 provides a first pass at answering this question. The precise repertoire size needed to provide significant evidence for ZLA in a population will also depend on the number of birds studied, the number of note or phrase types used by each bird, and how those notes or phrases are shared in the population, and will therefore be species- or even population-specific. However, inferences about ZLA in birdsong in general can be made using the strengths of concordance (i.e., the s) measured in populations, even if those
s are not individually statistically significant. Therefore, we encourage researchers to compute and report
for bird populations they study even when population repertoire sizes are small.
In birds, alarm calls may offer an appealing context for studying ZLA. In some bird species, the phrases that make up alarm calls appear to have lexical meanings. For example, different phrases can indicate different predator types [67–69]. The phrases used in alarm calls can differ among populations and change within populations over time [70]. If birds can create shorter versions of alarm calls for predators they encounter frequently, then ZLA in alarm calls may become strong. Thus, studies of the concordance between phrase duration and frequency of use in alarm calls in bird populations under different predation pressures may reward effort. This work will require data on the durations of multiple phrase types used in alarm calls, and on the frequency with which those phrase types are used in multiple populations. Obtaining frequencies of use for phrase types at the population level will require random sampling of population repertoires across all conditions that those populations encounter (e.g., soundscape recordings), and not just focal sampling under favorable recording conditions as in Bird-DB. We know of no system for which appropriate and sufficient data currently exists.
Identifying patterns consistent with ZLA in birdsong, and quantifying those patterns if they exist, may require studying the songs of many different bird populations and species. This requires songs that can be attributed to individual animals, where notes or phrases have been classified to type, and where the durations of notes have been measured. Annotated birdsong data already exists for many species, and automated note or phrase classification (e.g., [54,55]) may make such data easier to collect in the future. The ZLAvian package we introduce here will allow researchers who collect or maintain these data sets to test quickly and easily for evidence of ZLA. In this way, our work offers the opportunity to expand our understanding of ZLA and of the similarities and differences between birdsong and human language.
Supporting information
S1 Table. Evidence for ZLA in 11 populations of 7 bird species with songs archived on Bird-DB, computed with durations represented by medians rather than means.
https://doi.org/10.1371/journal.pcbi.1013228.s001
(PDF)
S1 Appendix. Text, figure, and table illustrating how the standard deviation of phrase type durations scales with the mean.
https://doi.org/10.1371/journal.pcbi.1013228.s002
(PDF)
S2 Appendix. Robustness analysis showing how the inferred relationship between phrase type duration and frequency of use depends on plausible types of phrase classification errors.
https://doi.org/10.1371/journal.pcbi.1013228.s003
(PDF)
S3 Appendix. Robustness analysis showing qualitatively similar results when we use phrases as catalogued on Bird-DB or individual notes as tokens when assessing ZLA.
https://doi.org/10.1371/journal.pcbi.1013228.s004
(PDF)
References
- 1. Hyland Bruno J, Jarvis ED, Liberman M, Tchernichovski O. Birdsong learning and culture: analogies with human spoken language. Annu Rev Linguist. 2021;7(1):449–72.
- 2. Mori C, Wada K. Songbird: a unique animal model for studying the molecular basis of disorders of vocal development and communication. Exp Anim. 2015;64(3):221–30. pmid:25912323
- 3. Saito N, Maekawa M. Birdsong: the interface with human language. Brain Dev. 1993;15(1):31–9. pmid:8338209
- 4. Doupe AJ, Kuhl PK. Birdsong and human speech: common themes and mechanisms. Annu Rev Neurosci. 1999;22:567–631. pmid:10202549
- 5. Jarvis ED. Evolution of vocal learning and spoken language. Science. 2019;366(6461):50–4. pmid:31604300
- 6. Lipkind D, Geambasu A, Levelt CC. The development of structured vocalizations in songbirds and humans: a comparative analysis. Top Cogn Sci. 2020;12(3):894–909. pmid:30761767
- 7. Bolhuis JJ, Okanoya K, Scharff C. Twitter evolution: converging mechanisms in birdsong and human speech. Nat Rev Neurosci. 2010;11(11):747–59. pmid:20959859
- 8. Fisher SE, Scharff C. FOXP2 as a molecular window into speech and language. Trends Genet. 2009;25(4):166–77. pmid:19304338
- 9. Miller JE, Hafzalla GW, Burkett ZD, Fox CM, White SA. Reduced vocal variability in a zebra finch model of dopamine depletion: implications for Parkinson disease. Physiol Rep. 2015;3(11):e12599. pmid:26564062
- 10. Moorman S, Ahn J-R, Kao MH. Plasticity of stereotyped birdsong driven by chronic manipulation of cortical-basal ganglia activity. Curr Biol. 2021;31(12):2619–2632.e4. pmid:33974850
- 11. Panaitof SC. A songbird animal model for dissecting the genetic bases of autism spectrum disorder. Dis Markers. 2012;33(5):241–9. pmid:22960335
- 12. Berwick RC, Okanoya K, Beckers GJL, Bolhuis JJ. Songs to syntax: the linguistics of birdsong. Trends Cogn Sci. 2011;15(3):113–21. pmid:21296608
- 13.
Bentz C, Ferrer-i-Cancho R. Zipf’s law of abbreviation as a language universal. In: Jäger G, Yanovich I, editors. Proceedings of the Leiden Workshop on Capturing Phylogenetic Algorithms for Linguistics. University of Tübingen; 2016.
- 14.
Zipf GK. Human Behavior and the Principle of Least Effort. Cambridge, MA, USA: Addison-Wesley Press; 1949.
- 15.
Zipf GK. The psycho-biology of language: an introduction to dynamic philology. Boston: Houghton Mifflin Company; 1935.
- 16. Kanwal J, Smith K, Culbertson J, Kirby S. Zipf’s law of abbreviation and the principle of least effort: language users optimise a miniature lexicon for efficient communication. Cognition. 2017;165:45–52. pmid:28494263
- 17. Ferrer-i-Cancho R, Bentz C, Seguin C. Optimal Coding and the origins of Zipfian Laws. J Quant Linguist. 2020;29(2):165–94.
- 18. Linders GM, Louwerse MM. Zipf’s law revisited: Spoken dialog, linguistic units, parameters, and the principle of least effort. Psychon Bull Rev. 2023;30(1):77–101. pmid:35840837
- 19. Ferrer-i-Cancho R, Hernández-Fernández A. The failure of the law of brevity in two new world primates. Statistical caveats. Glottotheory. 2013;4(1).
- 20. Petrini S, Casas-i-Muñoz A, Cluet-i-Martinell J, Wang M, Bentz C, Ferrer-i-Cancho R. Direct and indirect evidence of compression of word lengths. Zip’s law of abbreviation revisited. Glottometrics. 2023;54:58–87.
- 21. Hernández-Fernández A, G. Torre I, Garrido J-M, Lacasa L. Linguistic laws in speech: the case of Catalan and Spanish. Entropy. 2019;21(12):1153.
- 22. Torre IG, Luque B, Lacasa L, Kello CT, Hernández-Fernández A. On the physical origin of linguistic laws and lognormality in speech. R Soc Open Sci. 2019;6(8):191023. pmid:31598263
- 23. Koshevoy A, Miton H, Morin O. Zipf’s Law of Abbreviation holds for individual characters across a broad range of writing systems. Cognition. 2023;238:105527. pmid:37364507
- 24. Shu H, Chen X, Anderson RC, Wu N, Xuan Y. Properties of school Chinese: implications for learning to read. Child Dev. 2003;74(1):27–47. pmid:12625434
- 25.
Kang TS. Linguistic laws and compression in a comparative perspective: a conceptual review and phylogenetic test in mammals. Durham: Durham University; 2021.
- 26. Semple S, Hsu MJ, Agoramoorthy G. Efficiency of coding in macaque vocal communication. Biol Lett. 2010;6(4):469–71. pmid:20106854
- 27. Clink DJ, Ahmad AH, Klinck H. Brevity is not a universal in animal communication: evidence for compression depends on the unit of analysis in small ape vocalizations. R Soc Open Sci. 2020;7(4):200151. pmid:32431905
- 28. Heesen R, Hobaiter C, Ferrer-i-Cancho R, Semple S. Linguistic laws in chimpanzee gestural communication. Proc R Soc B-Biol Sci. 2019;286(1896):20182900.
- 29. Huang M, Ma H, Ma C, Garber PA, Fan P. Male gibbon loud morning calls conform to Zipf’s law of brevity and Menzerath’s law: insights into the origin of human language. Anim Behav. 2020;160:145–55.
- 30. Bezerra BM, Souto AS, Radford AN, Jones G. Brevity is not always a virtue in primate communication. Biol Lett. 2011;7(1):23–5. pmid:20573617
- 31. Ferrer‐i‐Cancho R, Lusseau D. Efficient coding in dolphin surface behavioral patterns. Complexity. 2008;14(5):23–5.
- 32. Arnon I, Kirby S, Allen JA, Garrigue C, Carroll EL, Garland EC. Whale song shows language-like statistical structure. Science. 2025;387(6734):649–53. pmid:39913578
- 33. Youngblood M. Language-like efficiency in whale communication. Sci Adv. 2025;11(6):eads6014. pmid:39908378
- 34. Luo B, Jiang T, Liu Y, Wang J, Lin A, Wei X, et al. Brevity is prevalent in bat short-range communication. J Comp Physiol A Neuroethol Sens Neural Behav Physiol. 2013;199(4):325–33. pmid:23377576
- 35. Demartsev V, Gordon N, Barocas A, Bar-Ziv E, Ilany T, Goll Y, et al. The “Law of Brevity” in animal communication: Sex-specific signaling optimization is determined by call amplitude rather than duration. Evol Lett. 2019;3(6):623–34. pmid:31867122
- 36. Hailman JP, Ficken MS, Ficken RW. The ‘chick-a-dee’ calls of Parus atricapillus: a recombinant system of animal communication compared with written English. Semiotica. 1985;56(3–4).
- 37. Ferrer-i-Cancho R, Hernández-Fernández A, Lusseau D, Agoramoorthy G, Hsu MJ, Semple S. Compression as a universal principle of animal behavior. Cogn Sci. 2013;37(8):1565–78. pmid:23895349
- 38. Mol C, Chen A, Kager RWJ, Ter Haar SM. Prosody in birdsong: a review and perspective. Neurosci Biobehav Rev. 2017;81(Pt B):167–80. pmid:28232050
- 39. Conner RN. Vocalizations of common ravens in Virginia. The Condor. 1985;87:379–88.
- 40. Favaro L, Gamba M, Cresta E, Fumagalli E, Bandoli F, Pilenga C, et al. Do penguins’ vocal sequences conform to linguistic laws? Biol Lett. 2020;16(2):20190589. pmid:32019463
- 41. Lewis RN, Kwong A, Soma M, de Kort SR, Gilman RT. Java sparrow song conforms to Mezerath’s law but not to Zipf’s law of abbreviation. bioRXiv. 2023.
- 42. Youngblood M. Language-like efficiency and structure in house finch song. PsyArXiv. 2023.
- 43. Rehsteiner U, Geisser H, Reyer H. Singing and mating success in water pipits: one specific song element makes all the difference. Anim Behav. 1998;55(6):1471–81. pmid:9641992
- 44. Vallet E, Beme II, Kreutzer M. Two-note syllables in canary songs elicit high levels of sexual display. Anim Behav. 1998;55(2):291–7. pmid:9480696
- 45. Weiss M, Kiefer S, Kipper S. Buzzwords in females’ ears? The use of buzz songs in the communication of nightingales (Luscinia megarhynchos). PLoS One. 2012;7(9):e45057. pmid:23028759
- 46. Suthers RA, Vallet E, Kreutzer M. Bilateral coordination and the motor basis of female preference for sexual signals in canary song. J Exp Biol. 2012;215(Pt 17):2950–9. pmid:22875764
- 47. Lewis RN, Soma M, de Kort SR, Gilman RT. Like father like son: cultural and genetic contributions to song inheritance in an estrildid finch. Front Psychol. 2021;12:654198. pmid:34149539
- 48. Lewis RN, Kwong A, Soma M, de Kort SR, Gilman RT. Inheritance of temporal song features in Java sparrows. Anim Behav. 2023;206:61–74.
- 49. James LS, Mori C, Wada K, Sakata JT. Phylogeny and mechanisms of shared hierarchical patterns in birdsong. Curr Biol. 2021;31(13):2796–2808.e9. pmid:33989526
- 50. Fehér O, Wang H, Saar S, Mitra PP, Tchernichovski O. De novo establishment of wild-type song culture in the zebra finch. Nature. 2009;459(7246):564–8. pmid:19412161
- 51. Greig EI, Taft BN, Pruett-Jones S. Sons learn songs from their social fathers in a cooperatively breeding bird. Proc R Sci B-Biol Sci. 2012;279(1741):3154–60. pmid:22593105
- 52.
Hanley ML. Word index to James Joyce’s Ulysses. Madison, Wisconsin: University of Wisconsin Press; 1937.
- 53. Tanimoto AM, Hart PJ, Pack AA, Switzer R. Vocal repertoire and signal characteristics of ‘Alalā, the Hawaiian Crow (Corvus hawaiiensis). Wilson J Ornithol. 2017;129(1):25–35.
- 54. Cohen Y, Nicholson DA, Sanchioni A, Mallaber EK, Skidanova V, Gardner TJ. Automated annotation of birdsong with a neural network that segments spectrograms. Elife. 2022;11:e63853. pmid:35050849
- 55. Kaewtip K, Alwan A, O’Reilly C, Taylor CE. A robust automatic birdsong phrase classification: a template-based approach. J Acoust Soc Am. 2016;140(5):3691. pmid:27908084
- 56. Lachlan RF, Ratmann O, Nowicki S. Cultural conformity generates extremely stable traditions in bird song. Nat Commun. 2018;9(1):2417. pmid:29925831
- 57.
R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2023.
- 58. Arriaga JG, Cody ML, Vallejo EE, Taylor CE. Bird-DB: A database for annotated bird song sequences. Ecol Inform. 2015;27:21–5.
- 59. Valz PD, McLeod AI. A simplified derivation of the variance of Kendall’s rank correlation coefficient. Am Stat. 1990;44(1):39–40.
- 60. Bates D, Maechler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015;67(1):1–48.
- 61. Shahar DJ. Minimizing the variance of a weighted average. Open J Stat. 2017;07(02):216–24.
- 62. Makowski D, Ben-Shachar MS, Chen SHA, Lüdecke D. Indices of effect existence and significance in the Bayesian framework. Front Psychol. 2019;10:2767. pmid:31920819
- 63.
Grzybek P, Stadlober E. Do we have problems with Arens’ law? A new look at the sentenceword relation. In: Grzybek P, Köhler R, editors. Exact Methods in the Study of Language and Text: Dedicated to Gabriel Altmann on the Occasion of his 75th Birthday. Berlin, Boston: De Gruyter Mouton; 2007.
- 64. Ballentine B. Vocal performance influences female response to male bird song: an experimental test. Behav Ecol. 2004;15(1):163–8.
- 65.
Podos J, Sung HC. Vocal performance in songbirds: from mechanisms to evolution. In: Sakata JT, Woolley SC, Fay RR, Popper AN, editors. The Neuroethology of Birdsong. Springer Handbook of Auditory Research. Cham: Springer; 2020. pp. 245–68.
- 66. Hsu MJ, Chen L-M, Agoramoorthy G. The vocal repertoire of Formosan macaques, Macaca cyclopis: acoustic structure and behavioral context. Zool Stud. 2005;44(2):275–94.
- 67. McLachlan JR, Magrath RD. Speedy revelations: how alarm calls can convey rapid, reliable information about urgent danger. Proc R Soc B-Biol Sci. 2020;287(1921):20192772. pmid:32070259
- 68. Suzuki TN. Parental alarm calls warn nestlings about different predatory threats. Curr Biol. 2011;21(1):R15-6. pmid:21215927
- 69. Suzuki TN. Communication about predator type by a bird using discrete, graded and combinatorial variation in alarm calls. Anim Behav. 2014;87:59–65.
- 70. Tanimoto AM, Hart PJ, Pack AA, Switzer R, Banko PC, Ball DL, et al. Changes in vocal repertoire of the Hawaiian crow, Corvus hawaiiensis, from past wild to current captive populations. Anim Behav. 2017;123:427–32.