Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Convergent evolution in a large cross-cultural database of musical scales

Abstract

Scales, sets of discrete pitches that form the basis of melodies, are thought to be one of the most universal hallmarks of music. But we know relatively little about cross-cultural diversity of scales or how they evolved. To remedy this, we assemble a cross-cultural database (Database of Musical Scales: DaMuSc) of scale data, collected over the past century by various ethnomusicologists. Statistical analyses of the data highlight that certain intervals (e.g., the octave, fifth, second) are used frequently across cultures. Despite some diversity among scales, it is the similarities across societies which are most striking: step intervals are restricted to 100-400 cents; most scales are found close to equidistant 5- and 7-note scales. We discuss potential mechanisms of variation and selection in the evolution of scales, and how the assembled data may be used to examine the root causes of convergent evolution.

Introduction

Music, like language, can be described as a generative grammar consisting of basic building blocks, and rules on how to combine them [1, 2]. In melodies, these basic units are usually specified by two quantities: frequency and duration. We generally refer to this basic pitch unit as a note, and a set of notes as a scale. Thus, as far as pitch is concerned, a scale is to a melody what an alphabet is to writing. Despite their centrality to music and apparent ubiquity, we know surprisingly little about scales. Most studies focus on scales from a limited number of musical traditions [3, 4], and the main statistical findings are that scales are non-equidistant, and have 7 or fewer notes [5, 6]. There are many anecdotal reports that certain notes are commonly used [3, 4, 710], but this has only been quantitatively examined in one small cross-cultural sample [11]. Thus, we lack concrete understanding of the fundamental questions of why humans use scales, how diverse they are, or how they came to be that way. We suspect that this is simply due to a lack of suitable resources. Here, we address this problem by presenting and analyzing a data set of musical scales from many societies, extant and extinct, built upon a century of ethnomusicological enterprise.

Let us begin by clarifying a few key terms and ideas. We define a scale simply as a sequence of unique notes (Fig 1A). Notes are pitch categories described by a single pitch, but as pitch is continuous, notes are more realistically described as regions of semi-stable pitch centered around a representative (e.g., mean, median) frequency [12, 13]. However, humans process relative frequency much better than absolute frequency [14], so scales are practically specified by the intervals (frequency ratios) between notes. Thus, a scale is usually defined by the ratios made between all notes and the first note, called the tonic; we hereafter refer to these as scale notes. Intervals are determined by their frequency ratios, but since we perceive pitch logarithmically [15, 16], they are naturally measured in units of cents—obtained by a base-2 logarithm of the ratio of frequencies f1 and f2, cents = 1200 × log2 f1/f2. In what follows, steps will describe the intervals between adjacent notes, and intervals can refer to the interval between any pair of notes.

thumbnail
Fig 1. Basic notions.

A: Illustration of relevant terms: Notes in a scale can be represented symbolically (e.g., letters for notes), or quantitatively. As examples, we show: the Western major scale in the key of C on a piano (top); and the corresponding intervals in cents and as frequency ratios (bottom; in 5-limit just intonation tuning). We show three types of intervals: a scale note, a step, and an interval that is neither a note or a step. B: Venn diagram indicating how scale data can be classified.

https://doi.org/10.1371/journal.pone.0284851.g001

We explicitly define an important sub-class of scales, octave scales, which span an octave (frequency ratio 2:1; 1,200 cents) and have circular (or helical) structure such that notes repeat at every octave [17]; e.g., Fig 1A shows that the first and last notes (an octave apart) of such a scale have the same name—a property is called octave equivalence). It is often reported that octave equivalence is one of the most universal features of music [3, 4, 710]. However, this has been disputed [13, 16, 1820], and statistical study of octave prevalence is still lacking. Octaves are indeed salient in the harmonic spectra [21, 22]. Yet, experiments have indicated that octave equivalence is only a weak perceptual phenomenon [2326], that depends heavily on musical training [2729], and culture [16]. Nonetheless, several measurable phenomena related to harmonic tones may lead to preferential use of the octave: tonal fusion [26, 30, 31], hearing in noise [32], and memorability of complex tones [33]. For example, increased tonal fusion between octave intervals (likewise for fourths, fifths, which also undergo tonal fusion) may give rise to greater perception of synchrony when singing in parallel [34].

To address longstanding questions on scale diversity and evolution, we first document the creation of a database of scales. We then analyse the empirical scales to see which intervals are significantly frequent or infrequent, providing robust statistical evidence that the octave is prevalent among many societies. We examine the cross-cultural diversity of octave scales, finding that within-society variation can in some cases be as great as the total variation in scales—scales are surprisingly not that diverse. In particular, across all geographical regions step intervals are limited to 100–400 cents, and scales are clustered around 5- and 7-note equidistant scales (scales where step sizes are similar in size). As a result of these restrictions, humans tend to use only ∼1% of possible octave scales, which strongly suggests some degree of convergent evolution due to shared biases (in addition to convergence via cultural diffusion). Finally, we discuss the potential mechanisms of change and selection of scales, discuss the challenges in understanding scales evolution, and propose credible future directions for studying this evolution.

Materials and methods

Database curation

A total of 60 books, journals, and other ethnomusicological sources were found to have relevant data on scales (S1 Table in S1 File) [13, 3593]. We note one previous attempt to create a database of scales from the ethnomusicological literature [94], however there are no details on its construction, and it does not link scales directly to original sources. Our database improves on these issues through a stringent methodology (described below), sources for all scales and is presented as a digitized, open-access, resource which will continue to grow as new data is made available.

We can define scales (Fig 1B) either prescriptively (“these are the notes you can use in a melody”) or descriptively (“these are the notes that were used in the melody”). Theory scales consist of intervals with idealized, exact frequency ratios—they lack the random fluctuations that exist in the real world. These are mainly found in a limited set of cultures that exist along the old Silk Road route, and they are not necessarily played according to these theoretical ideals [93, 95]. Absence of theory scales does not mean that other societies lack musical theory, which can be found either explicitly through language, or implicitly through an understanding of what is ‘correct’ and ‘incorrect’ in a particular style [2, 96100]. Theory scales are by definition prescriptive scales, although descriptive scales can be found which closely match these. Measured scales are obtained where measurements have been made of the notes on an instrument, or a recording of a song has been analysed with computational tools to extract a scale. Instrument tunings are by default prescriptive, but can be descriptive if all notes are used in a melody. There is some error in these measurements—taken with tools that include tuning forks, the Stroboconn, and modern computational approaches—but it is always within 10 cents. Measured scales taken from song recordings are exclusively descriptive, and they make up the smallest part of the database. This is because it is still quite a challenge to reliably infer scales from a recording of a performance using algorithms [76], and thus it requires extensive manual labor. Still, we believe that the future of studies on musical scales lies in tackling these challenges, due to the prospect of extracting descriptive scales from archives of ethnographic recordings [101104]. However, it is clear that the appropriate computational tools to perform such large-scale analyses are still lacking [105107].

Another statistical regularity in musical pitch is tonalityi.e., the distribution of notes, how they are played (duration, ornamentation), and their transition probabilities [109, 110]. Of the various aspects of tonality, tonal hierarchies (probability distribution of notes) [111] have been best documented across cultures [73, 112115]. Tonality (or modality) is in some cases an integral part of a scale that determines how the notes should be played [116], to the extent that musicians can reliably tell apart two ragas from the duration of a single note [117]. We emphasise that our definition of scale does not imply any specific tonal hierarchy, while acknowledging that similar definitions (e.g., maqam, raga) are often used with the expectation that they include tonality [100]. In the database, we account for tonality in a limited way by noting the order of notes in the scale, starting from the first note (tonic). However, this information is often unavailable, and we lack the finer details of note distribution or transition probabilities. Future cross-cultural analyses of recordings ought to document these important details [116, 118].

To enable a broad range of analyses we collected information pertaining to, where applicable, (i) the society (country, language or ethnic group, musical tradition), (ii) geography (country, geographic region) (Fig 2), (iii) instrument type, (iv) tonic note, and (v) whether the scale is measured using harmonic or melodic intervals. We additionally linked societies, where appropriate, to identifiers from other ethnographic and linguistic databases, such as D-PLACE and Glottolog [119, 120]. D-PLACE contains information on social structure, economic, and environmental information, while Glottolog can be used to determine the linguistic distance between two groups—a common proxy for cultural similarity [121]. Some musical traditions that span multiple countries (such as Western classical music) are also taken as a unit of society. Identification of the society or geographic origins of a scale was required for inclusion of scale data. Other details were found at varying frequency; e.g., tonic was only identified in 121 out of 434 measured scales.

thumbnail
Fig 2. Breakdown of scales in the database according to: Geographical area; theoretical scales or measured scales.

The map shows the geographic origin of the measured scales; theory scales are not included since they are not always associated a single country; marker size indicates sample size. Made with Natural Earth [108].

https://doi.org/10.1371/journal.pone.0284851.g002

Inferring octave scales

The database exists in two forms: the raw data (434 theory scales, and 462 measured scales), and a set of octave scales that is generated procedurally from the raw data according to five choices. For a complete workflow from source to database, including examples, and the choices we made to generate octave scales, see the supplementary materials(S1 Appendix, S1-S3 Figs and S1, S2 Tales in S1 File).

In total, we infer 896 octave scales (434 theory octave-scales, 384 octave-scales from instrument tunings, and 78 octave-scales from song recordings), from 73 societies. The theory scales span 6 regions, while the measured scales span 8 regions, and 46 countries (Fig 2). When inferring octave scales from measured scales, a principle assumption is that they add up to an octave. Since the validity of the assumption is not clear, we first study the statistics of the measured scales before studying octave scales.

Results

One of our goals is to estimate whether certain notes and intervals appear more or less frequently than expected by chance. This is hard to quantify, since we lack a universally-correct method of calculating the probability of observing an interval. Instead, we propose three independent statistical models that reflect different ways of constructing scales:

Model: Lognorm

This model assumes that scale notes S are chosen independently from a lognormal distribution, P(S) = lnN(μ, σ2). The rationale behind this choice is that small intervals should be uncommon (P(S) → 0 as S → 0) due to limits in pitch perception, and large intervals should be uncommon (P(S) → 0 as S → ∞) due to physical constraints (human anatomy; instrument size). We expect that this model is inappropriate for within-scale analyses, since notes within a scale are unlikely to be chosen independently of each other, but rather according to specific choices made by a musician. However, if the choices made by many independent musicians are sufficiently diverse, then the lognormal distribution may be an appropriate description of the between-scales note distribution.

Model: Shuffle

This model assumes that specific step sizes are important, but their arrangement is not. Normally, both step size and order determine scale notes. But now consider a musician who cares only about the step sizes, not their order in which they are arranged. This can be approximated by sampling from the original data and reshuffling the order of step sizes.

Model: Resample

This model assumes that step sizes are chosen independently from a distribution, and arranged randomly into scales. This is equivalent to a musician who is indifferent to both the step size, and the order. A reasonable choice of a distribution in this case is the posterior distribution of all step sizes.

In any alternatively-sampled set of scales, the number of scales and the number of notes in these scales matches the original. In the following sections, we will use these three models to, first, compare intervals between scales, and second compare intervals within scales, searching for evidence of statistically significant counts of intervals.

Statistically significant intervals between scales

To search for statistically significant (due to an abundance or dearth of) intervals, we first plot the distribution of scale notes (Fig 3A, Original) and step sizes (Fig 3B, Original) for a sub-sample including only measured scales, without imposing octave equivalence, and controlling for society (214 scales sampled from a total of 434 scales, with no more than 5 per society, resampled 1,000 times to achieve convergence; S4 Fig in S1 File). We compare this empirical distribution to the maximum-likelihood lognormal distribution (Fig 3A, Lognorm), and the corresponding step size distribution (Fig 3B, Lognorm). As expected, we find that scales drawn from the lognormal distribution have step sizes that do not resemble real scales—since notes within a scale are not independent. Instead, notes within scales are more evenly distributed, such that step sizes are peaked at ∼200 cents, and rarely smaller than 100 cents (Fig 3B); this is found in every geographical region that we investigated (S5 Fig in S1 File). However, scale notes from different scales are independent of each other, resulting in a distribution that is approximately lognormal when many scales are sampled (Fig 3A).

thumbnail
Fig 3. Between-scales distributions of step sizes and scale notes.

A: Distribution of notes in measured scales (blue; histograms are shown as lines; bin size = 30 cents). Maximum likelihood lognormal distribution fitted to notes distribution (black). Distribution of notes obtained via alternative sampling: shuffling the step sizes within scales (orange); resampling step sizes from the full distribution of step sizes (green). The x-axis is truncated at 3,000 cents for clarity. B: Distribution of step sizes (blue). Distribution of step sizes in a set of scales with notes generated independently from a lognormal distribution. C-E: Probability that the counts observed in the original data were generated by the lognormal distribution (C), the shuffled scales distribution (D), or the resampled scales distribution (E). Empty circles indicate that the interval is found more than chance, while filled circles indicate the opposite. Dotted line indicates a p value of 0.05 after applying a Bonferroni correction. Dashed lines indicate the values of 200 and 700 cents. Shaded region shows bootstrapped 95% confidence intervals.

https://doi.org/10.1371/journal.pone.0284851.g003

The utility of fitting a lognormal distribution to the data is that it can serve as a null-hypothesis baseline for estimating the probability that a scale note appears more or less than chance. To this end, we integrate the lognormal distribution over the range of each histogram bin i in Fig 3A to get the probability of observing scale notes, pi. We then calculate the binomial probability, , where ki is the number of observations in bin i, and n is the total number of observations. We report either: the probability that ki or higher is observed if ki/n > pi, ; or else, the probability that ki or less is observed, , if ki/npi. Low probability implies significant deviation from the null lognormal hypothesis. We see that only a few intervals deviate significantly from the lognormal distribution (Fig 3C): 1,200, 700, 200 and 2,400 cents are found more frequently, while 600 cents is found less frequently than chance. This is strong evidence that the octave (and the fifth) are important intervals in many societies.

To corroborate these findings, we repeat the significance test with different assumptions on how scales are generated. We first repeatedly shuffle the step sizes in each scale to generate new scales, to examine whether the statistically significant intervals would arise if step sizes were ordered randomly (Fig 3D). Similarly, by resampling with new step sizes, we can test whether the significant intervals could plausibly have been produced by arranging randomly selected step sizes from the distribution in Fig 3B (Fig 3E). These analyses demonstrate that the peak at 200 cents (Fig 3C) is due to it being the most common step size. The values of 600 and 2,400 cents are found to be less significant than in Fig 3C, but are still much more significant than most intervals. The fact that the peak at 700 cents is significant in Fig 3D but not in Fig 3E is likely due to the presence of equidistant scales (S6 Fig in S1 File), where the order of the intervals is mostly irrelevant. By this logic, one might expect the peak at 1,200 cents to disappear in Fig 3D, but it survives because a large fraction of equidistant scales do not extend beyond the octave (S6 Fig in S1 File). Remarkably, the octave is extremely significant according to all three tests.

Statistically significant intervals within scales

The previous method was used to assess whether intervals are found more than expected by chance across a collection of scales. Since grouping together scales from different societies might be problematic, we require a test that can discriminate unusually frequent / rare intervals within individual scales. To this end, we compare the original scales with many alternative scales that are created by shuffling the original scales’ step sizes. We use the full range of intervals that can be generated with an instrument, not just the scale notes. For an instrument with N notes, this results in N × (N − 1)/2 intervals. By sampling many more intervals, this test is powerful enough to detect, in some cases, significant signals within individual scales.

To test for statistical significance of an interval I, we find all intervals in a scale that fall within w = 100 cents of I, and calculate their distance from I. We repeat this process for 50 shuffled versions of the scale, collecting all intervals within w of I in a second group. We then use a Mann-Whitney U test to examine whether either set of intervals, original or shuffled, is significantly closer (i.e., p < 0.05) to the octave than the other (repeating 100 times to get a converged average). We demonstrate this procedure for I = 1,200 cents (Fig 4A): in most scales the original intervals are closer to an octave than the shuffled scales, although only a fraction of results are significant. Reasons for non-significant results include small sample sizes and the tendency of equidistant scales to produce similar intervals when shuffled.

thumbnail
Fig 4. Within-scales tests of interval significance.

A: Mean deviation of original intervals from the octave compared to shuffled intervals for 162 measured scales; colour indicates statistical significance; black line indicates x = y. B: Fraction of tested scales in which intervals are found to be significantly closer or further from a particular size than expected by chance. Null distribution (grey region) is obtained by repeating the analysis with resampled sets of scales. C: Fraction of significant results indicating that the intervals are closer or further to the octave than chance, against intonation error σ, in a test set which maximises the number of octave intervals. Dotted line indicates the point at which 35% of results are significant. Shaded region shows bootstrapped 95% confidence intervals.

https://doi.org/10.1371/journal.pone.0284851.g004

We extend this analysis to all intervals over the range 200 ≤ I ≤ 2600 cents, showing the fraction of significant results indicating an interval is found more or less frequently than chance (Fig 4B). Without a doubt, the most significant interval is the octave (35% significantly close), and the intervals that are most significantly avoided in scales are the regions flanking the octave. In further agreement with the between-scales analysis, the next most significant regions are those around 500, 600 cents, and 700 cents.

To put these results in perspective, we repeat the test on sets of scales generated by resampling step sizes from Fig 3B (Fig 4B, Null). The null distribution converges to 5% as expected, which clearly demonstrates that the high fractions of significant results in the original scales are not artefacts due to testing multiple hypotheses. Additionally, we show that the results do not depend on our choice of w (S7 Fig in S1 File). The consistency of these test results reinforces the conclusion that these significant intervals are not chosen randomly.

Effect of tuning variability on the search for universal intervals

Observation of significant intervals likely reflects the intentions of the musician to tune to these intervals. However, non-significant results do not necessarily indicate lack of such intentions, as imprecision in tuning (and tuning measurements) may result in false negatives. To understand the impact of imprecision, we generate a test set of scales (by sampling step sizes from Fig 3B) and fix all scale notes that are ≥1,200 cents to be exactly an octave higher than one of the notes that are ≤1,200 cents. We then add to the test sets normally-distributed noise, N(μ = 0, σ2). Even without noise (σ = 0), this test can find significant results only 90% of the time (Fig 4C), which demonstrates the difficulty in inferring intentionality simply due to low sample sizes.

To estimate reasonable bounds on the noise σ, one may first consider that the measurement error varies from about 1 cents for computational methods (e.g., Stroboconn) [122], to 5 cents for tuning forks. Also, instruments vary in the stability of their notes: e.g., humans typically sing with a standard deviation of at least about 10–20 cents [12, 13, 114, 123127]. However, the main source of error is perception: humans make mistakes in discriminating intervals below ∼100 cents (although we lack replications with non-Western participants) [128131]. Thus, we can expect a reasonable upper bound to the proportion of significant results that can be detected of about 30 to 70%. Thus, when we find the octave to occur significantly more than chance about 35% of the time (Fig 4B), this is a rate that is within the range of the hypothetical maximum.

Ultimately, it is hard to say how exactly scales are chosen. Were some important intervals fixed first (e.g., the octave), and then the rest chosen to fill the gaps? Or were step sizes of a certain size (e.g., big and small size categories) chosen, and then arranged in some preferred order [132]? What we have conclusively shown is that independently of how scales are created, they show a significant, consistent bias towards including some notes, and avoiding others.

Qualitative evidence for preferential use of octaves

A more direct route to understanding the intentions of musicians is through detailed ethnography. We therefore examined each source for qualitative evidence indicating preferential use (or absence) of the octave (SI Dataset 1). We found qualitative evidence in support of octave use in 26 out of 60 sources: use of the octave to tune instruments (8 sources); performing melodies in parallel octaves (11 sources); using the same name for notes an octave apart (10 sources), which is also strong evidence for octave equivalence. We also find quantitative evidence (statistically significant octaves for at least one scale; Fig 4A) in 26 sources; in total, 40 sources contain either quantitative or qualitative evidence. For 6 of the remaining 20 sources, there is evidence from secondary sources for preferential use of the octave in those cultures; another 3 sources report on archaeological findings, which are difficult to investigate further [48, 57, 63].

Two sources (Georgian polyphonic singing) provided qualitative evidence that fifths are perceptually important, instead of octaves [13, 89]; however, we found statistically significant octaves in one of these sources, so the evidence here is mixed [13]. A third source (Colombian marimba) provided quantitative evidence that octaves were found significantly less than chance [85]. Overall, we find some evidence (either primary or secondary) in support of octaves being significant in 46 out of 60 sources, and evidence to the contrary in 3 sources. These findings strongly support the view that octaves are widespread, but not absolutely universal [19, 20].

Statistics of octave scales

After verifying that preferential use of octaves is indeed widespread, we proceed to studying octave scales; we infer octave scales by making some assumptions (e.g., octave equivalence; see section “Inferring octave scales”). While in the previous section we exclusively examined measured scales, here we look at a mix of theory and measured scales. Data is shown for four samples of scales: (i) all theory scales, (ii) all measured scales, (iii) sub-sample controlling for society (‘SocID’, no more than 5 scales per society), and (iv) a sub-sample controlling for geographical region (‘Region’, no more than 10 scales per region).

Most scales are found to have 7 or fewer notes (Fig 5A), in agreement with previous work [6]. 6-note scales are remarkably rare across all sampling schemes. The predominance of step sizes of ∼200 cents (Fig 3B) is also seen in octave scales (Fig 5B). Consistent with existing literature [2], most steps are between 100–400 cents, and this applies in all geographic regions studied (S5 Fig in S1 File). The most common notes (Fig 5C), in all sub-samples, are exact matches for the significant intervals in Fig 4B (500, 700 cents). The fact that the statistics of octave scales is consistent with the statistics of measured scales (and with previous work), to an extent, validates our methodology for extracting octave scales (SI section “Inferring octave scales”).

thumbnail
Fig 5. Statistics of octave scales.

A: Distributions of the number of notes in a scale for different samples of the scale database: sample balanced by region, sample balanced by culture, theory scales, and measured scales. B-C: Histograms of step sizes (B) and notes in scales (C) for different samples of the scale database: samples balanced by region, samples balanced by society. Whiskers (A) and shading (B,C) indicate bootstrapped 95% confidence intervals. Histogram bins are 20 (B) and 30 (C) cents.

https://doi.org/10.1371/journal.pone.0284851.g005

Theory scales show sharp peaks at intervals close to 12-TET intervals. In contrast, the distributions of steps and notes in measured scales are much more diffuse. This difference is expected, since theory scales consist of mathematically-exact, ideal intervals, while measured scales include natural sources prone to error and variation. There is, however, correspondence between theory and measured scales in the rarely used notes (270, 450, 550, 650, 930 cents), which is even clearer when controlling for the number of notes (S8 Fig in S1 File). Regardless of the number of notes in a scale, we find salient peaks at 200, 500 and 700 cents (S8 Fig in S1 File). Overall, despite some differences between theory and measured scales, both sets are in agreement over the most distinct features: (1) step sizes are usually ∼200 cents, and between 100–400 cents, and (2) the most salient notes are at 500 and 700 cents, while the regions around these notes are avoided in scales.

Variation across societies is comparable to variation within some societies

To quantify variation in scales across societies, we use t-distribution stochastic neighbour embedding (tSNE) to map the scales onto a reduced two-dimensional representation [133]. This method can compare only scales with the same number of notes, so we show results separately for 7-note scales (Fig 6A), and 5-note scales (S9 Fig in S1 File). We group scales into clusters (using the DBSCAN algorithm (eps = 2, minimum samples = 5) [134], and report note distributions and region distributions for the largest four clusters. The tSNE embedding is useful for visualizing diversity among high-dimensional objects, but note that tSNE dimensions are arbitrary. The main information in this plot is the distances between points: distances in the 2d-embedding are non-linearly related to the real distances, such that 2d distance from one scale to all other scales will have a high rank-correlation with the real distances.

thumbnail
Fig 6. Cross-cultural diversity of scales.

A: 2-dimensional embedding of 556 heptatonic scales, labelled with 7 Western diatonic modes (black), 10 North Indian thaat (red), and the four largest clusters (shaded areas). B: Note distributions for each cluster. Black lines are shown for means of each note, and grey shading indicates equidistant scale notes ±30 cents. C: Geographic distribution of each cluster. D-G: Embeddings are labelled with: equidistant scales (D; where notes are on average within 30 cents of the equidistant values), theory vs measured scales (E), Carnatic scales (F), Thai scales (G).

https://doi.org/10.1371/journal.pone.0284851.g006

The simplest and most salient feature is that scales tend to be almost equidistant (Fig 6D). For example, in the largest cluster (Fig 6A and 6B; blue), notes are all within 30 cents of the corresponding notes in the equiheptitonic scale. Although most examples (29 / 40) of this cluster (Fig 6D) come from three countries (Thailand, Guinea, Malawi), in total they are found in 13 countries (e.g., Colombia, Georgia, Zimbabwe, Indonesia). Surrounding this cluster (Fig 6A), we can find examples of theory scales: Western diatonic modes, and North Indian thaat (e.g., green cluster). In general, we see that most equidistant scales are measured scales, but there is also a lot of overlap between theory and measured scales (Fig 6E). Furthermore, when clustering societies by scale similarity, we find two main clusters—one dominated by societies with theory scales, and one dominated by societies with equidistant scales (S10, S11 Figs in S1 File). Examples of scales that are furthest from equiheptatonic include the Gamelan pelog scale (Fig 6A and 6B; yellow cluster), and the Carnatic mela salagam (Fig 6F), which both have many small step sizes in a row.

Surprisingly, there seems to be little variation overall, such that multiple societies that use theory scales exhibit levels of within-society variability comparable to the total variability (Fig 6F). In general, we find that geographical regions tend to contain overlapping subsets of scales (S12A Fig in S1 File), and distances between scales within-regions are of similar magnitudes to distances between scales between-regions (S12B Fig in S1 File). Even Thai scales, which are often cited as exclusively being in equiheptatonic tuning [47, 62, 69, 70]—although this is disputed [68]—exhibit substantial within-society variability (Fig 6F). Overall, there appears to be less variation in 7-note scales than might have been expected, and a similar analysis of 5-note scales reveals similar results (S8 Fig in S1 File), which suggests that they share the same organizing principles.

Statistical analysis shows that scales tend to be equidistant

To put the diversity of scales in a broader context, one may consider a hypothetical universe of possible scales by enumerating them on a grid (grid scales). To this end, we take 20 cents as a basic grid resolution, and enumerate all possible unique scales with step sizes within 60–320 cents; this limitation on step size already reduces the number of possible scales (at 20 cents resolution) by 98%. To conveniently compare grid and real scales, we examine their 2-dimensional embedding. Strikingly, those grid scales that correspond to real scales (i.e., within ±10 cents of a real scale) are clustered around the equiheptatonic scale (Fig 7A, cyan). For comparison, Thai scales are found to be as far as 43 cents from equiheptatonic. We find that 70% of real scales are within this boundary, compared to only 10% of grid scales (Fig 7B). Alternative definitions of equidistance based on step sizes rather than scale notes also support the finding that scales tend to be close to equidistant (S13 Fig in S1 File).

thumbnail
Fig 7. Comparison of real scales with all possible 7-note scales enumerated on a grid (20 cents resolution; with step sizes limited to 60–320 cents).

A: Two-dimensional embedding of grid scales. Grid scales that correspond to real scales (notes are on average within 10 cents of a real scale) are highlighted cyan; a black circle shows the equidistant scale. B: Mean note distance from 7-note scales and the equiheptatonic scale, for Thai scales, real scales, and all grid scales. C: Step size histograms (bin size = 20 cents) in real scales and grid scales. D: Scale note histograms (bin size = 20 cents) for notes 2–7 (no tonic and octave) for real scales and grid scles. E: Entropy of note distributions for: real scales; real scales, but rearranged with their steps arranged in order of size (Sorted); real scales, but rearranged with their steps in random order (Shuffled); grid scales.

https://doi.org/10.1371/journal.pone.0284851.g007

To gain further insight into the striking (near-) equidistance of scales, we look at the note distributions for notes 2–7 (Fig 7D) for real scales and grid scales. The grid scales show the broadest note distributions for the two notes (4 and 5) furthest from the fixed ends (tonic, octave), resulting in higher entropy (Fig 7E). This is expected, since summing two random distributions should result in a higher variance; e.g., note 4 is the sum of three steps drawn from Fig 7C. In contrast, distributions of notes 4 and 5 in real scales are most predictable (lowest entropy), once again illustrating the significance of intervals of size 500 and 700 cents.

To examine the possibility that the difference between real scales and grid scales is mainly due to differences in their step size distributions (Fig 7C), we analyse two other sets of note distributions. We first look at the note distributions that ought to be farthest from equidistant scales: we rearrange the steps in each real scale so that they are ordered low-to-high, and consider scales starting from every position. This results in note distributions with entropy similar to grid scales (Fig 7E, Sorted). We then look at the note distributions of alternative versions of real scales obtained by shuffling the step sizes in scales. Again, we find that the entropy is highest at notes 4 and 5 (Fig 7E, Shuffled), in contrast to real scales. The main difference between the real scales and grid scales appears to be that step sizes in real scales are well-mixed—small steps are found adjacent to large steps rather than small steps (and vice versa). We found similar results for 5-note scales (S14 Fig in S1 File), which again suggests that these findings may generalize to other scales.

Discussion

How diverse are scales?

Despite some differences, scales across cultures are remarkably similar. For example, the set of Carnatic scales alone is almost as varied as the total set of scales (Fig 6F). In particular, most scales observed in cultures—when compared to the universe of possible scales—are close to 5- and 7-note equidistant scales (Fig 7, S12 Fig in S1 File). Many authors have reported that equidistant scales are rare [2, 6, 10, 135137]. Yet, we find that they are more prevalent than expected by chance. This discrepancy may be due to the lack of a robust definition for equidistant scales—for example, the only previous statistical study of prevalence of equidistance does not explicitly account for natural variation in intonation [6]. It is not clear how much deviation from perfect equidistance renders a scale perceptually non-equidistant, or how variability in pitch affects perception of equidistance. As an illustration of the difficulties, consider the following example from Georgian singing: the step sizes in this scale are close to equidistant (163–202 cents) when viewing the melodic pitch histogram. But the intervals between notes in the melody are much less exact (90–240 cents) [13]; despite being statistically equidistant, it is not clear whether this scale would be perceived as equidistant. Perception of equidistance may also vary with culture and training, in which case it may be better to avoid binary measures of equidistance, and instead to construct a perceptually-relevant, continuous measure.

The most salient difference between societies is whether their data consisted of theory scales or measured scales (S9, S10 Figs in S1 File), considering that theory scales appear to be less equidistant than measured scales. One hypothetical explanation for the predominance of non-equidistant scales in societies using theory scales: the process of creating new scales by combining simple-integer-ratio intervals will inevitably result in more non-equidistant scales because only one combination of step sizes can result in an equidistant scale. We wonder whether this societal difference would persist if we were to only compare measured scales from societies, since theory scales will certainly exhibit intonation variability when performed. In the literature, Western scholars often discuss variation in intonation in societies that lack mathematical musical theory as being an intentional form of expression [64, 68, 138141], whereas studies of classical musics typically investigate to which theoretical tuning system the musicians conform [13, 123, 126, 127]. Ultimately, it is difficult to intuit the differences between societies, since there are only a few cross-cultural studies of interval discrimination [142, 143]. Here, we performed a preliminary study on intonation variability (S15, S16 Figs in S1 File), finding comparable levels of variation (with the possible exception of the pelog scale) in Gamelan orchestras, Thai xylophones, Turkish ney [91], Georgian singing [89], and a Belgian carillon [144]. But ultimately, to understand the differences between societies that use theory scales and those that do not, we will need cross-cultural perceptual experiments, and direct measurements of scales from recordings.

How did far-away societies come to use such similar scales? – Using a cultural-evolutionary framework [145], we can describe this process as a combination of cultural diffusion, and convergent evolution due to common factors biasing the use of scales across cultures. Diffusion can certainly account for some similarities, as some societies have documented shared history: e.g., societies with theory scales in the Middle East, or East Asia; court music in Thailand, Laos and Cambodia [100]. Some have suggested that cultural diffusion can be inferred based on two cultures using similar equiheptatonic scales [80, 85], but evidence seems circumstantial given how widespread equiheptatonic scales are. Instruments, on the other hand, are a richer source of information than scales, so organology studies are a much more direct way of inferring cultural diffusion [146149]. While there is undoubtedly some transmission of information across cultures, we reiterate that: within-region scale diversity is comparable to between-region diversity (S12B Fig in S1 File), step intervals consistently show approximate limits of 100–400 cents across regions (S12B Fig in S1 File), and scales are (against chance odds) overwhelmingly close to equidistance in all regions (S12 Fig in S1 File). Taken together, these facts point towards some non-negligible degree of convergent evolution due to shared biases.

How do scales evolve over time?

Scales can change, or persist, in a variety of ways. On a short time-scale, vocal (or other non-fixed-pitch instrument) scales are inherently stochastic, due to a lack of precision in motor control [150]. On a longer time-scale, vocal scales reside in memory, which introduces another mechanism of change. Unlike vocal scales, instrument tunings physically persist through time, but are still affected by multiple factors: environment (temperature, humidity), material (wood, metal, animal organs), and physical force [49]. To illustrate this point, we checked examples of repeated tunings of the same instruments across a specific time frame: Gamelan orchestra (metal idiophone), with standard deviation of notes, σ = 8 cents (slendro) and σ = 13 cents (pelog) cents over about 25 years [46]; Angolan likembe (plucked metal idiophone), where σ = 18 cents over a few weeks [49]; Gambian kora (chordophone), with σ = 27 cents over one week [52]. Technology offers more robust ways of keeping scales stable over time: monochords (ancient Greece) and pitch pipes (ancient China) enable reliable tuning by fourths and fifths [36]. Likewise, the perceptual phenomenon of tonal fusion may have enabled stable scales by providing knowledge of perceptual anchors (octaves and fifths) that can be reliably passed through generations. Recently, it seems to us that stability in scales has been reinforced though global tuning standards and inventions such as fixed pitch instruments and electric tuners. Thus, we can see that scales change in many ways, but it is possible for the rate of change to decrease through the use of technology and musical theory.

We can imagine some possible mechanisms of scale evolution, drawing analogies with biological evolution: We can think of a single scale as a gene, and a set of scales used by a population of people as a genome. Changes to the scale notes is akin to mutating a single nucleotide. Adding or removing notes from a scale is like insertions/deletions of nucleotides. Scales can be copied through interactions between populations; genes can be shared through horizontal transmission. The Western diatonic modes can be constructed using the same step intervals, but changing the tonic position; the same process occurs in protein sequences (circular permutants), likely through gene duplication and homologous recombination.

The analogy starts to break when you consider that scales can be invented, which is more akin to designing genomes. There are recent examples of invention of microtonal tunings in Western music [36, 64, 151], and many theory scales bear the hallmarks of design: Greek modes are all circular permutants, based on simple integer ratios [152]; the Carnatic melakarta result from combinatorial enumeration of a set of intervals, constrained by a set of rules [153]. Similarly, other cultures that have a long history of mathematical scholarship [154] use overlapping sets of scales based on theoretical divisions of the octave [95, 155159]. Typically, though, the theory scales that still survive are similar to the meaured scales, so it is likely that designed scales are constrained by similar selection criteria that applies to non-designed scales.

We propose that a detailed mechanistic model of scale evolution is possible: The mechanisms of spontaneous change need to be studied at the resolution of a single mutation step (through, e.g., transmission chain experiments) [160]. The relative importance of the different mechanisms could be investigated with sufficient long-term data (e.g., same individual/group/culture, over a period of months/years). Such a model could then be used for agent-based modelling to study the role of horizontal transmission, and cultural-evolutionary biases.

How are scales selected?

We propose three types of selection pressure: (i) cultural-evolutionary biases are based on how many people, or which people in one’s group are using the scale [161]; (ii) cognitive biases depend on how pitch is perceived by humans; and (iii) production biases depend on what is easy or difficult to sing, and physical constraints on instruments.

We can think of three examples of cultural-evolutionary biases that may apply to music: conformity, novelty and presitge biases. A bias towards conformity results in the most populous scale type being increasingly successful. In general, humans tend to synchronize [34, 162] when playing music (same meter, tonality), and over a longer time period, there is evidence of shifts towards the use of 12-TET [126, 163]. A conformity bias could arise due to other reasons, but we can probably rule out the difficulty of learning scales—studies on novel scale systems have shown that people rapidly learn their statistics [164, 165]. Novelty biases could be can describe choosing novel microtonal tunings as a means of expression [140, 151]. An example of a prestige bias is where tunings are copied from players/instruments that are acknowledged as good [46, 87].

Theories of scale selection are commonly based on cognitive biases, of which we will give a brief summary: liking harmonicity [166, 167], disliking sensory dissonance [168, 169] (or liking sensory dissonance [114]), tonal fusion [30], neurodynamic theory [170], transmissibility [12, 132, 160, 171, 172], memory [173], and theories based on mathematical properties of scales [174179].

Production constraints can apply to both singing and instruments [180]. When singing, large intervals cost more energy to produce than smaller ones, while small intervals are difficult to produce reliably due to limits on motor control [181]. Instruments are less constrained in this way, but still have physical limits to the number of notes, and interval range. On the other hand, instruments are constrained in how reliably they can be re-tuned to the same scale. There is a long history [152, 182] behind the practice of tuning using harmonic intervals [50, 52, 59, 67, 88], and reports exist of tuning according to the step sizes [38, 49, 51], tuning instruments visually [87, 88, 96, 183], and copying a reference instrument [46, 87].

How can we study the evolution of scales?

Tracing the evolutionary history of scales is challenging. Scales change at rates that depend on the instruments and technology, and new scales can be invented from scratch. The evolution is potentially driven by numerous selection pressures that vary in strength across societies. Nonetheless, we propose some approaches that seem feasible.

We can try to track evolutionary trajectories of scales using historical data. By restricting the scope to a single society, one can make simplifying assumptions by using ethnographic accounts to inform models. For example, about Gamelan orchestras we know that they are typically tuned in reference to another orchestra [140], and instrument intonation changes at a steady rate (due to similar materials and environment). This can be described as an evolving network of Gamelan orchestras with edges between orchestras that influence each other. Gamelan tunings are also extensively documented in the literature, and additional tunings can be inferred from recordings.

Another approach to infer the evolutionary trajectories or selection pressures is by analyzing a population of scales. This approach requires appropriate mathematical models where multiple selection pressures can be considered in tandem. Ultimately, many models may have convergent predictions, which means that additional experiments are needed to distinguish the relative importance of different selection pressures. One limitation of this approach is the dependence on the sample of scales studied, and it is hard to construct a priori a representative sample of scales. In this study we control for society, but other criteria are possible. Does it matter if societies have different population sizes? How does one deal with differences in within-society variation in scales? Should we take into account frequency of scale use within a society [100, 184]? These questions will surely unravel as more data becomes available for hypothesis testing.

Possible bias in the scale database

Relying on data collected by a limited number of ethnomusicologists, the database at this stage has sparse geographic coverage (Fig 2). Some have suggested that ethnomusicologists have a bias towards reporting findings that are considered ‘interesting’, thus exaggerating diversity [20]. Certain musical traditions, such as Gamelan and Thai, were very popular research topics, so they are over-represented. In contrast, there are very few quantitative measurements of scales or instruments with fewer than 5 notes, despite these being reported in many sources. Additionally, vocal scales (taken from recordings of singing) are scarce in the database, yet the voice is probably the most important instrument from an evolutionary point of view [12].

Unfortunately, in some rare instances there seem to be statistical irregularities in the reporting of tunings [46]: note that Jaap Kunst [185] reported gamelan tunings (not included in this database) where all the higher notes were exactly an octave above the lower ones, and that this is extremely unlikely; in a study of prehistoric bone flutes where the tunings are given to an accuracy of 1 cent, one flute is recorded as having a series of equal tempered intervals: [200, 200, 200, 300] [63]. Ultimately, these biases do not void the conclusions, but the results shown here will certainly need to be updated as more data becomes available.

Limitations to studying scale evolution

We may be witnessing a decline in diversity of scales due to the converging forces of globalization and technological change [186]. There is already evidence of homogenization, via the adoption of 12-TET [126, 163]. Thus, to understand how scales evolve, we must look to the past, but therein lies a different problem: the older the instrument, the less certain we are about how they were played. For prehistoric artefacts, we cannot be sure whether they (or their reconstructions) faithfully resemble the instrument in its original condition. So far, the only instruments that remain sufficiently intact to play are aerophones [48, 57, 63], which can produce many different scales depending on how they are played [183, 187189]. Thus, we believe that the best source of scales is in ethnographic recordings spanning the past century [103]. It is therefore imperative that methods be developed that can faithfully infer scales from large samples of songs. Algorithms must be developed to handle low-quality recordings [190], background noise, instrument / singing segmentation [191], polyphonic stream segmentation [192], note segmentation [193], and tonal drift [194].

Conclusion

Scales are a cornerstone of music across the world, upon which endless combinations of melodies can be generated. Surprisingly, despite a wealth of ethnomusicological research on the subject, we lacked a comprehensive, diverse synthesis of scales of the world. Here we remedy this issue, with a focus on quantitative data that will enable detailed statistical analyses about how scales evolve. Our own preliminary analyses have lent quantitative and qualitative support for the widespread (but not necessarily universal) use of the octave in some special capacity. Despite the rich diversity of scales, when put in context of how many scales are possible, what stands out in our analysis is how remarkably similar they are across the globe. Altogether, this work presents a treatise on the evolution of scales, and proposes promising avenues for future research.

Acknowledgments

We thank Olga Velichkina, Frank Scherbaum, Malinda McPherson for discussions and comments on the manuscript. We thank Polina Proutskova, Elizabeth Philips, Steven Brown, Patrick Savage, Kaustuv Kanti Ganguli, John Garzoli and Neil McLachlan for discussions. We thank Marcus Pearce for suggesting the name of the database.

References

  1. 1. Lerdahl F, Jackendoff RS. A Generative Theory of Tonal Music, Reissue, With a New Preface. MIT press; 1996.
  2. 2. Patel AD. Music, Language, and the Brain. Oxford university press; 2010.
  3. 3. Burns EM. Intervals, Scales, and Tuning. In: Deutsch D, editor. The Psychology of Music (Second Edition). second edition ed. San Diego: Academic Press; 1999. p. 215–264.
  4. 4. Carterette EC, Kendall RA. Comparative Music Perception and Cognition. In: Deutsch D, editor. The Psychology of Music (Second Edition). second edition ed. San Diego: Academic Press; 1999. p. 725–791.
  5. 5. Khe TV. Is the Pentatonic Universal? A Few Reflections on Pentatonism. The World of Music. 1977;19(1/2):76–84.
  6. 6. Savage PE, Brown S, Sakai E, Currie TE. Statistical Universals Reveal the Structures and Functions of Human Music. P Natl Acad Sci Usa. 2015;112(29):8987–8992. pmid:26124105
  7. 7. Kolinski M. Recent Trends in Ethnomusicology. Ethnomusicology. 1967;11(1):1–24.
  8. 8. Harwood DL. Universals in Music: A Perspective From Cognitive Psychology. Ethnomusicology. 1976;20(3):521–533.
  9. 9. Trehub SE. Human Processing Predispositions and Musical Universals. In: Wallin NL, Merker B, Brown S, editors. The origins of music; 2000. p. 427–448.
  10. 10. Brown S, Jordania J. Universals in the World’s Musics. Psychol Music. 2013;41(2):229–248.
  11. 11. Kuroyanagi J, Sato S, Ho MJ, Chiba G, Six J, Pfordresher P, et al. Automatic Comparison of Human Music, Speech, and Bird Song Suggests Uniqueness of Human Scales. In: Proceedings of the Folk Music Analysis (FMA 2019); 2019.
  12. 12. Pfordresher PQ, Brown S. Vocal Mistuning Reveals the Origin of Musical Scales. Eur J Cogn Psychol. 2017;29(1):35–52.
  13. 13. Scherbaum F, Mzhavanadze N, Arom S, Rosenzweig S, Müller M. Tonal Organization of the Erkomaishvili Dataset: Pitches, Scales, Melodies and Harmonies. Scherbaum F, editor. 1; 2020.
  14. 14. Levitin DJ, Rogers SE. Absolute Pitch: Perception, Coding, and Controversies. Trends Cogn Sci. 2005;9(1):26–33. pmid:15639438
  15. 15. Attneave F, Olson RK. Pitch as a Medium: A New Approach to Psychophysical Scaling. The American Journal of Psychology. 1971;84(2):147–166. pmid:5566581
  16. 16. Jacoby N, Undurraga EA, McPherson MJ, Valdés J, Ossandón T, McDermott JH. Universal and Non-Universal Features of Musical Pitch Perception Revealed by Singing. Curr Biol. 2019. pmid:31543451
  17. 17. Shepard RN. Geometrical Approximations to the Structure of Musical Pitch. Psychol Rev. 1982;89(4):305–333. pmid:7134331
  18. 18. Udo W. Two Types of Octave Relationships in Central Australian Vocal Music? Musicology Australia. 1997;20(1):6–14.
  19. 19. Nettl B. An Ethnomusicologist Contemplates Universals in Musical Sound and Musical Culture. In: Wallin NL, Merker B, Brown S, editors. The origins of music. vol. 3; 2000. p. 463–472.
  20. 20. Nettl B. The Study of Ethnomusicology: Thirty-One Issues and Concepts. University of Illinois Press; 2010.
  21. 21. Lloyd LS. Modern Science and Musical Theory. Journal of the Royal Society of Arts. 1942;90(4619):581–594.
  22. 22. Parncutt R, Hair G. A Psychocultural Theory of Musical Interval: Bye Bye Pythagoras. Music Percept. 2018;35(4):475–501.
  23. 23. Dowling WJ, Hollombe AW. The Perception of Melodies Distorted by Splitting Into Several Octaves: Effects of Increasing Proximity and Melodic Contour. Perception & Psychophysics. 1977;21(1):60–64.
  24. 24. Krumhansl CL, Toivanen P, Eerola T, Toiviainen P, Järvinen T, Louhivuori J. Cross-Cultural Music Cognition: Cognitive Methodology Applied to North Sami Yoiks. Cognition. 2000;76(1):13–58. pmid:10822042
  25. 25. Bonnard D, Micheyl C, Semal C, Dauman R, Demany L. Auditory Discrimination of Frequency Ratios: The Octave Singularity. J Exp Psychol Human. 2013;39(3):788–801. pmid:23088507
  26. 26. Demany L, Monteiro G, Semal C, Shamma S, Carlyon RP. The Perception of Octave Pitch Affinity and Harmonic Fusion Have a Common Origin. Hearing Res. 2021;404:108213. pmid:33662686
  27. 27. Allen D. Octave Discriminability of Musical and Non-Musical Subjects. Psychon Sci. 1967;7(12):421–422.
  28. 28. Kallman HJ. Octave Equivalence as Measured by Similarity Ratings. Perception & Psychophysics. 1982;32(1):37–49. pmid:7133946
  29. 29. Hoeschele M, Weisman RG, Sturdy CB. Pitch Chroma Discrimination, Generalization, and Transfer Tests of Octave Equivalence in Humans. Attention, Perception, & Psychophysics. 2012;74(8):1742–1760. pmid:23007203
  30. 30. Stumpf C. Tonpsychologie. Hirzel S, editor. Leipzig; 1890.
  31. 31. McPherson MJ, Dolan SE, Durango A, Ossandon T, Valdés J, Undurraga EA, et al. Perceptual Fusion of Musical Notes by Native Amazonians Suggests Universal Representations of Musical Intervals. Nat Commun. 2020;11(1):2786. pmid:32493923
  32. 32. McPherson MJ, Grace RC, McDermott JH. Harmonicity Aids Hearing in Noise. Attention, Perception, & Psychophysics. 2022;84(3):1016–1042. pmid:35102502
  33. 33. McPherson MJ, McDermott JH. Time-Dependent Discrimination Advantages for Harmonic Sounds Suggest Efficient Coding for Memory. P Natl Acad Sci Usa. 2020;117(50):32169–32180. pmid:33262275
  34. 34. Savage PE, Loui P, Tarr B, Schachner A, Glowacki L, Mithen S, et al. Music as a Coevolved System for Social Bonding. Behav Brain Sci. 2020; p. 1–36. pmid:32814608
  35. 35. Hewitt MJ. Musical Scales of the World. Note Tree; 2013.
  36. 36. Rechberger H. Scales and Modes Around the World: The Complete Guide to the Scales and Modes of the World. Fennica Gehrman Ltd.; 2018.
  37. 37. Ellis AJ. On the Musical Scales of Various Nations. Journal of the Society of arts; 1885.
  38. 38. Wachsmann KP. An Equal-Stepped Tuning in a Ganda Harp. Nature. 1950;165(4184):40–41.
  39. 39. Kubik G. Harp Music of the Azande and Related Peoples in the Central African Republic: (Part I—Horizontal Harp Playing). Afr Music. 1964;3(3):37–76.
  40. 40. Brandel R. The Music of Central Africa: An Ethnomusicological Study: Former French Equatorial Africa the Former Belgian Congo, Ruanda-Urundi Uganda, Tanganyika. Springer Science & Business Media; 1967.
  41. 41. Kunst J. Music in New Guinea. Brill; 1967.
  42. 42. Rouget G, Schwarz J. Sur Les Xylophones équiheptaphoniques Des Malinké. Rev Musicol. 1969;55(1):47–77.
  43. 43. Tracey A. The Matepe Mbira Music of Rhodesia. Afr Music. 1970;4(4):37–61.
  44. 44. Tracey A. The Nyanga Panpipe Dance. Afr Music. 1971;5(1):73–89.
  45. 45. Knight R. Kora Music From the Gambia, Played by Foday Musa Suso; 1976.
  46. 46. Surjodiningrat W, Susanto A, Sudarjana PJ. Tone Measurements of Outstanding Javanese Gamelans in Jogjakarta and Surakarta. Gadjah Mada University Press; 1972.
  47. 47. Morton D, Duriyanga C. The Traditional Music of Thailand. vol. 8. Univ of California Press; 1976.
  48. 48. Haeberli J. Twelve Nasca Panpipes: A Study. Ethnomusicology. 1979;23(1):57–74.
  49. 49. Kubik G. Likembe Tunings of Kufuna Kandonga (Angola). Afr Music. 1980;6(1):70–88.
  50. 50. Van Zanten W. The Equidistant Heptatonic Scale of the Asena in Malawi. Afr Music. 1980;6(1):107–125.
  51. 51. Zemp H. Melanesian Solo Polyphonic Panpipe Music. Ethnomusicology. 1981;25(3):383–418.
  52. 52. Aning BA. Tuning the Kora: A Case Study of the Norms of a Gambian Musician. J Afr Stud. 1982;9(3):164.
  53. 53. Lu-Ting H, Kuo-huang H. On Chinese Scales and National Modes. Asian Music. 1982;14(1):132–154.
  54. 54. Kubik G. A Structural Examination of Homophonic Multi-Part Singing in East and Central Africa. Anuario Musical. 1984;39:27.
  55. 55. Kubik G. African Tone-Systems: A Reassessment. Yearb Tradit Music. 1985;17:31–63.
  56. 56. Gottlieb R. Sudan Ii: Music of the Blue Nile Province; The Ingessana and Berta Tribes; 1986.
  57. 57. Yu-An R, Carterette EC, Yu-Kui W. A Comparison of the Musical Scales of the Ancient Chinese Bronze Bell Ensemble and the Modern Bamboo Flute. Percept Psychophys. 1987;41(6):547–562.
  58. 58. Keefe DH, Burns EM, Nguyen P. Vietnamese Modal Scales of the Dan Tranh. Music Percept. 1991;8(4):449–468.
  59. 59. Tracey A. Kambazithe Makolekole and His Valimba Group: A Glimpse of the Technique of the Sena Xylophone. Afr Music. 1991;7(1):82–104.
  60. 60. Carterette EC, Kendall RA, Vale SCD. Comparative Acoustical and Psychoacoustical Analyses of Gamelan Instrument Tones. Journal of the Acoustical Society of Japan (E). 1993;14(6):383–396.
  61. 61. Schneider A. Sound, Pitch, and Scale: From “Tone Measurements” to Sonological Analysis in Ethnomusicology. Ethnomusicology. 2001;45(3):489–519.
  62. 62. Attakitmongcol K, Chinvejkitvanich R, Sujitjorn S. Characterization of Traditional Thai Musical Scale. In: Proceedings of the 5th WSEAS International Conference on Acoustics and Music: Theory & Applications (AMTA’04); 2004.
  63. 63. Zhang J, Xiao X, Lee YK. The Early Development of Music. Analysis of the Jiahu Bone Flutes. Antiquity. 2004;78(302):769–778.
  64. 64. Sethares WA. Tuning, Timbre, Spectrum, Scale. Springer Science & Business Media; 2005.
  65. 65. McNeil LE, Mitran S. Vibrational Frequencies and Tuning of the African Mbira. J Acoust Soc Am. 2008;123(2):1169–1178. pmid:18247916
  66. 66. Strand JL. The Sambla Xylophone: Tradition and Identity in Burkina Faso. Wesleyan University, Connecticut; 2009.
  67. 67. Kuss M. Music in Latin America and the Caribbean: An Encyclopedic History: Volume 1: Performing Beliefs: Indigenous Peoples of South America, Central America, and Mexico. University of Texas Press; 2010.
  68. 68. Garzoli J. The Myth of Equidistance in Thai Tuning. Anal Approaches Music. 2015;4(2):1–29.
  69. 69. Wisuttipat N. Relative Nature of Thai Traditional Music Through Its Tuning System. International Journal of Creative and Arts Studies. 2015;2(1):86–97.
  70. 70. Morkonr A, Punkubutra S, et al. The Collecting Process of Xylophone’s Sound D (Ranād Xek) From Art to Numerical Data. In: 2018 International Conference on Engineering, Applied Sciences, and Technology (ICEAST); 2018. p. 1–4.
  71. 71. Bader R. Temperament in Tuning Systems of Southeast Asia and Ancient India. In: Computational Phonogram Archiving. Springer; 2019. p. 75–107.
  72. 72. Kimberlin CML. Masinqo and the Nature of Qanat. The University of California, Los Angeles; 1976.
  73. 73. Nam U. Pitch Distributions in Korean Court Music: Evidence Consistent With Tonal Hierarchies. Music Percept. 1998;16(2):243–247.
  74. 74. Weisser S, Falceto F. Investigating Qanat in Amhara Secular Music: An Acoustic and Historical Study. Annales d’Éthiopie. 2013;28(1):299–322.
  75. 75. Ambrazevičius R. The Perception and Transcription of the Scale Reconsidered: Several Lithuanian Cases. The World of Music. 2005;47(2):31–53.
  76. 76. Ambrazevičius R. Modelling of Scales in Traditional Solo Singing. Music Sci. 2006;10(1_suppl):65–87.
  77. 77. Cooke PR. Ludaya—A Transverse Flute From Eastern Uganda. Yearb Int Folk Music Council. 1971;3:79–90.
  78. 78. Garfias R. Preliminary Thoughts on Burmese Modes. Asian Music. 1975;7(1):39–49.
  79. 79. Jones AM. A KwaikÈr Indian Xylophone. Ethnomusicology. 1966;10(1):43–47.
  80. 80. Jones AM. Indonesia and Africa: The Xylophone as a Culture-Indicator. African Music: Journal of the International Library of African Music. 1960;2(3):36–47.
  81. 81. Kubik G. The Endara Xylophone of Bukonjo. African Music: Journal of the International Library of African Music. 1962;3(1):43–48.
  82. 82. Kubik G. Discovery of a Trough Xylophone in Northern Mozambique. African Music: Journal of the International Library of African Music. 1963;3(2):11–14.
  83. 83. Kubik G. Embaire Xylophone Music of Samusiri Babalanda (Uganda 1968). The World of Music. 1992;34(1):57–84.
  84. 84. Thrasher A. The Transverse Flute in Traditional Chinese Music. Asian Music. 1978;10(1):92–114.
  85. 85. Miñana Blasco C. Afinación De Las Marimbas en La Costa Pacífica Colombiana: Un Ejemplo De La Memoria Interválica Africana en Colombia; 1990.
  86. 86. Sundberg J, Tjernlund P. Computer Measurements of the Tone Scale in Performed Music by Means of Frequency Histograms. STL-QPS. 1969;10(2-3):33–35.
  87. 87. Copeland L. Pitch and Tuning in Beninese Brass Bands. Ethnomusicology Forum. 2018;27(2):213–240.
  88. 88. Miller TE. Traditional Music of the Lao: Kaen Playing and Mawlum Singing in Northeast Thailand. 13. Praeger; 1985.
  89. 89. Mzhavanadze N, Scherbaum F. Svan Funeral Dirges (Zär): Musicological Analysis. Musicologist. 2020;4(2):168–197.
  90. 90. Mokhtar M, Mosharrafa AM. Modes in Modern Egyptian Music. Nature. 1937;140(3543):548–549.
  91. 91. Tan A. Ney Açkısının Tarihi Ve Teknik Gelişimi. Yayınlanmamış Doktora Tezi Marmara Üniversitesi, Sosyal Bilimler Enstitüsü, İstanbul. 2011;.
  92. 92. Mendes M, et al. Fé No Pife: As Flautas De Pífano No Contexto Cultural Da Banda Cabaçal Dos Irmãos Aniceto. Universidade do Estado de Santa Catarina; 2012.
  93. 93. Panteli M, Purwins H. A Quantitative Comparison of Chrysanthine Theory and Performance Practice of Scale Tuning, Steps, and Prominence of the Octoechos in Byzantine Chant. J New Music Res. 2013;42(3):205–221.
  94. 94. Honingh A, Bod R. In Search of Universal Properties of Musical Scales. J New Music Res. 2011;40(1):81–89.
  95. 95. Signell KL. Makam: Modal Practice in Turkish Art Music. vol. 4. Da Capo Pr; 1977.
  96. 96. Zemp H, Malkus V. Aspects of’Are’are Musical Theory. Ethnomusicology. 1979;23(1):5–48.
  97. 97. Rice T. Aspects of Bulgarian Musical Thought. Yearb Int Folk Music Council. 1980;12:43–66.
  98. 98. Feld S. ‘Flow Like a Waterfall’: The Metaphors of Kaluli Musical Theory: For Carl Voegelin. Yearb Tradit Music. 1981;13:22–47.
  99. 99. Baily J. Anthropological and Psychological Approaches to the Study of Music Theory and Musical Cognition. Yearb Tradit Music. 1988;20:114–124.
  100. 100. Church M. The Other Classical Musics: Fifteen Great Traditions. Boydell & Brewer; 2015.
  101. 101. Cornelis O, Lesaffre M, Moelants D, Leman M. Access to Ethnic Music: Advances and Perspectives in Content-Based Music Information Retrieval. Signal Process. 2010;90(4):1008–1031.
  102. 102. Rosenzweig S, Scherbaum F, Shugliashvili D, Arifi-Müller V, Müller M. Erkomaishvili Dataset: A Curated Corpus of Traditional Georgian Vocal Music for Computational Musicology. Transactions of the International Society for Music Information Retrieval. 2020;3(1).
  103. 103. Wood ALC, Kirby KR, Ember CR, Silbert S, Passmore S, Daikoku H, et al. The Global Jukebox: A Public Database of Performing Arts and Culture. Plos One. 2022;17(11):1–22. pmid:36322519
  104. 104. Six J, Cornelis O, Leman M. Tarsos, a Modular Platform for Precise Pitch Analysis of Western and Non-Western Music. J New Music Res. 2013;42(2):113–129.
  105. 105. Cornelis O, Six J, Holzapfel A, Leman M. Evaluation and Recommendation of Pulse and Tempo Annotation in Ethnic Music. J New Music Res. 2013;42(2):131–149.
  106. 106. Benetos E, Holzapfel A. Automatic Transcription of Turkish Microtonal Music. J Acoust Soc Am. 2015;138(4):2118–2130. pmid:26520294
  107. 107. Ozaki Y, McBride J, Benetos E, Pfordresher PQ, Six J, T Tierney A, et al. Agreement Among Human and Annotated Transcriptions of Global Songs. In: ISMIR 2021—22nd International Society for Music Information Retrieval Conference; 2021.
  108. 108. Jordahl K, den Bossche JV, Fleischmann M, Wasserman J, McBride J, Gerard J, et al. Geopandas/Geopandas: V0.8.1; 2020.
  109. 109. Savage PE, Passmore S, Chiba G, Currie TE, Suzuki H, Atkinson QD. Sequence Alignment of Folk Song Melodies Reveals Cross-Cultural Regularities of Musical Evolution. Curr Biol. 2022;32(6):1395–1402.e8. pmid:35120658
  110. 110. Verosky NJ. Interpreting the Tonal Hierarchy Through Corpus Analysis. Psychomusicology: Music, Mind, and Brain. 2021;31(2):96–106.
  111. 111. Krumhansl CL, Shepard RN. Quantification of the Hierarchy of Tonal Functions Within a Diatonic Context. J Exp Psychol Human. 1979;5(4):579–594. pmid:528960
  112. 112. Castellano MA, Bharucha JJ, Krumhansl CL. Tonal Hierarchies in the Music of North India. Journal of Experimental Psychology: General. 1984;113(3):394–412. pmid:6237169
  113. 113. Kessler EJ, Hansen C, Shepard RN. Tonal Schemata in the Perception of Music in Bali and in the West. Music Percept. 1984;2(2):131–165.
  114. 114. Ambrazevičius R, Wiśniewska I. Tonal Hierarchies in Sutartinės. Journal of interdisciplinary music studies. 2009;3(1/2):45–55.
  115. 115. Harasim D, Moss FC, Ramirez M, Rohrmeier M. Exploring the Foundations of Tonality: Statistical Cognitive Modeling of Modes in the History of Western Classical Music. Humanities and Social Sciences Communications. 2021;8(1):5.
  116. 116. Pearson L. Coarticulation and Gesture: An Analysis of Melodic Movement in South Indian Raga Performance. Music Anal. 2016;35(3):280–313.
  117. 117. Ganguli KK, Rao P. On the Perception of Raga Motifs by Trained Musicians. J Acoust Soc Am. 2019;145(4):2418–2434. pmid:31046305
  118. 118. Ganguli KK, Rao P. On the Distributional Representation of Ragas: Experiments With Allied Raga Pairs. Transactions of the International Society for Music Information Retrieval. 2018;1(1).
  119. 119. Kirby KR, Gray RD, Greenhill SJ, Jordan FM, Gomes-Ng S, Bibiko HJ, et al. D-Place: A Global Database of Cultural, Linguistic and Environmental Diversity. Plos One. 2016;11(7):1–14. pmid:27391016
  120. 120. Hammarström H, Forkel R, Haspelmath M. Glottolog 4.1; 2019.
  121. 121. Evans CL, Greenhill SJ, Watts J, List JM, Botero CA, Gray RD, et al. The Uses and Abuses of Tree Thinking in Cultural Evolution. Philosophical Transactions of the Royal Society B: Biological Sciences. 2021;376(1828):20200056. pmid:33993767
  122. 122. Jones AM. On Using the Stroboconn. African Music: Journal of the International Library of African Music. 1970;4(4):122–124.
  123. 123. Hagerman B, Sundberg J. Fundamental Frequency Adjustment in Barbershop Singing. vol. 21. Citeseer; 1980.
  124. 124. Jers H, Ternström S. Intonation Analysis of a Multi-Channel Choir Recording. TMHQPSR Speech, Music and Hearing: Quarterly Progress and Status Report. 2005;47(1):1–6.
  125. 125. Devaney J, Mandel MI, Ellis DPW, Fujinaga I. Automatically Extracting Performance Data From Recordings of Trained Singers. PsychoMusicology. 2011;21(1-2):108–136.
  126. 126. Serra J, Koduri GK, Miron M, Serra X. Assessing the Tuning of Sung Indian Classical Music. In: ISMIR; 2011. p. 157–162.
  127. 127. D’Amario S, Howard DM, Daffern H, Pennill N. A Longitudinal Study of Intonation in an a Cappella Singing Quintet. J Voice. 2020;34(1):159.e13–159.e27. pmid:30149974
  128. 128. Siegel JA, Siegel W. Categorical Perception of Tonal Intervals: Musicians Can’t Tell Sharp From Flat. Percept Psychophys. 1977;21(5):399–407.
  129. 129. McDermott JH, Keebler MV, Micheyl C, Oxenham AJ. Musical Intervals and Relative Pitch: Frequency Resolution, Not Interval Resolution, Is Special. J Acoust Soc Am. 2010;128(4):1943–1951. pmid:20968366
  130. 130. Zarate JM, Ritson CR, Poeppel D. Pitch-Interval Discrimination and Musical Expertise: Is the Semitone a Perceptual Boundary? J Acoust Soc Am. 2012;132(2):984–993.
  131. 131. Larrouy-Maestri P, Harrison PMC, Müllensiefen D. The Mistuning Perception Test: A New Measurement Instrument. Behav Res Methods. 2019;51(2):663–675. pmid:30924106
  132. 132. Stumpf C. The Origins of Music. OUP Oxford; 2012.
  133. 133. van der Maaten L, Hinton G. Visualizing Data Using T-Sne. J Mach Learn Res. 2008;9(86):2579–2605.
  134. 134. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-Learn: Machine Learning in Python. J Mach Learn Res. 2011;12:2825–2830.
  135. 135. McDermott J, Hauser M. The Origins of Music: Innateness, Uniqueness, and Evolution. Music Percept. 2005;23(1):29–59.
  136. 136. Ball P. The Music Instinct: How Music Works and Why We Can’t Do Without It. Random House; 2010.
  137. 137. Ross B, Knight S. Reports of Equitonic Scale Systems in African Musical Traditions and Their Implications for Cognitive Models of Pitch Organization. Music Sci. 2017; p. 1029864917736105.
  138. 138. Rahn J. Javanese Pélog Tunings Reconsidered. Yearb Int Folk Music Council. 1978;10:69–82.
  139. 139. Coolen MT. The Fodet: A Senegambian Origin for the Blues? The Black Perspective in Music. 1982;10(1):69–84.
  140. 140. Vetter R. A Retrospect on a Century of Gamelan Tone Measurements. Ethnomusicology. 1989;33(2):217–227.
  141. 141. Perlman M. American Gamelan in the Garden of Eden: Intonation in a Cross-Cultural Encounter. The Musical Quarterly. 1994;78(3):510–555.
  142. 142. Arom S, Fürniss S. An Interactive Experimental Method for the Determination of Musical Scales in Oral Cultures. Contemp Music Rev. 1993;9(1-2):7–12.
  143. 143. Perlman M, Krumhansl CL. An Experimental Study of Internal Interval Standards in Javanese and Western Musicians. Music Percept. 1996;14(2):95–116.
  144. 144. Schneider A, Leman M. In: Schneider A, editor. Sound, Pitches and Tuning of a Historic Carillon. Cham: Springer International Publishing; 2017. p. 247–298.
  145. 145. Boyd R, Richerson PJ. Culture and the Evolutionary Process. University of Chicago press; 1988.
  146. 146. Kirby PR. The Indonesian Origin of Certain African Musical Instruments. Afr Stud-uk. 1966;25(1):3–22.
  147. 147. Blench R. Using Diverse Sources of Evidence for Reconstructing the Past History of Musical Exchanges in the Indian Ocean. Afr Archaeol Rev. 2014;31(4):675–703.
  148. 148. Aguirre-Fernández G, Blasi DE, Sánchez-Villagra MR. Panpipes as Units of Cultural Analysis and Dispersal. Evolutionary Human Sciences. 2020;2:e17. pmid:37588347
  149. 149. Aguirre-Fernández G, Barbieri C, Graff A, Pérez de Arce J, Moreno H, Sánchez-Villagra MR. Cultural Macroevolution of Musical Instruments in South America. Humanities and Social Sciences Communications. 2021;8(1):208.
  150. 150. Hutchins SM, Peretz I. A Frog in Your Throat or in Your Ear? Searching for the Causes of Poor Singing. Journal of Experimental Psychology: General. 2012;141(1):76–97. pmid:21875245
  151. 151. Partch H. Genesis of a Music. University of Wisconsin Press; 1950.
  152. 152. Crocker RL. Pythagorean Mathematics and Music. The Journal of Aesthetics and Art Criticism. 1963;22(2):189–198.
  153. 153. Vijayakrishnan KG. The Grammar of Carnatic Music. vol. 8. Walter de Gruyter; 2007.
  154. 154. Goodman MKJ. An Introduction to the Early Development of Mathematics. John Wiley & Sons; 2016.
  155. 155. Marcus S. The Interface Between Theory and Practice: Intonation in Arab Music. Asian Music. 1993;24(2):39–58.
  156. 156. Katz I. Henry George Farmer and the First International Congress of Arab Music (Cairo 1932). Brill; 2015.
  157. 157. Farhat H. The Dastgah Concept in Persian Music. Cambridge University Press; 2004.
  158. 158. te Nijenhuis E. Dattilam: A Compendium of Ancient Indian Music. vol. 11. Brill Archive; 1970.
  159. 159. von Falkenhausen L. On the Early Development of Chinese Musical Theory: The Rise of Pitch-Standards. J Am Oriental Soc. 1992;112(3):433–439.
  160. 160. Anglada-Tort M, Harrison PMC, Lee H, Jacoby N. Large-Scale Iterated Singing Experiments Reveal Oral Transmission Mechanisms Underlying Music Evolution. Curr Biol. 2023. pmid:36958332
  161. 161. Creanza N, Kolodny O, Feldman MW. Cultural Evolutionary Theory: How Culture Evolves and Why It Matters. P Natl Acad Sci Usa. 2017;114(30):7782–7789. pmid:28739941
  162. 162. Mehr SA, Krasnow MM, Bryant GA, Hagen EH. Origins of Music in Credible Signaling. Behav Brain Sci. 2020; p. 1–41. pmid:32843107
  163. 163. Moelants D, Cornelis O, Leman M. Exploring African Tone Scales. In: Proceedings of the 10th International Society for Music Information Retrieval Conference. Kobe, Japan: ISMIR; 2009. p. 489–494.
  164. 164. Loui P, Wessel DL, Kam CLH. Humans Rapidly Learn Grammatical Structure in a New Musical Scale. Music Percept. 2010;27(5):377–388. pmid:20740059
  165. 165. Rohrmeier M, Widdess R. Incidental Learning of Melodic Structure of North Indian Music. Cognitive Sci. 2017;41(5):1299–1327. pmid:27859578
  166. 166. Huron D. Interval-Class Content in Equally Tempered Pitch-Class Sets: Common Scales Exhibit Optimum Tonal Consonance. Music Percept. 1994;11(3):289–305.
  167. 167. Gill KZ, Purves D. A Biological Rationale for Musical Scales. Plos One. 2009;4(12):1–9. pmid:19997506
  168. 168. Aucouturier JJ. The Hypothesis of Self-Organization for Musical Tuning Systems. Leonardo Music J. 2008;18:63–69.
  169. 169. Berezovsky J. The Structure of Musical Harmony as an Ordered Phase of Sound: A Statistical Mechanics Approach to Music Theory. Sci Adv. 2019;5(5):eaav8490. pmid:31114802
  170. 170. Large EW, Kim JC, Flaig NK, Bharucha JJ, Krumhansl CL. A Neurodynamic Account of Musical Tonality. Music Percept. 2016;33(3):319–331.
  171. 171. Dowling WJ. The Cognitive Framework for Musical Scales in Various Cultures. J Acoust Soc Am. 1988;84(S1):S204–S204.
  172. 172. Phillips E, Brown S. Beyond Harmonicity: Toward a Vocal-Melodic Theory of the Origin of Musical Scales. Music in Human Experience: Perspectives on a Musical Species. 2022; p. 247.
  173. 173. Dowling WJ. Scale and Contour: Two Components of a Theory of Memory for Melodies. Psychol Rev. 1978;85:341–354.
  174. 174. Clough J, Engebretsen N, Kochavi J. Scales, Sets, and Interval Cycles: A Taxonomy. Music Theor Spectrum. 1999;21(1):74–104.
  175. 175. Harasim D, Schmidt SE, Rohrmeier M. Axiomatic Scale Theory. J Math Music. 2020;14(3):223–244.
  176. 176. Balzano GJ. The Group-Theoretic Description of 12-Fold and Microtonal Pitch Systems. Comput Music J. 1980;4(4):66–84.
  177. 177. Trehub SE, Schellenberg EG, Kamenetsky SB. Infants’ and Adults’ Perception of Scale Structure. J Exp Psychol Human. 1999;25(4):965. pmid:10464941
  178. 178. Pelofi C, Farbood MM. Asymmetry in Scales Enhances Learning of New Musical Structures. P Natl Acad Sci Usa. 2021;118(31). pmid:34326245
  179. 179. Verosky NJ. Hierarchizability as a Predictor of Scale Candidacy. Music Percept. 2017;34(5):515–530.
  180. 180. Tierney AT, Russo FA, Patel AD. The Motor Origins of Human and Avian Song Structure. P Natl Acad Sci Usa. 2011;108(37):15510–15515. pmid:21876156
  181. 181. Sundberg J, Titze I, Scherer R. Phonatory Control in Male Singing: A Study of the Effects of Subglottal Pressure, Fundamental Frequency, and Mode of Phonation on the Voice Source. J Voice. 1993;7(1):15–29. pmid:8353616
  182. 182. West ML. The Babylonian Musical Notation and the Hurrian Melodic Texts. Music Lett. 1994;75(2):161–179.
  183. 183. Velichkina O. Playing Panpipes in Southern Russia: History, Ethnography, and Performance Practices. Ohio State University; 1998.
  184. 184. Can MC. Geleneksel Türk Sanat Müziğinde Arel Ezgi Uzdilek Ses Sistemi Ve Uygulamada Kullanılmayan Bazı Perdeler. Gazi Üniversitesi Gazi Eğitim Fakültesi Dergisi. 2002;22(1).
  185. 185. Kunst J. Music in Java: Its History, Its Theory and Its Technique. Springer; 1949.
  186. 186. Cowen T. Creative Destruction: How Globalization Is Changing the World’s Cultures. Princeton University Press; 2009.
  187. 187. Santiago GAS. Los Artefactos Sonoros Del Oaxaca Prehispánico. vol. 3. Secretaría de Cultura del Estado de Oaxaca; 2005.
  188. 188. Rawcliffe S. Eight West Mexican Flutes in the Fowler Museum. The World of Music. 2007;49(2):45–65.
  189. 189. Yoshikawa S, Kajiwara K. Cross Fingerings and Associated Intonation Anomaly in the Shakuhachi. Acoustical Science and Technology. 2015;36(4):314–325.
  190. 190. Gfeller B, Roblek D, Tagliasacchi M, Li P. Learning to Denoise Historical Music. In: ISMIR 2020—21st International Society for Music Information Retrieval Conference; 2020.
  191. 191. Marolt M, Bohak C, Kavčič A, Pesek M. Automatic Segmentation of Ethnomusicological Field Recordings. Applied Sciences. 2019;9(3).
  192. 192. Benetos E, Dixon S, Duan Z, Ewert S. Automatic Music Transcription: An Overview. Ieee Signal Proc Mag. 2019;36(1):20–30.
  193. 193. Mauch M, Cannam C, Bittner R, Fazekas G, Salamon J, Dai J, et al. Computer-Aided Melody Note Transcription Using the Tony Software: Accuracy and Efficiency. In: Proceedings of the First International Conference on Technologies for Music Notation and Representation; 2015.
  194. 194. Mauch M, Frieler K, Dixon S. Intonation in Unaccompanied Singing: Accuracy, Drift, and a Model of Reference Pitch Memory. J Acoust Soc Am. 2014;136(1):401–411. pmid:24993224