Modifying Deutsch’s scale illusion for application in music

Issei Ichimiya; Hiroko Ichimiya

doi:10.1371/journal.pone.0280452

Abstract

Deutsch’s scale illusion demonstrates that the overall pitch range is the preferred organization when in competition with both local (note-to-note) pitch proximity and laterality (differences in the input ear). Such intricate factors can make it difficult to mimic this illusion. If a note is under a condition in which grouping by the overall pitch range and the local pitch proximity do not conflict, we hypothesized that an illusion would be perceived simply as the result of the competition between pitch proximity and laterality. In this paper, we aimed to replicate such a condition by modifying Deutsch’s scale illusion. Psychophysical studies were conducted with healthy subjects. In the first half of the study, the C major scale with successive tones was presented in ascending form, alternating between the right and left ears; counterpart notes were simultaneously presented to the opposite ear, and the subjects were asked to listen to these dichotic tone patterns. Several counterpart notes were applied; we found that when the sequences of counterpart notes were close in note-to-note pitch proximity and were not overlapped with the ascending scale in pitch, the subjects appeared to perceive the scale clearly. In the latter half of the study, we applied this condition in music and devised auditory illusions such that melodies of the passages of "Lightly Row," "Cherry Blossoms," and "Jingle Bells" were perceived by listening to "jagged" dichotic tone patterns. The method we described in this paper is simple, and it is possible to easily create auditory illusions in music by applying our method.

Citation: Ichimiya I, Ichimiya H (2023) Modifying Deutsch’s scale illusion for application in music. PLoS ONE 18(2): e0280452. https://doi.org/10.1371/journal.pone.0280452

Editor: Nicola Megna, Istituto di Ricerca e di Studi in Ottica e Optometria, ITALY

Received: July 29, 2021; Accepted: January 3, 2023; Published: February 1, 2023

Copyright: © 2023 Ichimiya, Ichimiya. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting information files.

Funding: The authors received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Scale illusion, which was first reported by Deutsch [1–8], results from illusory conjunctions of pitch and location. The pattern that gives rise to this illusion is shown in Fig 1a and is constructed from two diatonic major scales, one ascending and the other descending, when played simultaneously. The notes presented to each ear are alternately drawn from the ascending and descending scales, giving rise to "jagged" input patterns in each ear [5]. When listening to this pattern through earphones, most subjects experience the illusion shown in Fig 1b: a melody corresponding to the higher tones is heard as coming from one earphone, while a melody corresponding to the lower tones is heard as coming from the opposite earphone. It is also interesting that listeners mostly perceive the higher stream coming from the right and the lower stream coming from the left, and the apparent locations of the higher and lower tones often remain fixed when the earphone positions are reversed.

Download:

Fig 1. Schema of Deutsch’s scale illusion.

(a) The stimulus is constructed from two diatonic major scales, one ascending and the other descending. (b) Through earphones, the melody patterns most subjects experience hearing while listening to (a). Few subjects perceive a full ascending or descending scale, which are shown in (c).

https://doi.org/10.1371/journal.pone.0280452.g001

Such effects can occur when listening to music. At the beginning of the final movement of Tchaikovsky’s Sixth Symphony, the notes from the theme alternate between the first and second violin parts, and the notes from the accompaniment alternate reciprocally [6]. The passage, however, is not perceived as it is performed; rather, one violin part appears to be playing the theme and the other the accompaniment.

Indeed, scale illusion has an intriguing effect; however, only a few variations or musical pieces creating similar illusions have been reported [6]. The intricate factors that give rise to this illusion make it difficult to mimic. Listeners, it is believed, do not perceive the "jagged" patterns that are presented to each ear because grouping by pitch proximity is powerful when in competition with laterality (i.e., differences in the input ear). Listeners instead perceive tones that are reorganized in space in accordance with their melodic reorganization; however, this explanation does not fully explain why a melody corresponding to higher tones is heard as coming from one ear, while a melody corresponding to lower tones is heard as coming from the opposite ear. If the subjects follow the pattern purely based on local (note-to-note) pitch proximity, they should hear a full ascending or descending scale [2, 4], as shown in Fig 1c, but few listeners perceive this. This observation indicates that the subjects are invoking overall pitch range as well as local pitch proximity in making grouping judgments. The strength of the overall pitch range on grouping is described in a study by Tougas and Bregman [9]. In their study, they presented a pure-tone stimulus with an "X" pattern to subjects, which consisted of two simultaneously gliding tones, one ascending and the other descending. The pattern of tones that many listeners reported was the bouncing percepts (a high glide falling and then rising and a low glide rising and then falling). Also, in a follow-up study of a Deutsch’s scale illusion [5], the overall pitch range has been shown to have a large influence in grouping. In their study, the structure of the pattern of notes used in the original scale illusion study was altered slightly by adding or subtracting a pair of notes from the ends of the sequence. On hearing these notes, more listeners perceived hearing a full ascending or descending scale, but most listeners still heard the tones as two non-overlapping pitch streams.

If a note is presented under a condition in which grouping by the overall pitch range and by the local pitch proximity do not conflict, we believe that the illusion can be perceived simply as the result of the competition between pitch proximity and laterality. We have intended to demonstrate such notes in this study. In the first half of the study, the C major scale with successive tones was presented in ascending form, alternating between the right and left ears. Several counterpart notes, which were simultaneously presented to the opposite ear, were compared to find the condition in which the illusion can be perceived effectively. Further, in the latter half of the study, we applied this condition in music, and examined if auditory illusions of well-known melodies can be perceived by listening to "jagged" dichotic tone patterns. Returning to Tchaikovsky’s Sixth Symphony, we will never know whether it was his intention to produce a spatial illusion, or whether he expected the audience to hear the theme waft back and forth between the two sides of the space. However, we know that the passage he created is not the only one that can produce the illusion. The method we have described in this study is simple, and it is possible to easily create more illusions in music by applying our method.

In this paper, the note pattern was drawn as a line graph, as shown in Fig 2, for easier comparison with other note patterns. Fig 2 shows the scale illusion, which is equivalent to Fig 1a and 1b.

Download:

Fig 2. Schema of Deutsch’s scale illusion drawn as a line graph.

This figure is equivalent to Fig 1a and 1b and is shown here for easier comparison with other note patterns. Stimulus (R), which is shown by the red circles and solid line, represents the notes presented to the right ear. Stimulus (L), which is shown by the blue Xs and dotted line, represents the notes presented to left ear. The orange solid line and the green dotted line show the percept coming from the right and the left ears, respectively.

https://doi.org/10.1371/journal.pone.0280452.g002

Experiment 1a

Materials and methods

Subjects.

Twenty volunteers (11 men and nine women; mean age, 38.1 years) were included. Subjects in Experiment 1a, as well as in later experiments, had normal hearing, had no neurological conditions, and were right-handed. The study protocols in this paper were reviewed and approved by the Clinical Research Ethics Committee of Ichimiya Clinic, and the study was conducted in accordance with the Declaration of Helsinki. Written informed consent was obtained from all subjects prior to their inclusion in the study.

Equipment

The same equipment was used throughout the experiments in this study. The stimulation tones were made using publicly available software, Wave Editor TWE (Yamaha Corporation, Tokyo, Japan). The tones were sinusoids of equal amplitude, and the duration of a tone component was set at 250 ms. The rise and fall times for each tone component were 10 ms, and there were no gaps between the adjacent tone components. Their frequencies ranged from 208 to 554 Hz and were set according to the 12-tone equal temperament tuning system. Tones were saved in the form of Waveform Audio Files (Microsoft Corporation, Redmond, WA, USA) with a sampling rate of 44.1 kHz/16-bit resolution.

Using an Aspire S3 computer (Acer America Corporation, San Jose, CA, USA) with a universal serial bus audio processor (SE-U55SXII; Onkyo Digital Solutions, Tokyo, Japan), auditory stimuli were delivered through dynamic headphones (MDR-7506; Sony, Tokyo, Japan) at a level of 75 dB SPL.

Tasks

The four dichotic tone patterns in S1 Table and Fig 3 were used in this experiment. The C major scale with successive tones was presented in ascending form, alternating between the right ear and left ear. When a tone from the scale was presented to the right ear, the pitch of the tones presented to the left ear was one of the following: two whole tones lower (Fig 3a), one whole tone lower (Fig 3b), equal (Fig 3c), or one whole tone higher than that of the right ear (Fig 3d). The tonal sequence presented to the right ear was the same for all four dichotic tone patterns. When the note of the ascending scale was presented to the left ear, the tone that was two whole tones lower was presented to the right ear. The aforementioned four dichotic tone patterns were named 2TL, 1TL, 0T, and 1TH, respectively. Theoretically, if the subjects followed the note-to-note pitch proximity, these dichotic tone patterns might have been perceived as ascending scales. If the subjects judged these tone patterns based on the overall pitch range, 2TL, 1TL, and 0T might also have been perceived as ascending scales. However, 1TH may have been perceived as a "jagged" pattern because the pitch of the left ear was always higher than that of the right ear.

Download:

Fig 3. Dichotic tone patterns for Experiment 1a.

An ascending scale was presented alternately to the right and left ear. Stimulus (R) and Stimulus (L) show the tone patterns presented to the right and left ear, respectively. When a tone from the scale was presented to the right ear, the pitch of the tones presented to the left ear was one of the following: (a) two whole tones lower (2TL), (b) one whole tone lower (1TL), (c) equal (0T), or (d) one whole tone higher (1TH) than that presented to the right ear.

https://doi.org/10.1371/journal.pone.0280452.g003

Tasks were performed in a manner similar to those used in our previous study [10]. The computer monitor displayed two buttons that played two of the four above-mentioned tone patterns. The subjects were asked to click on the two buttons, listen, and choose the one where they heard the ascending scale more clearly by answering a questionnaire. They could click the buttons multiple times before responding. The tone patterns that switched between the right and left ears were tested in the same way. Thus, each subject was presented with 12 tasks. The order of the tone pairs and the order of the two buttons displayed were random.

Statistical analysis

For each of the four tone patterns, the ratio was obtained by dividing the number the subject chose by three (i.e., the number of the tone patterns used for comparison). The values of these four ratios were compared using the Friedman test, followed by post hoc Wilcoxon signed rank test with Bonferroni corrections. A p value < 0.05 was considered to be statistically significant. The statistical analyses in this paper were performed using EZR version 1.52 (Saitama Medical Center, Jichi Medical University, Saitama, Japan) [11], which is a graphical user interface for R version 4.02 (The R Foundation for Statistical Computing, Vienna, Austria). More precisely, it is a modified version of R commander designed to add statistical functions frequently used in biostatistics.

Results and discussion

The results are shown in S2 Table. Fig 4 shows that the mean ratio of the ascending scale was perceived more clearly when compared with the other tone patterns. Because there were no statistically significant differences in the mean ratio when the right and left ears were switched, the data shown here combine the results of when the right and left ears were switched. The ratio in the case of 2TL and 0T was statistically higher than that in the case of 1TL (2TL vs. 1TL, p = 0.023; 0T vs. 1TL, p = 0.022). Interestingly, 2TL, in which the tone patterns of the ascending form and the tones that are two whole tones lower are presented alternately, is not statistically different from 0T, in which the definite ascending scale (not a "jagged" tonal sequence) is presented in one ear. Few subjects perceived the ascending scale when they heard the tonal sequence 1TH. The ratio was significantly lower than that of all the other tone patterns (2TL vs. 1TH, p = 0.002; 1TL vs. 1TH, p = 0.001; 0T vs. 1TH, p = 0.001).

Download:

Fig 4. Mean ratios of perception of the ascending scale in Experiment 1a.

The graph shows the mean ratio of the ascending scale that was perceived more clearly when compared with the other tone patterns. *: p < 0.05. **: p < 0.05 against all other tone patterns.

https://doi.org/10.1371/journal.pone.0280452.g004

In contrast to the Deutsch’s scale illusion study [1, 2] that embedded both ascending and descending scales in the notes, we only embedded the scale in the ascending form. Thus, it was easy to devise dichotic tone patterns where the overall pitch range and the local pitch proximity do not conflict. From the results of this experiment, we speculated that the scale would be clearly perceived as described below. Fig 5 shows an additional line graph drawn on Fig 3. The additional line that connected the notes was the counterpart of the ascending scale, which was presented alternately from the opposite ears. The additional line of 2TL appears to be in good continuation; that is, when the sequences of counterpart notes were close in note-to-note pitch proximity, the subjects appear to perceive the ascending scale more clearly. To address this issue, another dichotic tonal sequence in which the counterpart notes would be in good continuation was examined in the next experiment.

Download:

Fig 5. Dichotic tone patterns showing the counterpart notes.

Images showing an additional line graph (green dotted line) in Fig 3. The notes that are connected with the green line are counterparts of the ascending scale that are presented alternately from opposite ears.

https://doi.org/10.1371/journal.pone.0280452.g005