Figures
Abstract
The objective of this study was to examine whether different types of mental-imagery training focused on the vocal apparatus can enhance awareness of the vocal tract and diaphragm (vocal awareness) in non-professional singers.Sixty participants with no singing education received one of three training conditions: following instructions based on 1) a description of the physiological changes that take place during phonation (physiological description), 2) imitating an action using the vocal apparatus (imitative action), and 3) a metaphorical narration. Imitative action and metaphorical narration were conceptualized as more imaginative forms of training. Vocal awareness was assessed with a questionnaire that participants completed before and after the training. The questionnaire measured three indices: vocal apparatus representation, vocal apparatus interoceptive awareness, and vocal self-regulation. Results showed that all three types of training program significantly enhanced vocal awareness, but imitative action and metaphorical narration were more effective for interoceptive awareness, and metaphorical narration was more effective for self-regulation. In conclusion, the two imaginative forms of training were more effective than physiological description for improving vocal awareness.
Citation: Biassoni F, Vismara G, Gnerre M (2025) Efficacy of imaginative vocal training for enhancing vocal awareness in non-professional singers. PLoS One 20(12): e0338789. https://doi.org/10.1371/journal.pone.0338789
Editor: Ramandeep Kaur, Father Muller Charitable Institutions, INDIA
Received: July 27, 2025; Accepted: November 27, 2025; Published: December 12, 2025
Copyright: © 2025 Biassoni et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data Availability Statement: The data that support the findings of this study are available at https://osf.io/aqfz4/?view_only=1a64cdaaab3e4fa69a5f4b273d33d7a3.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Knowledge of the body, and of the voice, is of the utmost importance when learning to sing, as singers are their own instrument [1]. Vocal awareness, an essential element of the vocal self-concept, encompasses knowledge and understanding of one’s own unique voice [1–3]. It demands a mindful grasp of the processes facilitated by the muscles of phonation that generate and modify the voice [4]. This understanding serves as the foundation for the most proficient use and control of the voice. The limited and fragmented body of literature on enhancing vocal awareness converges on three key abilities: (1) vocal apparatus representation, (2) developing interoceptive awareness of the vocal apparatus, and (3) exercising vocal self-regulation [5–14]. Proponents of the Feldenkrais method [10], the Linklater method [12], and the Magnani method [11] argue that attending to one’s own voice and learning through the direct experience of bodily sensations foster greater awareness of its power and qualities. Similarly, within the Borragán method, Agudo et al. [7,8] emphasize the importance of body intelligence, that is, the ability to represent and remain aware of one’s own posture and actions. In contrast, teachers of the Wilfart [15], Le Huche [5,6], and Weiss [14] methods maintain that there is a relationship between the perception and representation of subjective bodily experience, on the one hand, and the effective management of vocal performance, on the other.
Although these methods differ in their specific procedures and theoretical underpinnings, they converge in emphasizing the central role of bodily awareness, and particularly vocal awareness, in supporting performance in both speech and singing. Within the vocal pedagogy literature [16–21], the vocal tract and the diaphragm are consistently identified as key components of the vocal apparatus, and of the phonatory system more broadly, that warrant particular attention when cultivating bodily awareness. It has been proposed that variability in vocal performance depends in part on the degree of awareness of these components, even though they are often poorly understood by learners and performers [22,23]. It has also been proposed that mental imagery contributes to vocal awareness by enabling individuals to form and draw on mental representations of the anatomical structures involved in vocal production and how they move [24,25]. For example, mentally rehearsing the movements of the articulators involved in speech production, and anticipating the resulting auditory feedback, can be effective because it engages both the motor and perceptual systems [26].
Vocal awareness is as important for non-professional singers as it is for those who earn their living by using their voices in a professional capacity. Moreover, increasing awareness of the body, and particularly of the voice, generally enhances self-knowledge [27–29] and can improve the effectiveness with which emotions and other communicative intentions are conveyed [30–32]. Although several methods for teaching singing are grounded in bodily knowledge and vocal awareness [7–10,12,13,33–35], relatively few studies have evaluated their effectiveness [7,8,27,36], and even fewer have focused on their application with singers who do not aspire to professional careers.Evidence supports the effectiveness of mental-imagery training to improve the performance of athletes [37] and actors [38], but not in singers. Knowledge and awareness of the body – especially the vocal apparatus – are as important to singers as they are to athletes and actors. We were interested in the potential for using mental imagery in vocal training. We therefore investigated the effects of three types of training based on mental imagery on non-professional singers’ vocal awareness. We chose to focus on awareness of the vocal tract and diaphragm because these are the most important components of the vocal apparatus. Singers rely on mental representations of their body and specifically their vocal apparatus to control phonation, so it is essential that they can form and draw on these mental representations [39–42].
Factors involved in vocal awareness
Many studies have shown that people who attend to their own voluntary and involuntary bodily actions and responses can enhance their knowledge of the body and increase their motor-control efficiency [43–49]. We have already referred to three key abilities underlying vocal awareness: 1) the vocal apparatus representation; 2) interoceptive awareness of the vocal apparatus; and 3) self-regulation, which involves managing and controlling vocal behaviors—such as pitch, volume, quality, and resonance—through conscious monitoring, adjustment, and refinement of vocal techniques and habits.
To acquire adequate motor-perceptual knowledge of the functioning of the vocal system, one must be able to perceive the actions of its different parts [44,50–53] and to picture them mentally [46–49,51,54–56]. To be able to represent one’s own voice (i.e., to be aware of it and listen to it internally) one must process and make use of physical sensations in the body [44,57–60], especially the functions of its internal organs, such as blood pressure, heartbeat, thermoregulation, breathing rhythm, digestion, and feelings such as arousal, which is related to the activity of the autonomic nervous system [57,58,61,62]. Together, these are known as interoception. The perception of muscle tension, movement, posture, balance, and tendon joint activity [44,45,63] is known as proprioception [7–10,14]. It has been suggested that the mouth and the larynx contain proprioceptive receptors that aid their localization and muscle control [16–20], but research on the proprioceptive experience of the diaphragm has yielded inconclusive and disputed findings [17,21].
It has been hypothesized that, to self-regulate and modulate vocal performance, one must be able to mentally represent the vocal apparatus, including the phonatory system, to use interoceptive and proprioceptive awareness of the vocal apparatus in action, and to integrate this awareness and knowledge with executive control and emotional sensitivity [7,8,10,64,65]. Methods of teaching singing based on these hypotheses have been tested and shown to be effective [5–8,11,13,14].
Mental imagery and vocal awareness
One way of teaching singing is using the so-called empirical method [34,35,66]. This method uses language to evoke images that enable the singer to prepare themself both physically and mentally to sing [5,6,11–13,67]. Phonation can be activated in two ways: via imitative action or responding to a metaphorical narration, both of which require imagination on the part of the singer. Imitative action relies on analogical thinking, whereby parallels or similarities are drawn between two or more concepts, situations, or objects to understand or solve problems, while metaphor is a form of symbolic thinking, referring to symbols that may not be physically present [79]. Imitative action facilitates learning and understanding by encouraging learners to replicate observed behaviors and to create a connection between a model and the action to be performed, whereas metaphorical narration creates connections between concepts. Imitative action and metaphorical narration allow learners to gain a nuanced understanding by making connections between singing and familiar concepts. In short, they can both serve as cognitive strategies used to grasp and reproduce phonatory acts.
In imitative action training, learners are asked to create a mental image of how the vocal apparatus moves when they speak by simulating actions (doing); in metaphorical narration training they imagine carrying out particular behaviours that can affect the way they speak (imagining) [5,6,11–13,34,35,66,67]. Both imitative-action and metaphorical-narration training are forms of mental-imagery training that require imagination and draw on the connection between the use and understanding of abstract and metaphorical language and sensorimotor bodily experience [68–72]. Several studies have shown that metaphorical language may help one think about and explore one’s own body and its functions more easily [73–81].
Objectives
In this study we tested the effects of three training programs on three indices of vocal awareness collected through a self-report questionnaire completed before (Time 1) and after the training (Time 2). The three training programs required participants to follow instructions based on 1) a description of the physiological changes that take place during phonation (physiological description), 2) imitating an action using the vocal apparatus (imitative action), and 3) a metaphorical narration. The three indices of awareness of the vocal tract and diaphragm respectively were 1) vocal apparatus representation2) interoceptive awareness of the vocal apparatus (awareness), and 3) vocal self-regulation (self-regulation).
Hypotheses
Based on the reviewed literature, we developed three key hypotheses to guide our investigation. First, we anticipated a general effect of time on vocal awareness (H1). Specifically, we expected that participants, regardless of the training method they received, would exhibit higher vocal awareness scores after training compared to before. Second, we hypothesized that the type of training would play a significant role in shaping vocal awareness (H2). More specifically, we predicted that participants undergoing imaginative training—whether through imitative action or metaphorical narration—would demonstrate greater improvements in vocal awareness compared to those receiving physiological description training. Finally, we expected an interaction effect between time and training type (H3). In other words, we hypothesized that the increase in vocal awareness from pre- to post-training would be more pronounced among participants exposed to imaginative training (imitative action or metaphorical narration) compared to those who received training based on physiological descriptions.
Method
Participants
Sixty non-professional singers (30 F, 30 M) took part in the study. Their mean age was 30.5 years (SD = 7.54, range 19–45). The size of the sample was calculated using G*Power 3.1.9.2 [82]. To achieve a statistical power of 95%, with a medium effect size (0.25), and an α level of.01, the sample size of 60 is recommended. Participants were recruited through advertisements in the local media. Inclusion criteria were that: 1) participants had not attended any vocal course or training focused on vocal awareness nor singing courses before, and 2) participants had no vocal nor hearing impairment. Participants were divided into three groups of 20 matched for sex and age and then assigned quasi-randomly to a group in which they received one form of training (physiological description, imitative action, or metaphorical narration) to enhance awareness of the vocal tract and diaphragm.
Participants were recruited between November 15, 2023, and February 10, 2024. All participants provided written informed consent prior to taking part in the study. The research was approved by the Ethics Committee for Research in Psychology (CERPS) of the Catholic University of the Sacred Heart (Protocol No. 60−23, Approval Date: May 24, 2023). No minors were involved in the study.
Materials
My voice and me questionnaire.
Since no questionnaire measuring vocal awareness has been published, we developed an ad hoc questionnaire. First, the questionnaire measured participants’ ability to form avocal apparatus representation. Items 1, 4, and 8 were based on previous research [39–42], adapted from the Body Consciousness Questionnaire [83] and the Body Intelligence Scale [84] with an α of.53. In Item 8, participants were shown the silhouette of a human being and asked to locate the vocal apparatus on the silhouette; correct and incorrect responses were identified by means of a numbered grid. Second, it measured participants’ interoceptive and proprioceptive awareness of the vocal apparatus. Items 2, 5, and 7 were based on previous research [7–10,14] with an α of.75. Third, it measured participants’ vocal self-regulation. Items 3 and 6 were adapted from a voice perception questionnaire reported in Fussi [85] and the Voice Handicap Inventory [86] with an α of.74. Three experts in singing and voice education rated the comprehensibility, relevance, and consistency of the items testing the three forms of vocal awareness, and 10 participants in a pilot study (5 m, 5 f), aged 20–40 years, also rated the comprehensibility of the questions and responses. Other than for the item for which participants located the vocal apparatus, responses were given using 5-point Likert-type scales from 1 (strongly disagree) to 5 (strongly agree), so the maximum that could have been scored for the seven scored items was 35.
Mental-imagery training programs
Each of the three training programs was designed to be consistent with the literature on singing teaching [5–8,11–13,34,35,87,88] and focused on awareness of the vocal tract and diaphragm.
Physiological description.
According to Garcia [89] specialized singing training requires an in-depth understanding of vocal physiology, learned from description and practice [11,90]. Physiological descriptions are technical explanations of the functions of the vocal tract and diaphragm and may include instructions for specific exercises, such as “Move the base of your tongue back;” “Lower your diaphragm;” “Contract your pelvic floor” [11]. This was the most concrete of the three training programs in its use of descriptive, specialist terminology and required the participant to make least use of their imagination.
Imitative action.
According to Mercado et al., [91], vocal imitation involves synchronizing an auditory stimulus with the vocal motor system, whether this is achieved with or without effort. Rather than asking participants to imitate an auditory stimulus we asked them to act as if they were carrying out a task that would influence phonation, such as “Sing as if you were yawning;” “Sing as if you had a hot potato in your mouth” [34,35]. If the participant followed the instruction correctly, they would produce, more or less automatically, the correct position of tongue, mouth, and palate for singing (vocal posture). This training program was more abstract than physiological description in its use of language and required the participant to make more use of their imagination.
Metaphorical narration.
Participants were invited to listen to narratives evoking images to illustrate the physiological processes that take place in the vocal system, and to use mental imagery themselves while experiencing the physical sensations of speaking. The processes of breathing in and out, for example, are evoked in the instruction “Speak as if an umbrella was opening inside your belly,” which could raise the participant’s awareness of their abdominal muscles and thus help them lower their diaphragm, Similar instructions include “Speak as if the sound were coming out of your back;” “Smile internally when you sing;”;“Feel as though your body were a sandbag leaning against a marble column” [12,34,35]. Damasio [41] also emphasizes the importance of iconic and symbolic language in the representation of body, such as the limbs and parts of the vocal apparatus, including the phonatory system, and actions; he describes our capacity for creating mind maps, whether or not these are in the form of images, that help us imagine and represent the shape of our limbs and their position in space. This was the most abstract of the three training programs in its use of metaphorical language and required participants to make the most use of their imagination. See Table 1 for a summary of the three programs.
Procedure
Participants undertook single experimental sessions, individually, in a soundproofed room. They were asked to stand up while they were carrying out the pre- and post-training performance tasks, but they sat down while completing the pre- and post-task questionnaires and while receiving the training. They carried out each of the performance tasks three times.
The protocol was as follows for vocal tract awareness:
- Pre-training performance task (5 s): utter the phrase Ciao Aurora(using resonators and vowel articulation to produce appropriateprosody)
- Complete My Voice and Me questionnaire
- Training phase (5 min)
- Post-training performance task for vocal tract awareness (5 s): utter Ciao Aurora again
- Complete My Voice and Me questionnaire again.
The protocol was as follows for diaphragm awareness:
- Pre-training performance task: maintain the vowel sound ee for as long as possible (a vocalization depending on the proper management of breath and breathing)
- Complete My Voice and Me questionnaire
- Training phase (5 min)
- Post-training performance task for diaphragm awareness: maintain thevowel sound ee for as long as possible again
- Complete My Voice and Me questionnaire again.
Data analyses
Mixed-model 2 x 3 analyses of variance (ANOVA) were conducted. The within-participant variable was time (pre- vs. post-training) and the between-participants variables were the three training programs (physiological description, imitative action, metaphorical narration) for each of the three outcome measures (vocal apparatus representation, interoceptive awareness, and vocal self-regulation) relating to the vocal tract and diaphragm tasks respectively. Tukey post hoc analyses were conducted. The level of statistical significance was set at p ≤ .05. Bonferroni corrections were applied.
Results
Mean and standard deviation values for the training activities are reported in Table 2. We predicted a main effect of time, such that participants would score higher on measures of vocal awareness after taking the training, and a main effect of training type, such that participants would score higher on measures of vocal awareness (both for the diaphragm and vocal tract) following imaginative training, either through imitative action or metaphorical narration.
Vocal apparatus representation
For vocal tract, the analysis revealed a significant main effect of time, [F(1, 57) = 10.21, p < .001, η²ₚ = .15], indicating a decrease in vocal apparatus representation scores from pre-training (M = 6.97, SD = 1.76) to post-training (M = 6.73, SD = 1.91). This is inconsistent with H1. Moreover, no significant main effect of training type was found, suggesting that the three training methods did not differ significantly in their impact on vocal apparatus representation scores (p > 0.05). Moreover, no significant interaction effect was found (p > .05). Thus, H2 and H3 were not supported.
A chi-square test assessed changes in participants’ identification of phonatory parts before and after training, based on their selections on a human figure (Table 3). The results showed a statistically significant difference for several phonatory parts: Heart: χ²(1) = 5.84, p = .016; lungs: χ²(1) = 9.17, p = .002; head: χ²(1) = 33.22, p < .001; mouth (tongue, palate, lips): χ²(1) = 7.80, p = .005; diaphragm: χ²(1) = 23.47, p < .001. Conversely, no statistically significant difference was found for the vocal cords and throat, although a trend was observed (χ²(1) = 3.15, p = .076). Specifically, after the training, the mouth was more frequently recognized as a key part of the vocal tract responsible for producing vocal utterances than before the training (see Fig 1), and subjects who underwent imitative action training identified the throat and mouth as the origin of voice production less frequently in their post-training performance than in their pre-training performance (throat: 90% in pre-training performance vs. 70% in post-training performance; mouth: 30% in pre-training performance vs. 15% in post-training performance). In contrast, subjects who participated in physiological description training identified the throat and mouth as the origin of voice production more frequently in their post-training performance than in their pre-training performance (throat: 65% in pre-training performance vs. 70% in post-training performance; mouth: 20% in pre-training performance vs. 30% in post-training performance).
As far as diaphragm representation was concerned, no significant main effect of time emerged (p > .05); therefore, H1 was not supported. However, there was a significant main effect of training type, F(2, 57) = 24.36, p < .001, η²ₚ = .30. Post hoc analyses showed that participants in the imitative action training (M = 7.85, SD = 1.49) and metaphorical narration training (M = 8.15, SD = 1.41) conditions reported significantly higher diaphragm representation scores than those in the physiological description condition (M = 7.05, SD = 1.63), p < .05, supporting H2. No significant time × training interaction emerged (p > .05), indicating that the effect of training did not vary over time; thus, H3 was not supported. A chi-square test was conducted to compare participants’ identification of different phonatory parts before and after training, based on their selections on a human figure (Table 3). The results indicated a statistically significant difference for several phonatory parts: Vocal cords and throat: χ²(1) = 77.91, p < .001; Heart: χ²(1) = 80.53, p < .001; Lungs: χ²(1) = 82.65, p < .001; Head: χ²(1) = 101.40, p < .001; Diaphragm: χ²(1) = 74.53, p < .001. However, the diaphragm was identified as the main part involved in the proposed vocal exercise in the post-training phase (55%), unlike the pre-training phase, where this muscle was given little relevance (25%) (see Fig 2).
Vocal apparatus interoceptive awareness
Regarding the vocal tract, participants demonstrated significantly higher interoceptive awareness in the post-training performance compared to the pre-training performance [F(1, 57) = 5.77, p = .05, η²ₚ = .021], confirming H1. Specifically, participants reported significantly higher scores in the post-training phase (M = 10.68, SD = 2.82) compared to the pre-training phase (M = 10.00, SD = 2.48), p = .04. A significant main effect of training was also found [F(2, 57) = 3.96, p = .04, η²ₚ = .056], indicating that training type influenced interoceptive awareness scores, confirming H2. Post hoc analyses revealed that participants in the imitative action training group demonstrated significantly greater improvement in interoceptive awareness compared to those in the physiological description training group (p < .001). However, no significant difference was found between the Imitative Action and metaphorical narration training groups (p > .05). A significant time × training interaction effect was also found [F(2, 57) = 3.42, p = .05, η²ₚ = .03], confirming H3. This suggests that the effect of training on interoceptive awareness varied over time. Post hoc analysis for the interaction effect confirmed that interoceptive awareness scores significantly increased from pre-training (M = 10.3, SD = 2.79) to post-training (M = 12.3, SD = 2.26) in the imitative action training group, p < .05, and from pre-training (M = 10.9, SD = 1.83) to post-training (M = 12.7, SD = 1.92) in the metaphorical narration training group, p < .05. However, no significant improvement was observed in the physiological description training group, p > .05. These findings confirm that imaginative training methods (IA and MN) were more effective than PD in enhancing interoceptive awareness, both in overall comparison and in pre-post changes, supporting H1 and H2.
Regarding the diaphragm, participants reported significantly higher interoceptive awareness scores in the post-training phase (M = 12.00, SD = 2.36) compared to the pre-training phase (M = 10.00, SD = 2.48), F(1, 57) = 39.95, p < .001, η²ₚ = .41, confirming H1. The main effect of training type was also significant, F(2, 57) = 11.06, p < .001, η²ₚ = .28, indicating that training condition had a substantial impact on diaphragm interoceptive awareness, thus supporting H2. Post hoc analyses showed that participants in the imitative action training group (M = 12.30, SD = 2.26) and in the metaphorical narration training group (M = 12.70, SD = 1.92) reported significantly higher scores than those in the physiological description group (M = 11.00, SD = 2.90), p < .05, whereas no significant difference emerged between the imitative action and metaphorical narration groups (p > .05). The time × training interaction was not significant, F(2, 57) = 0.33, p = .72, η²ₚ = .01, indicating that the pattern of pre- to post-training change did not differ reliably across training conditions; therefore, H3 was not supported, and any apparent differential gains should be interpreted with caution.
Vocal self-regulation
Regarding the vocal tract, participants reported significantly lower vocal self-regulation scores in the post-training phase (M = 6.80, SD = 2.09) compared to the pre-training phase (M = 7.33, SD = 1.65), F(1, 57) = 5.68, p = .021, η²ₚ = .09, contradicting H1. The main effect of training type was not significant, F(2, 57) = 1.31, p = .28, η²ₚ = .02, indicating that the three training methods did not differentially affect overall vocal self-regulation scores; thus, H2 was not supported. Although pairwise comparisons suggested slightly higher self-regulation scores in the imitative action (M = 7.25, SD = 1.39) and metaphorical narration (M = 6.80, SD = 1.99) groups compared to the physiological description group (M = 6.25, SD = 2.66), p = .043, these differences should be interpreted with caution given the non-significant omnibus effect. The time × training interaction was not significant (p > .05), indicating that the pattern of pre- to post-training change did not differ reliably across training conditions; therefore, H3 was not supported for vocal self-regulation of the vocal tract. Regarding the diaphragm, participants reported significantly lower vocal self-regulation scores in the post-training phase (M = 7.08, SD = 2.07) compared to the pre-training phase (M = 7.33, SD = 1.74), F(1, 57) = 5.19, p = .027, η²ₚ = .08, again contradicting H1. In contrast, the main effect of training type was significant, F(2, 57) = 11.06, p < .001, η²ₚ = .16, indicating that training condition substantially influenced diaphragm-related self-regulation, supporting H2. Post hoc analyses showed that participants in the metaphorical narration group (M = 8.05, SD = 1.83) reported higher diaphragm self-regulation scores than those in the physiological description and imitative action groups (ps < .05). The time × training interaction was not significant, F(2, 57) = 0.33, p = .72, η²ₚ = .01, indicating that the effect of training on diaphragm self-regulation did not vary significantly over time; therefore, H3 was not supported.
Discussion
This study investigated the impact of three training approaches—physiological description, imitative action, and metaphorical narration—on vocal awareness, specifically focusing on vocal tract and diaphragm perception. Vocal awareness was assessed using three indices: vocal apparatus representation, vocal apparatus interoceptive awareness, and vocal self-regulation. Although not all measures showed a significant and consistent effect of training across indices of vocal awareness, overall the findings indicate that imaginative training methods (imitative action and metaphorical narration) significantly influenced the development of vocal awareness.
Importantly, training based on imitative action and metaphorical narration was more effective than physiological description in enhancing vocal awareness on most indices. This pattern of findings supports H2 and is consistent with previous research indicating that symbolic imagery and metaphorical language foster deeper and more integrated mental representations of the body [72,76,81]. Furthermore, vocal apparatus interoceptive awareness emerged as the most responsive index across both the vocal tract and diaphragm conditions, reinforcing the idea that interoceptive awareness is a key factor in vocal training and self-perception. Although counter-intuitive, the post-training decreasing in measures of vocal apparatus representation and vocal self-regulation suggests that an early “conscious incompetence” phase of skill acquisition may arise from the training. Indeed, the intervention may have enhanced metacognitive resolution, as indicated by gains in interoceptive awareness, prompting participants to apply stricter internal criteria to what counts as an accurate representation of their vocal apparatus and as effective self-regulation. In this view, lower self-ratings may reflect a response-shift (criterion tightening), rather than a decrease in the participants’ skills: it is plausible that they moved from a coarse, overconfident baseline to a more realistic appraisal. This interpretation of their skills is in line with the Response Shift Theory [92]. Specifically, our findings suggest that the imaginative training appears to have acted as a catalyst, triggering a recalibration of internal standards [92, p. 32], whereby participants began to judge their abilities against a more refined and demanding scale. This recalibration may be looked at as a positive, adaptive process that signifies a deeper engagement with the nuances of vocal production. Consequently, the initial decline in self-reported scores can be interpreted as a necessary and valuable step in the learning curve. It marks the transition from a vague, undifferentiated understanding of one’s voice to a more discerning self-awareness, which forms the essential foundation upon which more advanced vocal technique and reliable self-regulation can be built in future training.
Vocal awareness of vocal tract
In terms of vocal apparatus representation, no significant effect of time or training was observed, suggesting that none of the training methods significantly altered participants’ explicit representation of the vocal apparatus and that neither hypothesis was supported. The representation of the vocal apparatus may involve a conscious cognitive process that is relatively stable over time and may not be easily modified through short-term training [93].
For vocal apparatus interoceptive awareness, a significant main effect of time and training was found, corroborating both hypotheses. This suggests that imaginative techniques facilitate a deeper understanding of the internal bodily processes related to phonation.These findings align with the existing literature [13,25,46,47,94], which emphasizes that bodily imagery and mental representation can enhance bodily perception and interoceptive awareness. In terms of vocal self-regulation, the results did not support an overall improvement in self-regulation, as post-training scores were lower than pre-training scores, contrary to H1. We interpreted this result considering that vocal regulation is a metacognitive process beyond basic bodily awareness, requiring the ability to adapt and modulate voice use based on situational demands. In other words, we suggest that improvements in vocal regulation do not always translate into an immediate sense of control. Instead, they may initially increase awareness of errors and difficulties in voice modulation. However, training type did influence vocal self-regulation, consistently with H2. Participants in the imitative action and metaphorical narration groups indeed exhibited higher scores compared to those in the physiological description group. This suggests that methods relying on imagination and embodied experience may be more effective in fostering self-regulation abilities than those focused solely on explicit anatomical descriptions. This is supported by the theory of embodiment [95], which may suggest that active involvement of the body and imagination enhances the ability to regulate one’s vocal actions more fluidly and intuitively.
Vocal awareness of diaphragm
Regarding diaphragm awareness, training type significantly influenced vocal apparatus representation, with imitative action and metaphorical narration leading to greater improvements than physiological description. These results support H2, reinforcing the idea that more imaginative and experiential learning methods are more effective in fostering awareness of the vocal apparatus. Unlike for vocal tract awareness, however, not all the measures consistently indicated a significant effect of time, suggesting that improvements in diaphragm representation were primarily driven by training type rather than general exposure or repetition. For interoceptive awareness of the diaphragm, the results confirmed both H1 and H2, with significant improvements following training. The strongest effects were observed for metaphorical narration and imitative action, further highlighting the efficacy of these training methods in enhancing interoceptive abilities. These training methods may have facilitated a more vivid and accessible mental representation of diaphragm dynamics, making the perception of its activation more intuitive. Such result is consistent with previous studies on body representation, which highlight how the use of mental imagery and symbolic narratives can enhance awareness of body parts that may also not be immediately visible [96–98].
In terms of vocal self-regulation related to the diaphragm, on the other hand, the findings did not support H1, as post-training scores were lower than pre-training scores. However, H2 was confirmed, since the metaphorical narration group showed higher scores in the post-training phase compared to the pre-training phase. This suggests that the use of metaphorical imagery may help participants internalize self-regulation strategies more effectively than physiological description or imitative action. Our interpretation is that the symbolic language used in metaphorical narration helps foster greater conscious control of the voice, particularly in managing breathing and diaphragmatic support [5,6,11–13,34,35,66]. These results may be explained by the ability of metaphors to bridge abstract concepts with body sensations and diaphragm-related processes, as observed in studies investigating the role of metaphorical language in improving body awareness [73,98,99].
Conclusions
These findings emphasize the potential value of implementing imaginative techniques in vocal education programs to foster a deeper understanding of the physiological aspects of voice production and of the embodied inner perception related to vocal awareness. Notably, the study suggests that training programs that use non-technical language could be especially beneficial for non-professional singers. This opens the possibility of deploying this type of vocal-awareness training to strengthen awareness and the ability to mindfully use one’s vocal apparatus in different groups of people, not only those interested in singing but also professionals whose work relies on vocal performance, including actors, call-center operators, teachers, and radio operators. In other words, on the one hand this approach can democratize the access to effective vocal training, making it more approachable and understandable for individuals without a technical background in music or voice. At the same time, imaginative training designed to enhance vocal awareness may offer an opportunity not only to develop more effective vocal skills, but also to cultivate broader abilities related to bodily awareness, self-contact, and the capacity to express oneself and communicate effectively. Though the conclusions from this research provide some exciting insights to develop training for vocal education based upon the empowerment of awareness of the singing process, some limits merit being noted. Firstly, the My Voice and Me self-report questionnaire has not yet been validated. However, it was developed based on established theoretical frameworks and expert input, ensuring face validity. Additionally, future research should incorporate a more diverse sample to represent the full spectrum of expertise levels. For instance, examining the effectiveness of these training programs in both professional and non-professional singers would require a larger sample and constitutes a promising direction for future studies.
A key limitation concerns the low internal consistency of the Representation subscale. We observed a relatively low Cronbach’s alpha (α = .53). However, this is not unexpected, given that the subscale comprises only three items. Alpha is sensitive to both scale length and average inter-item covariance and tends to increase with the number of items, and for very short scales the coefficient may underestimate internal consistency [100–102].
Furthermore, the questionnaire assumes a direct correlation between body and voice; however, this connection may not naturally emerge in non-professional singers. Future research could further investigate vocal awareness by integrating physiological measurements to corroborate self-reported data, providing a more comprehensive understanding of the interplay between body and voice.
Additionally, while the study indicated improvements in vocal awareness and technique within the duration of the training, the durability of these effects beyond the study’s timeframe remains uncertain. Long-term follow-ups to assess the retention and practical application of the acquired vocal skills would provide a more comprehensive understanding of the training’s lasting impact. Lastly, the discrepancy in the length (number of words) of instructions among the three conditions, as well as the varying cognitive complexity of the mental operations required, may have influenced the study results and deserves further discussion.
As far as future directions for this research are concerned, it would be meaningful to compare the perception of increased awareness self-reported by the participants and the actual differences of the acoustic features of the vocal performance, which were not considered in this study, as well as to develop further instruments to measure the subjective perception of the change in vocal awareness.
References
- 1. O’Bryan J. “We ARE our instrument!”: Forming a singer identity. Research Studies in Music Education. 2015;37(1):123–37.
- 2.
Monks SJ. Perceptions of the singing voice [dissertation]. Sheffield (UK): University of Sheffield; 2007.
- 3. Schneider SL, Sataloff RT. Voice therapy for the professional voice. Otolaryngol Clin North Am. 2007;40(5):1133–49, ix. pmid:17765699
- 4. Crow KM, van Mersbergen M, Payne AE. Vocal Congruence: The Voice and the Self Measured by Interoceptive Awareness. J Voice. 2021;35(2):324.e15-324.e28. pmid:31558332
- 5.
Allali A, Le Huche F. The voice. Tome 2. Pathologies vocales d’origine fonctionnelle. Masson. 1990.
- 6.
Allali A, Le Huche F. The voice: therapy for vocal disorders. Elsevier Masson. 2002.
- 7.
Agudo M, Borragàn-Torre A, Gonzàles MJ. Proprioceptive-elastic method for the treatment of voice disorders. In: Acta V Simposium Nacional Logopedia, 2003. 69–78.
- 8.
Agudo M, Borràgan-Torre A, Gonzàles MJ, Lucchini E, Ricci-Maccarini A. The proprioceptive method of vocal re-education. In: Fussi F, editor. La voce del cantante. Omega Edizioni. 2007. p. 469–85.
- 9. Estill J. Belting and classic voice quality: some physiological differences. Med Probl Perform Art. 1988;3:37–43.
- 10.
Feldenkrais M. Body and mature behavior: A study of anxiety, sex, gravitation, and learning. Berkeley (CA): North Atlantic Books. 2005.
- 11.
Fussi F, Magnani S. La logopedic score: The logopedic management of the singing voice. Omega Edizioni. 2003.
- 12. Linklater K. Vocal Traditions: Linklater Voice Method. Voice and Speech Review. 2018;12(2):211–20.
- 13.
Rohmert G. The singer on the path to sound. Schmidt. 1996.
- 14.
Weiss W. The mobile voice: Method of minimal movements and spatialization. Masson. 1996.
- 15.
Wilfart S. The song of being: analyzing, building, harmonizing through the voice. Albin Michel. 1997.
- 16. Cooper S. Muscle spindles in the intrinsic muscles of the human tongue. J Physiol. 1953;122(1):193–202. pmid:13109753
- 17. Corda M, Voneuler C, Lennerstrand G. Proprioceptive innervation of the diaphragm. J Physiol. 1965;178(1):161–77. pmid:14298107
- 18. Weddell G, Harpman JA, Lambley DG. The innervation of the musculature of the tongue. J Anat. 1940;74(Pt 2):255–67. pmid:17104811
- 19. Matthews PB. Muscle spindles and their motor control. Physiol Rev. 1964;44:219–88. pmid:14152906
- 20. Rajagopal MD, Walker LB. Neuromuscular spindles in the human tongue. Anat Rec. 1959;133(2):438–438.
- 21. Yasargil GM. Proprioceptive afferent impulses in the phrenic nerve of the cat. Helv Physiol Pharmacol Acta. 1962;20:39–58. pmid:14009026
- 22.
Darby JK. Speech evaluation in psychiatry. New York: Grune & Stratton. 1981.
- 23.
Giles H, Scherer KR. Social markers in speech. Giles H, Scherer KR, editors. Paris: Maison des Sciences de l’Homme. 1979.
- 24. Cleveland TF. Vocal pedagogy in the twenty first century: mental imaging and the teaching of voice. The National Association of Teachers of Singing Journal. 1989;45(3):41-42,54.
- 25. Skoog W. Use of Image and Metaphor in Developing Vocal Technique in Choirs. Music Educators Journal. 2004;90(5):43–8.
- 26. Tian X, Poeppel D. Mental imagery of speech: linking motor and perceptual systems through internal simulation and estimation. Front Hum Neurosci. 2012;6:314. pmid:23226121
- 27. Paparo SA. Singing with awareness: A phenomenology of singers’ experience with the Feldenkrais Method. Research Studies in Music Education. 2021;44(3):541–53.
- 28. Colombo B, Anctil R, Balzarotti S, Biassoni F, Antonietti A. The Role of the Mirror System in Influencing Musicians’ Evaluation of Musical Creativity: A tDCS Study. Front Neurosci. 2021;15:624653. pmid:33897346
- 29. Colombo B, Rigby A, Gnerre M, Biassoni F. The Effects of a Dance and Music-Based Intervention on Parkinson’s Patients’ Well-Being: An Interview Study. Int J Environ Res Public Health. 2022;19(12):7519. pmid:35742768
- 30. Biassoni F, Gnerre M, Malaspina E, Di Tella S, Anzuino I, Baglio F, et al. How does prosodic deficit impact naïve listeners recognition of emotion? An analysis with speakers affected by Parkinson’s disease. Psychology of Language and Communication. 2022;26(1):102–25.
- 31. Fatigante M, Biassoni F, Marazzini F, Diadori P. Responsibility and Culpability in Apologies: Distinctive Uses of “Sorry” versus “I’m Sorry” in Apologizing. Discourse Processes. 2015;53(1–2):26–46.
- 32. Gnerre M, Malaspina E, Di Tella S, Anzuino I, Baglio F, Silveri MC, et al. Vocal emotional expression in parkinson’s disease: roles of sex and emotions. Societies. 2023;13(7):157.
- 33.
Battaglia-Damiani D. Anatomy of the voice: technique, tradition, and the science of singing. Milan: Ricordi. 2003.
- 34.
Juvarra A. Singing and its techniques. Milan: Ricordi. 1987.
- 35.
Juvarra A. Figurative reflections on singing. 500 aforismi. Padova: Armelin. 2002.
- 36. Kar T. The use of analogy, imagination and metaphors as an instructional tool in voice training: A case study. jiae. 2021;2(1):9–31.
- 37. Simonsmeier BA, Frank C, Gubelmann H, Schneider M. The effects of motor imagery training on performance and mental representation of 7- to 15-year-old gymnasts of different levels of expertise. Sport, Exercise, and Performance Psychology. 2018;7(2):155–68.
- 38.
Gabel-Krauch S. Visualization and actor training. Bowling Green (OH): Bowling Green State University. 1982.
- 39.
Ambrosini C. Psychomotor functional areas: body schema. In: Ambrosini C, Wille AM, editors. Manuale di terapia psicomotoria dell’età evolutiva. Cuzzolin. 2008. p. 132–51.
- 40. Bastin J, Calvin S, Montagne G. Muscular proprioception contributes to the control of interceptive actions. J Exp Psychol Hum Percept Perform. 2006;32(4):964–72. pmid:16846291
- 41.
Damasio A. Self comes to mind: constructing the conscious brain. Vintage. 2012.
- 42. Farrer C, Franck N, Paillard J, Jeannerod M. The role of proprioception in action recognition. Conscious Cogn. 2003;12(4):609–19. pmid:14656504
- 43. Baas LS, Beery TA, Allen G, Wizer M, Wagoner LE. An exploratory study of body awareness in persons with heart failure treated medically or with transplantation. J Cardiovasc Nurs. 2004;19(1):32–40. pmid:14994780
- 44. Mehling WE, Gopisetty V, Daubenmier J, Price CJ, Hecht FM, Stewart A. Body awareness: construct and self-report measures. PLoS One. 2009;4(5):e5614. pmid:19440300
- 45. Fogel A. Embodied Awareness: Neither Implicit nor Explicit, and Not Necessarily Nonverbal. Child Dev Perspectives. 2011;5(3):183–6.
- 46. Gallagher I I. Philosophical conceptions of the self: implications for cognitive science. Trends Cogn Sci. 2000;4(1):14–21. pmid:10637618
- 47.
Gallagher S. How the body shapes the mind. Oxford: Oxford University Press. 2005.
- 48. Tsakiris M, Schütz-Bosbach S, Gallagher S. On agency and body-ownership: phenomenological and neurocognitive reflections. Conscious Cogn. 2007;16(3):645–60. pmid:17616469
- 49. Berlucchi G, Aglioti S. The body in the brain: neural bases of corporeal awareness. Trends Neurosci. 1997;20(12):560–4. pmid:9416668
- 50. Coslett HB, Saffran EM, Schwoebel J. Knowledge of the human body: a distinct semantic domain. Neurology. 2002;59(3):357–63. pmid:12177368
- 51.
Gantchev GN, Massion J, Mori S. Motor control, today and tomorrow. Academic Publishing House. 1999.
- 52. Head H. Aphasia and kindred disorders of speech. Brain. 1920;43(2):87–165.
- 53. Head H. Studies in neurology. Vol. 2. Oxford: Oxford University Press; 1920.
- 54. Tsakiris M, Tajadura-Jiménez A, Costantini M. Just a heartbeat away from one’s body: interoceptive sensitivity predicts malleability of body-representations. Proc Biol Sci. 2011;278(1717):2470–6. pmid:21208964
- 55.
Schilder P. The image and appearance of the human body. New York: International Universities Press. 1950.
- 56.
Schilder P. The image and appearance of the human body. The self in social interaction. New York: Wiley. 1968. p. 107–14.
- 57. Cameron OG. Interoception: the inside story--a model for psychosomatic processes. Psychosom Med. 2001;63(5):697–710. pmid:11573016
- 58. Craig AD. How do you feel? Interoception: the sense of the physiological condition of the body. Nat Rev Neurosci. 2002;3(8):655–66. pmid:12154366
- 59.
Craig AD. Interoception and emotion: a neuroanatomical perspective. In: Feldman Barrett L, Haviland Jones JM, Lewis M, editors. Handbook of emotions. 3rd ed. New York: Guilford Press. 2007. p. 272–90.
- 60. Craig ADB. How do you feel--now? The anterior insula and human awareness. Nat Rev Neurosci. 2009;10(1):59–70. pmid:19096369
- 61. Barrett LF, Quigley KS, Bliss-Moreau E, Aronson KR. Interoceptive sensitivity and self-reports of emotional experience. J Pers Soc Psychol. 2004;87(5):684–97. pmid:15535779
- 62. Vaitl D. Interoception. Biol Psychol. 1996;42(1):1–27.
- 63. Roll JP. Physiology of kinesthesia. Muscle proprioception: sixth sense or primary sense?. Intellectica. 2003;36–37:49–66.
- 64. Biassoni F, Gandola M, Gnerre M. Grounding the Restorative Effect of the Environment in Tertiary Qualities: An Integration of Embodied and Phenomenological Perspectives. J Intell. 2023;11(11):208. pmid:37998707
- 65.
Tosto IM. The musical voice. Orientamenti per l’educazione vocale. EDT. 2009.
- 66.
Lamperti F. Theoretical practical elementary guide for the study of singing. Milan: Ricordi. 1864.
- 67.
Brozzi G, Giubilei F, Mazzocchi R. The use of metaphors in vocal teaching: correlations between mental images and clinical objectivity. In: Fussi F, editor. The singer’s voice. Omega Edizioni. 2007. p. 151–7.
- 68. Aveyard ME, Madden CJ, Yaxley RH, Zwaan RA. Moving words: Dynamic representations in language comprehension. Cogn Sci. 2004;28:611–20.
- 69. Barsalou LW. Language comprehension: Archival memory or preparation for situated action?. Discourse Processes. 1999;28(1):61–80.
- 70. Barsalou L, McRae K, Richardson D, Spivey M. Spatial representations are activated during real time comprehension of verbs. Cogn Sci. 2003;27(5):767–80.
- 71. Glenberg AM, Kaschak MP. Grounding language in action. Psychon Bull Rev. 2002;9(3):558–65. pmid:12412897
- 72. Glenberg AM, Robertson DA. Symbol grounding and meaning: A comparison of high-dimensional and embodied theories of meaning. J Mem Lang. 2000;43(3):379–401.
- 73. Boroditsky L, Ramscar M. The roles of body and mind in abstract thought. Psychol Sci. 2002;13(2):185–9. pmid:11934006
- 74. Francuzo E, Gibbs R, Lima P. Metaphor is grounded in embodied experience. J Pragmat. 2004;36:1189–210.
- 75. Gibbs RW Jr, Franks H. Embodied metaphor in women’s narratives about their experiences with cancer. Health Commun. 2002;14(2):139–65. pmid:12046796
- 76.
Gibbs R. The poetics of mind: figurative thought, language, and understanding. Cambridge University Press. 1994.
- 77. Gibbs RW Jr. Embodied experience and linguistic meaning. Brain Lang. 2003;84(1):1–15. pmid:12537948
- 78.
Gibbs R. Embodiment and cognitive science. Cambridge University Press. 2006.
- 79. Wilson NL, Gibbs RW Jr. Real and imagined body movement primes metaphor comprehension. Cogn Sci. 2007;31(4):721–31. pmid:21635315
- 80. Harding J, McGlone M. Back (or forward) to the future: The role of perspective in temporal language comprehension. J Exp Psychol Learn Mem Cogn. 1998;24(5):1211–23.
- 81. Johnson M. Knowing through the body. Philos Psychol. 1991;4(1):3–18.
- 82. Faul F, Erdfelder E, Lang A-G, Buchner A. G*Power 3: a flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav Res Methods. 2007;39(2):175–91. pmid:17695343
- 83. McKinley NM, Hyde JS. The Objectified Body Consciousness Scale. Psychology of Women Quarterly. 1996;20(2):181–215.
- 84. Anderson R. Body intelligence scale: defining and measuring the intelligence of the body. Humanist Psychol. 2006;34(4):357–67.
- 85.
Fussi F. La voce del cantante. Omega Edizioni. 2005.
- 86. Benninger MS, Grywalski C, Jacobson BH, Jacobson G, Johnson A, Silbergleit A. The Voice Handicap Index (VHI). Am J Speech Lang Pathol. 1997;6:66–70.
- 87.
Fonagy I. The metaphor in phonetics. Didier. 1979.
- 88.
Fonagy I. The living voice: essays on psycho-phonetics. Payot. 1983.
- 89.
Garcia MPR. Ècole de Garcia. Complete treatise on the art of singing. Brandus. 1847.
- 90.
Vennard W. Singing: the mechanism and the technic. Carl Fischer. 1967.
- 91. Mercado III E, Mantell JT, Pfordresher PQ. Imitating sounds: A cognitive approach to understanding vocal imitation. Comp Cogn Behav Rev. 2014;9.
- 92. Howard JS, Mattacola CG, Howell DM, Lattermann C. Response shift theory: an application for health-related quality of life in rehabilitation research and practice. J Allied Health. 2011;40(1):31–8. pmid:21399850
- 93. Meerschman I, Van Lierde K, Peeters K, Meersman E, Claeys S, D’haeseleer E. Short-term effect of two semi-occluded vocal tract training programs on the vocal quality of future occupational voice users: “resonant voice training using nasal consonants” versus “straw phonation”. J Speech Lang Hear Res. 2017;60(9):2519–36. pmid:28837727
- 94. Herbert BM, Muth ER, Pollatos O, Herbert C. Interoception across modalities: on the relationship between cardiac awareness and the sensitivity for gastric functions. PLoS One. 2012;7(5):e36646. pmid:22606278
- 95. Borghi AM, Cimatti F. Embodied cognition and beyond: acting and sensing the body. Neuropsychologia. 2010;48(3):763–73. pmid:19913041
- 96.
Connolly C, Williamon A. Mental skills training. In: Williamon A, editor. Musical excellence: strategies and techniques to enhance performance. Oxford: Oxford University Press. 2004. p. 221–45.
- 97. Goldschmidt H. Dancing With Your Head On: Mental Imagery Techniques for Dancers. Journal of Dance Education. 2002;2(1):15–22.
- 98. Longo MR, Schüür F, Kammers MPM, Tsakiris M, Haggard P. Self awareness and the body image. Acta Psychol (Amst). 2009;132(2):166–72. pmid:19286139
- 99.
McNabb SE. The singer’s understanding of imagery and metaphorical language as teaching tools in the secondary-school choral rehearsal. Auckland: University of Auckland. 2022.
- 100. Cortina JM. What is coefficient alpha? An examination of theory and applications. J Appl Psychol. 1993;78(1):98–104.
- 101.
Samuels P. Statistical methods: scale reliability analysis with small samples. Birmingham: Birmingham City University, Centre for Academic Success. 2017.
- 102. Tavakol M, Dennick R. Making sense of Cronbach’s alpha. Int J Med Educ. 2011;2:53–5. pmid:28029643