Perceived differences in social status between speaker and listener affect the speaker's vocal characteristics

Non-verbal behaviours, including voice characteristics during speech, are an important way to communicate social status. Research suggests that individuals can obtain high social status through dominance (using force and intimidation) or through prestige (by being knowledgeable and skilful). However, little is known regarding differences in the vocal behaviour of men and women in response to dominant and prestigious individuals. Here, we tested within-subject differences in vocal parameters of interviewees during simulated job interviews with dominant, prestigious, and neutral employers (targets), while responding to questions which were classified as introductory, personal, and interpersonal. We found that vocal modulations were apparent between responses to the neutral and high-status targets, with participants, especially those who perceived themselves as low in dominance, increasing fundamental frequency (F0) in response to the dominant and prestigious targets relative to the neutral target. Self-perceived prestige, however, was less related to contextual vocal modulations than self-perceived dominance. Finally, we found that differences in the context of the interview questions participants were asked to respond to (introductory, personal, interpersonal), also affected their vocal parameters, being more prominent in responses to personal and interpersonal questions. Overall, our results suggest that people adjust their vocal parameters according to the perceived social status of the listener as well as their own self-perceived social status.


Introduction
In hierarchical social relationships, individuals who are of high social status normally have privileges that other members of their group lack [1]. Examples of this type of relationship in human societies include the ranking system within the military and company organisation models (e.g. an employer is higher in social status than an employee) [1]. Recent research suggests that individuals can obtain high social status through one of two main ways: by using force and intimidation (dominance), or by being knowledgeable and skilful (prestige) [2,3]. Humans communicate their social status to others using a wide range of behaviours, often shared with non-human animals, such as facial expressions and body postures [4], linguistic cues, like the use of formal and informal linguistic tenses, as well as using spatial metaphors that make reference to hierarchies or imply a large personal space [1,5].
In terms of non-verbal behaviour, alongside facial expressions and body postures, voice characteristics are an important means to communicate socially relevant information, including social status (e.g. [6,7,8]). The acoustic qualities of the human voice, aside from linguistic elements such as syntax and semantic content, can communicate an important array of biological information about the speaker including sex, femininity, attractiveness, fertility and sexual maturity, physical strength, and body size [9][10][11][12][13][14][15][16][17][18][19][20][21][22][23]. Human voices are sexually dimorphic, with men, for example, having lower pitched voices than women. While the precise evolutionary reasons for this pronounced difference are unclear, it has been suggested that it could be a product of sexual selection [24], including dominance competition [25]. In fact, there is strong cross-species evidence for the influence of sexual selection on male fundamental frequency (F 0 ), the parameter most closely related to voice pitch [26].
While no research to our knowledge has explored vocal parameters with respect to prestige, the effects of dominance have been widely studied. In general, studies have found that voices low in F 0 are perceived as more dominant in both men [6,7] and women [8]; however, one study [27] found a significant positive correlation between F 0 and dominance judgments for male, but not female, speakers. This discrepancy can be due to several reasons: first, they used vocal recordings that were approximately 3.5 seconds in length, which may be too short to base dominance judgements on. Additionally, the authors' paradigm was very complex with several different contexts, which they suggest may affect the findings. Finally, the authors used an unspecified pitch contour to calculate their mean vocal parameters, and thus their calculations of pitch may have differed from others.
Perceptions of dominance appear to be based on multiple cues: F 0 , which is related to androgen levels, as well as formant dispersion (D f ), related to vocal tract length and skeletal size, affects dominance perceptions [28]. The information obtained from vocal cues can also predict real-world circumstances. In one study, voices of surgeons which were rated as higher in dominance and lower in concern/anxiety, perhaps reflecting an 'arrogant' and 'lack-of-care' approach, were also more likely to have been previously sued for malpractice, even when controlling for speech content [29]. Likewise, low-pitched CEOs have been shown to manage larger companies and make more money [30].
Vocal parameters, however, are not constant, and can be modulated during social interactions. Shouting during aggressive displays is a typical example, and, in humans and some nonhuman animals, intensity (loudness) modulations are associated with dominance [27] and hostility [31,32]. Similar to changes in body posture that increase perceived body size, changes in vocal parameters can affect perception of the speaker. Puts et al. [25] reported that men tend to lower their voices during interactions with a competitor when they perceive themselves as physically dominant, and raise it when they believe they are not, exemplifying how elements of selfperceived social status may affect social interactions. Furthermore, taller and more dominant men are less sensitive to visual cues of dominance in other men [33,34], indicating that hierarchical relationships appear to be dependent on perception of relative, rather than absolute, social status, perhaps in an analogous way to how male sensitivity to female attractiveness in humans is stronger towards women of similar, than to lower or higher, mate value [35].
To date, most studies have measured responses to voices with artificially manipulated acoustic parameters (typically F 0 and D f ) to investigate how these affect perceptions of dominance [8,25,28,36], but little is known regarding vocal modulations during interactions with dominant or prestigious individuals, particularly in free speech as opposed to individual phonemes or standardised sentences. Although one study examined male responses during interactions depending on their relative physical and social dominance, which in their study was described similarly to our description of prestige [25], whether men and women respond to these two forms of social status in similar ways remains largely unanswered. In our experiment, we aimed to address these questions by measuring within-subject vocal modulations, in both men and women's voices, in response to dominant, prestigious, or neutral (control) targets. We did this by using a simulated job interview scenario where participants were required to act as a candidate and answer three standardized interview questions (ranging from introductory to interpersonal).
We predicted, first, that participants' vocal characteristics would change based on whether they were talking to a dominant, prestigious, or neutral target, because signs of social status have been shown to affect vocal characteristics (e.g. [19]), and because dominant individuals appear to be less sensitive to dominance cues in other men [33,34]; and second, that these changes would also be related to the participant's own self-perceived dominance and prestige. We predicted that those participants rating themselves as more dominant would speak more loudly (i.e. with higher intensity) than those who rated themselves as low in dominance [27], especially when speaking to high-status individuals. Additionally, we expected these high dominance participants to lower their F 0 when speaking to the dominant target, as these targets may be more likely to be in direct competition with them, for mating opportunities or resources [25]. We had no a priori predictions about how participant prestige would affect their interaction with the targets, however as research suggests that both using a dominant or prestigious route leads to attainment of high status, and higher status individuals are more likely to acquire mating opportunities and resources [37,38], there may be reason to expect that responses will be similar with respect to both F 0 and intensity parameters. On the other hand, if the behaviours of prestigious and dominant individuals differ significantly, then there might be reason to predict that there will be variability like that reported previously (low F 0 and high intensity) towards dominant individuals but a different pattern of results for prestigious individuals. Additionally, little work has been done on how men and women would differ in their interactions with the male targets. However, recent findings suggest that men and women vary their vocal parameters with respect to the attractiveness of the people they are interacting with [39], perhaps as a strategy to be perceived as more attractive, and as high status men/those with more resources are perceived to be of greater mate value than low status men [40], we predicted that women would vary their F 0 more towards high status than low status men (but we had no specific predictions for prestige vs. dominance strategies).
Finally, as the three interview questions differed semantically (see full description of questions in methods) we hypothesized that there might be a question effect, with the greatest variation of vocal parameters found in the most interpersonal question (question 3), in which participants would imagine how they might engage with and approach the employer (target) with a problem. That is, those participants rating themselves high in dominance may not vary their F0 to a question simply asking them to introduce themselves, however they may vary (we predict a decrease) their F0 when explaining a situation to/interacting with an employer.

Ethics statement
All procedures obtained ethical approval from the Ethics Committee of the Department of Psychology, Faculty of Natural Sciences, University of Stirling. All participants provided written informed consent and were offered course credit for their participation.
Target stimuli. We used EvoFit software [41] to create the face stimuli used in this experiment. This software allows the user to 'evolve' a face from sets of available faces over successive iterations, in a holistic (whole face) process as opposed to featurally (adding single features to the face one-by-one). An independent group of 14 men (mean age ± SD = 21.8 ± 7.3) were asked to create same-sex faces using written descriptions of dominant and prestigious individuals based on definitions used in current literature [2,3,42]. Dominant individuals were described as 'An approximately 36-45 year old male. He is an extremely dominant individual. This person likes to be in control and to get their way. They will use force, coercion, and intimidation to achieve their goals if necessary.' Prestigious individuals were described as 'An approximately 36-45 year old male. He is a highly valued, prestigious and influential individual. He has many valued skills and qualities and others follow him freely. This ultimately leads to his achieving his goals.' These 28 novel faces were rated for dominance and prestige using a 7-point scale (1 = low dominance/prestige; 7 = high dominance/prestige) by 69 undergraduate students (19 men; mean age±SD = 29.0 ± 9.7). The two faces which received the highest dominance (mean ± SD = 5.1 ± 1.3) and highest prestige (mean ± SD = 3.99 ± 1.3) scores were used as stimuli (i.e., as the dominant and prestigious employers, respectively). For the 'neutral' employer, the face receiving the median rating on dominance (mean ± SD = 3.3 ± 1.3) and prestige (mean ± SD = 3.1 ± 1.3) was used.
We then created three different 'employer profiles', which contained a face image and text description, including a name, a job title, and an employee testimonial. The name, job title and testimonial were used to further manipulate the impression of targets as either dominant, prestigious, or neutral (Fig 1 shows the three profiles). The three profiles were also scored by an independent group of raters (see Target Stimuli in S1 Text) for prestige and dominance, confirming that in all cases the attributes of the dominant target were rated as more dominant, the attributes of the prestigious target as more prestigious, and the attributes for the neutral target were rated as neither high in dominance or prestige; faces were additionally rated for perceived attractiveness and age (results of these ratings are presented in S1 Table and S1 Fig). Finally, job descriptions were identical (i.e. administrative/secretarial assistant including filing, answering telephones, booking appointments and scheduling meetings).
Experimental procedure. Participants were first told that the 'experiment' they were participating in was in fact a 'pilot' to test the effectiveness of a new interviewing technique which did not require the interviewee and interviewer to be in the same room. After written informed consent was obtained, participants were presented with the experiment using Qualtrics software (Qualtrics, Provo, UT, 2013; www.qualtrics.com), on a desktop computer located in a quiet room. Monaural audio responses of the participants were digitally recorded using Praat 5.2.44 (P. Boersma and D. Weenink, 2011; www.praat.org), with a sampling frequency of 44.1 kHz, using a head mounted microphone positioned about 2 cm from the participant's mouth. Additionally, participants were video recorded to emphasise monitoring and assessment of behaviour as in real-world interviews; the experimenters highlighted the recordings by adjusting the videorecorder in the presence of participants, while they viewed a real-time recording of themselves on a monitor.
To control for any potential order effects, 24 male and 24 female participants were shown the three targets in one of six possible sequences (i.e. 1: Dominant (D)-Prestigious (N)-Neutral (N); 2: D-N-P; 3: P-D-N; 4: P-N-D; 5: N-D-P; 6: N-P-D; the sequences were counterbalanced across participants). For each of the three targets, participants were asked to record responses to three common interview questions; hence we recorded 9 instances of speech from each participant. The interview questions were: 1) 'please introduce yourself to this potential employer in a few sentences', 2) 'please tell this employer why you are a good candidate for the job', and 3) 'if you had a problem with a colleague at work how would you convey it to your boss?'. Aside from the generic nature of the questions, they were also selected to differ in their interpersonal characteristics. That is, while question 1 was purely a request for the subject to introduce themselves, question 2 added a personal component in requiring the participant to think about and articulate what personal attributes they believed would make them qualified for the job. Finally, question 3 had an interpersonal emphasis and required the participant to think about how they might engage with and approach the employer (target) with a problem. Although a simulated job interview may not completely reflect a real-life situation, mock interviews have been shown to increase anxiety levels before and during the interview [43] and in our design participants were aware of being video and audio recorded to increase the realism of the scenario.
After recording their responses, participants were asked to enter some basic demographic information, fill in a self-report scale of dominance and prestige [3], rate the dominance and prestige of the three targets, and explain what they thought the purpose of the study was (see Experimental Procedure in S1 Text). The entire experiment was presented using Qualtrics software, and was completed by participants while they were alone in a room. Once they had finished the experiment, participants were debriefed, given the opportunity to ask any remaining questions, and were asked to confirm whether they still consented to the use of their data.
In total, 429 recordings were obtained (3 were discarded due to background noise that affected audio quality), with length ranging from 4 to 107 seconds (mean ± SD = 25.02 ± 16.41s). Length of recording did not differ significantly depending on which target participants were responding to (repeated-measures GLM: F 2, 86 = 0.95, p = .39).
Manipulation check. As a final step, we conducted a manipulation check. Once participants had completed the experiment, we asked them to rate the full profiles for prestige and dominance. These ratings confirmed that the mean dominance rating of the dominant target (mean ± SD = 6.58 ± 0.65) was significantly higher than the ratings of both the prestigious (mean ± SD = 4.66 ± 1.46) and neutral (mean ± SD = 3.27 ± 1.32) targets (F 2,94 = 87.99, p < .001 ; Fig 2A), and the prestigious target was rated as more prestigious (mean ± SD = 6.06 ± 1.04) than the dominant (mean ± SD = 4.25 ± 1.49) and neutral (mean ± SD = 3.44 ± 1.22) targets (F 2,94 = 57.62, p < .001; Fig 2B).
Data analysis. We analysed each recording using Praat, obtaining values every 10 ms on intensity (dB) and F 0 (Hz). F 0 was measured using a noise-resistant autocorrelation method, between 75 and 300 Hz for male voices, and 100 and 500 Hz for female voices, as recommended by the software programmers. To ensure that intensity values were not affected by differences in the length or number of silent periods, and to control for background noise during these, we only used values which corresponded to times points in which the Praat algorithm produced a value of pitch.
For the statistical analysis, we calculated five variables from each of the 429 recording (9 recordings per participant), two of which were related to intensity: mean intensity and intensity variability (intensity SD), and three to F 0 : mean F 0 , F 0 variability (F 0 SD), and minimum F 0 . These final values were analysed using repeated-measures general linear models (GLM) for each parameter (with Holm-Bonferroni [44] adjustments for multiple tests, because we performed two analyses of intensity parameters, and three of F 0 parameters), using sex of the participant (PS) as a between-subject factor, target and question as within-subject factors, and participant dominance (PD) and participant prestige (PP) as covariates.
Because both PD and PP are time-invariant, thus in order to use them as covariates, we compared the effect of the target on the vocal parameters of the participants, centring the covariate values to their mean [45][46][47]. First, we created a model including both PD and PP as covariates, for each dependent variable (each acoustic parameter). Significant interactions between a covariate and a within-subject factor, suggest that there is a difference in the slope of the regression relating the covariate to the dependent variable, for each level of the within-subject factor; for example, a significant interaction between target (neutral, dominant, prestigious) and PD, represents that there are different regression slopes of PD and the studied acoustic parameter, for each target. Then, for covariates significantly interacting with withinsubject factors, we performed further analyses including only one covariate (PD or PP), centred to the mean, as well as low and high values (10 th , and 90 th percentiles). In such cases, comparing the main effect of a within-subject factors across different levels of the covariate is of particular interest [46], as it shows predicted values for the dependent variable, for participants on different levels of the covariate (low, mean and high self-rated prestige or dominance). Selfrated PD ranged from 1.5 to 5.0, and the mean, 10 th , and 90 th percentiles were 2.89, 1.74, and 4.00 respectively; equivalent values for PP, which ranged from 3.56 to 6.44, were 4.73, 3.78, and 5.69. All tests are two-tailed.
While statistical analyses were performed using acoustical data (in dB and Hz for intensity and F 0 parameters, respectively), most figures were created using standardised values (z scores) for each participant to account for between-subject differences (most noticeably sex differences in F 0 as well as F 0 SD [39]) and represent within-subject trends.

Results
First, we tested whether individuals' self-rated status (prestige and dominance) predicted their vocal parameters, in response to each target. Then we tested if individuals altered their vocal parameters in speech directed at dominant or prestigious individuals. We conducted separate analyses testing within-subject differences in parameters related to intensity (mean intensity and intensity SD) and F 0 (mean F 0 , F 0 SD, and minimum F 0 ), with planned contrasts (Helmert) comparing responses to the neutral versus the high-status targets (dominant and prestigious), and between the two high-status targets (dominant versus prestigious). Descriptive statistics of the acoustic vocal parameters of the responses of the participants to each type of target are presented in S2 Table. Relationships between vocal parameters and self-rated status As we predicted participants would adjust their vocal characteristics based on their self-rated status (prestige and dominance), in our analyses we used these self-ratings as covariates, and tested whether there were relationships between each acoustic parameter, in response to each target, and the participants' own ratings of dominance (PD) and prestige (PP; Table 1). Mean (± SD) self-rated scores of PD were 3.07 ± 0.56 and 2.71 ± 0.91 for men and women, respectively; scores for PP were 4.66 ± 0.59 and 4.79 ± 0.83. As there were no significant differences in PD or PP between men and women (t-tests: PD: t 46 = 0.63, p = .11; PP: t 46 = 1.67, p = .53), we pooled these data in the analyses below.
As expected, participants who rated themselves as higher in dominance had lower F 0 , as well as lower F 0 SD, and minimum F 0 , although these trends did not reach significance in all cases. There was also a trend for more prestigious participants to vary their intensity less, particularly when responding to the dominant target.

Intensity parameters
Previous research showed that voices with higher mean amplitude and amplitude SD (amplitude is directly proportional to intensity) are perceived as more dominant [27]. Because of this, we anticipated that participants would adjust the intensity of their voices depending on the perceived status (dominance or prestige) of the targets, and their self-perceived dominance (PD) and prestige (PP). However, the analysis of intensity parameters revealed no significant differences in the mean intensity or intensity SD of the participants' responses depending on the target, even when including PP and PD (as covariates, centred to their mean), nor a significant interaction between participant sex and target (for detailed results, see S3 Table).

Fundamental frequency (F 0 ) parameters
The analysis of F 0 parameters revealed that mean F 0 was particularly sensitive to our manipulation (Table 2). Although the main effect of target did not reach significance, it showed a trend in which the mean F 0 of the participants progressively increased in responses to the neutral  (Table 2). When including PD and PP as covariates, the interaction between target and PD did reach significance (p = .01), suggesting that the slope of the regression relating PD to F 0 , differs between the different targets ( Table 2). Centring PD to the mean, 10 th and 90 th percentile, revealed differences in F 0 according to who the target participants were responding to, for participants with low self-perceived dominance (F 2,82 = 4.56, p = .01), but not for participants with mean (F 2,82 = 0.81, p = .45), or high (F 2,82 = 0.75, p = .48) self-perceived dominance. This suggests that only individuals who perceive themselves as low in dominance modulate their voices, depending on the status of the person they are speaking to (Fig 3).
Planned contrasts, however, revealed that in the case of the main effect of target, there was a significant difference in the mean F 0 of the participants between the neutral versus the highstatus targets (dominant, prestigious), but not between the two high-status targets (dominant versus prestigious; Table 3), predicted for participants with low self-perceived dominance as well as high self-perceived dominance (Fig 3); participants who perceived themselves as low in dominance tended to speak with a higher mean F 0 towards high-status targets, in comparison to the neutral target, while the opposite trend was found for high self-perceived participants. No such tendency was found for participants with mean dominance. Similarly, for the interaction between target and participant sex, F 0 SD was significantly different when including PD as a covariate, and comparing responses to the neutral and high-status targets (F 1, 41 = 5.65, p = .02), but not between the two high-status targets (Fig 4C, S4 Table). Thus, it appears that women varied F 0 more when talking to neutral targets than dominant and prestigious targets, while the opposite effect was evident in men: they varied their F 0 less when speaking to neutral targets than dominant and prestigious targets.
In addition, the general analysis and planned contrasts revealed the importance of the effects of question in the vocal parameters of spoken responses: there was a significant main effect of question, as well as significant interactions between question and PP on the mean F 0 of the participants (Table 2), and a significant interaction between question and participant sex for both mean F 0 and F 0 SD (Table 2); furthermore, the interaction between target, question and PD was significant, suggesting that the specific characteristics of the questions   Results are split by sex of the participants, and estimated for participants with low (10th percentile), mean, and high (90% percentile) dominance (panels a, c, e) and prestige (panels b, d, f). LD = low dominance; MD = mean dominance; HD = high dominance; LP = low prestige; MP = mean prestige; HP = high prestige. Standard deviation (SD) was used as a measure of variability. Results were standardised (to z scores) for each participant to make results equivalent and account for between-subject's differences. For panned contrasts (Table 3), shapes above the bars represent a significant difference (introductory, personal, interpersonal) had an effect on the vocal parameters of the responses ( Table 2). Planned contrasts revealed that in the cases of the interactions between target and question (for mean F 0 , including either PD or PP as covariate), there was a significant difference between the neutral versus and high-status targets, but not between the high-status targets, for participants with high PD and high PP ( Table 3).

Analysis of Fundamental frequency (F 0 ) parameters by question
Paralinguistic parameters thus vary depending on the target and self-rated status of the speaker, but participants changed their vocal characteristics of their responses according to the question they were responding to. To further explore this connection, we split the analysis by question in order to test the effect that the specific context of each question had on the responses. This analysis revealed that in the case of question 1 (Introductory), there were no significant differences in the vocal parameters of the participants depending on the target they were responding to (Table 4 and Fig 5).
However, participants responding to questions 2 (Personal) and 3 (Interpersonal), showed a significant interaction between target and participant dominance for F 0 . In short, this means that the slope of the association between PD and F 0 , was different for the three targets (Table 4 and Fig 5A). Planned contrasts revealed that in responses to questions 2 (Personal) and 3 between responses to neutral versus high-status targets (dominant and prestigious). *p < .05, **p < .01. Bars  Perceived differences in social status affect vocal characteristics (Interpersonal), mean F 0 was significantly higher when responding to high-status versus neutral targets for participants with low self-perceived dominance, while no such difference existed for participants with mean PD, and participants who perceive themselves as high in dominance showed the opposite trend: they had a lower F 0 in responses to high-status targets, than to the neutral target (Table 5 and Fig 5A). Similarly, participants with high PD, tended to speak with higher F 0 SD to the neutral than the high-status targets in question 3, while the opposite trend was found (though not significant), for participants with low PD (Table 5 and Fig 5B). In terms of minimum F 0 , in question 3 participants with low PD reached a significantly lower F 0 when responding to high-status than the neutral target (Table 5 and Fig 5C).
account for between-subject's differences. For planned contrasts ( Additionally, responses to question 3 (Interpersonal) were significantly different for male and female participants depending on the target: while the mean F 0 of male participants was lower in responses to the neutral target, female participants had lower mean F 0 in responses to the dominant target (Table 4).
In general, most differences in vocal parameters were found between the neutral and highstatus targets, but not between the two high-status targets; this was particularly true for mean F 0 , in which the interaction between PD and target was significant, and planned contrasts revealed differences in the predicted F 0 towards the targets, for participants with low and high, but not mean, PD. In fact, the correlation between PD differences in F 0 towards high-status minus the neutral target-i.e. a difference score (Fig 6), was negative and significant for questions 2 (Personal; Fig 6B) and 3 (Interpersonal; Fig 6C), and only marginally non-significant for question 2, when correlating F 0 with PP ( Fig 6E). This shows that participants who perceive themselves as low in PD tend to speak with higher mean F 0 , or pitch, to high status targets, and conversely participants with high PD tend to lower their mean F 0 .

Discussion
Previous studies have suggested that manipulations of vocal parameters, particularly F 0 , affect perceived dominance [28], that men adjust their voices during interaction with competitors Perceived differences in social status affect vocal characteristics depending on their perceived relative dominance [25] and, more generally, that hierarchical relationships are dependent on relative, rather than absolute, social status perceptions [33,34]. Such studies have, however, focused on dominance, and predominantly on men's voices. Our experimental design of a job interview scenario provides new insights into the specific nature of hierarchical relationships and into the vocal differences when addressing dominant and prestigious individuals of both men and women.
Firstly, we found that male and female participants who judged themselves to be more dominant lowered their F 0 when speaking to all targets, in line with previous research on men [25]. We also found a tendency for more prestigious participants to respond with lower intensity variability, and dominant participants to decrease variability in fundamental frequency (F 0 SD), which would perhaps make them sound calmer and more in control of situations; in fact, decreased F 0 variability is associated with lower aggressiveness in industrial as well as foraging societies [48], and it is known to occur in contexts involving competition [7,49].
Differences in vocal parameters between responses to the different targets were strongest in mean F 0 ( Table 2, Table 3), and when including self-perceived dominance in the models. As predicted by previous research [25], participants, and particularly persons who perceive themselves as low in dominance, responded with a relative higher F 0 when speaking to the highstatus targets, the opposite trend to what participants with high dominance tended to do. Additionally, variation in voice modulation arose when comparing responses to the neutral versus the high-status targets, suggesting that the status of the target, whether high in dominance or prestige, is the key factor. In fact, differences between responses to the dominant and prestigious targets were not significantly different when analysing all questions together.
Contextual vocal modulations, however, were not found to occur in mean intensity or intensity SD. This suggests that while these parameters may be a robust cue of social status, as shown above in the effects of self-perceived prestige and intensity variability, or even contextdependent (e.g. shouting) interactions, speakers do not modulate their voice intensity during free speech depending solely on the relative social status of the listeners. This is likely due to the nature of our interview scenario, as participants were not directly competing, and were not trying to signal aggression in front of a potential employer, but rather to make themselves appear favourable for a position.
Furthermore, the use of a job interview scenario allowed us to include questions with different characteristics: introductory, personal, and interpersonal. The analysis of the vocal characteristics by question revealed significant vocal differences dependent on the perceived social status of the target listener, mostly when personal and interpersonal questions are answered, but not during introductory responses. In these cases, the effects of target, especially when including PD as a covariate, were significant (Table 4). In general, participants' mean F 0 was raised when responding to the dominant or prestigious targets (Fig 3), and this was stronger in low self-perceived status participants (Fig 4), supporting previous results [25]. However, we also found a tendency of participants who feel high in dominance to lower the their mean F 0 when responding to the high-status targets The apparent differences between responses to personal and interpersonal questions in comparison to the introductory one may be because each participant tended to introduce him or herself in a similar manner to all targets (e.g. "my name is. . .", "I am currently studying. . .", "I live in. . ."), but when confronted with questions that required them to discuss their specific skills to the target (personal), and even more so when asked to imagine a hypothetical interaction with the target (interpersonal), the nature of the questions themselves may have induced participants to improvise and respond more naturally.
Differences in vocal parameters between the responses to these questions are apparent in our analysis. Although it could be argued that this is a product of the order in which the questions were presented, we suggest that this is unlikely because of the different characteristics of the questions and, furthermore, because participants participated in three interviews, which meant that they responded to question one (introductory) after question 3 (interpersonal) twice during the experiment. The possibility of order effects could be tested in future experiments, to disentangle responses to different types of questions. In addition, the 28 faces available, from which targets were selected, were all initially constructed under instructions to create a highly prestigious, or highly dominant individual. This could mean that the neutral target, being the median rated face, displayed some level of dominance and/or prestige as opposed to being 'neither' dominant nor prestigious. Although more noticeable differences between the targets would likely make differences stronger, our manipulation seems to have been enough to elicit vocal modulations in the participants. Additionally, there is evidence suggesting that men are generally perceived as more dominant than women (see, e.g. [50]), which could be a confounding factor on the issue of dominance for female participants; future studies could address vocal modulations in response to men, but also women, of varying social status. Also, although we created names, personas, and even faces that were high in either prestige or dominance, the prestigious target was perceived as significantly more dominant than the neutral target, and the dominant target as more prestigious than the neutral (Fig 2). This could be because they are both high-status strategies and there is some ambiguity in the literature, and perhaps in real life, about what the differences are between dominance and prestige, or even because are they intrinsically linked, and manoeuvring oneself to high status may require a partly 'prestigious' approach and a partly dominant approach, or some combination of the two. We believe that this may be one of the reasons why we see the neutral face as being lower in both, as this person is not likely to attain high status, be it through dominance or prestige. Finally, there is a potentially confounding effect of attractiveness; although the dominant and prestigious targets did differ in perceived attractiveness (S1 Table), it is important to highlight that there were no differences in attractiveness between the neutral and either of the high-status targets (of the two high-status targets, one has lower, and the other has higher, attractiveness that the neutral one). Furthermore, most differences in vocal parameters are found when comparing responses to the neutral versus the two high-status targets (S1 Table  and S1 Fig). This problem, however, could be further addressed in future studies, presenting more targets of each social status.
In conclusion, using a novel job interview scenario, we found that self-perceptions of dominance and prestige affected vocal parameters such that the higher an individual's self-perceived dominance, the lower their mean F 0 , F 0 SD, and minimum F 0 , and the higher their self-perceived prestige, the lower their intensity variability. Additionally, regardless of self-perceived status, participants changed their vocal characteristics when talking to neutral versus high-status targets, displaying a relatively higher mean F 0 when talking to high-status targets. The context of questions (i.e. introductory, personal, or interpersonal) also affected participants' vocal characteristics with the greatest changes in F 0 according to status of the listener observed for the responses to the personal and interpersonal questions. These F 0 effects were most pronounced when including participant self-perceived dominance in the models. Ultimately our findings suggest that individuals' vocal characteristics are influenced, whether consciously or non-consciously, by the relative difference between their self-perceived social status and the social status of the listeners.