Mimicry and automatic imitation are not correlated

It is widely known that individuals have a tendency to imitate each other. However, different psychological disciplines assess imitation in different manners. While social psychologists assess mimicry by means of action observation, cognitive psychologists assess automatic imitation with reaction time based measures on a trial-by-trial basis. Although these methods differ in crucial methodological aspects, both phenomena are assumed to rely on similar underlying mechanisms. This raises the fundamental question whether mimicry and automatic imitation are actually correlated. In the present research we assessed both phenomena and did not find a meaningful correlation. Moreover, personality traits such as empathy, autism traits, and traits related to self- versus other-focus did not correlate with mimicry or automatic imitation either. Theoretical implications are discussed.

Within the domain of imitation, social psychologists are applying different paradigms than cognitive psychologists. With reference to the chameleon effect, social psychologists investigate mimicry-individuals' tendency to imitate others in social settings. Mimicry has most often been investigated by applying naturalistic paradigms that measure the frequency of imitation in interactions between a participant and a confederate (e.g., [6,16,[19][20][21][22]). For example, Chartrand and Bargh [6] let participants engage in a card-sorting task with a confederate who repeatedly touched his face in one half of the experiment and repeatedly waggled his food in the other half of the experiment. Importantly, participants' behavior was videotaped during the experiment so that the researchers could code how often participants engaged themselves in face touching and food waggling. The typical result of such a paradigm is that participants PLOS ONE | https://doi.org/10.1371/journal.pone.0183784 September 6, 2017 1 / 21 a1111111111 a1111111111 a1111111111 a1111111111 a1111111111 more often touch their face than waggle their foot when the confederate touches his face and vice versa when the confederate waggles his foot. Besides such one-to-one interactions, researchers have also used adapted versions (e.g., [11][12][13]18,21,22,23,[24][25][26][27]) in which participants observe a video of a model who engages in two classes of different behaviors (e.g., nose touching vs. hair touching). Similar to the aforementioned studies, participants are videotaped while they watch the videos. Afterwards, coders measure the degree of mimicry by coding how often participants performed the two target actions themselves.
In contrast to such naturalistic paradigms, researchers in cognitive psychology focus on automatic imitation, which can be seen as the laboratory model of imitation [28]. In a typical experiment, participants respond over many trials with two different movements to two different symbolic cues while seeing a congruent movement, an incongruent movement, or no movement on a computer screen. For example, in the task developed by Brass, Bekkering, and Prinz [10], participants have to lift their index or middle finger in response to the number '1' or '2'. At the same time, participants also see a hand performing either the same finger movement (i.e. congruent), the other finger movement (i.e. incongruent), or no finger movement (i.e. neutral). Automatic imitation in this paradigm refers to the finding that individuals respond faster and with fewer errors to congruent trials than to incongruent trials (i.e., congruency effect). However, besides the congruency effect, researchers can also compute the facilitation effect-individuals' tendency to respond faster and with fewer errors to congruent trials than to neutral trials. Finally, the experimental setup allows measuring individuals' tendency to inhibit imitative responses (i.e., interference effect). That is, individuals respond slower and with more errors to incongruent trials than to neutral trials (e.g., [10,29]).
In the literature, the terms mimicry and automatic imitation are often used synonymously [28]. Thus, there seems to be wide agreement that mimicry and automatic imitation are correlated. But is this indeed the case? The answer to this fundamental question may depend on whether one compares the methods or the theoretical assumptions of mimicry and automatic imitation. While a methodological analysis suggests that the two paradigms measure different constructs, a theoretical analysis instead points towards a common ground.

Methodological differences between mimicry and automatic imitation
Paradigms on mimicry and automatic imitation differ in many methodological aspects (for an overview see Table 1). First, mimicry tasks have higher ecological validity than automatic imitation tasks, because mimicry, but not automatic imitation, assesses imitation in natural settings (cf. [28]). Second, the two tasks differ in terms of dependent variables. That is, paradigms on automatic imitation measure responses in the form of reaction times and error rates to congruent and incongruent observed movements on a trial-by-trial basis. In mimicry paradigms, however, researchers count how often participants engage in a certain behavior within a predefined time period and then calculate a sum score that indicates how often the confederate was imitated. Third, the two imitation paradigms differ in the degree to which participants become aware about the effect. Mimicry effects remain unaware for participants (cf. [30]). That is, in posttest interviews, participants usually report no awareness of the model's focal behavior (e.g., nose touching), no awareness of an intention to mimic, and no awareness that they mimicked the confederate's behavior (e.g., [6]). In contrast, in automatic imitation paradigms more conscious awareness seems to be involved. Although it has not yet been empirically tested, it is our experience that participants are well aware that the other person's finger movements influence their performance. Fourth, the two paradigms differ in the psychological processes needed to work on the task. While automatic imitation tasks need explicit cognitive control and executive functioning (e.g., [31]), mimicry tasks need less explicit cognitive control because mimicry is unconsciously driven and no actions need to be suppressed [6].

Theoretical similarities between mimicry and automatic imitation
Although the paradigms used in research on mimicry and automatic imitation differ in several aspects, there are also important similarities with regard to the theoretical assumptions and moderators of mimicry and automatic imitation. Theoretical assumptions. Irrespective of how imitation is assessed, it is generally agreed that automatic imitation and mimicry are both based on shared representations of observed and executed actions. For example, ideomotor theory [32][33][34][35]-a theory shaped in cognitive psychology-proposes that the visual image of an action is part of its motor representation. As a result, the observation of a certain action primes the execution of the same action. A similar proposition can be derived from research on mimicry, which puts forward the so-called perception-behavior link [6,[36][37][38]. This link implies that the observation of an action evokes the same representation as the execution of that action. This common representation then increases the likelihood of the execution of the perceived action. Regardless of its framing, the idea that the observation of an action leads to the activation of the corresponding motor plan in the observer has now been confirmed extensively in behavioral studies (e.g., [10,29,39,40]), fMRI studies (e.g., [41,42]), motor TMS studies (e.g., [43,44]), and single-cell recordings in both monkeys [45] as well as humans [46].
Moderating influences. Past research on moderating influences suggests that similar factors facilitate mimicry and automatic imitation. For example, research has repeatedly found that empathy positively correlates with different kinds of imitation paradigms [47][48][49]. Other research suggests that the link between empathy and imitation is specific to a subfactor of empathy-namely perspective taking. In this respect, it has been shown that watching actions from the first-person perspective increases imitation as compared to the third-person perspective [12,50,51]. Similarly, when assessing perspective taking as a personality trait with the Interpersonal Reactivity Index (IRI; [52]), research found that the ability to take another person's perspective increases mimicry [6,53] as well as automatic imitation [54,55].
Another often reported facilitator of imitation is the degree to which individuals focus on others as compared to themselves. To explore the role of self-construal [56], it has been tested whether individuals who perceive themselves as dependent on others (i.e., interdependent selfconstrual) imitate others more strongly than individuals who perceive themselves as unique individuals (i.e., independent self-construal). For example, van Baaren, Maddux, Chartrand, De Bouter, and van Knippenberg [57] found that priming an interdependent, as compared to an independent self-construal, increases mimicry. Moreover, the authors demonstrated that individuals from eastern societies, who are known for their interdependent self-construal, mimic others more strongly than individuals from western societies, who are known for their independent self-construal. Similar findings have been found in research on automatic imitation. Hogeveen and Obhi [58], for instance, used the same priming as van Baaren and colleagues and found larger congruency effects following an interdependent self-construal manipulation, as compared to an independent self-construal manipulation. Relatedly, research has found reduced automatic imitation effects when increasing the self-focus of participants by letting them sit in front of a mirror [59].
Another moderator that has been investigated is autism. For example, Cook, Swapp, Pan, Bianchi-Berthouze and Blakemore [60] found reduced automatic imitation effects in individuals with autistic spectrum disorder as compared to healthy controls. Similarly, research has repeatedly found decreased intentional mimicry for individuals with high autistic traits (for a review, see [61]).
Based on the above-reviewed literature, it has been claimed that certain personality factors influence imitation. That is, high empathic persons and high perspective-takers, individuals with an interdependent self-construal, as well as people scoring low on autistic traits should imitate others more strongly. However, the empirical evidence for this claim is less clear than what has often been assumed. First, research on autism did not only find reduced automatic imitation for individuals with high autistic traits, but also intact automatic imitation [62][63][64] or even increased automatic imitation [65]. Second, some researchers were not able to replicate the findings on the moderating role of empathy on imitation [54,66]. Third, although it is claimed that self-construal moderates imitation, there is to the best of our knowledge no study that actually showed self-construal as part of a personality trait to moderate imitation.

Present research
Taken together, the literature on imitation does not offer a clear answer to the question whether mimicry and automatic imitation are correlated and to which degree personality traits moderate imitation. Theoretical models on the mechanisms of imitation suggest that mimicry and automatic imitation are linked to each other and that certain personality traits should moderate mimicry and automatic imitation in the same way. However, some studies did not find support for these moderating influences. Moreover, methodological differences between mimicry and automatic imitation suggest that the two tasks might not be correlated. Therefore, in the present research we tested for the first time whether mimicry and automatic imitation are correlated and to which degree personality traits that have previously been linked to mimicry and automatic imitation moderate both phenomena. To this end, we ran a highly powered study with two hundred participants. To assess mimicry and automatic imitation, we applied the most often reported tasks in the literature.

Ethics statement
The study was conducted in accordance with the ethical standards of the 1964 Declaration of Helsinki and approved by the rules of the Institutional Review Board from the Faculty of Psychology and Educational Science of Ghent University. All participants provided informed consent at the beginning of the experiment and were informed that participation was voluntary and that all answers were processed and stored anonymously.
Data availability statement. The data file of the study is available from the Open Science Framework database. The URL necessary to access our data is: https://osf.io/v3afy/.

Participants
In return for partial course credit, two hundred students from Ghent University (Belgium) participated in this study. All participants reported normal or corrected-to-normal vision and hearing as well as confirmed to speak and comprehend Dutch. Prior to data analysis, we excluded two participants. One participant did not allow us to analyze her recorded videotape and was thus excluded from all analyses. Another participant was excluded, because she misunderstood the task. Moreover, two participants who had a mimicry score or a congruency effect higher than 4 SD above the sample mean were identified as outliers and discharged as well (note: leaving these outliers in the sample does not change the results). The reason why we discharged these outliers is that the removal of outliers leads to more accurate correlations between measures-that is they become closer to the population correlation (e.g., [67]). Thus, the final sample contained 196 participants (135 women, 61 men) with an age ranging from 17 to 44 (M = 18.74; SD = 2.74).

Procedure
After being welcomed, participants were seated in separate cubicles. The setup of each cubicle consisted of an Asus Eee PC 1215N laptop containing an integrated webcam, an external 17-inch Dell monitor, two speakers and an apple MB110FN/B AZERTY mac keyboard. Participants sat at a viewing distance of approximately 35 cm to the monitor. After participants were seated, they signed a written informed consent, and then ran through the mimicry and automatic imitation task. The order of these two tasks was counterbalanced between participants. Afterwards, participants filled in a couple of questionnaires measuring personality traits and indicated demographic data. At the end, the experimenter fully debriefed participants. Thereby, she told participants that they were videotaped during the experiment and asked whether we were allowed to use the videos for scientific purposes. Finally, participants were thanked and dismissed.

Materials/ Stimuli
Mimicry task. The mimicry task was an adaption of previously used video-based mimicry tasks [e.g., [11][12][13]18,21,22,23,[24][25][26][27]. Participants were told that they were going to watch two video clips of a model reading a story and that they should listen carefully to the story as questions about the text had to be answered at the end of the experiment. Participants then watched two video clips in which a female model read a story about a rabbit from the Dutch children book titled "365 konijnenverhaaltjes voor het slapengaan (365 rabbit stories to go off to sleep)" [68]. Each video lasted 10 minutes. In both videos, the model engaged in a specific action every 20 seconds. That is, in one video she was touching her nose and in the other video she was touching her hair (see Fig 1 for screenshots of the videos). The sequence of nose touching versus hair touching videos was counterbalanced between participants. In line with the cover story, participants had to answer ten story-related questions about the story after watching the videos. Throughout the task, a webcam was used to videotape participants. In order to prepare data for analyses, a coder blind to the conditions counted for each video how often participants engaged in nose touching and hair stroking actions. To cross-validate the coding, a second coder coded the videos of 50 randomly selected participants. The intra-class correlation coefficient was r = .77, p < .001.
Automatic imitation task. The automatic imitation task was based on Brass and colleagues' [10] task and was programmed using Tscope5 software [69]. In the beginning of each trial, participants were instructed to press down the 'G' and 'H' key with their right index and middle finger. Then, a fixation cross appeared for 500 ms. Afterwards, a picture of a mirrored left hand in resting position was presented for 200 ms. This picture was followed by a second picture that was presented for a maximum of 2000 ms or until participants gave a response. Depending on the condition, the picture either indicated an index finger movement, a middle finger movement, or no movement. At the same time, a number was displayed between the model's middle and index finger. Participants were instructed to lift their index finger as fast as possible in response to the number '1' and to lift their middle finger as fast as possible in response to the number '2' (see Fig 1 for an overview of the stimuli). This setup resulted in three different trials: In congruent trials participants responded with the same finger as the model, in incongruent trials participants responded with another finger than the model, and in neutral trials the participants responded with a finger while the model did not respond at all.
The automatic imitation task contained a practice and an experimental phase. The practice phase contained 12 trials. The experimental phase contained 360 trials resulting in 120 congruent trials, 120 incongruent trials and 120 neutral trials. In the experimental phase, participants were allowed to take a break after every 90 trials. In order to reduce the influence of spatial compatibility, participants were instructed to place their right hand rotated 90˚counterclockwise to the model's hand displayed on the screen (e.g., [70,71,72]).
To prepare the data for analysis, we removed extremely slow reaction times. That is, latencies below (1.20%) and above (0.05%) 3 SD of the participants' mean.
Personality scales. In order to test the degree to which personality traits that have previously been reported to moderate imitation correlate with mimicry and automatic imitation, we administered a couple of questionnaires.
Perspective taking and empathy: To assess perspective taking and empathy, we administered the Dutch version [73] of the Interpersonal Reactivity Index (IRI; [52]). The IRI contains 28 items that were scored on 5-point rating scales ranging from 1 (does not describe me well) to 5 (describes me very well). The IRI is divided into four dimensions, each containing 7 items: Besides Perspective Taking (Cronbach's α = .68), it administers Fantasy (Cronbach's α = .86), Empathic Concern (Cronbach's α = .64), and Personal Distress (Cronbach's α = .79). In order to prepare data for analyses, we computed mean scores of all subscales as well as a mean score over all subscales in order to compute a total score of empathy (Cronbach's α = .83).
Self-versus other focus: We assessed multiple scales to get insight into participants' focus on themselves versus others. First, we used a self-translated Dutch version of the Self-Construal Scale (SCS; [74]). The SCS measures the strength of an individual's interdependent and independent self-construal. These two self-construals are conceptualized as reflecting the emphasis on connectedness and relations often found in non-Western cultures (interdependent) and the separateness and uniqueness of the individual (independent) stressed in Western societies. The SCS contains 30 items, divided into an independence self-construal (Cronbach's α = .70) and an interdependence self-construal (Cronbach's α = .72). Participants answered on a 7-point scale ranging from 1 ("strongly disagree") to 7 ("strongly agree") to which degree they agreed to statements related to their self. In order to compute a total score of self-construal, we subtracted the interdependence self-construal subscale from the independence self-construal subscale. High values indicate a strong relative independence self-construal.
Second, we administered the Dutch version of the short Individualism and Collectivism Scale (ICS; [75]). The shortened version of the ICS contains eight items, divided into four subscales: Horizontal Individualism, Horizontal Collectivism, Vertical Individualism and Vertical Collectivism. For individuals high on horizontal collectivism, the well-being of their in-groups (e.g., family, tribe, coworkers, nation) is of central importance. However, they do not feel subordinate to their in-groups. In contrast, individuals high on vertical collectivism submit to the norms of their in-groups and are even willing to sacrifice their personal identities for their ingroups. Horizontal individualists, however, are characterized by seeking individuality rather than distinctiveness by doing their own thing and not to compare themselves with others. Vertical individualists are especially concerned with comparing themselves with others. That is, they believe that competition is the law of nature, and they desire to win in all kinds of competitions. Participants indicated on 7-point scales ranging from 1 ("definitely not true") to 7 ("definitely true") the degree to which the 8 statements apply to them. To compute a total score of the scale, we subtracted the mean of the collectivism items (Cronbach's α = .45) from the mean of the individualism items (Cronbach's α = .63). A high score reflects a relative high individualism as compared to collectivism.
Third, we assessed participants' need to belong. Need to belong refers to people's desire for interpersonal acceptance and belonging and is associated with greater sensitivity to interpersonal cues in order to foster connections with others [76]. In order to assess need to belong, we applied a self translated Dutch version of the Need to Belong Scale (NTBS; [77]). Participants indicated the degree to which each of the 10 statements applied to them on a 5-point scale (1 = strongly disagree, 5 = strongly agree). Example items of the scale are "I try hard not to do things that will make other people avoid or reject me," and "I want other people to accept me." Cronbach's alpha for the NTBS was α = .85.
Demographic data and further personality scales. Besides gender and age, we assessed further scales that had not yet been tested to moderate mimicry and automatic imitation. That is, we assessed participants' engagement in social activities, socioeconomic status, number of friends, learning styles and regulatory focus.
Social engagement. To assess social engagement, participants answered with yes or no whether they had been engaging in the following social activities: "donating blood," "registered as stem cell donor," "engagement in voluntary work," "member of a charitable institution (e.g., UNICEF, Greenpeace, etc.)," and "donating for a charitable institution (e.g., UNICEF, Greenpeace, etc.)." In order to compute a total score we summed up the amount of yes-answers participants gave on these items.
Socioeconomic status. We assessed participants' objective as well as subjective socioeconomic status (SES; [81]). To assess the subjective SES, participants responded to the following six statements on a 5-point scale (1 = strongly disagree; 5 = strongly agree): "My family usually had enough money for things when I was growing up," "I grew up in a relatively wealthy neighborhood," "I felt relatively wealthy compared to the other kids in my school," "I have enough money to buy things I want," "I don't worry too much about paying my bills," "I don't think I'll have to worry about money too much in the future." Cronbach's alpha for the mean score of this subjective SES was α = .77.
To assess participants' objective SES, they indicated on two 6-point scales (1= €1,000 or less; 2 = €1,001 -€2,000; 3 = €2001 -€3,000; 4 = €3,001 -€4,000; 5 = €4,001 -€5,000, 6 = more than €5,000) their income per month: "What was your annual gross household income when you were growing up? ," "What is your present annual gross household income?" In addition, participants indicated on three 10-point scales the following questions: "What is the highest educational level completed by your father?" "What is the highest educational level completed by your mother?" "What is your highest educational level?". To compute a total objective SES score, we calculated the mean of all items. Cronbach's alpha of the objective SES was α = .62.

Amount of friends:
To have some further indication about participants' social life, they were asked how many friends they have. In addition, they also gave an estimation of their number of Facebook friends, in case they had Facebook.
Learning style: With a self-translated Dutch version, we assessed participants' learning style [82]. The scale contained eight items, divided into four subscales: Active Learning, Concrete Experience, Reflective Observation, Abstract Conceptualization and Active Experimentation. All items were answered on a 5-point scale (1 = does not describe me well; 2 = describes me very well).
Regulatory Focus: We also assessed participants' regulatory focus. Regulatory Focus Theory [83] distinguishes between the promotion focus, which regulates nurturance needs and goals related to aspiration and accomplishment (i.e., ideals), and the prevention focus, which regulates security needs and goals related to safety and responsibilities (i.e., oughts). We assessed the Dutch version (cf. [84]) of the 18-item regulatory focus scale developed by Lockwood et al. [85]. Participants indicated on 7-point scales (1 = definitively not true; 7 = definitively true) how different promotion-and prevention-related statements apply to them. In line with previous research [85][86][87][88], we computed a difference score by subtracting the score of the prevention focus subscale (Cronbach's α = .70) from the promotion focus subscale (Cronbach's α = .67). High values indicate a relative strong promotion focus compared to a prevention focus.

Results
To test our hypotheses we conducted inference statistics using SPSS. In addition, we applied Bayesian statistics using JASP, an open source statistical package [89]. Specifically, we report the Bayes factors (BF) calculated with the default JASP priors. BF 10 gives the ratio with which the alternative hypothesis is favored over the null hypothesis (i.e., larger BFs argue in favor of the alternative hypothesis; see [90] for an overview), whereas BF 01 gives the ratio with which the null hypothesis is favored over the alternative hypothesis (i.e., a larger BFs argue in favor of the null hypothesis; see [90] for an overview). BF 01 is defined as the inverse of BF 10 .

Mimicry
Mimicry. First, we ran a 2 (observed video: nose touching video vs. hair touching video) x 2 (performed action: nose touching action vs. hair touching action) repeated measures ANOVA to test the presence of a mimicry effect. The ANOVA yielded no significant main effects, F(1,195) < 1.54, p > .21. However, more importantly, we found a significant interaction between observed video and performed action, F(1,195) = 11.69, p = .001, η 2 p = .06 indicating the presence of a substantial mimicry effect (see Fig 2 for means).
In an additional analysis, we calculated a nose-mimicry score by subtracting the amount of nose touching actions in the hair video from the amount of nose touching actions in the nose video. Likewise, we computed a hair-mimicry score by subtracting the amount of hair touching actions in the nose video from the amount of hair touching actions in the hair video. In order to compute an overall mimicry score, we averaged the nose-mimicry score and the hairmimicry score. To test the strength of this mimicry effect, we then conducted a Bayesian t-test in which we tested the hypothesis that the mimicry score (M =1.24; SD = 5.09) was larger than 0. The Bayesian t-test yielded BF 10 = 43.12, indicating that the data is 43.12 times more likely to have occurred under the alternative hypothesis than under the null hypothesis. A Bayes factor < 100 and > 30 is conventionally considered to be very strong evidence [91]. Reliability.
In a second analysis we tested the split-half reliability of the mimicry effect. That is, we first computed two different mimicry scores. One score was calculated from the even minutes and the other score was calculated from the odd minutes. In order to test the reliability of the mimicry task, we computed the Spearman-Brown coefficient. The coefficient was negative, ρ Ã = -.11 suggesting non-reliability.
In an additional analysis we computed a mimicry score for hair actions and a mimicry score for nose actions. The mimicry score for hair actions was computed by subtracting participants' hair actions during the nose video from participants' hair actions in the hair video. Likewise, the mimicry score for nose actions was computed by subtracting participants' nose actions in the hair video from participants' nose actions in the nose video. The Spearman-Brown coefficient for these two mimicry scores was negative, ρ Ã = -.59, suggesting non-reliability. This indicates that the low reliability is not restricted to one of the two actions. Moreover, we computed separately for even and odd minutes a nose-mimicry score and a hairmimicry score. Spearman-Brown coefficients were ρ Ã =.13 for nose-mimicry and ρ Ã =.15 for hair-mimicry indicating low reliability. In additional analyses, we tested the strength of the automatic imitation effects with Bayesian t-tests. Specifically, we conducted Bayesian t-tests and tested the hypothesis that the congruency effect (M = 46.19, SD = 29.62), the facilitation effect (M = 25.00, SD = 15.78), and the interference effect (M = 21.19, SD = 19.71) were larger than 0. The Bayesian t-tests yielded BF 10 = 1.18Á10 51 for the congruency effect, BF 10 = 1.01Á10 52 for the facilitation effect and BF 10 = 3.22Á10 31 for the interference effect. A Bayes factor > 100 is conventionally considered to be extreme evidence [91].
Error rates. As can be seen in  larger than zero yielded BF 10 = 3.68Á10 27 for the congruency effect, BF 10 = 105.05 for the facilitation effect and BF 10 = 1.94Á10 21 for the interference effect. A Bayes factor > 100 is considered to be extreme evidence [91].
Reliability. In a further analysis, we tested the split-half reliability of the automatic imitation task. First, we computed the congruency effect, the facilitation effect, and the interference effect for even trials and for odd trials separately for reactions times and for error rates. Second, we computed the Spearman-Brown coefficient in order to test the reliability of the automatic imitation task. For reaction times, the results yielded ρ Ã = .86 for the congruency effect, ρ Ã = .50 for the facilitation effect, and ρ Ã = .68 for the interference effect. For error rates, the results yielded ρ Ã = .61 for the congruency effect, ρ Ã = .001 for the facilitation effect, and ρ Ã = .62, for the interference effect.

Relationship between mimicry and automatic imitation
To test the relationship between mimicry and automatic imitation, we ran correlational analyses and will report both the corresponding p-values and BFs. In order to test the relationship  between mimicry and automatic imitation, we correlated the mimicry score with all the scores of the automatic imitation task (i.e., congruency effect, facilitation effect, and interference effect for reaction times as well as for error rates). However, none of the correlations involving the mimicry score reached significance, r < -.13 p > .08 (for an overview of all correlations, see Table 2). We then computed the BFs of all correlations to investigate whether the null hypothesis (i.e., mimicry and automatic imitation are not positively correlated) could be supported. When correlating the mimicry score with the reaction time scores of the automatic imitation task, the analyses yielded BF 01 = 29.59 for the congruency effect, BF 01 = 25.06 for the facilitation effect, and BF 01 = 27.15 for the interference effect. When correlating the mimicry score with the error rates of the automatic imitation task, the analyses yielded BF 01 = 9.51 for the congruency effect, BF 01 = 9.08 for the facilitation effect, and BF 01 = 10.55 for the interference effect. These results indicate moderate to strong evidence for the null hypothesis, which confirms that mimicry and automatic imitation are not positively correlated (cf., [91]).

Relationship between personality traits and mimicry as well as automatic imitation
In further correlation analyses, we tested whether mimicry and automatic imitation correlated with the assessed personality traits and demographic data. That is, we ran three separate correlational analyses. In the first analysis, we ran a correlation analysis between the mimicry score and all assessed personality scales as well as demographic data. In the second analysis, we correlated all reaction-time-based scores from the automatic imitation task (i.e. congruency effect, facilitation effect, and interference effect) with the assessed personality scales and demographic data. Finally, in the third analysis, we calculated the same correlations for the error-rate-based scores of the automatic imitation task. For each of the three correlational analyses, we corrected for multiple comparisons by the means of FDR [92]. An overview of all the correlations is provided in Table 3. Relation between mimicry and personality scales as well as demographics. The correlations between the mimicry score and the assessed personality scales did not yield any significant results, r < .15, p > .99. Moreover, when testing the influence of gender on mimicry, an independent sample t-test did not yield a significant effect either, t(194) = .81, p = .42.
Relation between automatic imitation and personality scales as well as demographics. Firstly, we correlated the reaction-time-based scores of the automatic imitation task with all assessed scales. The congruency effect correlated significantly with the Personality Distress subscale of the Interpersonal Reactivity Index (IRI; [52]), r = .26, p = .02, but not with any other scale (rs < .21, ps > .07). The facilitation effect did not correlate with any of the scales either (rs < .22, ps > .07). Likewise, all correlations between the interference effect and the assessed scales were non-significant (rs < .23, ps > .07).
Secondly, we tested for gender differences in the reaction-time-based imitation measures. Thirdly, we ran the same analyses for the error-based automatic imitation scores. However, significant correlations were neither found for the congruency effect (r < .21, p > .22), nor for the facilitation effect (r < .16, p > .46), nor for the interference effect (r < .20, p > .38). Also, there was no gender difference for the congruency effect, the facilitation effect, or the interference effect, ts(194) < 1.45, ps > .14.

Discussion
When assessing individuals' tendency to imitate others, researchers have been using different tasks. While social psychologists assess mimicry, cognitive psychologists assess automatic imitation. Both forms of imitation share similarities, but also differ in important aspects. Although it is largely assumed that mimicry and automatic imitation are grounded in similar underlying processes, the assessment of mimicry and automatic imitation differs in many methodological aspects. This raises the fundamental question to which degree these two forms of imitation are In line with past research, we detected significant imitation as well as mimicry effects. However, we did not find a significant correlation between the two tasks. This was further supported by Bayesian analyses showing that mimicry and automatic imitation were indeed not correlated. In addition, none of the assessed personality scales correlated with mimicry or automatic imitation in a meaningful way. When controlling for multiple comparisons, there was only one single significant relation, namely a correlation between the reaction time based congruency effect and the Personality Distress subscale of the Interpersonal Reactivity Index (IRI; [52]).
The absence of a correlation between mimicry and automatic imitation might be due to different reasons. First, it could be caused by differences in the methodological setup of the two paradigms. While automatic imitation is measured trial-by-trial, is based on reaction times or error rates, elicits awareness in participants, and needs explicit cognitive control, mimicry is based on behavioral observations, is rather subjectively detected, remains unaware for participants, and does not need explicit cognitive control. Although mimicry and automatic imitation did not correlate with each other, we do not argue that these two constructs are not related to each other at all. Rather, we suggest that both paradigms are part of the same construct (i.e., imitation), but measure different aspects of it. That is, while mimicry measures imitation in a social context, automatic imitation measures imitation in a laboratory context instead (for a similar distinction see [28]).
Second, another reason for the missing correlation could be found in the reliability of the two tasks. While the automatic imitation task produced good reliability, the mimicry task was not reliable at all. It is important to note that in line with past research on the mimicry task (e.g., [6,19,27]), the inter-judge reliability of coded actions was high in our study. However, as far as we know, we measured for the first time the actual reliability of the mimicry taskthat is the internal consistency of mimicry performance between odd and even minutes-and detected very low reliability. Psychometric theory has shown that unreliable tasks are less likely to correlate with other tasks (cf., [93,94]). Thus, it might well be that mimicry and automatic imitation are strongly linked to each other, but that the low reliability of the mimicry task reduces the likelihood to detect such a relation. Despite the low reliability, it is important to note, however, that a reduced reliability does not necessarily indicate that the task is not replicable (e.g., [95]). Indeed, mimicry tasks have been replicated numerous times by different researchers in different labs (for an overview see [30]).
Since past research has not reported internal consistency of mimicry performance, we do not know how reliable mimicry tasks are in general. But, if mimicry tasks are indeed rather unreliable, one should interpret past correlations of this task with caution. Future research may, thus, consider other tasks that assess mimicry with multiple trials in order to achieve higher internal consistency. For example, one could use tasks that measure mimicry of facial expression. It has been found that such tasks have good sensitivity to ASD related behaviors [96], and are sensitive to social variables [97].
The second finding that most of the assessed personality scales did not correlate with mimicry and automatic imitation in a meaningful way may be surprising. There might be different reasons for such a finding. First, given that unreliable tasks are less likely to correlate with other constructs (cf., [93,94]), it should be not surprising that mimicry did not correlate with any of the assessed personality scales. Moreover, since some of the assessed scales were not very reliable either, it is also not surprising that we did not find more significant results. Second, it might be that controlling for multiple comparisons reduced statistical power and as such masked some potential effects. Indeed, when not controlling for multiple comparisons, some correlations were significant. It is important to note, however, that all correlation coefficients were very low (i.e., rs < .27). Since irrelevant effects can become significant with large samples (e.g., [98,99]), one should be careful not to overinterpret such low correlations. Third, it might be that the assessed scales actually do not correlate with imitation. This interpretation is in line with other recent studies that did not find significant relations between imitation and different personality scales either. For instance, Butler and Ramsey [66] did not find a relation between automatic imitation and extraversion, agreeableness, disorders of social cognition (i.e., autistic-like and schizotypal traits), narcissism, and empathy. A crucial difference between studies that found a relation with personality scales and studies that did not is the sample size. While previous studies used rather low numbers of participants, we, but also Butler and Ramsey [66], aimed for high power. As underpowered studies in combination with publication bias increase the likelihood of false positives (e.g., [99]), one should regard previous findings suggesting a relation between personality traits and imitation with caution.
Moreover, the finding that the automatic imitation task does not correlate with mimicry and most social related personality traits raises the question to which degree automatic imitation actually taps into social processes. While previous research suggests that automatic imitation relates to social factors such as eye contact [100], social attitudes [101], human-like actions (e.g., [102,103]), or a pro-social focus [57][58][59]101,104,105], there are also a few studies that do not support this view. For instance, some studies did not find increased automatic imitation in human as compared to non-human agents (e.g., [54,106,107]). Moreover, Butler and Ramesey [66] did not find a relationship between social components of personality traits and automatic imitation. Likewise, some researchers did not find reduced automatic imitation in individuals with autism [62][63][64]. Finally, Farmer et al. [108] did not find an influence of social status and power on automatic imitation. Thus, in light of these contradictory findings, more research is needed in order to draw specific conclusions about the degree to which social factors facilitate automatic imitation.
Despite the lack of correlations, there are some other findings that merit further discussion. First, we found a significant correlation between the reaction-time-based congruency effect and the Personality Distress subscale of the Interpersonal Reactivity Index (IRI; [52]). While this finding goes in line with other research suggesting a positive relationship between automatic imitation and empathy [47][48][49], our effect should be interpreted with caution because this relation was only significant between one subscale of the IRI and one automatic imitation score. Second, when not controlling for multiple comparisons, some correlations between automatic imitation and the assessed personality scales would actually have correlated significantly. Although due to our large sample size [98,99] the correlations that were significant at the uncorrected level should be regarded with caution, future research may follow up these correlations in order to test to which degree they are nonetheless reliable. Third, we found a significant gender difference in automatic imitation with female participants showing stronger automatic imitation effects than male participants. While this finding is in line with previous studies (e.g., [66]), it raises the question of why this is the case. It might well be that differences in personality traits account for this difference. However, personality traits including empathy, autism-spectrum quotient, and traits related to self-versus other-focus cannot explain the sexdifference, as we did not find a relation between these traits and the automatic imitation task. Thus, future research may test other personality traits that are more prone to sex-differences.

Limitations
First, it is possible that our null results could be explained by the quality of the assessed scales. Indeed, some subscales had rather low reliability. However, most scales that have been previously found to correlate with imitation (i.e., perspective taking, self-construal, autism total scale) had an overall good reliability. Moreover, all measures that were used had been validated and are well established in the literature. Nevertheless, future research should use additional scales and scales with more reliable long-formats.
Second, one could argue that automatic imitation in the current study was confounded with spatial compatibility. To control for left-right spatial compatibility, we used a well-known method in which the stimulus hand is presented orthogonal to the response hand (e.g., [70,71,72]). Nevertheless, a potential concern with this method is that it fails to control for orthogonal spatial compatibility [109]. As a result, it is possible that orthogonal spatial processes introduced noise to the data that in turn masked potential correlations. Given that automatic imitation can still be observed even when spatial processes cannot contribute [28], future research should use tasks that control not only left-right, but also orthogonal spatial compatibility [110] in order to reduce noise in the data and as such increase statistical power.
Third, as it is common in psychological research, most of our participants were female. Although 61 male subjects is atypically high for psychological research, it may still be that the lower percentage of men, as compared to women, reduced the variability within the tasks causing null effects. In a related vein, it is important to note that all of our subjects were young psychology students. This, again, may have caused low levels of variance within the tasks, increasing the likelihood of finding null effects. Future research should, thus, aim at assessing more intermixed samples.

Conclusion
The present study provides novel insights into the understanding of imitation. First, we found no relationship between mimicry and automatic imitation suggesting that, despite similar underlying mechanisms, the two forms of imitation are less related to each other than initially suggested. Second, we did not find meaningful relationships between any form of imitation and most personality traits including empathy, autism-like personality traits, and traits related to self-versus other-focus. Therefore, we suggest in line with other research (i.e., [66]) that the relationship between social components of personality traits and imitation is less universal than previously reported in the literature. As previous studies on the relation between personality traits and imitation tested their predictions on rather low numbers of participants, our research stresses the importance to study larger samples in order to replicate and extend previously established findings.