Learning new content and vocabulary in a foreign language can be particularly difficult. Yet, there are educational programs that require people to study in a language they are not native speakers of. For this reason, it is important to understand how these learning processes work and possibly differ from native language learning, as well as to develop strategies to ease this process. The current study takes advantage of emotionality—operationally defined as positive valence and high arousal—to improve memory. In two experiments, the present paper addresses whether participants have more difficulty learning the names of objects they have never seen before in their foreign language and whether embedding them in a positive semantic context can help make learning easier. With this in mind, we had participants (with a minimum of a B2 level of English) in two experiments (43 participants in Experiment 1 and 54 in Experiment 2) read descriptions of made-up objects—either positive or neutral and either in their native or a foreign language. The effects of language varied with the difficulty of the task and measure used. In both cases, learning the words in a positive context improved learning. Importantly, the effect of emotionality was not modulated by language, suggesting that the effects of emotionality are independent of language and could potentially be a useful tool for improving foreign language vocabulary learning.
Citation: Frances C, de Bruin A, Duñabeitia JA (2020) The effects of language and emotionality of stimuli on vocabulary learning. PLoS ONE 15(10): e0240252. https://doi.org/10.1371/journal.pone.0240252
Editor: Koji Miwa, Nagoya University, JAPAN
Received: March 16, 2020; Accepted: September 22, 2020; Published: October 7, 2020
Copyright: © 2020 Frances et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data used for this manuscript are available at https://osf.io/64b3e/?view_only=eac93c2fc41b42fb9cd3519466cd62af.
Funding: This research has been partially funded by the following entities: Spanish Government [https://ciencia.sede.gob.es/]: PGC2018-097145-B-I00 (JAD), RED2018-102615-T (JAD), SEV-2015-0490, and BES-2016-077169 (CF) Comunidad de Madrid [https://www.comunidad.madrid/]: H2019/HUM-5705 (JAD) The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Our current study addresses the question of whether emotionality—specifically, positive valence and high arousal—affects word learning in a native and a foreign language. In particular, the focus is on cases in which one’s only experience with the to-be-learned object is a definition without having seen or touched that object in person. Although this question may seem odd, in reality, when we learn new information in academic settings, our experience with the content we learn is quite limited. If we take the example of biology or history class, the information we learn there, regardless of how concrete, will hardly ever be a part of our sensory experience. According to Kousta et al. , this should lead to poorer representations and more difficulty in acquisition. In cases like Content and Language Integrated Learning (CLIL), new information is also conveyed in a foreign language, adding to the complexity of learning and remembering information that is abstracted from sensory experience. Given this increased difficulty in learning, it is important to find tools that can improve or facilitate this process. One possible tool is emotionality, as it has been suggested to improve learning in other cases [1,2], but it is not clear whether it also be useful in this kind of experience. In particular, would having only verbal and not sensory experience with the content make emotionality useful even for acquiring new concrete nouns? Kousta et al.’s  theory would suggest that emotionality, given its independence from concreteness, would indeed play a facilitatory role in that case.
In this study, we therefore examined whether vocabulary learning in a native and foreign language can be facilitated by emotionality. We use the terms ‘native’ and ‘foreign’ rather than ‘first language’ and ‘second language’ because we focused on bilinguals living in a bilingual society who speak multiple languages in the community but also acquire another language (English) at school. In many (monolingual and bilingual) societies it is now common for children to acquire foreign languages at school. In some cases, the foreign language is taught through content courses in that language. Currently, most of the research addressing this type of learning in a foreign language comes from classroom studies on CLIL using children. This literature focuses on language outcomes more so than on content learning and has found mostly positive results in this area [3–11]. The literature on adult foreign language medium learning is more limited but also focuses on benefits associated with language  and not content. The research on content learning is both more restricted and less clear-cut, with studies showing positive, negative, and null results . Overall, studies tend to focus on the efficiency of courses in general, evaluating overall performance at the end of the term. Very few studies compare the immediate understanding and learning of new content in a foreign and a native language. Those who do have found either detrimental or null effects of foreign language use, depending on the task [14,15], particularly without foreign language support . These results have been accounted for in the context of cognitive load theory, which suggests a working memory overload for individuals trying to learn content in a language they are not proficient in . Importantly, contributing to this literature would improve our understanding of whether foreign and native language learning differ as well as providing insights into possible improvements on foreign language teaching methods.
As mentioned before, we will focus on emotionality as a potential way of improving word learning. In particular, emotionality provides greater richness to lexical items, aiding their processing  and increasing familiarity at the time of retrieval. This process is facilitated by an enhancement of attention during encoding, leading to more durable memory . This idea has been supported by studies on language processing in participants’ native language . These studies have shown that emotionality facilitates learning in the native language—with emotional words having lower ages of acquisition—as well as aiding processing and memory . But this phenomenon is still poorly understood, especially in the case of bilinguals. In addition, research so far has focused mostly on how we process or respond to known emotional words, but we do not have a clear idea of how learning in general and vocabulary learning in particular are affected by the valence of the new word or concept. Therefore, it is unclear whether we learn words that refer to new emotional concepts better or easier than those referring to new neutral concepts.
In the foreign language, the story is not as simple. This is in part because of a decrease in emotionality in the foreign language . Focusing on memory for known words, the effects of this decreased emotionality are very inconsistent. Some studies have found enhanced memory for emotional as compared to neutral words in a foreign language [19–23] while others have not . In addition, others have even found inconsistent  or inverse effects .
The literature on emotionality and word acquisition in a foreign language is quite scarce. Most studies have focused on acquiring new lexical forms for known concepts. For example, Ferré, Ventura, Comesaña, and Fraga  studied new lexical form learning in a foreign language through word paring—associating a new item with the native language word. Results showed an improvement in later word recognition and translation for abstract words that were emotional (positive or negative). This shows that new labels for known concepts carry the same emotionality effects as the original terms, but in that context, emotionality played a role only in the acquisition of new forms for abstract words.
The question of whether emotionality affects word learning in a non-native language relates to the foreign language effect: the idea that people are less biased and more emotionally distant in their foreign language . It is hypothesized that this effect may be a consequence of learning the foreign language in an emotionally attenuated academic environment . Therefore, understanding how emotionality affects new word learning in a foreign language—i.e., new concepts with their associated form, not just learning new labels for known words or remembering known words—has implications for understanding the foreign language effect. To test this explanation, we have to manipulate the circumstances or context of the first exposure to a word or concept. In the current study, we do exactly that: we teach participants new concepts for which they do not have an equivalent native language word. This allows us to see what the effects of emotionality are as it is conveyed only by the definition of a word, without any prior exposure to the word or concept, or knowledge of the object.
Most studies so far have examined how emotionality affects the acquisition of new labels for known objects rather than learning of new words for new concepts. One notable exception is the study by Brase and Mani , which showed emotionality effects for new negative words in an emotional Stroop task and a sentence completion task. In the foreign language condition, this occurred only in the emotional context, whereas in the native language, they showed an effect in both emotional and neutral contexts. But, it is worth noting that their recall results are in contrast with prior studies, with no emotionality effect in the native language and a restricted effect in the foreign language—only for negative words in the emotional context.
Frances et al.  took a similar approach to the one in the present study, where participants learn new information—in their case new content, instead of new vocabulary—that was embedded in a positive or a neutral semantic context. This contrasts with prior studies that focused either on memory for known words or on the acquisition of new lexical items for known concepts. They found that positively valenced content was remembered better than neutral content and, importantly, that this effect occurred in the native and foreign languages equally. As we are using a similar strategy of teaching new information—in this case, new objects and their names—in the foreign or native language, we expect similar results. Namely, we expect enhanced memory for positive items and a similar improvement in both languages.
The foreign language literature has shown inconsistent effects of emotionality and is unclear about whether emotionality helps increase vocabulary in that language. Therefore, the purpose of the current study is to understand new concept acquisition, removing the influence of prior experience with that concept or item. Importantly, in the current study, new information is taught implicitly through descriptive texts, which are more naturalistic than single, isolated words. This provides a more accurate understanding of what happens in more realistic learning situations.
Other studies have manipulated the emotionality of the context through association with emotional images  or paralinguistic information . For the purposes of the present work, we have operationally defined emotionality as positive valence and high arousal manipulated through the semantic context of the new word. Therefore, we manipulated emotionality semantically by conveying descriptions of objects in either a positive or neutral manner.
Given that one of the possible applications for this research is classroom contexts with students of different ages, only positive and neutral descriptions were used, as negative materials would not be advisable for educational settings. Furthermore, some studies have found that negative stimuli do not produce the same effects as positive stimuli and can hinder performance [19,21], particularly for the surrounding neutral words as they capture attention, unlike positive words which aid performance in these cases [32–34].
In short, this study aims to understand the effects of positive valence—manipulated semantically—on vocabulary learning. One of the priorities in the current study is to use naturalistic classroom-type stimuli than most prior research. The experiments here use novel words and concepts conveyed through paragraph-long definitions. We would like to emphasize that although the results of this study may have practical applications, our goal is to understand how learning can be influenced from a theoretical point of view. The ultimate goal is to understand how we process and incorporate this type of information in a foreign language, in particular as it compares to our native language.
Participants were recruited from the Participa platform at the Basque Center on Cognition, Brain and Language. This database includes extensive demographic information, information on language history and use, and any physical or cognitive impairments. All eligible participants received an email inviting them to participate. Participants were typical young adults from the Basque Country.
Fifty-six participants were tested, three participants were removed for low accuracy (< 50%) in the old/new recognition task (two from the English condition and one from the Spanish condition) and 10 were removed for low accuracy (< 33%) in the name matching task (five from the English condition and five from the Spanish condition). The remaining participants were 43 native Spanish speakers (24.14 years old, SD = 3.99) with an intermediate to high level of English (BEST—Basque, English, and Spanish Test—Interview scores of four or five out of five ).
Language was a between-subjects factor. Participants did only one language condition: the native language condition (i.e., Spanish: 23) or the foreign language condition (i.e., English: 20 participants). Groups were matched for age (see Table 1), gender (13 females in the Spanish group and 12 females in the English group), education level (see Table 2), student status (17 students in the Spanish group and 16 in the English group), verbal and nonverbal IQ (Kbit) , age of acquisition of English, and Spanish and English LexTALE [37,38] (see Table 2). In addition, all participants were trilingual, with the exception of two participants in the Spanish condition who were only bilingual.
The experiment was approved by the Ethical and Scientific committees at the Basque Center on Cognition, Brain and Language (BCBL) and all participants provided written consent to participate in the study (Approval number: 7209). They were compensated 8€ for their time.
The stimuli consisted of 46 images of invented objects that had been used in another study  and 92 disyllabic pseudowords (46 for names of objects, 46 for fillers used in testing phase—see S1 Table for the full list of pseudowords and their corresponding bigram frequencies by language). Object names were equally likely to be an English or Spanish word (e.g., chefio) as measured by average token bigram frequency for each word (MSP = 491, SDSP = 411; MEN = 594, SDEN = 374; t(45) = 1.22, p = .230, BF01 = 3.14, error % = 8.24 x 10−6) using B-Pal for Spanish  and N-Watch for English .
The object names were presented in 46 descriptions of made-up objects, their origin, and their use (see Fig 1 for an example). A positive and a neutral version of each description was created. The details changed between versions, but there was always at least one characteristic in common between the two descriptions (e.g., “it is a type of maze”). Each description was translated to create an English and a Spanish version. Descriptions were matched on length by language (MSpa = 45.2, SDPos = 3.21; MEng = 45, SDNeu = 3.07; t(45) = .349, p = .728, BF01 = 5.90, error % = 1.15 x 10−5) and emotionality (MPos = 46, SDPos = 3.82; MNeu = 44.8, SDNeu = 3.52; t(45) = 1.62, p = .111, BF01 = 1.85, error % = 7.93 x 10−4). Paragraphs had a length of 39 to 54 words per description, M = 45.7, SD = 3.79. Length was also matched using a 2-way ANOVA with language and emotionality on length of paragraph (p’s > .05, see Table 3).
All four versions of an example stimulus (neutral English, neutral Spanish, positive English, and positive Spanish) as well as the corresponding image. Due to copyright restrictions, the image is not one of the ones from our stimulus set, but it is similar to those we used.
Each description was controlled for word frequency to match emotion conditions. Word frequency was computed by taking the average frequency of every adjective, noun, adverb, and verb (excluding auxiliary and modal verbs) in the paragraph. For Spanish, we used EsPal  and matched the two emotionality conditions (t(45) = .507, p = .614, BF01 = 5.54, error % = 1.12 x 10−5, see Table 3 for means and standard deviations). For English, we used the frequency from the Hyperspace Analogue to Language  using the English Lexicon Project  and matched the two emotionality conditions (t(45) = .525, p = .602, BF01 = 5.49, error % = 1.11 x 10−5, see Table 3).
Positive and neutral descriptions had significantly different valence (ANEW ) for both Spanish (t(45) = 4.26, p < .001, BF01 = 218, error % = 2.41 x 10−5) and English (t(45) = 7.86, p < .001, BF01 = 1.96 x 107, error % = 8.24 x 10−13), see Table 3. Positive and neutral descriptions also had significantly different arousal (ANEW—Bradley & Lang ) for both Spanish (t(45) = 5.01, p < .001, BF01 = 2.02 x 103, error % = 2.46 x 10−6) and English (t(45) = 7.38, p < .001, BF01 = 4.19 x 106, error % = 1.85 x 10−12), see Table 3. In addition, and in order to validate the emotionality (both valence and arousal) of the descriptions, we carried out a norming study with 18 participants from the same population as the study evaluating the descriptions in Spanish (see S1 Fig for instructions), on a scale from 0 to 4. The by item analysis found that items were considered significantly higher valenced in the positive version (MPos = 3.27, SDPos = .335) than the neutral version (MNeu = 2.21, SDNeu = .352), t(45) = 15.48, p < .001, BF01 = 3.96 x 1016, error % = 5.76 x 10−21). Similarly, items were considered to cause significantly higher arousal in the positive version (MPos = 1.32, SDPos = .454) than the neutral version (MNeu = .459, SDNeu = .394), t(45) = 8.56, p < .001, BF01 = 1.86 x 108, error % = 6.17 x 10−11).
Participants were recruited from a database with some basic questionnaires: K-Bit—Kaufman Brief Intelligence Test, —, LexTALE in English  and Spanish , linguistic background, socioeconomic background, and experience with bilingual education. These were used to select participants and match the groups. Then, they had a learning and a testing phase, which were either in English or Spanish (language was a between-subjects factor).
The learning phase consisted of seeing the picture of each object along with its description (containing the object name 4 times), one by one. Participants were instructed to read for comprehension. After seeing the object and reading the description for a minimum of 15 seconds (no Next button visible until the timeout) participants were allowed to move on to the next screen at their own pace. Participants took an average of 27.10 seconds (SD = 6.99 seconds) to read each description. The next screen asked them to type in the object's name. If they typed it incorrectly, they were taken back to the previous screen and then asked to type it again. This happened very infrequently (on average for two words for every three participants) and equally in every condition (MNeu-Eng = .186 words, SDNeu-Eng = .546; MPos-Eng = .163 words, SDPos-Eng = .374; MNeu-Spa = .163 words, SDNeu-Spa = .374; MPos-Spa = .163 words, SDPos-Spa = .374).
After seeing all 46 descriptions (23 positive and 23 neutral descriptions randomly mixed), they moved on to a filler task in order to reduce recency effects and assess working memory. The filler task consisted of a Corsi task (forwards and backwards)  which was timed to last 15 minutes. The purpose of the filler task was to decrease serial position effects [47,48] and reflect only information that had been more permanently registered in memory .
After the filler task, the testing phase started. The first task consisted of a cued recall in which participants were shown the image of an object and asked to recall its name (Fig 2A). This task was not timed. The second task was an old/new recognition task in which participants were presented the 46 object names along with 46 fillers one at a time and had 3000ms to say whether they had seen the word before or not (Fig 2B). The main task we were interested in, third task, consisted of a cued name-matching recognition task in which participants had 5000ms to select the correct name of the objects out of three choices (Fig 2C). For this task, the two distractors were pseudowords that had also been presented during the learning phase chosen randomly and matched for emotionality. This was our main task of interest because it was fully language independent—only the image and the name were presented, which were the same in both language conditions—and thus theoretically equally difficult. Finally, the last task was an attentional check that consisted of matching the correct characteristic that goes with the object (Fig 2D), with 5000ms to respond. The characteristic that had been held constant in both versions of the object description—positive and neutral—was used for this task. The two fillers were chosen at random from the other emotion-matched objects.
(a) Name recall task where participants were asked to type in the name of the object, (b) old/new recognition task where participants had to respond whether they had seen the word before or not, (c) name matching task where participants selected the correct name for the object, and (d) attentional check where participants selected the correct characteristic for the object. All responses were given using the keyboard (F and J keys for Old/New and A, F, and J keys for the matching tasks). Due to copyright restrictions, the image is not one of the ones from our stimulus set, but it is similar to those we used.
The order of tasks was chosen so that the earlier tasks would not affect the later tasks and held constant across participants. The name recall task was placed first because it was likely to be influenced by seeing the words again in the recognition tasks, but was unlikely to affect the other tasks. Then, in order to prevent increased familiarity with the learned items, we kept the Old/New task—which is purely based on familiarity—before the name matching task. Given that the attentional check was not of theoretical interest, we kept that task last.
Results and discussion
The following are the linear mixed effect model analyses for both experiments. In all cases, analyses were run using linear mixed-effects models in R, lme4  and lmerTest packages . Both two-level categorical predictors (emotionality and language) were coded as -0.5 and 0.5 (neutral/positive and English/Spanish). Subjects and items were included as random effects, but due to convergence issues, random slopes varied by analysis. Models on response times have the log transformation of response time as the dependent variable.
The models included all fixed effects of interest, as well as random intercepts for participants and items, unless otherwise stated. When models did not converge, all correlations between the random slopes and the random intercepts were removed . If the model still did not converge after removing the random correlations, we built down the random effects structure by removing the item slopes that explained the least variance until convergence was reached and singular fits were fixed.
Only the LME results are reported here. For the ANOVA results and the comparison between the two, please see (S1 Appendix).
Name matching task.
The first model had accuracy (0 incorrect; 1 correct) as the dependent variable and language (English or Spanish) and emotionality (Neutral or Positive), as well as their interaction as fixed effects (see Table 4 for means, standard deviations, and 95% confidence intervals and Fig 3A for percentage of errors by condition). In addition, we included the random intercepts for subject and item. In the model, there was a significant main effect of emotionality (β = .217, SE = .092, z = 2.35, p = .019), such that positive emotionality led to higher accuracy, and of language (β = .245, SE = .092, z = 2.65, p = .008), such that Spanish led to higher accuracy. There was no interaction between emotionality and language (β = .020, SE = .183, z = .108, p = .914).
The graphs show (a) percentage of errors and (b) response time. Error bars show 95% confidence intervals.
The model with log transformed response time as the dependent variable had language (English or Spanish) and emotionality (Neutral or Positive), as well as their interaction as fixed effects (see Table 4 for means, standard deviations, and 95% confidence intervals and Fig 3B for average response times by condition). In addition, we included the random intercepts for subject and item and random slopes for emotionality by subject. In the model, there was no main effect of emotionality (β = -.042, SE = .027, t = -1.53, p = .137) or of language (β = -.005, SE = .025, t = -.215, p = .830) and no interaction between emotionality and language (β = -.023, SE = .047, t = -.496, p = .620). (See S1.2 Table in S1 Appendix for the 90% confidence intervals of the effect sizes.)
Given the null interactions, we followed up the analyses using Bayesian statistics on the by-subject averages. Analyses were run using JASP statistical software (version 0.12.2) . We used the default settings for all model comparisons (the priors were r scale fixed effects of .5, r scale random effects of 1, and r scale covariates of .354). For the t-test the prior was a Cauchy distribution of scale .707.
First, we analyzed the effects on accuracy. We ran a comparison between a model predicting accuracy from Language, Emotionality, and Subject to one which also included the interaction (Language x Emotionality). The model with the additional factor—the interaction—showed evidence that the data was approximately three times more likely to be observed under the null model without the interaction compared to the one with it, BF01 = 3.33, error % = .485 (by item, BF01 = 4.65, error % = 2.26), suggesting that accuracy was not affected by an interaction between language and emotionality. In a similar vein, we ran a t-test comparing the emotionality effect (accuracy on the positive condition minus accuracy on the neutral condition) and found moderate evidence that the emotionality effect was in fact the same between languages, BF01 = 3.31, error % = .016 (by item, BF01 = 6.08, error % = 1.67 x 10−5).
We then ran the same analyses on response time. We ran a comparison between a model predicting response time from Language, Emotionality, and Subject to one which also included the interaction (Language x Emotionality). The model with the additional factor—the interaction—did not show an improvement, BF01 = 3.29, error % = .365 (by item, BF01 = 2.80, error % = 4.39), suggesting that the interaction did not have an effect on response time and that the data was moderately more likely to be observed under the model without the interaction rather than the one with it. In a similar vein, we ran a t-test comparing the emotionality effect (response time on the positive condition minus response time on the neutral condition) and found moderate evidence that the emotionality effect was in fact the same between languages, BF01 = 3.30, error % = .016 (by item, BF01 = 4.43, error % = 1.09 x 10−5).
Name recall task.
The name recall task was evaluated in two ways: exact correct recalls and Levenshtein distance (LD)  between the produced string and the original item. For the first analysis, the number of exact recollections (correct word with the correct object) were counted for each participant within each emotionality condition. As expected, given the difficulty of the task, recall was very low , on average fewer than two words (M = 1.91 words, SD = 2.75 words).
The first model had accuracy (0 incorrect; 1 correct) as the dependent variable and language (English or Spanish) and emotionality (Neutral or Positive), as well as their interaction as fixed effects (see Table 5 for means, standard deviations, and 95% confidence intervals by condition). In addition, we included the random intercepts for subject and item. In the model, there was no main effect of emotionality (β = .429, SE = .296, z = 1.45, p = .147) but there was one of language (β = 1.21, SE = .306, z = 3.94, p < .001), such that participants recalled more words in Spanish than in English, and no interaction between emotionality and language (β = .577, SE = .586, z = .985, p = .325).
Then, we calculated the normalized LD for each response. All responses that were shorter than 3 characters or contained a real word related to the object were removed as they were not considered real attempts. When calculating the LD, the number of insertions, deletions, and alterations needed to get from the produced word—the recalled string—to the original word—the studied string—is taken into account to calculate a standardized value. Then, these values are divided by the word length in order to normalize the value [55–57]. We also ran a model on the LD between the produced word and the correct response. The dependent variable was the normalized LD and language (English or Spanish) and emotionality (Neutral or Positive), as well as their interaction were the fixed effects. In addition, we included the random intercepts for subject and item. In the model, there was no main effect of emotionality (β = -.179, SE = .131, t = -1.30, p = .195) but there was one of language (β = -.374, SE = .142, t = -2.63, p = .008), such that participants produced strings closer to the correct one in Spanish than in English, and no interaction between emotionality and language (β = .082, SE = .249, t = .330, p = .742).
Old/new recognition task.
The first model had accuracy (0 incorrect; 1 correct) as the dependent variable and language (English or Spanish) and emotionality (Neutral or Positive), as well as their interaction as fixed effects (see Table 6 for means, standard deviations, and confidence intervals by condition). In addition, we included the random intercepts for subject and item. In the model, there was no main effect of emotionality (β = -.085, SE = .103, z = -.829, p = .407) or of language (β = -.024, SE = .108, z = -.227, p = .820) and no interaction between emotionality and language (β = .115, SE = .204, z = .561, p = .575).
The model with log transformed response time as the dependent variable had language (English or Spanish) and emotionality (Neutral or Positive), as well as their interaction as fixed effects. In addition, we included the random intercept for subject but had to remove the random effects by item in order for the model to converge. In the model, there was a significant main effect of emotionality (β = -.046, SE = .018, t = -2.55, p = .011), such that positive emotionality led to faster response times, and of language (β = .079, SE = .020, t = 3.96, p < .001), such that Spanish led to higher accuracy. There was no interaction between emotionality and language (β = -.017, SE = .036, t = -.488, p = .626).
The effects of language and emotionality did not appear consistently in all measures—they were present for name matching accuracy, but not response time; old/new response time, but not accuracy; and name recall only showed an effect of language. Importantly, we consistently found no interaction between language and emotionality, but need further evidence to verify that this interaction in fact does not exist. In addition, the language comparison was between subjects and the task was quite difficult in general. This suggests that the results need to be conceptually replicated. The next experiment addresses this and provides a close conceptual replication of this experiment.
The main goal of Experiment 2 was to replicate the results of Experiment 1 with a slightly different design. Besides that main goal, there were several minor goals. First, we wanted to have a fully within-subjects design in order to increase power, in case there was an interaction we were unable to detect before. Second, we wanted to improve the design and performance—in particular in the name recall task—by reducing the number of items to be learned per session. Although the total number of items was similar, participants learned only half in each of 2 sessions and the duration of the distractor task was reduced. Third, we wanted to remove the possible influence of the old/new task, which was not essential for answering our question. The attentional check also was not essential, but was maintained (at the end of the experiment) in order to (1) continue to assess whether participants had read the texts in their entirety and (2) to make sure that between session 1 and session 2 they were equally motivated to pay attention to the full texts, not just the object names.
Sixty participants from the same pool as Experiment 1 were tested. One participant was excluded for not following directions, four more were removed for low accuracy in the name matching task (<50% accuracy in either language), and one more was removed for a technical error. The remaining participants were 54 (17 male) native Spanish speakers (25.98 years old, SD = 6.19) with an intermediate to high level of English (as in experiment 1). Participants were not significantly different (p’s > .05) from those in Experiment 1 on any of the following variables: age, gender, education level, student status, verbal and nonverbal IQ (Kbit) , age of acquisition of English, and Spanish and English LexTALE [37,38]. As in Experiment 1, all participants were at least trilingual, with the exception of three participants that were only bilingual.
The experiment was approved by the Ethical and Scientific committees at the BCBL and all participants provided written consent to participate in the study (Approval number: 11709). They were also compensated for their time with 12€ after finishing the second session.
Stimuli were the same as in Experiment 1, except for six items (bangel, dela, fortor, mova, testor, and trequi). These items were removed in order to reduce the stimuli to 10 items per condition (neutral Spanish, positive Spanish, neutral English, and positive English).
Participants were recruited from the same database as in Experiment 1. Each participant completed two sessions: one in English (foreign language) and one in Spanish (native language). The two sessions were at least 5 days apart and the order of sessions was counterbalanced between participants. The learning phase was the same as in Experiment 1. Participants took an average of 32.07 seconds (SD = 8.57 seconds). Again, participants had to retype the words very infrequently and equally in every condition (MNeu-Eng = .63, SDNeu-Eng = 1.12; MPos-Eng = .52, SDPos-Eng = .86; MNeu-Spa = .50, SDNeu-Spa = 1.13; MPos-Spa = .24, SDPos-Spa = .61). After seeing the 20 descriptions (10 positive and 10 neutral descriptions randomly mixed) for that session, they moved on to a filler task—forwards Corsi in session 1 and backwards Corsi in session 2 —which was programmed to last 7 minutes.
After the filler task, the testing phase started and concerned only the 20 items from that session. The first task consisted of a cued recall identical to that of Experiment 1 (Fig 4A). The second task consisted of a cued name-matching recognition task similar to that of Experiment 1, except that participants had 2500ms to select the correct name of the objects out of two choices (Fig 4B). The response options were reduced to 2 in order to increase the reliability of response times. For this task, the distractors were pseudowords that had also been presented during the learning phase, chosen randomly. For each item there were 2 trials: one with a distractor of equal emotionality and one with the distractor of opposite emotionality.
(a) Name recall task in which participants were asked to type-in the name of the object, (b) name matching task in which participants selected the correct name for the object, and (c) attentional check in which participants selected the correct characteristic for the object. In Tasks 2 and 3, participants responded using the keyboard keys F for left and J for right. Due to copyright restrictions, the image is not one of the ones from our stimuli, but it is similar to those we used.
The last task, the attentional check, was modified to display the object name rather than the image when asking for the correct characteristic (Fig 4C). The reason behind this change is that some of the characteristics were quite visual, making the discrimination deducible by the image and not dependent on the description. In any case, this task was kept only as a check. Participants had only two response options and 3500ms to respond. As with the name matching task, there were 2 trials for each item, one with a distractor of equal emotionality and one with the distractor of opposite emotionality.
Results and discussion
All analyses were run as in Experiment 1. Only the LME results are reported here. For the ANOVA results and the comparison between the two, please see (S1 Appendix).
Name matching task.
The first model had accuracy (0 incorrect; 1 correct) as the dependent variable and language (English or Spanish) and emotionality (Neutral or Positive), as well as their interaction as fixed effects (see Fig 5A and Table 7 for means, standard deviations, and 95% confidence intervals). In addition, we included the random intercepts for subject and item and the language slopes by participant (uncorrelated). In the model, there was a significant main effect of emotionality (β = .446, SE = .150, z = 2.98, p = .003), such that positive emotionality led to higher accuracy, but no effect of language (β = .170, SE = .135, z = 1.26, p = .209) and no interaction between emotionality and language (β = -.112, SE = .153, z = -.732, p = .464).
The graphs show (a) percentage of errors and (b) response time in milliseconds. Error bars show 95% confidence intervals.
The model with log transformed response time as the dependent variable had language (English or Spanish) and emotionality (Neutral or Positive), as well as their interaction as fixed effects (see Fig 5B and Table 7 for means, standard deviations, and 95% confidence intervals). In addition, we included the random intercepts for subject and item and random slopes for emotionality by subject. In the model, there was a main effect of emotionality (β = -.065, SE = .020, t = -3.21, p = .003), such that positive items were responded to faster, and of language (β = -.026, SE = .009, t = -3.00, p = .003), such that participants responded faster in Spanish than in English, but no interaction between emotionality and language (β = .007, SE = .017, t = .403, p = .687).
As in Experiment 1, we followed up the null interactions using Bayesian statistics on the by-subject averages using the same software and priors.
First, we analyzed the effects of the interaction on accuracy. We ran a comparison between a model predicting accuracy from Language, Emotionality, and Subject to one which also included the interaction (Language x Emotionality). The model with the additional factor—the interaction—showed evidence that the data was almost four times more likely to be observed under the model without the interaction compared to the one with it, BF01 = 3.84, error % = .50 (by item, BF01 = 2.13, error % = 1.22), suggesting that accuracy was not affected by an interaction between language and emotionality. We ran a t-test comparing the emotionality effect (accuracy on the positive condition minus accuracy on the neutral condition) and found moderate evidence that the emotionality effect was in fact the same between languages, BF01 = 5.40, error % = .009.
We then ran the same analyses on response time. We ran a comparison between a model predicting response time from Language, Emotionality, and Subject to one which also included the interaction (Language x Emotionality). The model without the additional factor—the interaction—was almost four times more likely compared to the one with it, BF01 = 3.81, error % = 3.75 (by item, BF01 = 2.81, error % = .563), suggesting that the interaction did not have an effect on response time and that the data was moderately more likely to be observed under the model without the interaction rather than the one with it. We ran a t-test comparing the emotionality effect (response time on the positive condition minus response time on the neutral condition) and found moderate evidence that the emotionality effect was in fact the same between languages, BF01 = 4.69, error % = .019.
Name recall task.
The name recall task was evaluated as in Experiment 1. For the first analysis, the number of exact recollections (correct word with the correct object) were counted for each participant within each emotional condition. As expected, given the difficulty of the task, recall was very low ; on average fewer than four words out of the 40 (M = 3.80 words, SD = 3.80 words) were correctly recalled. The first model had accuracy (0 incorrect; 1 correct) as the dependent variable and language (English or Spanish) and emotionality (Neutral or Positive), as well as their interaction as fixed effects (see Table 8 for means, standard deviations, and confidence intervals). In addition, we included the random intercepts for subject and item. In the model, there was no main effect of emotionality (β = .385, SE = .281, z = 1.37, p = .170) nor of language (β = .195, SE = .153, z = 1.27, p = .203), and no interaction between emotionality and language (β = -.107, SE = .305, z = -.349, p = .727).
We also ran a model on the Levenshtein distance between the produced word and the correct response. The dependent variable was the normalized Levenshtein distance and language (English or Spanish) and emotionality (Neutral or Positive), as well as their interaction were the fixed effects. In addition, we included the random intercepts for subject and item. In the model, there was no main effect of emotionality (β = -.030, SE = .023, t = -1.29, p = .204) but there was one of language (β = -.043, SE = .013, t = -3.23, p = .001), such that participants produced strings closer to the correct one in Spanish than in English, and no interaction between emotionality and language (β = -.009, SE = .026, t = -.338, p = .735).
Finally, the average score on the attentional check for English was 73.2% (SD = 11.5%) and for Spanish it was 73.5% (SD = 11.0%). We used both a frequentist and a Bayesian paired samples t-test of language on accuracy by subject to verify that participants read equally well in both language conditions. The test showed moderate evidence that the null model was more likely than the alternative model, t1(53) = .22, p = .829, Cohen’s D = .029, BF01 = 6.59, error % = .001, t2(38) = .64, p = .525, Cohen’s D = .203, BF01 = 5.72, error % = 9.91 x 10−6.
Overall, the results of Experiment 2 show a more consistent effect of emotionality in recognition, but not in recall. The effect of emotionality appears in the name matching task, both for accuracy and response time, but not in the name recall task. The effects of language are not very consistent, they appear in the name recall task—but, only for the Levenshtein distance measure—and for response time in the name matching task. Importantly, there was consistently no interaction between language and emotionality.
The motivation for this study was to shed light on a highly common phenomenon: learning new content in a foreign language. This occurs for example in CLIL-based school environments or in university programs taught in language other than the official one in the region. In particular, the focus was on using naturalistic materials and tests and to understand whether learning in a foreign language differs from learning in a native language. We focused on the effects of emotional context—here defined as positive valence and high arousal—during the process of learning new words (in this case, pseudowords) attached to new content, here represented by novel objects.
Importantly, all of the strings that were learned had no prior associations for the participants, as they corresponded to concepts invented for the purpose of this study, and they were equally likely in both languages. This means that any emotionality effects were caused directly by the semantic context created in the experiment, removing the possible impact of a priori developed language-emotion connections. We focused specifically on the acquisition of lexical forms and their association with an object image. Furthermore, these new words or strings were embedded in descriptions or definitions of the objects they represented, much like how we normally acquire new information.
We hypothesized that new lexical items would be more difficult to learn in a foreign language—even though the target stimuli were exactly the same between language conditions—and that positive emotionality would help compensate for this difficulty. This was addressed through two experiments in which participants carried out a name matching task between the images of the objects they studied with their names (among other tasks). This task tested the associations that were created during the learning phase. We focused on this task because it was language independent, and thus exactly the same for both language conditions and not as taxing for participants. In addition, participants did a lexical decision task (old/new recognition task) in Experiment 1 and a name recall task in both experiments.
In Experiment 1, we found foreign language effects in the three main tasks: name matching accuracy, old/new response time, and name recall (both accuracy and Levenshtein distance). The critical task—the name matching task—showed a language effect on accuracy in Experiment 1, when there was more information to be remembered. This effect was present in response times—but not accuracy—in Experiment 2, when the task was made easier and the to-be-remembered information was reduced. These results align with prior studies that found improved performance in the native language when different pieces of information need to be associated , but the diverging findings depending on the tasks and measures used also provide a possible explanation as to why other studies do not find the same language effects [3,5,9,59]. Although there were several differences between the two versions of the task, most differences related to making the task less demanding, for example by reducing the number of items to be remembered, the retention period, the number of tasks, and the number of answer options. Given that the language effects remained in the Levenshtein measure in recall—with recall being the most demanding of the tasks—, one could interpret that the difficulty of the task overall affected accuracy disproportionately in the foreign language. It is worth noting that, although the two groups in Experiment 1 were very well matched, Experiment 2 was within subjects, which could have also influenced the results. Nevertheless, given that the only task in Experiment 2 that showed a language effect was the most difficult one—namely, recall—, this would support our idea that difficulty drove the effect. In order to conclude this with certainty, this hypothesis needs to be explored further in future studies.
With respect to emotionality, we found that positive semantic contexts aided recognition memory and, in particular, in the name matching task. These data fit well with preceding studies showing that learning performance is better when the information they are tested on is positive . These effects of positive semantic context were present in both experiments, but absent in the recall tasks—possibly due to a lack of power in this task. In this case, unlike with the effect of foreign language, it was not modulated by the amount of information that had to be learned. This means that it is a more stable effect that is robust to some manipulations of other variables—such as language (as we discuss below) or amount of information. This suggests that this manipulation—positive valence and arousal of the semantic context—can be used in several different circumstances to increase familiarity with learned items. It should be noted though, that these results do not inform our understanding of the effects of negative valence on word learning and are not definitive with respect to recall.
When analyzing the size of the observed effects, the partial eta squared of the effects found in this study are considered to be medium to large effects . Depending on the task and experiment, 7 to 46% of the variance was explained by emotionality (see S1 Appendix). For accuracy in the name matching task, this was 13% in Experiment 1 and 34% in Experiment 2—with the latter more than doubling the 14% benchmark of a large effect. For response times in that task, effect sizes were 15% for Experiment 1 and 46% in Experiment 2, both considered large effects. In practical terms, emotionality lead to a 5 (Experiment 1) and 7% (Experiment 2) difference in accuracy, which equated to roughly remembering 2 to 3 items more. Although a relatively discrete boost in performance, this could be the difference between passing and failing an exam for a student that is struggling in a class and could become more meaningful as it accumulates. With respect to response times, there was a 165 ms (Experiment 1) and 98 ms (Experiment 2) difference for this task. In more general terms, this study suggests that emotionally loaded semantic contexts—not just emotional content—conveying new pieces of information can improve short term recognition.
With respect to the relationship between emotionality and foreign language, we found no interaction in either accuracy or response times in the name matching task—or in any of the other tasks or measures—in either experiment. This means that, although learning new words and concepts may be more demanding in a foreign language—depending on the task—, the effects of emotionality remain constant across languages. Similar results have been found by other studies in the context of memory for known words [19–23], but the current study is the first to demonstrate this in the acquisition of new words and concepts as well as using more naturalistic stimuli.
These results may seem surprising, as the foreign language effect suggests that people are less emotional—and thus less affected by emotional biases—in a foreign language . Nevertheless, our results do not contradict this idea, but rather expand on possible causes for the effect as they suggest that these biases are not present during acquisition. In other words, in the first encounter with a word or concept, the emotional context around it can have the same effect in the foreign as in the native language. Therefore, the later observed effects might relate to the way in which this information or vocabulary was acquired, rather than with the use of a foreign language itself. In addition, our results have ramifications for our understanding of how we learn a foreign language. In particular, they support the idea that the reduced emotionality found in foreign language contexts has its origins in the way we learn a foreign language—namely, in a classroom and not in our more informal social environment—, rather than being a characteristic of how we process languages that are not our native one . Similar claims have in fact been made by other authors . Although practical applications of this research can be easily extracted, this is not our main goal. Therefore, we only provide the suggestion of looking into manipulating emotionality and semantic context as a way to improve learning, but do not provide practical advice for teachers.
It is worth noting that our study focuses on acquisition for the lexical item in the short term. Although we did not test indicators of word acquisition beyond the item itself and its association with a visual object, we provide evidence on the essential first steps towards word learning. Future research should expand our results to other areas of word acquisition such as semantics and word use. Despite the limitations of the experiments, this study provides ecological validity insofar as participants were taught new words in the same way they might learn them naturally in multilingual classroom contexts: by reading a textbook or asking for the definition of the word. Nevertheless, further research using even more naturalistic situations—e.g., a classroom setting, but maintaining the same level of experimental control—, is necessary in order to understand learning in these situations in more depth.
The current results get us one step closer to understanding the intricacies of language learning by non-balanced bilinguals and the complex relationship between emotionality and each of the languages they know. Our results point to an increased importance of chunking information—or splitting it up into multiple sessions—in the foreign language with respect to the native language. We highlight the importance of emotional semantic context when learning new words, equally in both the native and foreign language. And, finally, although not our main goal, this provides a practical teaching tool, where simply by adding a positive valence to texts, teachers might be able to help students improve their memory and learning of new vocabulary.
S1 Fig. Example trial from norming task.
Each description was given followed by the questions “How positive or negative is the description of this object?” [¿Cómo de positiva o negativa es la descripción de este objeto?] and “How intense is the description of this object?” [¿Cómo de intensa es la descripción de este objeto?] with a clarification underneath saying “Please select how intense the emotional activation you feel is, regardless of whether the description is positive or negative” [Por favor selecciona cómo de intensa es la activación emocional que sientes, más allá de que la descripción sea positiva o negativa]. After each question there was a dropdown menu. In the valence question the answer options were very negative, somewhat negative, neutral, somewhat positive, and very positive [muy negativa, algo negativa, neutral, algo positiva, muy positiva]. For the arousal question, the options were: not at all intense, a little intense, somewhat intense, very intense, extremely intense [para nada intensa, un poco intensa, bastante intensa, muy intensa, extremadamente intensa].
S1 Table. Pseudowords and their average bigram frequencies.
The following is a list of all the pseudowords used for the main tasks—learning and testing—by type, i.e. whether they were used as learn/test stimuli or as foils. Reported are the average token-type bigram frequencies for the word as reported by B-Pal  and N-Watch software .
We thank Julen Cristti© for the creation of part of the materials that were used in this set of experiments.
- 1. Kousta ST, Vigliocco G, Vinson DP, Andrews M, Del Campo E. The representation of abstract words: Why emotion matters. J Exp Psychol Gen. 2011;140(1):14–34. pmid:21171803
- 2. Kousta ST, Vinson DP, Vigliocco G. Emotion words, regardless of polarity, have a processing advantage over neutral words. Cognition. 2009;112(3):473–81. pmid:19591976
- 3. Admiraal W, Westhoff G, De Bot K. Evaluation of bilingual secondary education in the Netherlands: Students’ language proficiency in English 1. Educ Res Eval. 2006;12(1):75–93.
- 4. Aguilar M, Rodríguez R. Lecturer and student perceptions on CLIL at a Spanish university. Int J Biling Educ Biling. 2012;15(2):183–97.
- 5. Bergroth M. Immersion students in the matriculation examination three years after immersion. In: Björklund S, Mard-Miettinen K, Bergström M, Södergard M, editors. Exploring Dual-Focussed Education: Integrating language and content for individual and societal needs [Internet]. Vaasan yliopisto; 2006. p. 123–34. Available from: http://www.uva.fi/materiaali/pdf/isbn_952-476-149-1.pdf
- 6. Dalton-Puffer C. Discourse in Content and Language Integrated Learning (CLIL) Classrooms [Internet]. John Benjamins Publishing. John Benjamins Pub.; 2007. (Language Teachning; vol. 20). Available from: https://books.google.es/books?id=Y-deXDJJ9-gC
- 7. Jiménez Catalán RM, Ruiz de Zarobe Y. The receptive vocabulary of EFL learners in two instructional contexts: CLIL versus non-CLIL instruction. In: Content and language integrated learning: Evidence from research in Europe. 2009. p. 81–92.
- 8. Ouazizi K. The effects of CLIL education on the subject matter (Mathematics) and the target language (English). Lat Am J Content Lang Integr Learn. 2016;9(1):110–37.
- 9. Serra C. Assessing CLIL at primary school: A longitudinal study. Int J Biling Educ Biling. 2007;10(5):582–602.
- 10. Xanthou M. The impact of CLIL on L2 vocabulary development and content knowledge. English Teach. 2011;10(4):116–26.
- 11. Dallinger S, Jonkmann K, Hollm J, Fiege C. The effect of content and language integrated learning on students’ English and history competences—Killing two birds with one stone? Learn Instr [Internet]. 2016;41:23–31. Available from: http://dx.doi.org/10.1016/j.learninstruc.2015.09.003
- 12. Yang S-W. The study of variables influencing the effect of English medium instruction on academic content learning and English proficiency development. The Pennsylvania State University; 2014.
- 13. Dalton-Puffer C. Content-and-language integrated learning: From practice to principles? Annu Rev Appl Linguist. 2011;31:182–204.
- 14. Vander Beken H, Brysbaert M. Studying texts in a second language: The importance of test type. Bilingualism. 2018;21(5):1062–74.
- 15. Dirix N, Vander Beken H, De Bruyne E, Brysbaert M, Duyck W. Reading Text When Studying in a Second Language: An Eye-Tracking Study. Read Res Q. 2019;1–27.
- 16. Roussel S, Joulia D, Tricot A, Sweller J. Learning subject content through a foreign language should not ignore human cognitive architecture: A cognitive load theory approach. Learn Instr. 2017;52:69–79.
- 17. Bennion KA, Ford JH, Murray BD, Kensinger EA. Oversimplification in the study of emotional memory. J Int Neuropsychol Soc. 2013;19(9):953–61. pmid:24007950
- 18. Caldwell-Harris CL. Emotionality differences between a native and foreign language: Theoretical implications. Front Psychol. 2014;5(1055):1–4.
- 19. Ayçiçeǧi A, Harris CL. Bilinguals’ recall and recognition of emotion words. Cogn Emot [Internet]. 2004;18(7):977–87. Available from: : http://search.ebscohost.com/login.aspx?direct=true&db=psyh&AN=2004-19292-006&site=ehost-live&scope=site%5Cnhttp://firstname.lastname@example.org%5Cnhttp://email@example.com
- 20. Ayçiçeǧi-Dinn A, Caldwell-Harris CL. Emotion-memory effects in bilingual speakers: A levels-of-processing approach. Biling Lang Cogn [Internet]. 2009;12(3):291–303. Available from: http://journals.cambridge.org
- 21. Ferré P, Anglada-tort M, Guasch M. Processing of emotional words in bilinguals: Testing the effects of word concreteness, task type and language status. Second Lang Res. 2018;34(3):371–94.
- 22. Ferré P, Garcia T, Fraga I, Sanchez-Casas R, Molero M. Memory for emotional words in bilinguals: Do words have the same emotional intensity in the first and in the second language? Cogn Emot [Internet]. 2010;24(5):760–85. Available from: http://www.informaworld.com/openurl?genre=article&doi=10.1080/02699930902985779&magic=crossref%7C%7CD404A21C5BB053405B1A640AFFD44AE3
- 23. Ponari M, Rodriguez-Cuadrado S, Vinson D, Fox N, Costa A, Vigliocco G. Processing advantage for emotional words in bilingual speakers. Emotion. 2015;15(5):644–52. pmid:25893450
- 24. Anooshian LJ, Hertel PT. Emotionality in free recall: Language specificity in bilingual memory. Cogn Emot [Internet]. 1994;8(6):503–14. Available from: http://digitalcommons.trinity.edu/psych_faculty
- 25. Ferré P, Ventura D, Comesaña M, Fraga I. The role of emotionality in the acquisition of new concrete and abstract words. Front Psychol. 2015;6(976):1–10.
- 26. Altarriba J, Basnight-Brown DM. The Acquisition of Concrete, Abstract, and Emotion words in a Second Language. Int J Biling. 2011;16(4):446–52.
- 27. Costa A, Vives M-L, Corey JD. On language processing shaping decision making. Curr Dir Psychol Sci [Internet]. 2017;26(2):146–51. Available from: https://doi.org/10.1177/0963721416680263
- 28. Costa A, Foucart A, Arnon I, Aparici M, Apesteguia J. “Piensa” twice: On the foreign language effect in decision making. Cognition. 2014;130(2):236–54. pmid:24334107
- 29. Brase J, Mani N. Effects of learning context on the acquisition and processing of emotional words in bilinguals. Emotion. 2017;17(4):628–39. pmid:27991815
- 30. Frances C, De Bruin A, Dunãbeitia JA. The influence of emotional and foreign language context in content learning. Stud Second Lang Acquis. 2019;
- 31. Erk S, Kiefer M, Grothe J, Wunderlich AP, Spitzer M, Walter H. Emotional context modulates subsequent memory effect. Neuroimage. 2003;18(2):439–47. pmid:12595197
- 32. Dolcos F, Katsumi Y, Denkova E, Dolcos S. Factors Influencing Opposing Effects of Emotion on Cognition: A Review of Evidence from Research on Perception and Memory. In: Opris I, Casanova MF, editors. The Physics of the Mind and Brain Disorders Springer Series in Cognitive and Neural System, vol 11. Cham: Springer; 2017. p. 297–341.
- 33. Hurlemann R, Hawellek B, Matusch A, Kolsch H, Wollersen H, Madea B, et al. Noradrenergic Modulation of Emotion-Induced Forgetting. J Neurosci. 2005;25(27):6343–9. pmid:16000624
- 34. Iordan AD, Dolcos F. Brain Activity and Network Interactions Linked to Valence-Related Differences in the Impact of Emotional Distraction. Cereb Cortex. 2017;27(1):731–49. pmid:26543041
- 35. de Bruin A, Carreiras M, Duñabeitia JA. The BEST dataset of language proficiency. Front Psychol. 2017;8.
- 36. Kaufman AS, Kaufman NL. Kaufman Brief Intelligence Test. Encyclopedia of Special Education: A Reference for the Education of Children, Adolescents, and Adults with Disabilities and Other Exceptional Individuals. En. 2013.
- 37. Lemhöfer K, Broersma M. Introducing LexTALE: A quick and valid lexical test for advanced learners of English. Behav Res Methods. 2012;44(2):325–43. pmid:21898159
- 38. Izura C, Cuetos F, Brysbaert M. Lextale-Esp: A test to rapidly and efficiently assess the Spanish vocabulary size. Psicológica. 2014;35:49–66.
- 39. Antón E, Thierry G, Duñabeitia JA. Mixing Languages during Learning? Testing the One Subject—One Language Rule. PLoS One. 2015;10(6):1–45.
- 40. Davis CJ, Perea M. BuscaPalabras: A Program for deriving orthographic and phonological neighborhood statistics and other psycholinguistic indices in Spanish. Behav Res Methods. 2005;37(4):665–71. pmid:16629300
- 41. Davis CJ. N-Watch: A program for deriving neighborhood size and other psycholinguistic statistics. Behav Res Methods. 2005;37(1):65–70. pmid:16097345
- 42. Duchon A, Perea M, Sebastián-Gallés N, Martí A, Carreiras M. EsPal: One-Stop Shopping for Spanish Word Properties. Behav Res Methods. 2013;45(4):1246–58. pmid:23468181
- 43. Lund K, Burgess C. Producing High-Dimensional Semantic Space from Lexical Co-Occurence. Behav Res Methods, Instruments, Comput. 1996;28(2):203–8.
- 44. Balota D, Yap MJ, Cortese MJ, Hutchinson KA, Kessler B, Loftis B, et al. The English Lexicon Project. Behav Res Methods. 2007;39(3):445–59. pmid:17958156
- 45. Bradley MM, Lang PJ. Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings. 1999.
- 46. Milner B. Interhemispheric Differences in the Localization of Psychological Processes in Man. Br Med Bull. 1971;27(3):272–7. pmid:4937273
- 47. Cohen RL. The Effects of Interference Tasks on Recency in the Free Recall of Action Events. Psychol Res. 1989;51(4):176–80. pmid:2616696
- 48. Postman L, Phillips LW. Short-Term Temporal Changes in Free Recall. Q J Exp Psychol [Internet]. 1965;17(2):132–8. Available from: http://www.tandfonline.com/doi/abs/10.1080/17470216508416422
- 49. Atkinson R. C, Shiffrin RM. The Control of Short-Term Memory. Sci Am [Internet]. 1971;225(2):1–43. Available from: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.398.2237&rep=rep1&type=pdf
- 50. Bates D, Maechler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015;67(1):1–48.
- 51. Kuznetsova A, Brockhoff PB, Christensen RHB. lmerTest package: Tests in linear mixed effects models. J Stat Softw. 2017;82(13):1–26.
- 52. Barr DJ. Random effects structure for testing interactions in linear mixed-effects models. Front Psychol. 2013;4(328):3–4.
- 53. JASP Team. JASP (Version 0.9). 2018.
- 54. Kruskal JB. An Overview of Sequence Comparison: Time Warps, String Edits, and Macromolecules. SIAM Rev [Internet]. 1983;25(2):201–37. Available from: http://www.jstor.org/stable/info/2030214
- 55. Casaponsa A, Antón E, Pérez A, Duñabeitia JA. Foreign Language Comprehension Achievement: Insights from the Cognate Facilitation Effect. Front Psychol. 2015;
- 56. Duñabeitia JA, Ivaz L, Casaponsa A. Developmental Changes Associated with Cross-language Similarity in Bilingual Children. J Cogn Psychol [Internet]. 2016 Jan 2;28(1):16–31. Available from: http://dx.doi.org/10.1080/20445911.2015.1086773
- 57. Duñabeitia JA, Dimitropoulou M, Morris J, Diependaele K. The Role of Form in Morphological Priming: Evidence from Bilinguals. Lang Cogn Process. 2013;28(7):967–87.
- 58. Nott CR, Lambert WE. Free recall of bilinguals. J Verbal Learning Verbal Behav [Internet]. 1968;7(6):1065–71. Available from: https://search.proquest.com/docview/1297342710?accountid=17248
- 59. Stohler U. The acquisition of knowledge in bilingual learning: An empirical study on the role of language in content learning. VIEWS-Vienna english Work Pap [Internet]. 2006;15(3):41–6. Available from: https://boris.unibe.ch/18531/1/views15_3_Stohler.pdf
- 60. Richardson JTE. Eta squared and partial eta squared as measures of effect size in educational research. Educ Res Rev [Internet]. 2011;6(2):135–47. Available from: http://dx.doi.org/10.1016/j.edurev.2010.12.001
- 61. Keysar B, Hayakawa SL, An SG. The foreign-language effect: Thinking in a foreign tongue reduces decision biases. Psychol Sci [Internet]. 2012;23(6):661–8. Available from: http://pss.sagepub.com/ pmid:22517192
- 62. Duñabeitia JA. Emotional Diglossia in Multilingual Classroom Environments: A Proposal. Psychol Cogn Sci Open J. 2017;3(3):74–8.