Complexity, Training Paradigm Design, and the Contribution of Memory Subsystems to Grammar Learning

Mark Antoniou; Marc Ettlinger; Patrick C. M. Wong

doi:10.1371/journal.pone.0158812

Abstract

Although there is variability in nonnative grammar learning outcomes, the contributions of training paradigm design and memory subsystems are not well understood. To examine this, we presented learners with an artificial grammar that formed words via simple and complex morphophonological rules. Across three experiments, we manipulated training paradigm design and measured subjects' declarative, procedural, and working memory subsystems. Experiment 1 demonstrated that passive, exposure-based training boosted learning of both simple and complex grammatical rules, relative to no training. Additionally, procedural memory correlated with simple rule learning, whereas declarative memory correlated with complex rule learning. Experiment 2 showed that presenting corrective feedback during the test phase did not improve learning. Experiment 3 revealed that structuring the order of training so that subjects are first exposed to the simple rule and then the complex improved learning. The cumulative findings shed light on the contributions of grammatical complexity, training paradigm design, and domain-general memory subsystems in determining grammar learning success.

Citation: Antoniou M, Ettlinger M, Wong PCM (2016) Complexity, Training Paradigm Design, and the Contribution of Memory Subsystems to Grammar Learning. PLoS ONE 11(7): e0158812. https://doi.org/10.1371/journal.pone.0158812

Editor: Etsuro Ito, Waseda University, JAPAN

Received: March 16, 2016; Accepted: June 22, 2016; Published: July 8, 2016

Copyright: © 2016 Antoniou et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting Information files.

Funding: This research was supported by the US National Institutes of Health grants R01DC008333 (P.W.) and R01DC013315 (B.C. & P.W.), the Hong Kong University Grants Committee (GRF 477513 and 14117514) (P.W.), Dr. Stanley Ho Medical Development Foundation (P.W.), VA RR&D Grant 1IK2RX000974 (M.E.), and Australian Research Council Discovery Early Career Research Award DE150101053 (M.A).

Competing interests: The authors have declared that no competing interests exist.

Introduction

Second language (L2) learning is characterized by great variability in learning outcomes, and there is increasing interest in the contribution made by memory subsystems to L2 learning success [1,2]. Research has shed light on the contribution made by memory subsystems in the learning of consonants [3], vowels [4], lexical tones [5], vocabulary [6], and overall language ability [7], but much remains unknown about their role in grammar learning (but see [8–15]). This is somewhat surprising because mastery of grammar is a necessary component of L2 learning that distinguishes proficient from non-proficient L2 speakers [16], and poses particular difficulty for L2 learners [17]. Furthermore, while there is a growing body of work that examines the effectiveness of different training methods, there have been few systematic examinations of the relationship between training methods and memory subsystems [18]. The purpose of the present study is to conduct an exploratory analysis of the role of memory subsystems in grammar learning. We seek to determine if known factors such as the ordering of the complexity of training and the presentation of trial-by-trial feedback contribute to grammar learning success, and how these paradigms differentially recruit memory subsystems. Our ultimate goal is to lay the foundations for future studies to proactively tailor training based on individual cognitive profiles with the aim of maximizing grammar learning outcomes.

Grammar refers to the rules governing how linguistic units may combine in a given language, including how phonemes are combined (phonology), how words are created (morphology), and how words combine to form sentences (syntax). Non-native learners often experience difficulty in learning these grammatical rules [19]. Grammar learning depends on the process of abstracting patterns from input [20], but to date, little work has been conducted on variability in grammar learning for L2 learners. There is, however, evidence linking domain general cognitive abilities (e.g., auditory working memory, and declarative and procedural memory) and their associated brain structures to grammar learning. Thus, we might expect L2 grammar learning to vary across individuals as a function of these memory subsystems.

Working memory capacity has long been associated with L2 learning success. Baddeley et al. [21] defines auditory working memory as a phonological loop, or buffer, that mediates between auditory input and complex higher-order learning and language abilities. Greater working memory availability would improve language learning by virtue of allowing the learner to incorporate a larger amount of input into the learning process [22]. Working memory may facilitate learning by allowing relevant information to be actively attended to during processing [23]. Empirical studies support the importance of working memory in language learning (for reviews see [2,7,24]). Working memory has been shown to play an important role in a number of language skills, including reading comprehension [25], sentence comprehension [26], resolving lexical ambiguity [27,28], modifying output in the L2 [29], acquiring L2 vocabulary knowledge [30,31], and grammar learning [32].

Additionally, Ullman's [33,34] Declarative/Procedural model specifies the roles of declarative and procedural memory in language learning and use. Procedural memory underlies both motor and cognitive skill and habit learning, and is considered to be a type of implicit memory, associated with acquiring sequences [35]. Declarative memory comprises knowledge about facts and events related to the world or knowledge of events that one has experienced [36]. Simple grammatical processes are sequence-oriented, and therefore procedural, in nature. Thus, it would be reasonable to expect procedural memory to correlate with grammar learning. In contrast, the lexicon relies on declarative memory, which is specialized for arbitrary associations. In Ullman's model, the role of working memory is to allow for maintenance and structuring of rule-governed patterns (in service of procedural memory), and to manipulate selected lexical items (in service of declarative memory). However, when it comes to L2 learning, there is evidence that the relationships between the different memory systems and language become more complex. The L2 lexicon will rely on declarative memory (as was the case for the native language), but the L2 grammar, unlike the grammar of the native language, will draw on both declarative and procedural memory. At low L2 grammar proficiency, there is a greater reliance on declarative memory, whereas as L2 proficiency increases, the L2 grammar will rely more on procedural memory (as does the native language grammar), and working memory demands will be reduced. From a cognitive perspective, declarative memory will influence the initial stages of L2 grammar learning and procedural memory may determine ultimate attainment.

There is a growing number of studies that are investigating the contribution of declarative and procedural memory subsystems to grammar learning success [37,38]. Morgan-Short et al. [9] familiarized subjects with an artificial language and then had them complete a grammaticality judgement task in which some sentences violated word order. At early stages of acquisition (after two training sessions), grammar learning correlated with declarative, but not procedural, memory. At later stages of acquisition (after six sessions), grammar learning correlated with procedural memory, and no longer with declarative. The findings lend support to the Declarative/Procedural view [33,34] that declarative and procedural memory predict L2 grammatical development at the early and late stages of acquisition, respectively. Recently, Ettlinger et al. [8] examined the acquisition of a morphophonological grammar and how this relates to procedural, declarative, and working memory. There was considerable variation in grammar learning outcomes across subjects, but importantly, this was accounted for by declarative and procedural memory, whereas working memory did not correlate. A role for the domain-general memory system in language learning is further supported by neuroimaging evidence. Neuroimaging and lesion studies have identified a shared neural substrate consisting of a frontostriatal network incorporating Broca's area and the basal ganglia, linking procedural memory and grammar [39–41].

Given the posited roles of procedural, declarative, and working memory in L2 grammar learning, these domain-general cognitive systems may be of particular importance when assigning individuals to different types of training methods. External factors, such as training parameters, may differentially recruit memory systems. For example, one interesting contribution of Ettlinger et al. [8] is the finding that learners differed in their grammar learning success depending on whether the structure being learned was grammatically simple or complex. Learners encountered more difficulty in learning the complex morphophonological pattern than the simple one. However, acquisition of the complex pattern entailed acquisition of the simple pattern for some learners. This finding was not consistent with the well-established view that simple linguistic structures should be learned first before moving on to more advanced, complex structures [42]. For example, scaffolding in learning has been a fundamental teaching principle, according to which students incrementally extend their abilities and level of understanding [43–45]. Compatibly, connectionist modelling of language development has advocated that starting small will lead to superior learning [46–48], and this has also been demonstrated for several aspects of grammar learning [49–53]. The Ettlinger et al. [8] finding is, however, consistent with research on childhood language development and treatment of communication disorders which suggests that use of complex language material in training leads to improved language learning and treatment outcomes. For example, Gierut [54] has shown that learning phonologically complex segments leads to greater improvements and generalization in children with phonological delays. Specifically, training children to produce more complex (i.e., marked) structures such as affricates results in learning of simpler (i.e., unmarked) but related structures such as fricatives. Similarly, training clusters generalizes to singletons, and training clusters with greater sonority (e.g., /kw/) affects learning of clusters with less sonority (e.g., /bl/). Importantly, for all of the above cases, training of simpler segments did not result in gains for complex segments. Kiran [55] provides converging evidence from the lexical-semantic domain. Training of complex, atypical items within a semantic category (e.g., a penguin as an example of the category birds) boosted access of simpler, typical items in that category, whereas training of simpler items did not yield benefits for complex items. Thompson and Shapiro [56] highlight the benefits of training complex sentence structures in agrammatic aphasics. Training production and comprehension of complex sentences generalized to simpler sentence structures with the same movement, whereas training simple sentences did not generalize to complex ones. The above findings converge to suggest that greater language learning outcomes may be achieved when complex language material is used in training, rather than when simple material is used. Thus, we might expect similar benefits to be observed in the learning of grammar.

An additional factor that contributes to learning is the presentation of corrective feedback. Feedback refers to information provided to learners that verifies the occurrence of learning [57]. Learning with trial-by-trial feedback is considered to be easier than without, and improves the speed of learning as well as overall performance [58]. However, for some tasks, feedback leads to a decrement in performance [59]. Moreover, neuroimaging studies have shown that the frontostriatal network plays a crucial role in feedback-based learning [60,61]. Given that the frontostriatal network is linked to procedural memory [62], we might expect subjects with greater procedural memory availability to benefit most from feedback. It has also been suggested that presenting feedback taxes working memory [63], and therefore, learners with greater working memory capacity will benefit most from feedback [64,65]. One possible explanation is that working memory affects what information learners attend to when feedback is presented [7,29]. With regard to language training specifically, presenting feedback leads to better learning of speech sounds [66], and is generally considered to improve L2 grammar learning outcomes [67]. It is not clear how feedback might interact with the complexity of the training material, although individuals with greater procedural memory and working memory should be most likely to benefit.

In sum, past research suggests that there exists a link between memory subsystems (working, declarative, and procedural) and L2 grammar learning outcomes, although the nature and mechanism of these relationships is still unclear. Crucially, there is no clear way to translate the reviewed findings on the relationship between working memory, declarative, and procedural memory and L2 learning into improved second language pedagogy. To address this, in the present study, we examined how working memory, and declarative and procedural memory subsystems relate to different training methods of grammar learning. Specifically, we manipulated whether feedback was incorporated, how it was incorporated, and whether item ordering affects acquisition, to better understand the role that memory subsystems play in L2 grammar learning. The results may assist practitioners to pre-identify what training methods work best with which language learners based on pretesting of these memory subsystems. That is, beyond extant research that predicts L2 learning ability based on working memory, we may be able to identify which language learners will learn best with different language learning methods incorporating different types of feedback and item ordering.

We exposed subjects to the same artificial language used in Ettlinger et al. [8], containing both simple and complex grammatical processes of word-formation. Half of the words were formed using a simple grammatical pattern, and the other half were formed via a complex pattern. Following training, subjects were asked to generalize these newly learned grammatical processes to new words. The training conditions were manipulated across three experiments. In Experiment 1, the contribution of training was established by comparing grammar learning following passive, exposure-based training and test versus a condition in which training was not administered and subjects were required to learn during the test phase based on feedback alone. In Experiment 2, the contribution of feedback was assessed by comparing learning when feedback was or was not provided during test. In Experiment 3, we examined the contribution of the ordering of training by first presenting either the simple or complex grammar to assess the robustness of generalization and whether learning the complex pattern automatically entails the simple one.

Experiment 1: The Influence of Passive, Exposure-Based Training on Grammar Learning

The present series of experiments seeks to address key issues in cognitive science concerning the role of domain-general memory subsystems in grammar learning. Before examining the contribution of feedback (Experiment 2) or ordering of complexity in training (Experiment 3), it is first necessary to demonstrate that memory subsystems play a role in the learning processes that occur during training.

For instance, it is well known in the field of artificial grammar learning that is possible for subjects to perform above chance without having learned the grammatical rules during training, either because of biases in the stimuli or because of potential learning and reasoning strategies used during test. To address this, we asked a test-only control group to complete the same artificial language tests but without having first completed training. Crucially, both groups completed a battery of cognitive tests to determine if domain-general memory subsystems relate to the initial training and not the test-specific strategies and reasoning in which subjects may potentially engage.

The aim of Experiment 1 was to test the contribution of passive, exposure-based training to grammar learning outcomes. It was hypothesized that learning outcomes would be greater when training preceded test relative to the test-alone condition (with no training). Further, if as we expect memory abilities correlate with learning of the simple and complex grammatical rules, then we would expect these relationships to be diminished in the test-alone controls.

Method

Ethics Statement.

This study was performed in strict accordance with an approved protocol. Subjects provided informed written consent in accordance with the Institutional Review Board and all experimental procedures were approved by the Northwestern University Institutional Review Board.

Subjects.

In total across Experiments 1–3, one hundred and twenty-two native English speakers who were students at Northwestern University took part in the study. Some subjects reported that they possessed experience with another language (Arabic n = 1, Bulgarian n = 1, French n = 8, German n = 2, Hebrew n = 3, Mandarin n = 3, and Spanish n = 26), but none considered themselves to be native speakers (self-ratings ≤ 4 out of 7), and none of these languages have structures similar to those present in our study. Subjects gave informed consent and were monetarily compensated for their time. All were free of neurological deficits and passed a pure tone audiological screening at 25 dB HL at 500, 1,000, 2,000, and 4,000 Hz.

In Experiment 1, to assess the contribution of passive, exposure-based training to grammar learning, 36 subjects completed the training followed by the test phase with feedback (baseline group) (M_age = 23.1; SD = 2.3; 21 females), and 25 subjects completed only the test phase with feedback, but no training (test-only group) (M_age = 21.9; SD = 2.3; 17 females). As is shown in Table 1, the groups were matched for declarative memory, procedural memory, and working memory.

Download:

Table 1. Group sizes, mean ages, and memory measures for baseline and test-only groups in Experiment 1.

Bottom row shows p-values from t-tests confirming that the groups were matched on each memory measure.

https://doi.org/10.1371/journal.pone.0158812.t001

Stimulus materials.

The artificial language was comprised of 30 noun-stems and two affixes that combined to form 120 words. The nouns were consonant-vowel-consonant monosyllables and represented 30 different animals (e.g., [pag] represented dog). The prefix [ka-] signalled the diminutive (e.g., doggy), and the suffix [-il] signalled the plural (e.g., dogs). The phonemic inventory consisted of American English consonants and the three vowels [a, e, i]. Each vowel was used in 10 word noun-stems (e.g., pag, zek, kij represented dog, sheep, rooster, respectively).

The grammar of the artificial language had two types of word formation rules, one simple and the other complex, and is modeled after grammar rules in a natural language, Shimakonde. In Shimakonde, plural and diminutive affixes combine with noun stems to create new words (Liphola 2001; Ettlinger, 2008). In our artificial grammar, the simple rule involved concatenating noun-stems with the suffix [-il] and/or prefix [ka-] (e.g., pag, pagil, kapag represent dog, dogs, doggy, respectively). The complex rule required concatenation and changing the vowels of both the stem and the affix, reflecting two phonological processes that do not occur in English. The first process is vowel harmony, which changes the vowel of the suffix to match that in the noun-stem (e.g., the plural of pag is pagil (simple) but zek is zekel). The second process is reduction, which changes noun stem vowels to match the prefix [ka-] (e.g., the diminutive of zek is kazak). These two processes may combine to form complex words (e.g., kazakel meaning many little sheep) and these contrast with simple words (e.g., kapagil meaning many little doggies).

For the experiment, each word was paired with a picture of a common animal (for a full stimulus list see [8]). A native English speaker produced the words at a normal rate with English prosody and phonology. Recordings were made using a Shure SM58 microphone and were digitized in Praat (16-bit, 22.05 kHz).

Procedure.

Subjects were instructed that they would be tested on a new language. No instructions were given concerning the rules of the language or that there were any rules to learn. For subjects that underwent training, they were presented with picture-spoken word pairings (e.g., a picture of a dog was shown and [pag] was heard). Twelve nouns were presented in all four forms in the following order: singular, diminutive, plural, and diminutive plural. Half of the nouns were simple, and the other half were complex, and their presentation order was randomized. Each exposure block was repeated four times, resulting in 192 training trials in total (12 nouns × 4 forms × 4 repetitions = 192 exposures). Each noun was presented onscreen as a picture for 3 s. The spoken word naming the picture was played 500 ms after the picture had appeared. At the end of the 3 s exposure, the picture disappeared and the screen remained blank for 500 ms before the next noun was presented. Upon completion of training, subjects were given a short break before moving on to the test phase.

Subjects were tested on their ability to apply the newly learned grammatical rules to novel words in a modified wug test [68]. A wug test requires subjects to modify a previously unencountered word from its singular form (e.g., wug) to produce it in a different form (e.g., plural form = wugs). In our version of the wug test, subjects were exposed to a new picture (out of 18 unencountered nouns) for 1.5 s and 500 ms after the new picture appeared, the spoken word naming the singular form of the picture was played. The picture then disappeared and the screen remained blank for 1 s. Subjects were then presented with a picture of the same noun but in a different form (either plural, diminutive, or diminutive plural). For example, if the subject had been exposed to a lion, they might now be presented with a picture of many small lions. Two spoken words were provided as response options and subjects were required to select the correct word by pressing one of two buttons on a response box within a 5 s response time limit. The plural, diminutive, and diminutive plural forms of each of the 18 unencountered nouns were tested in random order, with singular forms always used as the prompt. This resulted in a total of 54 test trials (18 nouns × 3 forms = 54 test trials). Foil responses were always the alternative affix (i.e., [-el] vs. [-il] for plurals, [a] vs. [e] for diminutives, and [-el] vs. [-il] for diminutive plurals). The order of the foil and correct words as response options 1 and 2 was counterbalanced. After a response, feedback was provided indicating whether they had made a correct or incorrect response. For incorrect trials, subjects were also played back the correct answer. Stimulus presentation was controlled by a computer running E-Prime (Psychology Software Tools, Pittsburgh, PA). Auditory stimuli were presented at about 72 dB SPL via Sennheiser HD 280 PRO headphones and visual stimuli were presented on a computer monitor. Responses were recorded using a low-latency button box.

Cognitive tests.

A cognitive test battery was administered to measure subjects' procedural, declarative and working memory subsystems. The tests were selected because they have been used in closely related past work [8,9].

Procedural memory was assessed using a computerized version of the Tower of London (TOL) test [69]. In the TOL, subjects are presented with an arrangement of balls stacked on pegs and are required to move balls one at a time in order to arrive at a goal arrangement. As the test progresses, trials become more difficult, such that the minimum number of moves required increases. The same start-goal sequences are repeated later in the test, and any improvement in performance is taken as a measure of procedural learning [70,71]. Each participant's score on the second repetition of sequences was normalized relative to the rest of the group.

Declarative memory was assessed using the Visual-Auditory Learning subtest of the Woodcock-Johnson III (WJ-III) Tests of Cognitive Abilities [72]. Subjects are required to learn to associate new visual symbols with orally presented words. Sequences of the newly learned symbols are then presented and the subject is required to 'read' them aloud. The score is standardized such that the population mean corresponds to a score of 100.

Working memory was assessed using the Auditory Working Memory subtest of the WJ-III. In this subtest, subjects are required to form categories of words and digits while retaining the appropriate sequence of the items. A series of intermixed digits and words are presented via audio recorded stimuli (e.g., “dog, 1, shoe, 8, 2, apple”). The subjects' task is to first repeat the words in sequential order (e.g., dog, shoe, apple) and then the digits in order (e.g., 1, 8, 2). The score is standardized such that the population mean corresponds to a score of 100.

Results and Discussion

We examined the contribution of training to grammar learning by conducting a 2 × (2) ANOVA with the between-subjects factor of group (baseline vs. test-only) and a within-subjects factor of grammar (simple vs. complex). The S1 File lists subject codes for Experiment 1, their grammar learning scores, and measures of declarative, procedural, and working memory. A main effect of group, F(1, 59) = 15.3, p < .001, = .206, revealed that overall the baseline group (61.2%) outperformed the test-only group (43.4%). A main effect of grammar, F(1, 59) = 23.3, p < .001, = .283, revealed that overall the simple grammar (64.3%) was easier to learn than the complex (42.5%). There was no significant interaction, p = .314. As shown in Fig 1, these findings suggest that passive exposure to a nonnative language boosts learning of both simple and complex grammatical rules. The only learning effect that did not exceed chance level was learning of the complex rule by the test-only group.

Download:

Fig 1. Grammar learning accuracy (%) of simple and complex grammar rules by subjects who underwent passive, exposure-based training + test (baseline group) versus those who immediately completed the test without first undergoing training (test-only group).

Errors bars depict the standard error of the mean.

https://doi.org/10.1371/journal.pone.0158812.g001

To test if domain-general measures of memory subsystems are linked to grammar learning, a series of stepwise multiple regression analyses were conducted for both simple and complex grammar learning, examining the unique contributions made by measures of procedural, declarative, and working memory. As is shown in Table 2, for subjects in the baseline condition, a significant correlation was found between learning of the simple grammar and procedural memory, but not declarative memory or working memory, and a significant correlation was also found between learning of the complex grammar and declarative memory, but not procedural memory or working memory. The test-only group did not show the same patterns of correlations (see Fig 2), suggesting that these memory systems are recruited during learning that occurs prior to the test phase. For the test-only group, the only significant correlation was a negative correlation between declarative memory and learning of the simple grammar.

Download:

Fig 2.

Scatterplots depicting correlations for baseline group between (A) procedural memory and simple grammar learning, and (B) declarative memory and complex grammar learning. For test-only group, procedural and declarative memory did not correlate with simple or complex grammar learning (C and D, respectively).

https://doi.org/10.1371/journal.pone.0158812.g002

Download:

Table 2. Pearson correlations between the memory measures and learning of the simple and complex grammatical patterns by the baseline and test-only groups.

https://doi.org/10.1371/journal.pone.0158812.t002

For the baseline group, the multiple regressions revealed that for the simple grammar the only significant predictor was procedural memory, B = .437, β = .531, R² = .282, adjusted R² = .261, p = .001, and for the complex grammar the only significant predictor was declarative memory, B = .837, β = .669, R² = .448, adjusted R² = .431, p < .001. For the test-only group, declarative memory was the only significant predictor for the simple grammar, B = –.273, β = –.430, R² = .185, adjusted R² = .150, p = .032, and there were no significant predictors for the complex grammar. It appears that the complex rule was too difficult for memory effects to emerge in the test-only condition.

The findings of Experiment 1 demonstrate that (a) exposure-based learning prior to test leads to superior grammar learning than test alone, and (b) procedural and declarative memory abilities correlate with learning of simple and complex grammatical patterns, respectively. Crucially, these relationships between memory subsystems and grammar learning were only observed for the baseline group and not the test-only group, which suggests that these memory systems are implicated in grammar learning as we hypothesize, and rule out alternative explanations such as that subjects improvements were due to their ability to detect biases in the stimuli or because of potential learning and reasoning during test.

Experiment 2: The Contribution of Feedback to Grammar Learning

Having demonstrated that exposure-based training leads to superior learning of grammatical rules than learning that occurs solely due to feedback during test, we next examined the contribution of feedback during the test phase. Feedback is thought to improve grammar learning [67]. Thus, one might expect that the presentation of feedback during the test phase would boost learning outcomes relative to a condition in which no feedback is provided during testing. Furthermore, the frontostriatal network plays a crucial role in feedback-based learning [60,61] and has also been linked to procedural memory [62]. It has also been suggested that presenting feedback taxes working memory [63], and learners with greater working memory capacity will benefit most from feedback [64,65]. Therefore, those individuals with better procedural memory and working memory might benefit most from feedback. If the presentation of feedback during test does not significantly boost learning outcomes, this would suggest that most learning occurs during the exposure phase, rather than during test.