Figures
Abstract
We investigated interactions between morphological complexity and grammaticality on electrophysiological markers of grammatical processing during reading. Our goal was to determine whether morphological complexity and stimulus grammaticality have independent or additive effects on the P600 event-related potential component. Participants read sentences that were either well-formed or grammatically ill-formed, in which the critical word was either morphologically simple or complex. Results revealed no effects of complexity for well-formed stimuli, but the P600 amplitude was significantly larger for morphologically complex ungrammatical stimuli than for morphologically simple ungrammatical stimuli. These findings suggest that some previous work may have inadequately characterized factors related to reanalysis during morphosyntactic processing. Our results show that morphological complexity by itself does not elicit P600 effects. However, in ungrammatical circumstances, overt morphology provides a more robust and reliable cue to morphosyntactic relationships than null affixation.
Citation: Mehravari AS, Tanner D, Wampler EK, Valentine GD, Osterhout L (2015) Effects of Grammaticality and Morphological Complexity on the P600 Event-Related Potential Component. PLoS ONE 10(10): e0140850. https://doi.org/10.1371/journal.pone.0140850
Editor: Benjamin Xu, The National Institutes of Health, UNITED STATES
Received: May 14, 2015; Accepted: October 1, 2015; Published: October 21, 2015
Copyright: © 2015 Mehravari et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: The authors received no funding specifically designated for this work. Portions of this research were conducted while ASM was supported by National Institutes of Health (http://www.nih.gov/) grant F31 DC013700 and National Institutes of Health grant T32 DC005361, while DT was supported by National Science Foundation (http://www.nsf.gov/) grant BCS-1349110 and National Science Foundation grant BCS-1431324, and while LO was supported by National Science Foundation grant BCS-1261501. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Prior research has identified several event-related brain potentials (ERPs) that are robustly sensitive to aspects of real-time language comprehension. One of these potentials, the P600 effect, is a positive-going wave that onsets at about 500 ms after presentation of the anomalous word and persists for several hundred ms. P600 effects are routinely elicited by anomalies involving grammatical agreement, tense, case, and verb subcategorization [1–9]. By contrast, the N400 component is sensitive to properties of words (such as lexicality and word frequency), a word’s predictability, and the “semantic fit” between a word and its context (see [10]). These ERP responses have been used to study a wide array of topics pertaining to real-time language comprehension, including the processing of dissociation between sentence structure and meaning (e.g., [3,11], cf. [12,13]) and neurocognitive changes that accompany increasing proficiency in a second-language [14–16].
However, fundamental questions remain about the range of stimulus and task properties that elicit and modulate these language-sensitive ERP effects. Here, we focus on a possible confounding between two stimulus variables, namely, ungrammaticality and morphological complexity. The stimuli from Osterhout and Nicol [3], a frequently-cited paper, make this issue clear. In that study, ERPs elicited by critical words in grammatically correct sentences like “The cat will eat the food” (critical word underlined) were compared to ERPs elicited by critical words in grammatically incorrect sentences like “The cat will *eating the food”. “Eating” is both grammatically incorrect and morphologically more complex than “eat”; “eating” contains the extra morpheme “-ing”. The Osterhout and Nicol paper is only one of many in which morphological complexity and grammaticality have been conflated (see [17] for discussion and review). Note that the confound in Osterhout and Nicol’s study and in similar studies is not a fatal flaw, in that the logic of the interpretations were valid regardless of whether such a confounding was present. Nonetheless, the confound does make it difficult to identify the exact cause of the difference in brain responses between conditions, i.e., whether the P600 resulted solely from the effects of ungrammaticality, or a combination of ungrammaticality plus complexity.
A possible remedy to this confound is to hold the morphological form of the critical word constant, while varying the pre-critical context (e.g., The cat was eating…versus The cat will *eating…). However, this remedy immediately runs into two problems. First, the pre-critical word varies across conditions. This can have important consequences for interpretation of ERPs time-locked to the critical word, because ERPs elicited by the pre-critical word can spill over and contaminate the signal elicited by the word of interest. This is especially a problem in designs where the pre-critical word differs across conditions in length or word category (see [17,18] for critical discussions of this issue). Second, these sorts of contextual manipulations can lead to predictability of a sentence’s grammaticality before the critical word is encountered. In the example provided above, assuming that other sentences in this grammatical/ungrammatical comparison were constructed similarly, the auxiliary verb will is fully predictive of the upcoming verb’s grammaticality. That is, over the course of the experiment, participants would be able to develop expectations about upcoming words’ grammaticality before they are actually encountered. Including a sufficient number of filler stimuli that mask the predictability of ungrammatical forms is one remedy to this issue. However, given the large number of stimuli needed for ERP research, this can quickly lead to experimental paradigms that are unduly long and taxing for participants.
One solution to these problems is to use balanced (factorial) designs that are fully crossed in terms of grammaticality and morphological complexity (e.g., have four conditions in which the simple and complex critical words are each used in both grammatical and ungrammatical sentences). In analyzing data from balanced designs, one would simply average across levels of complexity at the target word, and then test for the main effect of grammaticality. In many cases balanced designs are appropriate and adequate to overcome the problems associated with baseline confounds for context manipulations and complexity/predictability confounds for target manipulations (see [19] for an example). However, even in cases where ungrammatical stimuli at both levels of complexity elicit P600 effects, there may be theoretical reasons to investigate interactions between grammaticality and complexity. For example, affixation itself might have an influence on the brain response to words as they are encountered during reading, and the brain response to ungrammatical features might be influenced by whether or not the feature is expressed through overt inflectional morphology.
Numerous electrophysiological and electromagnetic studies have indicated that readers decompose a morphologically complex word into its component morphemes (e.g., [20–28]), though these studies typically used either single-word or priming paradigms, where decomposition is indexed by modulations of the N400 or other earlier components. Grammatical complexity at the sentence level typically impacts the P600 component, even in the absence of any type of anomaly [5,29]. What is unclear from this work is whether morphologically complex sentence-embedded words may also elicit a P600-like effect in the same way as words in syntactically complex sentences. That is, decomposition processes engaged during sentence reading may impact brain responses, leading to at least partially additive effects of grammaticality and complexity on the P600.
On the other hand, some sentence processing models predict that grammaticality will interact with morphological complexity. Such interactions are predicted by models that posit both active prediction of grammatical affixes, and memory searches for the “controller” of verb tense (the entity in the sentence that determines verb tense; [6]) once the affixed verb has been encountered in the sentence. It is now well established that comprehenders make active predictions about upcoming linguistic material at multiple levels, including word forms, semantics, morphology, and syntax [30–41]. For example, if a sentence contains a verb in the past progressive tense (e.g. The sheep were …), the reader could predict a subsequent verb marked with the–ing suffix (e.g., grazing). Conversely, if the sentence contained a modal verb (e.g., The sheep should …), the reader could predict an upcoming verb without overt inflection (e.g., graze). In cases where these predictions are met, no processing difficulty is predicted, and brain responses should not vary based on the morphological complexity of the target word. The newly encountered word can easily be integrated into the predicted linguistic representation, leading to a lack of P600 effects (see also [42]). However, in cases where bottom-up detected words mismatch with features generated by the prediction, the reader searches through his or her representation of the sentence in working memory to find the entity acting as the grammatical “controller” of verb morphology (in this case, the verb were) [8,43–52].
From this perspective, the ease of sentence comprehension varies as a function of how well the successive words in the sentence allow accurate predictions about upcoming words, and also the quality of cues provided by words triggering retrievals and the degree of overlap with potential targets. (see [43] for an explicit, computationally-implemented model of this). This prediction/memory search account therefore predicts interactions between grammaticality and complexity that might be manifested in P600 amplitude. For the ungrammatical sentence “The sheep were graze…”, the context should lead to a prediction that the verb will be in the present participle form (grazing). The absence of the expected grammatical affix should elicit a P600 effect. However, because the verb appeared in its base form with no inflectional morphology, there may be no subsequent search through working memory to identify the “controller” of the verb form (were; i.e., a lack of a search due to a lack of a triggering morpheme), or because the form graze is a base verb form lacking for features, it will not clash in features with any potential controllers held in working memory. By contrast, for the ungrammatical sentence “The sheep should grazing…”, the modal form of the verb leads to the prediction of the uninflected base verb form (graze). However, the presence of the unexpected grammatical affix -ing will also initiate the search through working memory for a controller of agreement (a verb in progressive tense). Because grazing is specified for tense and aspectual features, no such controlling verb can be found in the sentence. This sentence is therefore “anomalous” in two ways: by virtue of the ungrammaticality, and by virtue of an unsuccessful working-memory search for an appropriate licensing verb. This model therefore predicts an interaction between grammaticality and the morphological complexity, such that the P600 effect to critical words in the anomalous sentences will be larger for the inflected form of the anomalous verb.
Method
Participants
Participants were 14 native English speaking college students. All participants were all strongly right-handed, as assessed by an abridged version of the Edinburgh Handedness Inventory [53] and had normal or corrected-to-normal vision. One participant reported having become fluent in German at age 16; all other participants were only fluent in English. All participants were of age 18 or over. Specific age and gender information was not collected from 1 of the 14 subjects; of the 13 subjects from whom this information was collected, 4 were female (self-reported gender), and the average age was 19.6 years (range: 18–25 years). Participants provided written informed consent and received course extra credit for taking part in the study. All experimental procedures were approved by the University of Washington Institutional Review Board.
Materials
Stimuli were 120 sentence quadruplets in a fully crossed 2 (morphological complexity) by 2 (grammaticality) design. Sentences were either grammatically correct or contained a violation of constraints on tense morphology, and the critical word was either morphologically simple or morphologically complex, resulting in four versions of each sentence (see Table 1 for an example). Critical words were verbs in either the base form (e.g., graze) or the present participle form (e.g., grazing). 120 different verbs were used, and the verbs were chosen so that the average written word-form log frequency (provided in the CELEX2 database [54]) of the base form and present participle form of all 120 verbs was not significantly different (base form average frequency = 1.233, present participle form average frequency = 1.230, t = 0.067, p = 0.946). The word immediately preceding the critical word was either was/were or a modal verb (should/could/would/will/can/must/may). The four versions of each sentence were distributed across four experimental lists, such that each participant only saw one version of each sentence, and there were 30 sentences per condition in each list. Each list contained an additional 60 filler sentences, all of which were grammatically correct. In total, each list contained 180 sentences. The sentence order in each list was randomized, and lists were divided into 3 blocks of 60 sentences each. S1 Table lists all experimental and filler sentences.
Procedure
Participants were tested in a single session lasting no more than 120 minutes. Each participant was randomly assigned to one of the stimulus lists. During testing, participants were seated in a comfortable recliner in front of a CRT monitor. Participants were instructed to relax and minimize movements and eye blinks while silently reading each sentence. Each trial consisted of the following events: a blank screen for 1000 ms, followed by a fixation cross, followed by a stimulus sentence presented one word at a time. The fixation cross appeared on the screen for 500 ms followed by a 400 ms ISI. Each word of the sentence appeared on the screen for 300 ms followed by a 350 ms ISI. After the final word of the sentence, there was a 1000 ms blank screen, followed by a “yes/no” prompt. Participants were instructed to give a sentence acceptability judgment at the “yes/no” prompt, where “yes” was the response for sentences that were correct in all ways and “no” was the response for sentences that contained any sort of error. The “yes/no” prompt remained on the screen until participants responded “yes” or “no”; as soon as a response was given, presentation of the next sentence began. Participants were randomly assigned to use either their left or right hand for the “yes” response.
Data acquisition and analysis
Continuous EEG was recorded from 19 tin electrodes attached to an elastic cap (Electro-cap International) in accordance with the 10–20 system [55]. Eye movements and blinks were monitored by two electrodes, one placed beneath the left eye and one placed to the right of the right eye. Electrodes were referenced to an electrode placed over the left mastoid. EEG was also recorded from an electrode placed on the right mastoid to determine if there were differential experimental effects detectable on the mastoids. No such effects were found. EEG signals were amplified with a bandpass of 0.01–40 Hz (-3db cutoff) by an SAI bioamplifier system. ERP waveforms were filtered offline below 30 Hz. Impedances at scalp and mastoid electrodes were held below 5 kΩ and below 15 kΩ at eye electrodes.
Continuous analog-to-digital conversion of the EEG and stimulus trigger codes was performed at a sampling frequency of 200 Hz. ERPs, time-locked to the onset of the critical word in each sentence (underlined in the examples in Table 1), were averaged offline for each participant at each electrode site in each condition. Trials characterized by eye blinks, excessive muscle artifact, or amplifier blocking were not included in the averages. 3.70% of all trials were rejected, and the rejection rate did not differ significantly between experimental conditions: F(3,52) = 0.024, p = 0.995). ERPs were quantified as mean amplitude within a given time window. All artifact-free trials were included in the ERP analyses. In accordance with previous literature and visual inspection of the data, the following time windows were chosen: 300–500ms (LAN/N400), and 500–900ms (P600), relative to a 100ms poststimulus baseline. A poststimulus baseline was used because there were moderate differences in the prestimulus baseline, likely a result of needing to use different pre-critical words across conditions.
Differences between conditions were analyzed using a repeated-measure ANOVA with two levels of morphological complexity (simple, complex) and two levels of grammaticality (grammatical, ungrammatical). Data from midline (Fz, Cz, Pz), medial (right hemisphere: Fp2, F4, C4, P4, O2; left hemisphere: Fp1, F3, C3, P3, O1), and lateral (right hemisphere: F8, T8, P8; left hemisphere: F7, T7, P7) electrode sites were treated separately in order to identify topographic and hemispheric differences. ANOVAs on midline electrodes included electrode as an additional within-subjects factor (three levels), ANOVAs on medial electrodes included hemisphere (two levels) and electrode pair (five levels) as additional within-subjects factors, and ANOVAs over lateral electrodes included hemisphere (two levels) and electrode pair (three levels) as additional within-subjects factors. The Greenhouse-Geisser correction for inhomogeneity of variance was applied to all repeated measures on ERP data with greater than one degree of freedom in the numerator. In such cases, the corrected p-value is reported.
Results
Results from the end-of-sentence judgment task showed that participants were highly accurate in judging the acceptability of the sentences (grammatical mean proportion correct = .92, SE = .02; ungrammatical mean proportion correct = .96, SE = .01). Accuracy in judging the simple versus complex sentences (Table 2) was not significantly different (grammatical simple v. complex stimuli: t(13) = 0.775, p = 0.452; ungrammatical simple v. complex stimuli: t(13) = 0.660, p = 0.520).
Visual inspection of the grand mean ERP waveforms showed that relative to grammatical stimuli, ungrammatical stimuli elicited a widely-distributed positivity beginning around 500ms with a posterior maximum (a P600 effect [56]) in both the morphologically complex (Fig 1) and morphologically simple (Fig 2) conditions. The P600 was larger for complex ungrammatical stimuli than for simple ungrammatical stimuli (Fig 3). As can also be seen in Fig 3, there was no difference in the 500–900ms time window between simple and complex grammatical stimuli. Statistical analysis confirmed these observations. In the 500–900ms time window, there was a main effect of grammaticality (midline: F(1,13) = 8.278, p = 0.013; medial: F(1,13) = 6.486, p = 0.024) that was strongest over posterior electrodes (grammaticality x electrode interaction: midline: F(2,26) = 28.196, p < 0.001; medial: F(4,52) = 15.606, p < 0.001; lateral: F(2,26) = 14.038, p = 0.001). Also in the 500–900ms time window, there was an interaction between complexity and grammaticality (complexity x grammaticality interaction: midline: F(1,13) = 11.467, p = 0.005; medial: F(1,13) = 14.767, p = 0.002; lateral: F(1,13) = 5.594, p = 0.034), which was again strongest over posterior electrodes (complexity x grammaticality x electrode interaction: midline: F(2,26) = 5.818, p = 0.015; medial: F(4,52) = 7.683, p = 0.007; lateral: F(2,26) = 13.855, p = <0.001).
Grand mean ERP waveforms for sentences with morphologically complex, grammatical critical words (black line) and sentences with morphologically complex, ungrammatical critical words (red line). Onset of the critical word in the sentence is indicated by the vertical bar. Calibration bar shows 3μVof activity; each tick mark represents 100ms of time. Negative voltage is plotted up.
Grand mean ERP waveforms for morphologically simple, grammatical critical words (black line) and sentences with morphologically simple, ungrammatical critical words (red line). Onset of the critical word in the sentence is indicated by the vertical bar. Calibration bar shows 3μVof activity; each tick mark represents 100ms of time. Negative voltage is plotted up.
Grand mean ERP waveforms for all four sentence conditions over central parietal and occipital electrodes: grammatical, morphologically simple (black dashed line), grammatical, morphologically complex (black solid line), ungrammatical, morphologically simple (red dashed line), and ungrammatical, morphologically complex (red solid line). Onset of the critical word in the sentence is indicated by the vertical bar. Calibration bar shows 3μVof activity; each tick mark represents 100ms of time. Negative voltage is plotted up.
To further explore the interaction between complexity and grammaticality, we computed the mean activity in all four conditions over a central-posterior region of interest (Cz, P3, Pz, P4, O1, O2) where P600 effects are typically largest [57]. We then performed four paired-samples t-tests and used a Bonferroni correction (alpha level α = 0.0125). These t-tests confirmed our initial observations. Relative to grammatical stimuli, ungrammatical stimuli elicited a significant P600, in both the simple and complex conditions (simple grammatical vs. simple ungrammatical: t(13) = -3.160, p = 0.008; complex grammatical vs. complex ungrammatical: t(13) = -5.336, p < 0.001) (uncorrected p-values are reported). The P600 was significantly larger for complex ungrammatical stimuli than for simple ungrammatical stimuli (simple ungrammatical vs. complex ungrammatical: t(13) = -5.564, p < 0.001). When comparing the simple and complex grammatical conditions in the P600 time window, one sees that the complex grammatical stimuli were numerically more negative going than the simple grammatical stimuli (note the difference in polarity, relative to the effect of complexity in the ungrammatical sentences). However, this numerical difference was not significant, nor did it approach significance considering the Bonferroni-corrected alpha level (simple grammatical vs. complex grammatical: t(13) = 2.013, p = 0.065).
Statistical analysis also revealed effects of grammaticality in the 300–500ms time window. Relative to grammatical stimuli, ungrammatical stimuli elicited more negative responses in frontal electrodes and more positive responses in posterior electrodes from 300–500ms (grammaticality x electrode interaction: midline: F(2,26) = 7.582, p = 0.008; medial: F(4,52) = 6.257, p = 0.006). There was no effect of hemisphere on this interaction. Independent of electrode site, ungrammatical stimuli produced greater negativities from 300–500ms in the left hemisphere than in the right hemisphere (grammaticality x hemisphere interaction: medial: F(1,13) = 6.452, p = 0.025; lateral: F(1,13) = 6.124, p = 0.028). This is similar to some previous reports of left anterior negativity (LAN) effects in response to morphosyntactically anomalous stimuli (see e.g., [9]); however, in the present case left and frontal negativity seemed to be driven by variability in the quality of brain responses (N400 versus P600) to ungrammatical stimuli from individual participants [57,58]. Moreover, LAN effects are highly variable across studies, and recent findings suggest that these left hemisphere negativities can result from component overlap (either within or across individuals) between centrally-distributed N400 effects and right hemisphere-dominant P600 effects [57,58]. Our complexity manipulation did not modulate this negativity, despite visual differences in the magnitude of the negativity (see Figs 1 and 2), likely due to this variability. Instead, it modulated the P600 effect, which, across studies, is a more reliable index of sensitivity to morphosyntactic relations (e.g., [6,8,59]).
Discussion
This experiment investigated the effects of morphological complexity and grammaticality on the P600 during sentence comprehension. We asked whether manipulations of morphological complexity and stimulus grammaticality would have independent or additive effects on the P600 ERP component, which has been associated with processing both ungrammatical stimuli and grammatical but complex stimuli. Our results showed a strong interaction between these two factors, with no discernible unique effects of morphological complexity on the P600. Specifically, we found that whereas both morphologically simple and complex ungrammatical words in sentences elicited a P600 (as expected), the P600 was larger for morphologically complex ungrammatical words than morphologically simple ungrammatical words. There was no reliable difference in brain responses between the morphologically simple and complex grammatical conditions. These results have both methodological and theoretical implications.
Methodologically, these results indicate that care must be taken when comparing the effects of grammaticality when the stimuli being compared vary in morphological complexity. This is especially relevant when comparing the magnitude of the P600 across different experiments, and should be taken into account when stimuli are designed. As advocated for by Steinhauer & Drury [17], using a balanced stimulus design, in which simple and complex forms of the critical verb are each used in grammatical and ungrammatical conditions, overcomes the confound of morphological complexity (example from [17]: Grammatical: 1) “The boys play a game.”, 2) “The boy plays a game.”; Ungrammatical: 3) “The boys plays a game”, 4) “The boy play a game.”). While resolving the morphological complexity confound, this type of stimulus design may lead researchers to miss subtle differences in processing associated with different levels of complexity on the target. For example, Marcinek and colleagues [60] recently reported subtle differences in ERP responses across complexity conditions in sentences like those described above. Specifically they report that in comparisons of grammatical simple and ungrammatical complex subject-verb agreement manipulations (The boys walk/*walks…), ERP results showed a LAN followed by a P600, but when comparing grammatical complex to ungrammatical simple agreement manipulations (The boy walks/*walk…), there was no apparent LAN, and the P600 had an earlier onset.
These cautions are not simply limited to cases of morphological complexity, but extend to other linguistic features as well. Importantly, in some cases, averaging across sub-conditions within balanced designs could lead to spurious theoretical conclusions when sub-conditions elicit qualitatively different brain responses. For example, Marcinek and colleagues [60] further report that asymmetric word category violations in a balanced design elicit qualitatively different brain responses: nouns substituted for verbs (… to enjoy/*meal…) elicit N400 effects, whereas verbs substituted for nouns (…the meal/*enjoy…) elicit P600 effects. Nieuwland, Martin and Carrieras [61] show that animacy alternations, which require differential object marking in Spanish, elicit qualitatively different brain responses: animate object nouns preceded by determiners unmarked for animacy (…al/*el obispo…, “the bishop”) elicited N400 effects, whereas inanimate object nouns preceded by animate-marked determiners (…el/*al suelo…, “the floor”) elicited P600 effects. In either of these cases, simply averaging across levels within the balanced design would have resulted in either spurious biphasic effects or null effects, depending on the level of spatiotemporal overlap of the N400 and P600 effects. Thus, as the present data additionally suggest, some caution is warranted when interpreting data averaged across conditions within balanced designs. In some cases this averaging would obscure subtle differences in processing (as in the present data), or more gravely result in averaged ERP waveforms not reflective of participants’ processing in the sub-conditions of the balanced design (see [15,57,62] for further cautions about averaging across individual participants within conditions who show qualitatively different ERP responses to linguistic violations). This is not to say the balanced designs should be abandoned altogether, simply that researchers must consider the consequences that the averaging procedure may have on their final interpretation of the data.
At a theoretical level, the present data clearly show that the processing of verb ungrammaticality is not a unitary phenomenon. Instead, the P600 elicited by ungrammatical verbs is sensitive to the morphological complexity of the target word. One possible theoretical account for this pattern is provided by the prediction/memory search account of morphosyntactic processing outlined in the Introduction. On this account, predicted features, even in complex contexts, do not trigger processing difficulties. In the two grammatical contexts of the present experiment, there was no statistically reliable difference in P600 amplitude and no other evidence of processing difficulty in the ERP record. This is despite differences in the stimuli at the level of morphological complexity, as well as more superficial levels (e.g., word length). However, when bottom-up encountered input requires retrieval of previously encountered input (e.g., in the case of unexpected or ungrammatical morphosyntactic features, or in the processing of verb-argument relations in filler-gap configurations, or other complex dependencies), P600 amplitude should index salience of the anomaly and also difficulty in establishing a licit dependency. Among the factors implicated in this process are the strength to which comprehenders commit to the initial prediction and the level of cue overlap between the critical word and target. All else being equal, more difficulty in processing (indexed by larger P600 amplitudes) should occur when initial predictions were stronger (and thus more difficult to overcome) and when salient retrieval cues of the probe do not match with any potential targets (see [8,42,51,52] for other recent similar results in this vein).
In the present study P600 amplitude was significantly larger for ungrammatical stimuli in the morphologically complex condition (…should *grazing…) than in the morphologically simple condition (…were *graze…). We can account for this by appealing to both of the above-mentioned constraints: predictability and cue overlap. It is well-established that comprehenders predict upcoming information, and that the constraints imposed by the linguistic context guide not only comprehenders’ expectations of upcoming material, but also the strength to which they commit to those expectations. This has been most thoroughly explored in the realm of sentential constraint on semantic and lexical predictions. Several studies have now shown that comprehenders actively use semantic context to predict both upcoming lexical items, as well as semantic, syntactic, and phononlogical features of those items [34,35,63–69]. Importantly, the level of lexical and semantic constraint that a sentence context provides is approximately linearly associated with N400 amplitude, and even varies as a function of the number of alternate completions even across sentences with similar constraints (as measured by cloze probability) [63]. The role of prediction is made even clearer in work by Brothers, Swaab and Traxler [70]. They show that accuracy in prediction shows earlier effects on brain responses than contextual effects, measured by cloze probability for discourse-final words. Prediction accuracy for final words modulated the N250, whereas contextual constraint effects were not present until the N400 time window.
Effects of prediction strength have also been demonstrated in the realm of grammatical processing, particularly when evaluating the role of statistical probabilities on the processing of verbal complements. Several studies have shown effects of verb-complement co-occurrence bias both behaviorally [71–73] and using ERPs [7]. Specifically, Osterhout, Holcomb and Swinney [7] demonstrated that the statistical probability of a given verb occurring with a sentential versus NP complement modulated P600 amplitudes. Osterhout and colleagues showed graded P600 amplitudes to disambiguating verbs in complement clauses based on probability that main clause verb will occur with complement clauses: the largest P600s were found following verbs that never occur with sentential complements, intermediate P600s were found following verbs that occur most frequently with NP complements (but still occur occasionally with sentential complements), and no P600 effects were found following verbs lacking a particular complement bias. This demonstrates that linguistic experience can shape processing behavior at the syntactic level, where P600s elicited by violations of expectations show amplitudes in inverse proportion to the strength of those expectations.
In the present study we can see the effects of predictability by considering the morphosyntactic constraints imposed by the pre-critical auxiliary verbs in our stimuli. English modal verbs (e.g., can, could, shall, should, will, would) occur primarily in frames where they subcategorize for verbs in their bare stem form, whereas finite forms of the verb be (e.g., is, are, were) are far less restrictive in the types of complements they occur with. They can co-occur with participial verbal complements (as in our grammatical stimuli), as well as with predicative adjectives or nouns, with prepositional phrases, etc. To obtain a course-grained approximation of the level of constraint imposed by the two types of verbs, we queried the Corpus of Contemporary American English [74] to estimate the proportion of tokens where finite forms of be (am, is, are, was, were) are followed within two words by a present participle (-ing form) as well as the proportion of tokens where modal auxiliary verbs (can, could, shall, should, will, would, must, might, may) are followed within two words by a verb in its base form. The proportion for finite forms of be was 0.135; the proportion for modals was 0.346. That is, encountering a finite form of be predicts that a verbal present participle will be encountered within two words 13.5% of the time, whereas encountering a modal predicts that a base verb form will be encountered within two words 34.6% of the time. The local subcategorization constraint imposed by modals is thus approximately three times that imposed by finite forms of be. This suggests that, as demonstrated by Osterhout, Holcomb and Swinney, participants’ prior experience with English co-occurrence probabilities shape their expectations about upcoming material and the strength to which they commit to those expectations. Violation of these stronger predictions thus results in enhanced P600 effects.
The second determinant of P600 amplitude within the prediction/working memory search account implicates cue overlap between retrieval probes and potential targets. Within a content-addressable memory retrieval architecture (e.g., [44,48]), in order to be available as a retrieval target, the cues available on an item must match a subset of those available in working memory. Work within the sentence comprehension literature has shown that both syntactic and semantic features can serve as retrieval cues when establishing morphosyntactic and other verb-argument dependencies [8,49,50]. Prior ERP work in this vein has shown that P600 amplitude is related to potential cue-overlap when processing subject-verb agreement: P600 amplitudes are larger in ungrammatical contexts when there are no potential targets for cues than when a potential target is held in memory, even when it is not a syntactically licensed agreement controller [8,75]. In the context of the current experiment, a search of working memory is triggered upon encountering ungrammatical verbs: when encountering syntactically unexpected verb forms, comprehenders search working memory to seek out an appropriate licensor of those features (i.e., to potentially recover a mis-comprehension). In our stimuli, verbs in the simple condition are underspecified for features, relative to verbs in the complex condition. Participial verbs contain specifications for aspectual features (e.g., telicity, boundedness), which must be syntactically and/or semantically licensed [76]. Based on these featureal specifications, participial verbs must “seek out” particular targets that license their features; as participles mismatch in licensing features with preceding modal verb contexts (…should *grazing…), large ungrammaticality effects results. In contrast, base verbal forms occur in many syntactic contexts (finite, nonfinite, singular, plural, etc) and thus are relatively free in their syntactic distribution. Being underspecified for features, base verb forms thus neither match nor mismatch with features in preceding context, resulting in reduced effects of ungrammaticality (cf. [76,77] for formal approaches to underspecification).
The results from the present study cannot determine unambiguously whether strength of prediction or feature mismatch may have contributed independently or in an interactive fashion in shaping the results here. Note also that the prediction/memory search account is only one of a number of possible explanations for the observed interaction in our data. However, our present data align nicely with a growing body of research implicating both prediction and retrieval as key determinants of sentence processing success and difficulty [8,36–38,42,44,46,51,52,70,78,79]. Future research will be needed to more clearly establish the exact cognitive locus of these effects.
More importantly, our results clearly demonstrate that ungrammaticality of verb forms is not a unitary processing phenomenon. Instead, neural responses to ungrammatical verbs clearly varied as a function of the morphological cues and complexity of the critical word, whereas brain responses to grammatical verbs did not show a similar sensitivity. A further limitation of this experiment is that the morphologically complex words (e.g., eating), are not only more complex but also longer than the morphologically simple words (e.g., eat). This could result in ungrammaticalities in the complex condition being simply visually more salient than in the simple condition. However, there is sufficient reason to believe that length alone cannot account for the effects we see in our data, and in particular, the enhanced P600 wave in the complex ungrammatical condition compared to the simple ungrammatical condition. First, if length had any unique effect on ERPs in our data, it should be visible in a comparison of the two grammatical conditions (e.g., should graze versus were grazing). In our data, there were no significant differences between these two conditions in the P600 time window, suggesting that length alone cannot explain the effects we saw in the ungrammatical conditions. Moreover, to the extent that length had any effect on the data, it was numerically opposite with respect to the grammaticality condition. That is, the complex (and thus longer) word in the grammatical condition was numerically (but non-significantly) more negative-going, but in the ungrammatical condition it was very reliably more positive-going. If there were systematic effects of length on the P600 effect, they should have shown the same polarity with respect to the grammaticality manipulation; instead, we found opposite polarity effects, indicating that length alone cannot explain the significant P600 amplitude differences in the two ungrammatical conditions.
In summary, this experiment has shown that whereas there is no unique effect of morphological complexity on the P600, overt morphological cues (e.g., -ing) elicit a larger P600 when stimuli are ungrammatical, compared to when the overt cues are not present. These results have important methodological considerations for ERP research, and also provide more information about what types of cognitive processes the P600 may be indexing.
Supporting Information
S1 Table. All experimental and filler sentences.
All four versions of each experimental sentence, as well as the 60 filler sentences used in every list.
https://doi.org/10.1371/journal.pone.0140850.s001
(XLSX)
S2 Table. ERP mean amplitudes and participant characteristics.
Mean ERP amplitudes for each participant in the 300–500ms and 500–900ms time windows in all four sentence conditions at all electrodes, as well as each participant’s gender and age.
https://doi.org/10.1371/journal.pone.0140850.s002
(XLSX)
Author Contributions
Conceived and designed the experiments: ASM DT GDV. Performed the experiments: ASM EKW. Analyzed the data: ASM. Contributed reagents/materials/analysis tools: LO. Wrote the paper: ASM DT LO.
References
- 1. Osterhout L, Holcomb PJ (1993) Event-related potentials and syntactic anomaly: Evidence of anomaly detection during the perception of continuous speech. Lang Cogn Process 8: 413–437.
- 2. Hagoort P, Brown CM, Groothusen J (1993) The syntactic positive shift as an ERP measure of syntactic processing. Lang Cogn Process 8: 439–484.
- 3. Osterhout L, Nicol J (1999) On the distinctiveness, independence, and time course of the brain responses to syntactic and semantic anomalies. Lang Cogn Process 14: 283–317.
- 4. Friederici AD, Hahne A, Mecklinger A (1996) Temporal structure of syntactic processing: Early and late event-related potential effects. J Exp Psychol Learn Mem Cogn 22: 1219–1248. pmid:8805821
- 5. Kaan E, Swaab TY (2003) Repair, revision and complexity in syntactic analysis: An electrophysiological differentiation. J Cogn Neurosci 15: 98–110. pmid:12590846
- 6. Osterhout L, Mobley L (1995) Event-related brain potentials elicited by failure to agree. J Mem Lang 34: 739–773.
- 7. Osterhout L, Holcomb PJ, Swinney DA (1994) Brain potentials elicited by garden-path sentences: Evidence for the application of verb information during parsing. J Exp Psychol Learn Mem Cogn 28: 786–803.
- 8. Tanner D, Nicol J, Brehm L (2014) The time-course of feature interference in agreement comprehension: Multiple mechanisms and asymmetrical attraction. J Mem Lang 76: 195–215. pmid:25258471
- 9. Molinaro N, Barber H, Carreiras M (2011) Grammatical agreement processing in reading: ERP findings and future directions. Cortex 47: 908–930. pmid:21458791
- 10. Kutas M, Federmeier KD (2011) Thirty years and counting: Finding meaning in the N400 component of the event-related brain potential (ERP). Annu Rev Psychol 62: 621–647. pmid:20809790
- 11. Allen M, Badecker W, Osterhout L (2003) Morphological analysis in sentence processing: An ERP study. Lang Cogn Process 18: 405–430.
- 12. Brouwer H, Fitz H, Hoeks J (2012) Getting real about semantic illusions: Rethinking the functional role of the P600 in language comprehension. Brain Res 1446: 127–143. pmid:22361114
- 13. Guajardo LF, Wicha NYY (2014) Morphosyntax can modulate the N400 component: Event related potentials to gender-marked post-nominal adjectives. Neuroimage 91: 262–272. pmid:24462934
- 14. Morgan-Short K, Steinhauer K, Sanz C, Ullman MT (2012) Explicit and implicit second language training differentially affect the achievement of native-like brain activation patterns. J Cogn Neurosci 24: 933–947. pmid:21861686
- 15. Tanner D, McLaughlin J, Herschensohn J, Osterhout L (2013) Individual differences reveal stages of L2 grammatical acquisition: ERP evidence. Biling Lang Cogn 16: 367–382.
- 16. Osterhout L, McLaughlin J, Pitkänen I, Frenck-Mestre C, Molinaro N (2006) Novice learners, longitudinal designs, and event-related potentials: A means for exploring the neurocognition of second language processing. Lang Learn 56: 199–230.
- 17. Steinhauer K, Drury JE (2012) On the early left-anterior negativity (ELAN) in syntax studies. Brain Lang 120: 135–162. pmid:21924483
- 18.
Osterhout L, McLaughlin J, Kim A, Greewald R, Inoue K (2004) Sentences in the brain: Event-related potentials as real-time reflections of sentence comprehension and language learning. In: Carreiras M, Clifton C, editors. The on-line study of sentence comprehension: Eyetracking-ERPs, and beyond. New York: Psychology Press. pp. 271–308.
- 19. White EJ, Genesee F, Steinhauer K (2012) Brain responses before and after intensive second language learning: Proficiency based changes and first language background effects in adult learners. PLoS One 7: e52318. pmid:23300641
- 20. Solomyak O, Marantz A (2009) Evidence for early morphological decomposition in visual word recognition. J Cogn Neurosci 22: 2042–2057.
- 21. Rodriguez-Fornells A, Münte TF, Clahsen H (2002) Morphological Priming in Spanish Verb Forms: An ERP Repetition Priming Study. J Cogn Neurosci 13: 443–454.
- 22. Lavric A, Clapp A, Rastle K (2007) ERP evidence of morphological analysis from orthography: A masked priming study. J Cogn Neurosci 19: 866–877. pmid:17488210
- 23. Münte TF, Say T, Clahsen H, Schiltz K, Kutas M (1999) Decomposition of morphologically complex words in English: Evidence from event-related brain potentials. Cogn Brain Res 7: 241–253.
- 24. Royle P, Drury JE, Bourguignon N, Steinhauer K (2012) The temporal dynamics of inflected word recognition: A masked ERP priming study of French verbs. Neuropsychologia 50: 3542–3553. pmid:22975192
- 25. McKinnon R, Allen M, Osterhout L (2003) Morphological decomposition involving non-productive morphemes: ERP evidence. Neuroreport 14: 883–886. pmid:12858053
- 26. Morris J, Grainger J, Holcomb PJ (2008) An electrophysiological investigation of early effects of masked morphological priming. Lang Cogn Process 23: 1021–1056. pmid:19779574
- 27. Morris J, Frank T, Grainger J, Holcomb PJ (2007) Semantic transparency and masked morphological priming: An ERP investigation. Psychophysiology 44: 506–521. pmid:17498223
- 28. Morris J, Stockall L (2012) Early, equivalent ERP masked priming effects for regular and irregular morphology. Brain Lang 123: 81–93. pmid:22917673
- 29. Kaan E, Harris A, Gibson E, Holcomb PJ (2000) The P600 as an index of syntactic integration difficulty. Lang Cogn Process 15: 159–201.
- 30. Fine AB, Jaeger TF, Farmer TA, Qian T (2013) Rapid expectation adaptation during syntactic comprehension. PLoS One 8: e77661. pmid:24204909
- 31. Dikker S, Rabagliati H, Farmer TA, Pylkkänen L (2010) Early occipital sensitivity to syntactic category is based on form typicality. Psychol Sci 21: 629–634. pmid:20483838
- 32. Dikker S, Rabagliati H, Pylkkänen L (2009) Sensitivity to syntax in visual cortex. Cognition 110: 293–321. pmid:19121826
- 33. Farmer TA, Christiansen MH, Monaghan P (2006) Phonological typicality influences on-line sentence comprehension. Proc Natl Acad Sci U S A 103: 12203–12208. pmid:16882728
- 34. Federmeier KD, Kutas M (1999) A rose by any other name: Long-term memory structure and sentence processing. J Mem Lang 41: 469–495.
- 35. Federmeier KD, Wlotko EW, De Ochoa-Dewald E, Kutas M (2007) Multiple effects of sentential constraint on word processing. Brain Res 1146: 75–84. pmid:16901469
- 36. Federmeier KD (2007) Thinking ahead: The role and roots of prediction in language comprehension. Psychophysiology 44: 491–505. pmid:17521377
- 37. Dell GS, Chang F (2014) The P-chain: Relating sentence production and its disorders to comprehension and acquisition. Philos Trans R Soc 369: 20120394.
- 38. Pickering MJ, Garrod S (2013) An integrated theory of language production and comprehension. Behav Brain Sci 36: 329–347. pmid:23789620
- 39. Lew-Williams C, Fernald A (2007) Young children learning Spanish make rapid use of grammatical gender in spoken word recognition. Psychol Sci 18: 193–198. pmid:17444909
- 40. Altmann GTM, Kamide Y (1999) Incremental interpretation at verbs: Restricting the domain of subsequent reference. Cognition 73: 247–264. pmid:10585516
- 41. Levy R (2008) Expectation-based syntactic comprehension. Cognition 106: 1126–1177. pmid:17662975
- 42. Tanner D, Bulkes NZ (in press) Cues, quantification, and agreement in language comprehension. Psychon Bull Rev.
- 43. Lewis RL, Vasishth S (2005) An activation-based model of sentence processing as skilled memory retrieval. Cogn Sci 29: 375–419. pmid:21702779
- 44. Lewis RL, Vasishth S, Van Dyke JA (2006) Computational principles of working memory in sentence comprehension. Trends Cogn Sci 10: 447–454. pmid:16949330
- 45. Gordon PC, Hendrick R, Johnson M (2001) Memory interference during language processing. J Exp Psychol Learn Mem Cogn 27: 1411–1423. pmid:11713876
- 46. Van Dyke JA, Johns CL (2012) Memory interference as a determinant of language comprehension. Lang Linguist Compass 6: 193–211. pmid:22773927
- 47. Van Dyke JA, Johns CL, Kukona A (2014) Low working memory capacity is only spuriously related to poor reading comprehension. Cognition 131: 373–403. pmid:24657820
- 48.
McElree B (2006) Accessing recent events. In: Ross BH, editor. Psychology of Learning and Motivation. San Diego: Academic Press, Vol. 46. pp. 155–200.
- 49. Van Dyke JA, McElree B (2011) Cue-dependent interference in comprehension. J Mem Lang 65: 247–263. pmid:21927535
- 50. Van Dyke JA (2007) Interference effects from grammatically unavailable constituents during sentence processing. J Exp Psychol Learn Mem Cogn 33: 407–430. pmid:17352621
- 51. Wagers MW, Lau EF, Phillips C (2009) Agreement attraction in comprehension: Representations and processes. J Mem Lang 61: 206–237.
- 52. Dillon B, Mishler A, Sloggett S, Phillips C (2013) Contrasting intrusion profiles for agreement and anaphora: Experimental and modeling evidence. J Mem Lang 69: 85–103.
- 53. Oldfield RC (1971) The assessment and analysis of handedness: The Edinburgh Inventory. Neuropsychologia 9: 97–113. pmid:5146491
- 54.
Baayen RH, Piepenbrock R, Gulikers L (1995) The CELEX lexical database (CD-ROM).
- 55. Jasper HH (1958) The ten–twenty system of the International Federation. Electroencephalogr Clin Neurophysiol 10: 371–375.
- 56. Osterhout L, Holcomb PJ (1992) Event-related brain potentials elicited by syntactic anomaly. J Mem Lang 31: 785–806.
- 57. Tanner D, Van Hell JG (2014) ERPs reveal individual differences in morphosyntactic processing. Neuropsychologia 56: 289–301. pmid:24530237
- 58. Tanner D (2015) On the left anterior negativity (LAN) in electrophysiological studies of morphosyntactic agreement. Cortex 66: 149–155. pmid:24928489
- 59. Nevins A, Dillon B, Malhotra S, Phillips C (2007) The role of feature-number and feature-type in processing Hindi verb agreement violations. Brain Res 1164: 81–94. pmid:17658491
- 60.
Marcinek BT, Steinhauer K, Ullman MT, Drury JE (2014) Asymmetric ERP responses within sub-conditions of “balanced” syntactic violation paradigms: P600s, LANs, and additivity. Poster presented at the 2014 Neurobiology of Language Conference. Amsterdam.
- 61. Nieuwland MS, Martin AE, Carreiras M (2013) Event-related brain potential evidence for animacy processing asymmetries during sentence comprehension. Brain Lang 126: 151–158. pmid:23735756
- 62. Tanner D, Inoue K, Osterhout L (2014) Brain-based individual differences in online L2 grammatical comprehension. Biling Lang Cogn 17: 277–293.
- 63. Wlotko EW, Federmeier KD (2012) So that’s what you meant! Event-related potentials reveal multiple aspects of context use during construction of message-level meaning. Neuroimage 62: 356–366. pmid:22565202
- 64. Thornhill DE, Van Petten C (2012) Lexical versus conceptual anticipation during sentence processing: Frontal positivity and N400 ERP components. Int J Psychophysiol 83: 382–392. pmid:22226800
- 65. Rommers J, Dijkstra T, Bastiaansen M (2013) Context-dependent semantic processing in the human brain: Evidence from idiom comprehension. J Cogn Neurosci 25: 762–776. pmid:23249356
- 66. Wicha NYY, Moreno EM, Kutas M (2004) Anticipating words and their gender: An event-related brain potential study of semantic integration, gender expectancy, and gender agreement in Spanish sentence reading. J Cogn Neurosci 16: 1272–1288. pmid:15453979
- 67. DeLong KA, Urbach TP, Kutas M (2005) Probabilistic word pre-activation during language comprehension inferred from electrical brain activity. Nat Neurosci 8: 1117–1121. pmid:16007080
- 68. DeLong KA, Quante L, Kutas M (2014) Predictability, plausibility, and two late ERP positivities during written sentence comprehension. Neuropsychologia 61: 150–162. pmid:24953958
- 69. Van Berkum JJA, Brown CM, Zwitserlood P, Kooijman V, Hagoort P (2005) Anticipating upcoming words in discourse: Evidence from ERPs and reading times. J Exp Psychol Learn Mem Cogn 31: 443–467. pmid:15910130
- 70. Brothers T, Swaab TY, Traxler MJ (2015) Effects of prediction and contextual support on lexical processing: Prediction takes precedence. Cognition 136: 135–149. pmid:25497522
- 71. Garnsey SM, Pearlmutter NJ, Myers E, Lotocky MA (1997) The contributions of verb bias and plausibility to the comprehension of temporarily ambiguous sentences. J Mem Lang 37: 58–93.
- 72. Trueswell JC, Kim A (1998) How to prune a garden path by nipping it in the bud: Fast priming of verb argument structure. J Mem Lang 39: 102–123.
- 73. Trueswell JC, Tanenhaus MK, Kello C (1993) Verb-specific constraints in sentence processing: Separating effects of lexical preferences from garden-paths. J Exp Psychol Learn Mem Cogn 19: 528–553. pmid:8501429
- 74.
Davies M (2008-) The Corpus of Contemporary American English: 450 million words, 1990-present.
- 75. Kaan E (2002) Investigating the effects of distance and number interference in processing subject-verb dependencies: An ERP study. J Psycholinguist Res 31: 165–193. pmid:12022794
- 76. De Swart H (1998) Aspect shift and coercion. Nat Lang Linguist Theory 16: 347–385.
- 77. Prévost P, White L (2000) Missing surface inflection or impairment in second language acquisition? Evidence from tense and agreement. Second Lang Res 16: 103–133.
- 78. Luke SG, Christianson K (n.d.) Predicting inflectional morphology from context. Lang Cogn Neurosci.
- 79. Luke SG, Christianson K (2012) Semantic predictability eliminates the transposed-letter effect. Mem Cognit 40: 628–641. pmid:22160873