Figures
Abstract
Recent studies have investigated the role of semantic processing in the parafovea using the Rapid Parallel Visual Presentation (RPVP) paradigm, which involves the simultaneous presentation of two words, one in the fovea (W1) and one in the parafovea (W2). Results have shown that both accuracy and response times are influenced by the semantic relatedness between the words. These findings suggest that semantic information can be extracted from the parafovea and in parallel with the processing of the foveal word. In the present study, we aimed to provide further evidence of parallel semantic processing and to gain deeper insight into the availability and spatial distribution of attentional resources when semantic relatedness is present. Two experiments were conducted. In both, the first part replicated the previous RPVP setup: two words were simultaneously presented and participants were required to read them aloud. Frequency and semantic relatedness between the two words were manipulated. Results replicated previous findings. Crucially, each experiment included a second task: as participants began reading the two words, a probe (an asterisk) could appear to the right of the parafoveal word (Experiment 1) or above the foveal word (Experiment 2). Probe detection times and accuracy were equally facilitated at both positions when the two words was of high-frequency and semantically related. In contrast, when W1 was of low-frequency, neither accuracy nor reaction times in probe detection benefited from parafoveal processing, suggesting that an increased load on the foveal word hinders parafoveal processing. Finally, the same probe detection results were obtained – both in terms of mean reaction times and accuracy – across the two spatial positions, indicating an even distribution of attentional resources across foveal and parafoveal words.
Citation: Bandiera V, Arduino LS, Daini R, Martelli M, Primativo S (2026) Distribution of visuo-attentional resources while reading multiple words. PLoS One 21(2): e0341917. https://doi.org/10.1371/journal.pone.0341917
Editor: Yiu-Kei Tsang, Hong Kong Baptist University, HONG KONG
Received: June 25, 2025; Accepted: January 14, 2026; Published: February 2, 2026
Copyright: © 2026 Bandiera et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data will be available on Open Science Framework (OSF) through this link: https://osf.io/khtc7/overview?view_only=d53516d66fac42c88386cc86fffa743d.
Funding: SP received funding from The PRIN GRANT 2022S9ZMAY (by Italian Ministry of University and Research). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Reading is an everyday activity, yet it remains a highly complex cognitive task that involves multiple components, including linguistic processes (orthography, phonology, lexical and semantic access), visual perception, attention and eye movements. To efficiently extract information, readers move their eyes to bring text into the fovea, the area of the retina with the highest visual acuity. Considerable research has investigated the nature of the information that can be extracted from the parafoveal region [1–3], which in left-to-right reading languages, extends rightwards approximately 15–20 characters beyond the fovea [4]. Despite reduced visual acuity and attentional resources, readers can extract useful information from the parafovea [5,6].
A key debate in the reading literature concerns: i) the serial vs. parallel distribution of attention to multiple words and ii) what types of information (orthographic, phonological, lexical, or semantic) can be extracted from parafoveal vision and how it interacts with foveal processing.
Understanding how attention is allocated and distributed across text during reading is central to both experimental and theoretical research on eye movement control [7]. Cognitive models of reading are often categorized based on how they assume attention is allocated. The Sequential Attention Shift (SAS) models posit that attention shifts serially from one word to the next, tightly linking eye movements to lexical access. Although SAS models allow some parafoveal preview—where readers extract partial information from upcoming words— they assume full lexical processing only occurs once a word is fixated (e.g., E-Z Reader: [8–11]). For example, [12] examined perceptual capacity limits by comparing semantic and color judgments in a dual-task paradigm. Their results seem to indicate that, whereas change detection for low-level non-linguistic features can approach unlimited capacity, semantic judgments exhibit a clear dual-task cost. This pattern could be compatible with serial processing under certain conditions. Similarly, the authors [13] reported the same pattern when using sentences instead of words.
In contrast, parallel models (e.g., SWIFT [14–17]) propose that multiple words can be processed simultaneously. These models suggest that attention is distributed across multiple words in the visual field, with the extent of parallel processing depending on cognitive and linguistic constraints (see for example [15]). Unlike serial models, parallel models assume that multiple words can be processed at the same time, with processing depth decreasing as words move away from fixation [18–20]. This graded allocation of attention explains the parafoveal-on-foveal effect – where parafoveal word properties influence foveal processing (PoF; [21]) often measured through the foveal inspection time [22–24]. Notably, Snell et al. [24], by using a syntactic categorization task within a flanking paradigm, showed that syntactic information can be integrated across foveal and parafoveal words. Similarly, in a semantic categorization task, bilingual participants showed semantic processing of the parafoveal words [25]. Indeed, in both studies, the authors showed that decision response times were influenced by the syntactic and the semantic congruency of flanking words, respectively. Overall, these competing models differ in how much and what kind of information is assumed to be accessible from the parafovea.
Another critical issue concerns the role of foveal load, namely how the cognitive demand associated with processing the foveal word may modulate parafoveal processing. According to the foveal load hypothesis [26], increased processing difficulty in the fovea (e.g., due to low-frequency words) reduces the amount of information that can be extracted from the parafovea. Indeed, Henderson and Ferreira [26] first demonstrated this effect: when the foveal word was more difficult to process (e.g., a low-frequency word), the parafoveal preview benefit (PPE) was reduced compared to an easier-to-process word (e.g., a high-frequency word).
The Foveal Load Effect (FLE) aligns well with serial models, where the processing of the parafoveal word is strongly influenced by the attentional resources required to process the foveal word and, only once this is at least partially processed, attention can move forward to the parafoveal word, through an attentional moving window acting similarly to the spotlight hypothesized by Posner [27]. Conversely, the parallel models posit that the focus of attention can be shifted independently of the eye’s fixation position and allocated to a spatially extended region of the text, to support the processing of multiple words at once. Attention could be narrowed or widened due to a failure or a successful recognition of words [14,15]. Thus, on the basis of serial and parallel models’ assumptions, the foveal load can modulate parafoveal processing in two different ways: affecting the linguistic depth of processing according to serial models [9] and the spatial extent of processing according to the parallel models [28]. However, it should be noted that recent research questioned the robustness of the FLE. For example, although foveal load can influence saccade targeting [29], this does not necessarily reflect reduced parafoveal processing [30]. Replications of the FLE are limited and sometimes contradictory [31], and the effect may vary across writing systems [30,32].
The current study has two aims: i) to replicate prior evidence of fast semantic processing in parafovea with the Rapid Parallel Visual Presentation (RPVP) paradigm [33–35]; and ii) to examine the spatial distribution of attentional resources and any changes due to semantic relatedness by introducing a non-linguistic probe-detection task. In our previous work, we employed the Rapid Parallel Visual Presentation (RPVP) paradigm in Italian, where participants read aloud two simultaneously presented words—one in the fovea and one in the parafovea—that were either semantically related or unrelated. Across different exposure durations, we consistently observed both parafoveal preview benefits (PPE) and PoF effects: participants were more accurate and faster when words were semantically related and when W1 was of high-frequency. These effects occurred even under brief exposure (≈100 ms), suggesting rapid access to semantic information and supporting parallel processing mechanisms [33,34]. Crucially, even under high foveal load (i.e., low-frequency W1), semantic relatedness facilitated foveal reading times, implying that semantic processing of the parafoveal word occurred despite increased cognitive demand.
In the present study, we explored more directly the distribution of visual attention immediately after a two-words reading task (RPVP paradigm). To this end, we adopted a probe detection task, with the probe placed either in the foveal or parafoveal region. To ensure that the stimuli fell on the fovea and parafovea, an eye-tracker was used. If attention is distributed serially, we might observe a good performance for probe detection in all conditions in fovea and an advantage in parafoveal probe detection for semantically related words. On the other hand, if attention is distributed in parallel, we might observe a faster probe detection in case of semantic relatedness between the two words, regardless of whether the probe appears in the foveal or parafoveal location.
Materials and methods
Participants
The experimental protocol was approved by the local ethic committee of LUMSA University (CERS) with a protocol number 16/2023. Two different groups of participants from April 1st to December 15th 2024, were recruited for each of the two experiments described below, and all provided written informed consent. Based on a G*power analyses ([36]; power = 0.95, α = 0.05, effect size = 0.6), the required sample size for each experiment was estimated to be N = 32 participants.
All participants self-reported normal or corrected-to-normal vision and no history of dyslexia. In Experiment 1, 32 participants took part (mean age = 22.4 years, range = 20–29 years, SD. = 2.3, 21 females, mean education = 14 years). In Experiment 2, 32 participants were also tested (mean = 21 years, range = 18–25 years, SD. = 2.3, 23 females, mean education = 14 years). All participants were unaware of the purpose of the study.
Stimuli
Eighty pairs of Italian nouns were selected from the CoLFIS database [37]. All words were nouns 4–5 letters in length (mean = 4.5). Written word frequency (high and low) and semantic relatedness between the two words were manipulated orthogonally, in order to obtain two lists of 40 word pairs each (see Table 1). High and low-frequency words were selected according to (cf. [38]). Half of the stimuli were of high-frequency (>92 occurrences per million, average 447; SD 326,3; Range 97–1469) and half were of low-frequency (<92 occurrences per million, average 43; SD 19,4; Range 25–87). 40 pairs of words were semantically related (SR) and half were semantically unrelated (SU) (see S1 Appendix). Semantic relatedness was determined via 7-point Likert scale questionnaire administered to 33 independent participants (mean age = 21.4 years, range = 20–23 years, SD = 0.8, 30 females) who did not take part in the main experiments. We established that pairs of words receiving a rate of 4.5 or higher were considered semantically related (e.g., PANE-VINO; BREAD-WINE; mean rate = 5.8, SD = 1.2; range = 2.5–7.0). Pairs of words receiving a rate of 3 or lower were considered semantically unrelated, e.g., RIVA-ARTE; SHORE-ART; mean rate = 1.7, SD = 1.2; range = 1.0–5.3) (see Table 2). The word pairs did not involve synonyms or antonyms.
Software and apparatus
The Experiment Builder software (SR Research Ltd., Mississauga, ON, Canada) was used for programming and running the experiments. In order to ensure the fixation stability and retinal position of the stimuli prior to their presentation on the screen, eye movements were recorded via an SR Research Ltd. EyeLink 1000 Plus eye-tracker, sampling at 1000 Hz. Participants were seated at around 57 cm from a 27-inch LED monitor (1366 × 768 pixels, 60 Hz), with their head movement restricted by a chin-rest. Vocal responses were recorded via a one-way microphone connected to an external sound card (M-track 22).
Experimental procedure
At the beginning of each experiment, a standard nine-point calibration and validation procedure was conducted. Each trial began with a fixation cross (subtending a visual angle of 0.5°) displayed on the left side of the screen. When fixation was maintained for at least 250 ms, the cross disappeared and a pair of words (W1 and W2) appeared for 150 ms. The fixation cross was positioned between the second and the third letters of W1, corresponding to the optimal viewing position [39].
The order of presentation of the word pairs was pseudo-randomized across participants, with W1 displayed in the fovea and W2 in the parafovea. Words were presented in the Courier New, a monospaced font ensuring equal center-to-center letter spacing, and each letter subtended 0.5° of visual angle. The total horizontal extent of the word pairs ranged from 7.5° (both words 4 letters) to 9.5° (both words 5 letters).
In 50% of trials (20 SR and 20 SU), a red asterisk (the probe) appeared immediately (0 ms delay) after the onset of the participant’s vocal response. In Experiment 1, the probe appeared just to the right of W2 (7 pixels/ 0.18°), and in Experiment 2, the probe appeared 7 pixels/ 0.18° above W1. Participants were instructed to (1) read both words aloud and (2) press a button on the button-box as quickly as possible when the probe appeared, with both speed and accuracy being emphasized. Each trial ended with a response screen, followed by a 3000 ms intertrial interval. The following dependent measures were recorded: 1) Vocal response times (vRTs) to W1; 2) Accuracy in reading W1 and W2; 3) Probe detection accuracy and 4) probe reaction times (pRTs), defined as the interval from probe onset to button press. Notably, the procedure allowed pRTs to be measured independently of vRTs.
Data cleaning and statistical procedure
As reported in the previous section, the following dependent measures underwent statistical analysis: vRTs to W1; accuracy in reading W2; probe detection accuracy and pRTs. We excluded from the analysis the participants who, because of low accuracy (below 30%) and recording failures, did not have data values in at least one of the experimental conditions (5.4%). We also excluded participants who detected fewer than half of the probe trials (i.e., missing ≥ 10 trials) in at least one experimental condition (8%). Furthermore, by visual inspection of each trial, we excluded trials containing saccades directed towards the parafoveal word. Analyses for W2 accuracy, probe accuracy, and pRTs included only trials where W1 was correctly named. vRTs were analyzed only for trials in which both W1 and W2 were correctly read. Outliers (vRTs and pRTs > 2.5 SDs <from each participant’s mean) were removed.
We applied mixed-effects models to the dependent variables reported above by using the 2.6.44 Jamovi version and the library package GAMLj3. Linear mixed-effects models were fitted to vocal response time data and probe response time data, with W1 frequency (high- vs. low-frequency), W2 frequency (high- vs. low-frequency), and semantic relatedness (unrelated vs. related) as fixed factors, and subjects as random intercept. The model assumed Gaussian residuals and used Satterthwaite’s approximation for degrees of freedom. Bonferroni post-hoc was adopted for exploring the interactions. Generalized mixed-effects models (logistic) were fitted to binary accuracy data – accuracy on W2 and probe accuracy – with W1 frequency, W2 frequency, and semantic relatedness entered as fixed factors, and subjects entered as random factors. The model used a logit link function and Wald confidence intervals.
Results
Experiment 1 – probe to the right of parafoveal word
Four participants were excluded from the analysis: one due to low accuracy in reading W2 (>30%) and three due to probe detection performance ≤10 trials in at least one experimental condition. Full descriptive statistics for all dependent variables across all experimental conditions are reported in S2 Appendix.
Accuracy in reading W2.
The generalized linear mixed-effects model converged successfully, and the model fit was adequate (Conditional R² = 0.219; Marginal R² = 0.097; χ²/df = 0.98), indicating that both fixed and random effects contributed meaningfully to explaining accuracy variance.
Overall mean accuracy was 96% for W1 (range = 84–100%) and 68% for W2 (range = 44–94%). The analysis revealed significant main effects of W1 frequency (χ²(1) = 6.81, p = 0.009), W2 frequency (χ²(1) = 10.12, p = 0.001), and semantic relatedness (χ²(1) = 96.91, p < 0.001). Accuracy was higher for high- vs. low-frequency words (W1: 73% vs 68%; β = –0.25, SE = 0.09, OR = 0.78, p = 0.009; W2: 74% vs 68%; β = –0.30, SE = 0.09, OR = 0.74, p = 0.001). Conversely, semantic relatedness increased the likelihood of a correct response (80% vs 60%; β = 0.93, SE = 0.09, OR = 2.54, p < 0.001).
Significant interactions were also observed for W1 × W2 frequency (χ²(1) = 12.00, p < .001), W2 frequency × semantic relatedness (χ²(1) = 28.74, p < .001), and the three-way interaction (χ²(1) = 27.02, p < .001). The W1 × semantic relatedness interaction was not significant (p = 0.35).
The three-way interaction (β = 1.96, SE = 0.38, OR = 7.11, p < 0.001) is reported in Fig 1 and it revealed that the joint facilitation of frequency and semantic relatedness was particularly pronounced when both words were high-frequency and semantically related. The random intercept for subjects accounted for little variability (Var = 0.52, SD = 0.72, ICC = 0.14), indicating that about 14% of variance in response accuracy was due to between-subject differences.
The figure illustrates the significant three-way interaction in accuracy between W1 frequency (HF = high-frequency; LF = low-frequency), W2 frequency (HF = high-frequency; LF = low-frequency) and semantic relatedness (SR = semantically related; SU = semantically unrelated). The asterisk (*) indicates statistically significant differences (p < 0.05). Error bars represent the standard error.
Vocal reaction times on W1.
The model converged successfully, the conditional and marginal R² were 0.53 and 0.02, respectively, and significant main effects were found for W1 frequency [F(1,1591) = 5.61, p = 0.018], W2 frequency [F(1,1592) = 11.34, p < 0.001], and semantic relatedness [F(1,1592) = 19.02, p < 0.001]. Parameter estimates revealed that RTs were slower for low-frequency words compared to high-frequency ones for both W1 (766 vs 745 ms; β = 21.67, SE = 9.15, p = 0.018) and W2 (771 vs 740 ms; β = 30.83, SE = 9.16, p < 0.001). In addition, semantically related pairs yielded shorter RTs than unrelated pairs (735 vs. 775; β = –40.06, SE = 9.19, p < 0.001).
Importantly, the interaction between W2 frequency and semantic relatedness was also significant [F(1,1592) = 9.78, p = 0.002] indicating that the semantic facilitation effect was stronger for high-frequency W2 items (β = 57.31, SE = 18.32, p = 0.002), as represented in Fig 2. All other two-way and three-way interactions were non-significant (ps > 0.32).
The figure illustrates a significant interaction in vocal reaction times on W1 between W2 frequency (HF = high-frequency; LF = low-frequency) and semantic relatedness (SR = semantically related; SU = semantically unrelated). The asterisk (*) indicates statistically significant differences (p < 0.05). Error bars represent the standard error.
The random intercept for subjects accounted for moderate variability (Var = 35,586, SD = 189; ICC = 0.52), indicating that about half of the variance in RTs was attributable to between-subject differences.
Accuracy in probe detection.
The model successfully converged, the model fit was satisfactory (Conditional R² = .211; Marginal R² = .124; χ²/df = 0.92), indicating that both fixed and random effects accounted for meaningful variance in accuracy. Significant main effects emerged for W1 frequency (χ²(1) = 4.06, p = 0.044), with higher accuracy for high- vs. low-frequency words (88% vs 83%; β = –0.35, SE = 0.17, OR = 0.71, p = 0.044) and semantic relatedness (χ²(1) = 19.05, p < .001) that increased the odds of a correct response (89% vs 80%; β = 0.75, SE = 0.17, OR = 2.11, p < 0.001). W2 frequency did not reach significance (p = 0.132).
The two-way interaction between W1 × semantic relatedness (χ²(1) = 14.46, p < 0.001) and between W2 × semantic relatedness (χ²(1) = 24.07, p < 0.001) were statistically significant. Also, the three-way interaction among W1 frequency, W2 frequency and semantic relatedness was significant (χ²(1) = 5.20, p = 0.023). The interaction (β = 1.56, SE = 0.68, OR = 4.76, p = 0.023) is reported in Fig 3 and it indicated that the joint effect of word frequency and semantic relatedness on probe accuracy was most pronounced when both words were high-frequency and semantically related.
The figure illustrates the significant three-way interaction in probe detection accuracy between W1 frequency (HF = high-frequency, LF = low-frequency), W2 frequency (HF = high-frequency, LF = low-frequency) and semantic relatedness (SR = semantically related; SU = semantically unrelated). The asterisk (*) indicates statistically significant differences (p < 0.05). Error bars represent the standard error.
The random intercept for subjects showed moderate variability (Var = 0.36, SD = 0.60, ICC = 0.10), suggesting that approximately 10% of the variance in probe accuracy was attributable to between-subject differences.
Reaction times for probe detection.
The model successfully converged (N = 1049 observations, 32 participants; optimizer = bobyqa), the conditional and marginal R² were 0.371 and 0.052, respectively, indicating that both fixed and random effects together explained 37.1% of the variance in RTs. Significant main effects were found for W1 frequency, F(1,1010) = 10.06, p = 0.002, and semantic relatedness, F(1,1010) = 8.06, p = .005. The main effect of W2 frequency was not significant (p = 0.096). Probe reaction times were slower for low-frequency W1 compared to high-frequency W1 (518 vs 499 ms; β = 19.1, SE = 6.02, p = 0.002), and semantic relatedness facilitated responses (500 vs 517 ms; β = –17.1, SE = 6.02, p = 0.005). However, these effects were further modulated by several significant interactions: W1 × W2 frequency, F(1,1011) = 6.51, p = 0.011; W1 frequency × semantic relatedness, F(1,1011) = 36.58, p < 0.001; W2 frequency × Semantic relatedness, F(1,1011) = 17.40, p < 0.001; and the three-way interaction W1 frequency × W2 frequency × Semantic relatedness, F(1,1011) = 7.83, p = 0.005. The significant three-way interaction (β = –67.4, SE = 24.1, p = 0.005) is reported in Fig 4 and it revealed that facilitation patterns were strongest when both words were high-frequency and semantically related.
The figure illustrates the significant three-way interaction in probe detection reaction times between W1 frequency (HF = high-frequency; LF = low-frequency), W2 frequency (HF = high-frequency; LF = low-frequency) and semantic relatedness (SR = semantically related; SU = semantically unrelated). The asterisk (*) indicates statistically significant differences (p < 0.05). Error bars represent the standard error.
The random intercept for subjects (Var = 4739, SD = 68.8, ICC = 0.34), indicated that about one-third of the variability in RTs was attributable to between-subject differences.
Discussion of Experiment 1 results.
The results of Experiment 1 showed that participants were more accurate in reading the parafoveal word (W2) when the foveal and the parafovea words were of high-frequency and when a semantic relatedness existed between the two words. Similarly, vocal reaction times for W1 were faster when W2 was of high-frequency and when it was semantically related to W1. These findings are consistent with both the parafoveal preview effect (PPE) and the parafoveal-on-foveal (PoF) effect previously reported in the literature (see [33,34]). Importantly, we also found that probe detection was both faster and more accurate when both words were high-frequency and semantically related.
Experiment 2 – probe above the foveal word
Six participants were excluded from the analyses due to low accuracy on W2 (three participants with W2 accuracy <30%) or an insufficient number of valid trials for probe detection (three participants with ≤10 trials per condition). Full descriptive statistics for all dependent variables across all experimental conditions are reported in S2 Appendix.
Accuracy in reading W2.
Overall accuracy on W1 was 96% (range = 86–100%) while accuracy on W2 was 63% (range = 38–93%). Model fit was adequate (conditional R² = 0.19; marginal R² = 0.09), the overdispersion was minimal (χ²/df = 0.98), and significant main effects were found for W1-F [χ²(1) = 7.45, p = 0.006], W2-F [χ²(1) = 17.79, p < 0.001], and semantic relatedness [χ²(1) = 59.95, p < 0.001]. Responses were less accurate for LF words (W1: 69% vs 63%; β = −0.25, SE = 0.09, z = −2.73, p = 0.006; W2: 70% vs 61%; β = −0.38, SE = 0.09, z = −4.22, p < 0.001). Accuracy was higher for semantically related pairs (73% vs 58%; β = 0.70, SE = 0.09, z = 7.74, p < 0.001).
There were significant two-way interactions between W1 frequency and W2 frequency [χ²(1) = 22.75, p < 0.001], and W2 frequency and semantic relatedness [χ²(1) = 64.64, p < 0.001]. The three-way interaction among W1 frequency, W2 frequency and semantic relatedness was also significant [χ²(1) = 4.71, p = 0.030].
The three-way interaction (β = 0.78, SE = 0.36, z = 2.17, p = .030; OR = 2.19) is reported in Fig 5 and it revealed an increase in W2 accuracy when there is a semantic relatedness between the couple, especially when W2 is high-frequency.
The figure illustrates the significant three-way interaction in accuracy between W1 frequency (HF = high-frequency; LF = low-frequency), W2 frequency (HF = high-frequency; LF = low-frequency) and semantic relatedness (SR = semantically related; SU = semantically unrelated). The asterisk (*) indicates statistically significant differences (p < 0.05). Error bars represent the standard error.
Random effects showed low between-subject variance (σ² = 0.40; SD = 0.64; ICC = 0.11), indicating that about 11% of total variance in accuracy was attributable to subject-level differences.
Vocal reaction times on W1.
The model showed good overall fit (conditional R² = 0.54; marginal R² = 0.03), and significant main effects emerged for W1 frequency, [F(1, 1542) = 31.79, p < 0.001], W2-F [F(1, 1542) = 7.96, p = 0.005], and semantic relatedness [F(1, 1543) = 21.03, p < 0.001]. Responses were slower for LF compared to HF words for both positions [W1: 752 vs. 710 ms; β = 42.30, SE = 7.50, t(1542) = 5.64, p < 0.001; W2: 742 vs. 720 ms; β = 21.16, SE = 7.50, t(1542) = 2.82, p = 0.005). Responses were also faster for semantically related pairs compared to unrelated ones (748 vs. 714 ms; β = −34.53, SE = 7.53, t(1543) = −4.59, p < 0.001).
A significant interaction between W2 frequency and semantic relatedness was found [F(1, 1542) = 36.35, p < 0.001]. The interaction is reported in Fig 6 and it revealed that the semantic facilitation effect was larger when the second word had HF (β = 90.54, SE = 15.02, t(1542) = 6.03, p < 0.001). No other interactions were statistically significant (all ps > 0.05). Random effects indicated substantial variance across subjects (σ² = 24,109; SD = 155), with an intraclass correlation of 0.53, suggesting that about half of the total variance was attributable to between-session differences.
The figure illustrates a significant interaction in vocal reaction times on W1 between W2 frequency (HF = high-frequency; LF = low-frequency) and semantic relatedness (SR = semantically related; SU = semantically unrelated). Asterisk (*) indicates statistically significant differences (p < 0.05). The bars indicate the standard error.
Accuracy in probe detection.
The model showed good convergence and explained a moderate portion of variance (marginal R² = 0.117; conditional R² = 0.176). There were significant main effects of W1 frequency (χ²(1) = 7.54, p = 0.006); W2 frequency (χ²(1) = 25.76, p < 0.001), and semantic relatedness (χ²(1) = 5.57, p = .018; OR = 1.47). Accuracy was higher for HF-W1 (HF:87% vs LF: 81%), LF-W2 (HF:78% vs LF: 89%) and semantically related couples (SR:87% vs SU: 82%). A significant interaction emerged between W1 frequency and semantic relatedness (χ²(1) = 14.11, p < 0.001). For HF-W1, semantic relatedness strongly increases accuracy (from mean = 0.8 to mean = 0.92). Conversely, for LF-W1, semantic relatedness actually reduces accuracy (from mean = 0.83 down to mean = 0.79). Also the interaction between W2 frequency and semantic relatedness was significant (χ²(1) = 13.44, p < 0.001), revealing that only for HF-W2 relatedness dramatically boosts accuracy (from mean = 0.68 to mean = 0.85).
The three-way interaction among the two frequency variables and semantic relatedness did not reach significance (χ²(1) = 2.83, p = 0.09). The random intercept variance for subjects was 0.24 (SD = 0.49), suggesting small interindividual variability. The results of the two significant interactions are reported in Fig 7.
The figure illustrates two statistically significant interactions: (A) the interaction between W1 frequency (HF = high-frequency; LF = low-frequency) and semantic relatedness (SR = semantically related; SU = semantically unrelated); and (B) the interaction between W2 frequency (HF = high-frequency; LF = low-frequency) and semantic relatedness (SR = semantically related; SU = semantically unrelated). The asterisk (*) indicates statistically significant differences (p < 0.05). Error bars represent the standard error.
Reaction times for probe detection.
The model showed good convergence and explained a substantial proportion of variance (conditional R² = 0.49; marginal R² = 0.05). Significant main effects were observed for semantic relatedness [F(1, 998) = 21.48, p < 0.001] with shorter reaction times for semantically related pairs than for unrelated ones (472 vs 495 ms; β = −23.47, SE = 5.06, t(998) = −4.63, p < 0.001), whereas the main effect of W1or W2 frequency were not statistically significant [ps > 0.1].
There were significant two-way interactions between W1 frequency and W2 frequency [F(1, 998) = 5.43, p = 0.020], W1 Frequency and Semantic Relatedness [F(1, 999) = 53.23, p < 0.001], and W2 Frequency and Semantic Relatedness [F(1, 998) = 14.55, p < 0.001].
Importantly, the three-way interaction between W1 frequency, W2 frequency, and semantic relatedness was also significant, F(1, 998) = 16.45, p < 0.001. The three-way interaction (β = −82.25, SE = 20.28, t(998) = −4.06, p < 0.001) revealed that the effect of semantic relatedness depended jointly on the frequencies of both words: facilitation for related pairs was strongest when both words were of high-frequency, and weakest (or reversed) when frequencies were low, as shown in Fig 8.
The figure illustrates the significant three-way interaction in probe detection reaction times between W1 frequency (HF = high-frequency; LF = low-frequency), W2 frequency (HF = high-frequency; LF = low-frequency) and semantic relatedness (SR = semantically related; SU = semantically unrelated). The asterisk (*) indicates statistically significant differences (p < 0.05). Error bars represent the standard error.
Random effects suggested moderate variability across sessions (σ² = 5633; SD = 75.1), with an intraclass correlation coefficient (ICC) of.46, indicating that approximately half of the total variance in RTs was attributable to between-session differences.
Discussion of Experiment 2 results.
Consistent with Experiment 1, accuracy in reading the parafoveal word was higher when the two words were of high-frequency and when the two words were semantically related. Likewise, vocal reaction times revealed faster naming of the foveal word when W2 was of high-frequency and semantically related to the W1. Crucially, and in line with the results of Experiment 1, both accuracy and reaction times for probe detection were enhanced when the W1 and W2 were of high-frequency and semantically related. Conversely, when the foveal word was of low-frequency, no semantic facilitation effects were observed in either accuracy or reaction times, suggesting a reduced influence of parafoveal semantic information under conditions of increased foveal processing demands.
Discussion
In this study, we investigated whether semantic processing in the parafovea can occur in parallel with foveal word processing, as well as how spatial attentional resources are distributed during brief, simultaneous word presentations.
Building on previous research using the Rapid Parallel Visual Presentation (RPVP) paradigm [33–35] we conducted two experiments in which participants read aloud two words—one centrally located and one in parafoveal vision—and subsequently performed a non-linguistic probe detection task. This design enabled us to jointly assess how lexical-semantic variables affect attentional load and spatial allocation of attention.
In terms of reading performance, we observed two significant main effects: W1 frequency and semantic relatedness. Indeed, participants were more accurate in reading the parafoveal word (W2) and faster in naming the foveal word (W1) when W1 was a high-frequency word and also when it was semantically related to W2. These two effects did not interact. These results are consistent with the predictions of parallel models of reading (e.g., [14–16]), which posit that multiple words can be processed simultaneously and that attention is distributed across a gradient extending beyond the point of fixation. Moreover, our findings align with parafoveal-on-foveal (PoF) effects [21,22,24], showing that the semantic properties of parafoveal words can influence the processing of foveal words.
Importantly, our study contributes novel evidence by coupling the RPVP paradigm with a non-linguistic probe detection task, which allowed us to directly test the attentional load and the spatial distribution of attention during reading-like tasks. In Experiment 1, the probe was positioned to the right of the parafoveal word; in Experiment 2 it appeared above the foveal word. Across both experiments, semantic facilitation in probe detection emerged reliably when W1 was high-frequency, and it was strongest when both words were high-frequency.
According to parallel models of reading (e.g., SWIFT [18]), if attention is spatially extended across the entire region occupied by the words, both in terms of spatial allocation and depth of semantic processing, then probe detection should: (i) benefit from semantic relatedness between the words, and (ii) show no difference based on the probe’s spatial position. Consistent with these predictions, we observed enhanced probe detection when W1 and W2 were of high-frequency and semantically related, regardless of probe location. This semantic facilitation in a non-linguistic task suggests that attention was allocated not only on the foveal word but also on the parafoveal word, supporting the idea of graded, overlapping attentional fields that can integrate visual and semantic information across multiple words [15,28]. These findings further reinforce claims that semantic processing is not confined to foveal vision and can occur rapidly even for words not yet fixated [16,20].
Crucially, the semantic advantage disappeared under conditions of high foveal load—when W1 was a low-frequency word—suggesting that attentional resources were more heavily taxed by the demands of foveal lexical access. This finding supports the foveal load hypothesis [26], and is consistent with serial attention models (e.g., [8,9,12,13]), which assume that parafoveal processing is constrained by the availability of attentional resources, which are primarily engaged with the fixated word. Under high load, the allocation of attention to the parafovea—and thus parafoveal semantic processing—appears to be limited. Nevertheless, the fact that semantic parafoveal processing was observed under low load conditions (see reading accuracy and vocal reaction times results) and that probe detection performance did not vary between spatial positions (Experiments 1 and 2) presents a challenge to strictly serial models. These results instead support a more flexible, dynamic allocation of attention, as proposed by the SWIFT model and other parallel frameworks [14,17], which allow for the simultaneous processing of multiple words to varying degrees depending on context and cognitive load.
Moreover, our findings contribute to ongoing controversies regarding the robustness of the foveal load effect (FLE). Although early work provided evidence for FLE [26], more recent studies have raised questions about its reliability and generalizability [30,31]. Our results confirm that foveal load can reduce semantic preview and attentional spread, yet they also demonstrate that such modulation is not all-or-none, and that parallel processing may still occur under favorable conditions, such as when the foveal word is of high-frequency.
A further possible speculation might help account for the results. Indeed, it is possible that the probe detection efficiency depends on the cognitive resources available immediately after the reading task: the presence of semantic facilitation makes more attentional resources available to detect the probe. According to this hypothesis, when the first task imposes a high cognitive load (i.e., low-frequency of W1 and no semantic relatedness) the probe should be less detectable compared to the condition of lower cognitive load (i.e., high-frequency and semantic relatedness between the two words). This effect due to overall cognitive load would be spatially independent, justifying the similar results obtained in terms of probe detection when presented on foveal or parafoveal locations.
One strength of our study lies in the use of a non-linguistic probe task as a proxy for attentional allocation. This method minimizes confounds associated with linguistic decision-making and provides a more direct behavioral measure of attentional deployment across the visual field. However, future studies should consider extending this paradigm by incorporating eye-tracking or EEG to further examine the temporal dynamics and neural correlates of parallel processing.
Finally, the present study has some limitations. Our mixed-effects models included random intercepts but not random slopes; although this modeling choice represents a limitation, the consistency of the observed effects across experiments and dependent measures supports their robustness. Furthermore, our design was limited to two-word displays with fixed spatial locations, real-world reading involves more complex syntactic and contextual constraints.
Future research should investigate whether similar effects can be observed in continuous text and whether parafoveal semantic processing generalizes to more naturalistic settings, potentially using gaze-contingent paradigms or more ecologically valid stimuli.
Supporting information
S1 Appendix. List of the stimuli used in Experiments 1 and 2.
https://doi.org/10.1371/journal.pone.0341917.s001
(DOCX)
S2 Appendix. Main descriptive statistics for all dependent variables in Experiments 1 and 2.
https://doi.org/10.1371/journal.pone.0341917.s002
(DOCX)
References
- 1. Inhoff AW, Rayner K. Parafoveal word processing during eye fixations in reading: effects of word frequency. Percept Psychophys. 1986;40(6):431–9. pmid:3808910
- 2. Rayner K. Eye movements and attention in reading, scene perception, and visual search. Q J Exp Psychol (Hove). 2009;62(8):1457–506. pmid:19449261
- 3. Schotter ER, Angele B, Rayner K. Parafoveal processing in reading. Atten Percept Psychophys. 2012;74(1):5–35.
- 4. Rayner K. Eye movements in reading and information processing: 20 years of research. Psychol Bull. 1998;124(3):372–422. pmid:9849112
- 5. Legge GE, Mansfield JS. The visual span for reading decreases in peripheral vision. Invest Ophthalmol Vis Sci. 1997;38(Suppl):S223.
- 6. Miellet S, O’Donnell PJ, Sereno SC. Parafoveal magnification: visual acuity does not modulate the perceptual span in reading. Psychol Sci. 2009;20(6):721–8. pmid:19470124
- 7. Schad DJ, Engbert R. The zoom lens of attention: simulating shuffled versus normal text reading using the SWIFT model. Vis cogn. 2012;20(4–5):391–421. pmid:22754295
- 8. Reichle ED, Rayner K, Pollatsek A. The E-Z reader model of eye-movement control in reading: comparisons to other models. Behav Brain Sci. 2003;26(4):445–76; discussion 477-526. pmid:15067951
- 9.
Reichle ED. Serial-attention models of reading. Oxford University Press; 2011.
- 10. Reichle ED, Pollatsek A, Fisher DL, Rayner K. Toward a model of eye movement control in reading. Psychol Rev. 1998;105(1):125–57. pmid:9450374
- 11. Miellet S, Sparrow L, Sereno SC. Word frequency and predictability effects in reading French: an evaluation of the E-Z Reader model. Psychon Bull Rev. 2007;14(4):762–9. pmid:17972746
- 12. White AL, Palmer J, Boynton GM. Evidence of serial processing in visual word recognition. Psychol Sci. 2018;29(7):1062–71. pmid:29733752
- 13. Brothers T. Capacity limits in sentence comprehension: evidence from dual-task judgements and event-related potentials. Cognition. 2022;225:105153. pmid:35551040
- 14. Engbert R, Longtin A, Kliegl R. A dynamical model of saccade generation in reading based on spatially distributed lexical processing. Vision Res. 2002;42(5):621–36. pmid:11853779
- 15. Snell J, van Leipsig S, Grainger J, Meeter M. OB1-reader: a model of word recognition and eye movements in text reading. Psychol Rev. 2018;125(6):969–84. pmid:30080066
- 16. Snell J, Grainger J. Readers are parallel processors. Trends Cogn Sci. 2019;23(7):537–46. https://doi.org/10.1016/j.tics.2019.04.006 pmid:31138515
- 17. Meeter M, Marzouki Y, Avramiea AE, Snell J, Grainger J. The role of attention in word recognition: results from OB1-reader. Cogn Sci. 2020;44(7):e12846. pmid:32564419
- 18. Engbert R, Nuthmann A, Richter EM, Kliegl R. SWIFT: a dynamical model of saccade generation during reading. Psychol Rev. 2005;112(4):777–813. pmid:16262468
- 19. Snell J, Mathôt S, Mirault J, Grainger J. Parallel graded attention in reading: A pupillometric study. Sci Rep. 2018;8(1):3743. pmid:29487414
- 20. Snell J, Yeaton J, Mirault J, Grainger J. Parallel word reading revealed by fixation-related brain potentials. Cortex. 2023;162:1–11. pmid:36948090
- 21. Kennedy A, Pynte J. Parafoveal-on-foveal effects in normal reading. Vision Res. 2005;45(2):153–68. pmid:15581917
- 22. Angele B, Tran R, Rayner K. Parafoveal-foveal overlap can facilitate ongoing word identification during reading: evidence from eye movements. J Exp Psychol Hum Percept Perform. 2013;39(2):526–38. pmid:22866764
- 23. Grainger J, Mathôt S, Vitu F. Tests of a model of multi-word reading: effects of parafoveal flanking letters on foveal word recognition. Acta Psychol (Amst). 2014;146:35–40. pmid:24370788
- 24. Snell J, Meeter M, Grainger J. Evidence for simultaneous syntactic processing of multiple words during reading. PLoS One. 2017;12(3):e0173720. pmid:28278305
- 25. Snell J, Declerck M, Grainger J. Parallel semantic processing in reading revisited: effects of translation equivalents in bilingual readers. Lang Cogn Neurosci. 2018;33(5):563–74. https://doi.org/10.1080/23273798.2017.1392583
- 26. Henderson JM, Ferreira F. Effects of foveal processing difficulty on the perceptual span in reading: implications for attention and eye movement control. J Exp Psychol Learn Mem Cogn. 1990;16(3):417–29. pmid:2140401
- 27. Posner MI. Orienting of attention. Q J Exp Psychol. 1980;32(1):3–25. pmid:7367577
- 28.
Engbert R, Kliegl R. Parallel graded attention models of reading. Oxford University Press; 2011.
- 29. Liu Y, Reichle ED, Li X. The effect of word frequency and parafoveal preview on saccade length during the reading of Chinese. J Exp Psychol Hum Percept Perform. 2016;42(7):1008–25. pmid:27045319
- 30. Zhang L, Kang L, Chen W, Xie F, Warrington KL. Parafoveal processing of orthography, phonology, and semantics during Chinese reading: effects of foveal load. Brain Sci. 2024;14(5):512. https://doi.org/10.3390/brainsci14050512 pmid:38790490
- 31. Findelsberger E, Hutzler F, Hawelka S. Spill the load: Mixed evidence for a foveal load effect, reliable evidence for a spillover effect in eye-movement control during reading. Atten Percept Psychophys. 2019;81(5):1442–53.
- 32. Sun Y, Li S, Zhang Y, Wang J. Parafoveal word frequency does not modulate the effect of foveal load on preview in Chinese reading: evidence from eye movements. Brain Sci. 2024;14(4):360. pmid:38672012
- 33. Rusich D, Arduino LS, Mauti M, Martelli M, Primativo S. Evidence of semantic processing in parafoveal reading: a rapid parallel visual presentation (Rpvp) study. Brain Sci. 2020;11(1):28. pmid:33383778
- 34. Primativo S, Rusich D, Martelli M, Arduino LS. The Timing of Semantic Processing in the Parafovea: Evidence from a Rapid Parallel Visual Presentation Study. Brain Sci. 2022;12(11):1535. pmid:36421859
- 35. Massol S, Mirault J, Grainger J. The contribution of semantics to the sentence superiority effect. Sci Rep. 2021;11(1):20148. pmid:34635695
- 36. Faul F, Erdfelder E, Buchner A, Lang A-G. Statistical power analyses using G*Power 3.1: tests for correlation and regression analyses. Behav Res Methods. 2009;41(4):1149–60. pmid:19897823
- 37.
Bertinetto P, Burani C, Laudanna A, Marconi L, Ratti D, Rolando C, et al. Corpus e lessico di frequenza dell’italiano scritto (CoLFIS). 2005.
- 38. Barca L, Burani C, Arduino LS. Word naming times and psycholinguistic norms for Italian nouns. Behav Res Methods Instrum Comput. 2002;34(3):424–34. pmid:12395559
- 39.
O’Regan JK. Optimal viewing position in words and the strategy-tactics theory of eye movements in reading. 1992. 333–54.