Skip to main content
Advertisement
  • Loading metrics

The superficial layers of the primary visual cortex create a saliency map that feeds forward to the parietal cortex

  • Chen Liu ,

    Contributed equally to this work with: Chen Liu, Chengwen Liu

    Roles Data curation, Formal analysis, Investigation, Methodology, Software, Visualization, Writing – original draft

    Affiliations State Key Laboratory of Cognitive Science and Mental Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China, College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China

  • Chengwen Liu ,

    Contributed equally to this work with: Chen Liu, Chengwen Liu

    Roles Data curation, Formal analysis, Investigation, Methodology, Software, Writing – review & editing

    Affiliation Institute of Interdisciplinary Studies, Cognition and Human Behavior Key Laboratory of Hunan Province, School of Educational Science, Hunan Normal University, Changsha, China

  • Laurentius Huber,

    Roles Methodology, Software

    Affiliation Functional MRI Core, National Institute of Mental Health, National Institutes of Health, Bethesda, Maryland, United States of America

  • Li Zhaoping,

    Roles Writing – review & editing

    Affiliation University of Tübingen, Max Planck Institute for Biological Cybernetics, Tübingen, Germany

  • Peng Zhang

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    zhangpeng@ibp.ac.cn

    Affiliations State Key Laboratory of Cognitive Science and Mental Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China, College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China

Abstract

A salient visual object with a distinct feature from the surrounding environment automatically captures attention. While the saliency signals have been found in many brain regions, their source remains highly controversial. Here, we investigated the neural origin of visual saliency using cortical layer-dependent functional magnetic resonance imaging (fMRI) of cerebral blood volume (CBV) at 7 Tesla. Behaviorally, human observers were better at detecting salient foreground bars with a larger orientation contrast from uniformly oriented background bars. Saliency-sensitive signals were strongest in the superficial layers of the primary visual cortex (V1) and in the middle layers of the intraparietal sulcus (IPS) of the parietal cortex. Layer-dependent effective connectivity revealed the transmission of saliency signals along the feedforward pathway from V1 to IPS. Furthermore, behavioral sensitivity to the foreground stimulus correlated significantly with the fMRI response in the superficial layers of V1. Our findings provide mesoscale evidence that a visual saliency map is created by iso-feature suppression through lateral inhibition in the superficial layers of V1, and then feeds forward to attentional control brain regions to guide attention and eye movements.

Introduction

Only a small fraction of sensory inputs can be selected by our attention for further processing. Attention is guided both endogenously by top–down factors and exogenously by bottom-up factors [15]. Although the bottom-up guidance is simpler, it remains controversial which brain area computes saliency from exogenous visual input. It is believed that the brain generates a saliency map from feature contrast to guide attention shifts and eye movements [69]. According to a traditional and popular computational model [7,10,11], different feature maps (e.g., color, luminance, orientation, etc.) are generated by center-surround contrast mechanisms and then combined into a feature-agnostic master map of saliency, presumably in a high-order brain region such as the parietal cortex. However, it has been shown that saliency behavior can also be explained by the V1 Saliency Hypothesis (V1SH) [8,12], according to which the saliency map is generated by intracortical mechanisms in the primary visual cortex (V1) and represented by the firing rates of feature-selective neurons.

Visual saliency-related signals have been observed in the frontal cortices [13,14], parietal cortices [2,15], V4 [16], V1 [17,18], as well as subcortical regions including the pulvinar of the thalamus [19] and the superior colliculus (SC) of the midbrain [2022]. An important question is which area generates the saliency signals originally and which areas merely receive and utilize the saliency signals generated in another area. For example, a recent study showed that inactivation of the posterior parietal cortex in monkeys significantly weakened the relationship between image salience and visually guided behavior [15], this could suggest that the parietal cortex is the original source of the saliency signals, but this is also consistent with the idea that parietal cortex utilizes the saliency signals and combine them with top–down signals for guiding behavior [2]. Using salient visual stimuli that were made subliminal by backward masking, a human neuroimaging study found responses to saliency in the earliest (C1) component of event-related potentials, and functional magnetic resonance imaging (fMRI) responses to saliency in V1 but not in the parietal cortex [18]. However, backward masking may not fully abolish top–down attention, since attention can modulates stimulus processing even without awareness [23,24] and C1 amplitude [2527]. Meanwhile, supraliminal inputs do evoke parietal responses, as demonstrated by both fMRI and neurophysiological studies [28,29]. Hence, the lack of a saliency signal in IPS could be due to the limited spatial resolution of 3T fMRI to isolate the retinotopic signal corresponding to the spatially localized salient location. Furthermore, a monkey electrophysiological study reported that the latency of the saliency signals in SC is shorter than that in V1 [22], but this short SC latency is longer than the latency of the saliency signals in V1 reported in another monkey study [17]. Therefore, neuroscience studies have found saliency-related signals in multiple brain regions, and it remains controversial as to which brain regions are, respectively, the sources and recipients of the saliency signals. In this study, we aim to shed light on the origin of saliency signals through high-resolution fMRI, which allows us to examine laminar patterns of responses to saliency signals. As will be explained later, the laminar patterns are diagnostic of the sources of the saliency signals, and we will focus on the question of whether the source for the saliency signals is in V1, IPS, or SC.

Recent advances in high-resolution fMRI at ultra-high magnetic field have enabled noninvasive imaging of mesoscale functional units, such as cortical layers and columns, in the human brain at submillimeter resolution [3032]. Using submillimeter blood oxygen level dependent (BOLD) fMRI techniques, feedforward and feedback pathways of stimulus-driven activity and top–down attentional modulation have been investigated in the human visual cortex [3336]. However, the laminar specificity of T2* weighted BOLD fMRI is limited by the draining vein effect [3740]. The Vascular Space Occupancy (VASO)-based cerebral blood volume (CBV) technique is an inversion recovery MRI technique that detects changes in blood volume by nulling the signal from blood water [41,42], particularly in small vessels such as arterioles and capillaries. Therefore, compared to T2*w BOLD signals that are sensitive to blood oxygen in large veins, CBV changes are more closely related to the site of neuronal activation, offering higher spatial specificity to resolve layer-dependent signals across cortical depth, despite having relatively lower functional sensitivity [43,44].

Using laminar fMRI techniques, one can examine the laminar patterns of responses to visual saliency in various brain regions to diagnose the source of the saliency signals. Fig 1 schematizes the laminar patterns of saliency-related signals in V1 and parietal cortex according to each of the three possible origins of the saliency signal: parietal cortex, V1, and SC, based on the canonical neural circuits in the primate visual system [4548]. If the parietal cortex generates the saliency map and modulates V1 activity through feedback connections, the laminar pattern of saliency-related signals should be consistent with cortico-cortical feedback, stronger in the deep and superficial layers of V1 and in the deep layer of the parietal cortex. If, according to the V1SH, saliency signals emerge in V1 through intracortical interactions mediated by horizontal connections in the superficial layers of V1, the strongest saliency-related activity should be observed in the superficial layers of V1 and in the middle layers of high-level cortex that receive feedforward input. Finally, if the saliency signals originate in the SC, they would modulate activity more strongly in the middle and superficial cortical layers through the tecto-thalamo-cortical pathway [19,48,49]. Specifically, this pathway may relay through the pulvinar, terminating in the middle and superficial cortical layers, or through the LGN, which primarily targets the middle layers of V1. Overall, the middle layers receive stronger thalamic input than the superficial layers [49]. Thus, although the SC was not recorded in the current study due to the limited field-of-view of VASO fMRI, the laminar profiles of saliency signals recorded in the cortical regions can still be used to test the hypothesis whether these signals originate from a subcortical source.

thumbnail
Fig 1. Three possible hypotheses on the origins of saliency signals and their consequent laminar patterns of the saliency signals through feedforward and feedback neural pathways.

In the feature combination hypothesis, saliency signal initially emerged in frontoparietal areas and then modulate neural activity of parietal and early visual cortices through feedback connections (Red lines); In V1SH, saliency signals initially emerge via intracortical interactions mediated by the horizontal connections in superficial layers of V1 (primary visual cortex). These signals can then feed forward to middle layers of parietal cortex (Green lines); In the subcortical computation hypothesis, saliency signals are generated in superior colliculus (SC) and then modulate middle and superficial layer activity of cerebral cortex through thalamo-cortical pathways. On the right, relative strengths of saliency signals in the deep (D), middle (M), and superficial (S) layers of V1 and parietal cortex that should arise from each hypothesis are shown. These indicated relative strengths are schematic representations derived from existing literature on the hierarchical connections of brain regions.

https://doi.org/10.1371/journal.pbio.3003159.g001

In the current study, to reveal the neural source of visual saliency, we employed submillimeter fMRI at 7 Tesla with a CBV-weighted VASO fMRI technique to investigate cortical depth-dependent responses to a foreground stimulus consisting of four bars presented at different orientation contrasts (θ = 90°, 15°, or 0°) relative to a background of iso-oriented bars (Fig 2). To link the fMRI responses with the bottom-up saliency of the foreground region rather than top–down task goals, the fMRI responses were collected from participants who were performing a demanding fixation task at a central fixation cross away from the foreground region. Consequently, the peripheral texture stimuli, including the foreground region, were task-irrelevant. The laminar profile of the saliency-sensitive fMRI response, its correlation with behavioral sensitivity to saliency, and the layer-dependent functional connectivity collectively support the hypothesis that the saliency map is generated in the superficial layers of V1 and subsequently projects to attention-control brain regions, such as the parietal cortex.

thumbnail
Fig 2. Stimuli and procedures for the behavioral and fMRI experiments. (A) Stimulus and procedure of the psychophysical experiment outside the MRI scanner. The orientation contrast between foreground and background bars is defined as θ. (B) Luminance contrast sensitivity () for detecting the 90° and 15° foreground. Each dot represents data from one participant. Error bars represent standard errors of the mean (SEM). *** p < 0.001. (C) Stimulus and procedure of the fMRI experiment. Texture stimuli with three θ conditions were presented in separate blocks. The foreground bars were randomly presented to the lower-left or lower-right of fixation in each block while participants performed a central fixation task. (D) Localizer activations to the retinotopic location of the foreground bars in V1 and V2 of the early visual cortex and IPS of the parietal cortex (thresholded at p < 0.05 uncorrected in a representative participant), as indicated by the dashed circles. Dashed ellipses enclose the regions of interests to the foreground region in the contralateral visual field. Dashed lines mark the borders of the brain areas. The data underlying this Figure can be found in data/main_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

https://doi.org/10.1371/journal.pbio.3003159.g002

Results

Behavioral sensitivity was higher to a foreground region with a larger orientation contrast against the background

In a separate psychophysical experiment outside the MRI scanner, we measured participants’ behavioral sensitivity to detect foreground bars with different orientation contrasts against the iso-oriented background bars. In each trial, the texture stimulus, containing the foreground and background bars, was presented for 200 ms (Fig 2A). The foreground bars were presented either to the lower-left or the lower-right of the fixation, at an eccentricity of 4.3°, and participants pressed one of two buttons to report whether the foreground bars were at lower-left or lower right. The orientation of the uniformly oriented background bars was randomly chosen for each trial, while the foreground bars had an orientation contrast θ of either 90° or 15° relative to the background bars. Across trials, the Michaelson luminance contrast between the bars (all of which had the same luminance) and the uniform gray background was adaptively adjusted using a 3-down-1-up staircase procedure, implemented independently for the two θ conditions, to identify a threshold contrast at which the participant can perform the task at 80% accuracy. The results showed that luminance contrast sensitivity was significantly higher for a foreground with a larger orientation contrast θ (Fig 2B, 90° versus 15°, t19 = 13.829, p < 0.001). Specifically, the behavioral sensitivity to detect the 90° foreground () was approximately twice that of the sensitivity to detect the 15° foreground () (Fig 2B). It should be noted that, at our luminance contrast sensitivity to detect the foreground, the luminance of all the bars were presented at a suprathreshold level. Hence, the luminance contrast threshold measured in our task does not reflect the visibility of the entire stimulus but rather the saliency of the foreground bars in the context of the background texture. When the orientation contrast θ or saliency is higher, the foreground is more easily detected and localized, so that our participants could do our task at a lower luminance contrast of the bars. Hence, our luminance contrast sensitivity is an assessment of the saliency of the foreground bars, and we also call our behavioral saliency score. Furthermore, to correlate with the fMRI signals for saliency later, we define

Here, is the average of and .

Saliency-sensitive responses were strongest in the superficial layers of V1 and in the middle layers of IPS

In the 7T fMRI experiment, a slice-saturation slab-inversion VASO (SS-SI-VASO) sequence with 0.82-mm isotropic resolution was used to measure CBV-weighted signals across cortical depth from the posterior part of the brain [44,50]. In each inversion TR, blood-nulled and not-nulled 3D-EPI volumes were acquired to minimize T2*-weighted BOLD contamination. The imaging slab was oriented in an oblique-coronal orientation to cover the early visual cortices and the parietal cortex from the dorsal part of the occipito-parietal lobe (S1C Fig). According to the literature and hypotheses on the neural origin of visual saliency (Fig 1), our analyses focused on V1 and V2 of the early visual cortex, and the intraparietal sulcus (IPS) of the parietal cortex. As an intermediate level in the cortical hierarchy between V1 and IPS, V2 was expected to exhibit a laminar profile in-between those of the two brain regions. Thus, the laminar pattern of V2 can provide further information to distinguish the laminar hypotheses schematized in Fig 1.

The stimulus and procedure of the fMRI experiment are illustrated in Fig 2C. Bar stimuli with three orientation contrast conditions (θ = 90°, 15°, and 0°, randomly clockwise or counterclockwise to the orientation of the background bars) were presented in the lower visual field in separate 30-s blocks, interleaved with 15-s fixation period. Within a stimulus block, θ is fixed, and each stimulus was presented for 200 ms, with a random inter-stimulus interval ranging from 200 to 400 ms. In each stimulus, the iso-oriented background bars had a random orientation, and the foreground bars appeared randomly either to the lower left or lower right of the fixation. During the experiment, participants were instructed to maintain fixation and to count the number of luminance changes of the fixation cross. The fixation task was designed to be attentionally demanding by the subtlety and the large number of the luminance changes in each run. Consequently, the reported number of luminance changes was correct in only 32.0% of the runs, and deviated from the correct number by 4.2% on average. The performance indicates that the task was sufficiently challenging and yet participants remained engaged and performed well (S2 Fig). Furthermore, eye-tracking data confirmed that participants maintained stable fixation without systematically biased toward the foreground stimuli (S3 Fig).

In two localizer fMRI runs, naturalistic stimuli of the same size and location as the foreground region were presented in the lower-left and lower-right visual fields in separate blocks while participants performed a same fixation task (S4 Fig). The fMRI responses to the natural stimuli were used to identify the cortical regions of interest (ROIs) for the foreground bars, as shown in Fig 2D for a representative participant.

To quantify our results and to account for individual differences in mean response amplitude, we define the normalized CBV foreground response as . In which, is the original CBV response for subject layer at foreground orientation contrast , and . CBV-weighted fMRI responses in the foreground ROIs in V1, V2, and IPS are plotted as a function of cortical depth in Fig 3A (see S5 Fig for the BOLD response profiles and S6 Fig for VASO response profiles extended into the white matter [WM] and cerebrospinal fluid [CSF]).

thumbnail
Fig 3. Laminar fMRI responses in V1, V2 and IPS. (A) Upper: Normalized CBV responses ( in percent signal change) to the orientation foregrounds in V1, V2, and IPS. Lower: CBV response difference between the 90° and 15° foreground conditions (). Each gray dot represents data from one participant. Error bars represent SEM. *, ** and *** indicate p < 0.05, p < 0.01, and p < 0.001, respectively. D: deep, M: middle, S: superficial. (B) Volume activation maps of the saliency-sensitive response in the foreground ROIs of V1, V2, and IPS for a representative participant. Red lines indicate the boundary between gray matter (GM) and cerebrospinal fluid (CSF). Yellow lines indicate the boundary between GM and white matter (WM). (C) Left: Surface activation maps of saliency-sensitive response ( in percent signal change) in different cortical depths of V1 in a representative participant (same participant as in B). Dashed circles indicate the location of foreground on the cortical surface. Right: Saliency maps (averaged across all participants) in image space. The data underlying this Figure can be found in data/main_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

https://doi.org/10.1371/journal.pbio.3003159.g003

A two-way repeated-measures (rm) ANOVA with cortical depth (deep, middle, and superficial) and orientation contrast (θ = 90°, 15°, and 0°) as within-subject factors was performed on the normalized foreground response . A significant effect of orientation contrast θ was found in all three brain regions (V1: F2, 38 = 7.080, p = 0.002; V2: F2, 38 = 4.763, p = 0.014; IPS: F2, 38 = 4.673, p = 0.015), indicating stronger responses to the foreground with larger orientation contrast (i.e., 90° compared to 15° and 0° conditions). However, CBV responses to the 15° foreground were slightly reduced compared to the 0° condition. This is due to the partial volume effect of background suppression (S7 Fig). Similar effects of background suppression were found for the 90° and 15° conditions. Therefore, to avoid the partial volume effect of background suppression, the saliency-sensitive fMRI response was calculated as the difference between the fMRI responses to the 90° foreground and the 15° foreground conditions (Fig 3A, lower).

The amplitude of differed significantly across cortical depth in V1, V2, and IPS (V1: F2, 38 = 9.696, p = 0.002; V2: F2, 38 = 9.833, p = 0.001; IPS: F2, 38 = 5.421, p = 0.008). A significant interaction effect was found between depth and ROI, indicating that the effect of the cortical depth on the depended on the ROIs (F4, 76 = 6.052, p = 0.002). In V1, the saliency-sensitive response was strongest in the superficial layers (S versus D: t19 = 3.408, p = 0.003; S versus M: t19 = 3.660, p = 0.002). In V2, it was stronger in the superficial and middle layers than in the deep layers (S versus D: t19 = 3.232, p = 0.004; M versus D: t19 = 4.582, p < 0.001). In IPS, was strongest in the middle layers (M versus D: t19 = 2.659, p = 0.015; M versus S: t19 = 2.983, p = 0.008). Saliency-sensitive responses calculated by the original CBV responses showed a similar pattern of results (S8 Fig). The laminar profiles of the response difference between the 90° and 0° conditions, , were also qualitatively similar (S9 Fig). To account for differences in signal-to-noise ratio (SNR) across cortical layers in the observed laminar profile, we normalized the saliency-sensitive responses by the SNR at each cortical depth. The normalized responses still exhibited similar laminar profiles (S10 Fig). In Fig 3B, the activation map in a representative participant clearly showed the strongest saliency sensitivity in the superficial layers of V1, in the middle and superficial layers of V2, and in the middle layers of IPS, consistent with the laminar profiles for a V1 origin of saliency in Fig 3A. To better illustrate the V1 saliency map across cortical depths, saliency-sensitive responses were shown on the cortical surface in the right hemisphere of the representative subject (Fig 3C, left). The saliency maps in the image space were reconstructed using a population receptive field (pRF) model, shown in the right panels of Fig 3C. Saliency signals were strongest in the superficial layer of V1 at the location of foreground bars. These findings are consistent with the V1 hypothesis in Fig 1, in which the saliency signals initially emerge by horizontal interactions in the superficial layers of V1 and subsequently feed forward to the middle layers of IPS.

Behavioral sensitivity to saliency correlated with the saliency-sensitive fMRI response in the superficial layers of V1

To investigate whether the fMRI responses can predict the behavioral sensitivities to the foreground bars, we calculated Pearson’s correlations between the behavioral sensitivity to saliency and the saliency-sensitive fMRI response . For each brain region, the cortical depth with the strongest saliency-sensitive fMRI response was selected. showed a significant correlation with in the superficial layers of V1 (Fig 4A, r = 0.542, p = 0.014), but not with those in the superficial layers of V2 (r = −0.218, p = 0.356) or the middle layers of IPS (r = −0.203, p = 0.390) (see correlations with all layers in S11 Fig). The correlation in V1 remained significant after family-wise error (FWE) correction by permutation test (Fig 4B). The correlation results supported that the saliency-sensitive fMRI response in the superficial layers of V1 can predict the behavioral sensitivity to saliency. Although the sample size (N = 20) of this study is not sufficient for an exploratory whole-brain correlation analysis with behavioral performance, as highlighted by [51], it provides enough power for hypothesis-driven analyses focused on a limited number of brain regions [18].

thumbnail
Fig 4. Behavioral-fMRI correlations and functional connectivity. (A) Behavioral sensitivity to saliency , i.e., the difference in luminance contrast sensitivity to detect the two different foregrounds, is correlated with fMRI saliency-sensitive response in the superficial layers of V1, giving a significant Pearson’s correlation coefficient r. Each circle represents one participant. (B) Pearson’s correlation coefficients r in the superficial layers of V1 and V2, and in the middle layer of IPS. The dashed line indicates the significance threshold after family-wise error (FWE) correction by a permutation test. (C) The feedforward and feedback pathways of saliency signals between V1 and IPS, assessed by the layer-specific generalized psychophysiological interaction (gPPI) analysis of functional connectivity. (D) Functional connectivity difference (90° vs. 15°) of the two pathways in C. Each gray dot represents a single participant. Error bars represent the standard errors of the mean across participants. **, *** indicates p < 0.01, p < 0.001. The data underlying this Figure can be found in data/main_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

https://doi.org/10.1371/journal.pbio.3003159.g004

Layer-dependent effective connectivity revealed saliency transmission from V1 to IPS

A generalized psychophysiological interaction (gPPI) analysis was conducted to examine layer-dependent effective connectivity of saliency signals between V1 and IPS. For the feedforward connectivity , the superficial layer of V1 was used as the seed ROI, and the middle layer of IPS served as the target ROI. For the feedback connectivity , the seed ROI was the deep layer of IPS, and the target ROI was the deep layer of V1 (Fig 4C). The gPPI results revealed a significant connectivity difference between the 90° and 15° foreground conditions for the feedforward connection from V1 superficial to IPS middle (t19 = 4.348, p < 0.001, Holm corrected). In contrast, no significant difference was observed for the feedback connection from IPS deep to V1 deep (t19 = 2.041, p = 0.055) (Fig 4D, see S12A Fig for the original connectivity and for each θ). A paired t test showed that the feedforward connectivity difference was significantly stronger than the feedback connectivity difference (t19 = 3.173, p = 0.005). Due to the low SNR of the VASO signal, here we used the BOLD signals for the gPPI analysis. The gPPI results using VASO and BOLD signals were similar (S12B Fig), both showing a significant feedforward connection from V1 to IPS. The connectivity differences across all cortical depths are shown in S12B Fig, indicating that the middle layer of IPS received significant feedforward input from V1. The layer-dependent connectivity results further support the hypothesis that the saliency signal propagates along the feedforward pathway from V1 to IPS.

Fixation stability was not affected by the presence of foreground bars

Since participants were performing a fixation task in the fMRI experiment, the fixation stability was unlikely to be affected by the presence of the task-irrelevant foreground. To confirm that participants could maintain fixation, we monitored the eye movements of 12 participants in a control behavioral experiment using the same stimuli and fixation task as in the fMRI experiment. Fixation stability was assessed using the bivariate contour ellipse area (BCEA) of 95% fixations [52] (S3A Fig). BCEA results showed no significant difference among the three θ conditions (F2, 22 = 0.291, p = 0.750, BF10 = 0.227, moderate evidence for H0: no difference across θ conditions) (S3B Fig). The distributions of gaze positions below the center of the fixation cross also showed no significant difference among the three θ conditions (F2, 22 = 2.197, p = 0.135, BF10 = 0.736, weak evidence for H0) (S3C Fig), suggesting that participants maintained good fixation when the foreground bars were presented in the lower visual field. Therefore, our findings were unlikely to be due to unstable fixation.

Discussion

Using 7T VASO-CBV fMRI at submillimeter resolution, we investigated layer-dependent responses to salient visual input in the early visual cortex and the parietal cortex of human participants, and the functional connectivity between these brain regions to elucidate how saliency signals are propagated. Saliency-sensitive responses were strongest in the superficial layers of V1, and in the middle layers of IPS. Meanwhile, behavioral sensitivity to saliency correlated significantly with the fMRI saliency-sensitive responses in the superficial layers of V1, but not with those in the middle layers of IPS. Furthermore, layer-dependent effective connectivity revealed significant feedforward transmission of saliency signals from V1 to IPS. These findings support the hypothesis that a bottom-up saliency map is created by contextual influences mediated by horizontal connections, which are most abundant in the superficial layers of V1, and that the saliency signals are subsequently propagated to other brain regions, such as IPS and SC, to guide attention and eye movements. Finally, the saliency signal observed in the superficial layers of V1 is unlikely to originate from the colliculus–pulvinar pathway, as this would have resulted in a similar laminar pattern in IPS (Fig 1).

While the V1SH is supported by substantial behavioral evidence with features dimensions in orientation, color, motion directions, eye of origin, and their combinations [12,5357], neuroscience evidence remain limited and controversial [15,17,18,22]. With a large brain coverage and a submillimeter resolution, the laminar fMRI results in our study provide important mesoscale evidence for the cortical microcircuitry underlying the saliency computations for bottom-up attentional attraction in humans. The laminar response patterns from multiple brain regions (Fig 3A and 3B) and the layer-dependent connectivity results (Fig 4C and 4D) are consistent with a canonical feedforward pathway from V1 to IPS [45,46]. Since the intracortical horizontal connections in V1 are most abundant in the superficial layers [5860], and since contextual influences mediated by these connections are key mechanisms underlying the saliency computation in the V1SH [8], our finding that the strongest saliency-related signal is in the superficial layers of V1 strengthens the evidence for V1 as the source of the saliency signals.

While the previous studies cannot rule out the SC as a potential source for saliency computation, our findings provide evidence against this possibility. If the saliency map was generated in the SC, the saliency signals should be stronger in the middle layers than in the superficial layers of V1 (Fig 1), since the thalamic input mediated from the SC via the tecto-thalamo-cortical pathway is strongest in the middle layers of V1 [48,49]. Our laminar results (Fig 3) are not consistent with this prediction. Moreover, our findings are aligned with recent evidence that visual responses in the SC of awake primates depend critically on signals routed through the LGN and V1 [61]: deactivation of the LGN abolished SC responses to visual stimuli. It is therefore more likely that the saliency signals are generated in V1 and then projected to the superficial layers of the SC to guide attention and eye movements [19,62].

In contrast to the results reported by [18], we observed significant saliency-sensitive signals in IPS (Fig 3Aand 3B). This discrepancy may be attributed to the higher spatial resolution of our 7T fMRI data, which allowed better isolation of fMRI responses to the foreground from those to the background bars. Another possible reason is the visibility of the texture stimuli, which were made invisible by backward masking in Zhang and colleagues [18]. By making the stimuli invisible, the previous study largely eliminated conscious awareness-related feedback from a high-level cortex such as the IPS, while the saliency signals (as verified by their behavioral cueing effects) were still observed in V1. However, as a consequence, the feedforward propagation of the saliency signals from V1 to IPS cannot be illustrated due to the lack of the saliency signals in the high-level region. Overcoming the limitations of the previous study, our study represents important advances in demonstrating the neural origin and recipients of saliency signals. We reveal the V1 origin and the feedforward propagation of saliency signals from V1 to IPS, by allowing the manifestation of the saliency signals in the parietal cortex and by analyzing layer-specific response and connectivity. Furthermore, the laminar response pattern arising from horizontal contextual influences in the superficial layers of V1 provides critical support for the proposed mechanisms of saliency computation in the V1SH.

Considering the slow acquisition speed of the VASO sequence, and to minimize top–down anticipatory effects, we employed a block design in which a salient foreground region was presented in the lower-left or lower-right visual field in a random order and at random intervals within a 30-s block. In principle, with such a design, we cannot exclude the possibility that the responses to ipsilateral foreground stimuli were also in the ROI-averaged response of IPS. However, our IPS ROIs were primarily located in the posterior IPS (S13 Fig), which exhibits strong lateralization and clear retinotopic organization [63]. Therefore, the saliency-sensitive responses in the IPS should be predominantly driven by the contralateral foreground. Importantly, our conclusion depends on the laminar pattern rather than the retinotopic specificity of saliency signals in the IPS. Nevertheless, an event-related design would allow for measuring fMRI responses to individual foreground events. Achieving this will require further development of high-resolution fMRI methods with faster acquisition speed.

Although this study only investigated the neural mechanisms of saliency based on orientation contrast, previous behavioral studies suggest that V1 also computes saliency from other low-level features such as color, motion direction, eye of origin, and their combinations [12,53,55,57,64]. Thus, our findings are likely to generalize to saliency by contrast in all the basic feature dimensions. Although saliency-related signals have also been observed in higher brain regions [6567], it remains unclear whether these signals are inherited from V1 or computed within these brain regions themselves. Meanwhile, saliency signals computed by V1 should be the fastest for rapid deployment of bottom-up attentional shifts. To make the deployment rapid, the direct anatomical projection from V1 to SC should be favored compared to indirect projections from V1 to SC via higher cortical areas such as the parietal and frontal cortices. Meanwhile, when bottom-up saliency is combined with top–down factors to guide attention [2], in cooperation or in competition, the feedforward connectivity such as the ones investigated in this study, should play important roles.

In summary, mesoscale evidence at the level of cortical layers resolves a longstanding controversy regarding the neural origin of saliency signals. Our results demonstrate that visual saliency is first generated by intracortical mechanisms in the superficial layers of V1 and subsequently propagated to other attentional control regions. These findings provide direct support for V1SH, and reinforce the motivation for a new and evidence-backed vision framework [68,69], which hypothesized that the attentional bottleneck starts from V1’s output to downstream areas. These findings not only shed light on an early locus of neural and computational mechanisms of visual attentional selection, but also provide valuable guidance for the development of biologically inspired artificial visual systems.

Methods and materials

Ethics statement

Experimental protocols were approved by the Institutional Review Board of the Institute of Biophysics, Chinese Academy of Sciences (No. 2012-IBP-011). All participants gave written informed consent before the experiments.

Participants

Twenty healthy volunteers (9 females, aged 22–42 years) participated in the psychophysical and fMRI experiments. The sample size was determined based on the large effect (Cohen’s f > 0.4) of orientation contrast on V1 responses in a previous study [18], as calculated by G*Power [70]. A sample size of 20 participants was sufficient to achieve 90% statistical power for detecting a large effect in a within-subjects ANOVA. Twelve volunteers (5 females, 22–30 years of age) participated in the control movement with eye movement recordings, 6 of them also participated in the fMRI experiment. All participants had normal or corrected-to-normal vision.

Stimuli and procedures

Visual stimuli were generated in MATLAB (Mathworks) with Psychophysics Toolbox extension [71,72] on a Windows 10 operating system computer. Stimuli and procedures were illustrated in Fig 2A and 2C. Each texture stimulus (12.5 by 6.5 degrees of visual angle) consisted of 17 × 9 bars of 0.5° × 0.05° in visual angle presented in the lower visual field on a gray background, with 0.75° spacing between the bars. In the foreground present conditions, all bars were identical except for a foreground of 2 × 2 bars titled either 90° or 15° (orientation contrast θ) relative to the background bars. The foreground bars were located at 4.3° of eccentricity either to the lower-left or lower-right of the fixation. In the uniform texture condition, all bars were uniformly oriented. The orientation of background bars was randomly chosen from 0° to 180°.

Psychophysical experiment.

Visual stimuli were presented on a Cambridge Research System Display++ LCD monitor (32 inches, 1,920 × 1,080 pixels, 120-Hz refresh rate) at a viewing distance of 100 cm. Subjects’ heads were stabilized using a chin rest.

A bright fixation cross (73 cd/m2) was shown at the center of the screen. Participants were informed to keep fixation throughout the experiment. A trial began after a button press. After 1 s of fixation, the texture stimulus was presented for 200 ms, followed by another 1 s of fixation for the participant to respond (Fig 2A) before the next trial began. Participants were instructed to make 2-alternatives-forced-choice response by pressing one of two buttons to indicate whether the foreground appeared in the lower left or lower right visual field. Bar luminance (ranged from 43.8 to 78.9 cd/m2) was adjusted by a 3-down-1-up staircase procedure to determine the participant’s contrast sensitivity for detecting the foreground. The luminance contrast of the bars from the uniform gray background (background luminance = 43.8 cd/m2) was defined by the Michaelson contrast . The initial contrast of the staircase procedure was 0.2863 and changed by 15% in each step. Two blocks of trials (120 trials each) were collected for each orientation contrast condition. Each block consisted of two independent staircases (60 trials each) that were randomly intermixed across trials. Thus, a total of four 60-trial staircases were collected for each condition. A psychometric curve was fitted by a Weibull function relating the performance in each condition to the contrast of the bars. The contrast detection threshold was defined as the contrast level at 80% accuracy, and then the contrast sensitivity was defined as .

FMRI experiment.

Stimuli were presented with an MRI safe projector (1,024 × 768 pixels @ 60 Hz) on a translucent screen placed behind the head coil. Participants viewed the stimuli through a mirror mounted inside the head coil at a viewing distance of 69 cm from the screen. Visual stimuli were identical as those in the psychophysical experiment except that the luminance of the bars and the uniform background were = 87.5 cd/m2 and = 0.4 cd/m2 (max and min luminance of the projector, a high contrast texture stimulus was used to maximize the SNR of fMRI signals to the visual stimuli), respectively. The purpose of using a dark background was to enhance activations to the texture stimuli. Texture stimuli with different orientation contrast with the background bars (θ = 90°, 15°, or 0°) were presented in separated 30-s blocks, interleaved with 15-s fixation periods (Fig 2C). Each fMRI run lasting 270 s, with 6 blocks of stimuli per run (2 blocks for each θ). In each block, there were 60 texture stimuli, each lasting 200 ms followed by an interval which was randomly 200, 300, or 400 ms. The foreground bars were randomly presented in the lower-left or lower-right visual field. Subjects were instructed to maintain fixation and count the number of subtle luminance changes (about 25% dimming) in the fixation cross. The number of luminance reductions varied across runs and was reported at the end of each run, followed by feedback. A total of 9 runs were collected for each participant. The order of stimulus conditions was balanced across runs and participants.

Two localizer runs were used for each participant to localize the retinotopic ROIs for the foreground bars. Each run consisted of eight 30-s stimulus blocks, interleaved with 15-s fixation intervals (S4 Fig). In each stimulus block, colored naturalistic stimuli (same size as the foreground in the main experiment) were presented at 4 images per second (116.7 ms presentation interleaved with 133.3 ms fixation), at one of the foreground locations. Participants were instructed to maintain fixation and to count fixation changes as in the main experiment.

Eye tracking experiment.

Eye movements were recorded using an SR Research Eyelink 1000 Plus eye tracker. The centroids of the left eye’s pupil were sampled at 1,000 Hz. Visual stimuli were presented on a 27 inches LCD monitor at 1,920 × 1,080 resolution and 60 Hz refresh rate, at a viewing distance of 100 cm. Participants stabilized their heads using a chin rest. The texture stimuli and fixation task were identical to those employed in the fMRI experiment. Eye gaze positions were preprocessed in the following steps: eye blinks removal (200 ms before and after the eye blinks) with linear interpolation of removed data, linear detrend, baseline correction per stimulus block. A 95% BCEA was used to quantify the distribution of fixation samples [52].

MRI data acquisition

MRI data were collected on a 7T scanner (Siemens MAGNETOM) with a 32-channel receive single-channel transmit head coil (Nova Medical) in the Beijing MRI Center for Brain Research. The gradient coil had a maximum amplitude of 70 mT/m, 200 µs minimum gradient rise time, and 200 T/m/s maximum slew rate. Participants used a bite bar to reduce head motion. Functional data of blood nulled and not-nulled images were acquired using a SS-SI-VASO sequence with 3D-EPI readout [50,73] (0.82-mm isotropic voxels, 26 oblique-coronal slices, FOV = 133 × 177 mm2, paired-TR = 5,020 ms, TE = 25 ms, TI1 = 1,744 ms, FA = 26°, bandwidth = 1,064 Hz/pixel, Partial Fourier = 6/8 with 8 POCS iterations, GRAPPA = 3 with FLASH reference). The 3D slab of fMRI acquisition was placed in oblique-coronal orientation (S1C Fig). Images with reversed phase and read directions were acquired for susceptibility distortion correction. T1-weighted anatomical images were acquired using an MP2RAGE sequence [74] (0.7-mm isotropic voxels, FOV = 224 × 224 mm2, 256 sagittal slices, TE = 3.05 ms, TR = 4,000 ms, TI1 = 750 ms, FA = 4°, TI2 = 2,500 ms, FA = 5°, bandwidth = 240 Hz/pixel, phase and slice partial Fourier = 7/8, GRAPPA = 3).

MRI data analysis

Preprocessing of functional data.

The preprocessing of fMRI data was performed using AFNI [75], LAYNII [76] and custom Python code. The following steps were applied for blood-nulled and not-nulled volumes separately: EPI image distortion correction with blip-up/down nonlinear warping method, rigid-body correction of head motion, spatially up-sampled by a factor of 2 (3dResample in AFNI) [77]. Since the blood-nulled and not-nulled volumes were acquired alternately, the 3D timeseries were up-sampled by a factor of 2 (3dUpsample in AFNI) and shifted by TR/2 for BOLD correction. The VASO or CBV-weighted volume was calculated by dividing the nulled by not-nulled volumes (LN_BOCO in LAYNII). To minimize image blur, all spatial transformations were combined and applied to the functional images in one interpolation (3dAllineate in AFNI with sinc method). After per run scaling as percent signal change, a general linear model (GLM) with a canonical HRF (BLOCK4 in AFNI) was used to estimate the VASO and BOLD signal change from baseline. The fMRI signals in all cortical depths were fitted equally well by convolving the stimulus regressor with the canonical HRF (S14 Fig). Thus, the GLM analysis should not introduce bias across cortical depth. Head motions and low-order drifts were included as regressors of no interest in the GLM.

Surface segmentation.

The T1-w anatomical volume was aligned to the mean EPI image after motion correction, and segmented into WM, gray matter (GM), and CSF compartments using FreeSurfer with the “high-res” option [78]. FreeSurfer’s segmentation results were visually inspected and manually edited to remove dura mater and sinus, etc., ensuring accurate surface segmentation (S1A Fig). High-density surface mesh was generated by a factor of 4 to better align with the up-sampled volume grid [79]. The cortical depth profiles were constructed using the equi-volume model [80]. Two equi-volume intermediate surfaces between the WM and pial surfaces were generated using mripy (https://github.com/herrlich10/mripy), dividing the GM into three equi-volume layer compartments (S1A Fig). For each voxel, the layer weight (volume percentage among WM, CSF, and the three-layer compartments) were calculated and subsequently used in a spatial regression approach to unmix layer activity [81]. Specifically, We applied a general linear model (GLM) to estimate the response associated with each cortical layer: Y = X × B + U, where Y is an n × 1 vector containing voxel responses within the ROI (n denotes the total number of voxels), X is an n × 5 layer weight matrix representing the proportion of each voxel’s volume assigned to the five compartments (three cortical layers, WM, and CSF), and B is a 5 × 1 vector of the estimated layer-specific responses. The residual term U is an n × 1 vector capturing deviations from the least-squares fit. This unmixing approach helps to minimize correlations in activity across different cortical layers.

ROI definition.

Anatomical ROIs of the early visual areas (V1–V2) were defined on the cortical surface by a 7T retinotopic atlas of the Human Connectome Project using Neuropythy tools [82,83]. The anatomical ROI for IPS was taken from the Wang15 atlas [84]. Clusters of voxels with significant positive activation in the localizer (p < 0.05 uncorrected, cluster size ≥ 120 upsampled voxels) within the anatomical boundary of IPS were selected as the foreground ROIs (Fig 2D). Three participants (five hemispheres) did not show significant clusters at the threshold. In these cases, we slightly relaxed the cluster-defining threshold until two clusters could be identified for each hemisphere. The foreground ROIs in IPS consisted of 2–3 clusters per hemisphere (2.625 ± 0.807 on average), mainly located in the posterior IPS (S13B Fig). Accordingly, surround ROIs in V1 and V2 were acquired by subtracting the foreground ROI from vertices with significant activation to the texture stimuli (p < 0.05, uncorrected). Surface ROIs were then projected to the EPI volume to select voxels in a column-wise manner. Finally, the numbers of (up-sampled) voxels corresponding to different layers of the V1, V2, and IPS were: V1 deep: 352.6 ± 41.0 (Mean ± SEM across participants); V1 middle: 327.0 ± 39.6; V1 superficial: 294.2 ± 36.2; V2 deep: 371.6 ± 49.3; V2 middle: 341.8 ± 44.7; V2 superficial: 310.7 ± 41.4; IPS deep: 1816.7 ± 460.9; IPS middle: 1748.5 ± 449.1; IPS superficial: 1630.0 ± 435.9.

Visual field map reconstruction.

The visual field map of saliency-related signals was reconstructed for each cortical depth using the saliency-dependent (θ90θ15) CBV responses and the pRF maps from the Benson atlas [83,85]. To account for individual differences in the pRF maps, the pRF locations were linearly transformed by aligning the peak of localizer activation to the foreground location. Specifically, angle and eccentricity parameters ( and ) were obtained for the cortical vertex showing the maximum localizer activation, separately for each hemisphere. The corrected pRF parameters were defined as:

Here, the polar angle of 127.88° and eccentricity of 4.28° correspond to the screen location of foreground bars. (0°–180° within each hemisphere) and are the original pRF parameters provided by the Benson atlas [83].

The visual field map was reconstructed by multiplying the each node’s CBV response with its corrected pRF map and then summed across all nodes [86]. Finally, the reconstructed map was normalized (divided) by the maximum response across all cortical depths, and then averaged across participants.

Layer-dependent effective connectivity.

A gPPI method was used to investigate the effective connectivity across cortical depths in V1 and IPS [87]. CBV timeseries of the foreground ROI were averaged within each cortical depth of V1 and IPS, and then normalized by z-score. Due to the uncertainty about CBV HRF, the PPI term was calculated by the dot product of the stimulus regressor (boxcar function, shifted by one TR due to the HRF delay) and the seed timecourse without deconvolution [88,89]. The GLM includes the seed region time course, stimulus regressors, PPI terms of the three θ conditions, along with baseline regressors and head motion parameters. Saliency-dependent functional connectivity was defined as the difference between the PPI beta weights of the θ90 and θ15 conditions. To ensure retinotopic correspondence between V1 and IPS activity, gPPI analysis was performed within each hemisphere, and then results from the two hemispheres were averaged.

Statistical analysis

Two-way rm ANOVA with cortical depths (deep, middle, and superficial) and θ (90°, 15°, and 0°) as within-subject factors was performed on the ROI-averaged CBV responses. All data met the sphericity assumption of ANOVA. In cases where the sphericity assumption was violated, p values were adjusted using the Greenhouse-Geisser correction. One-way rm ANOVA was performed to evaluate the difference in saliency-related signals across cortical depths, followed paired t tests if there was a significant difference. No FWE correction is needed for paired t tests followed by a significant one-way ANOVA with three levels [90]. To examine whether the laminar profiles differed across cortical regions, a two-way rms ANOVA was conducted on the saliency-related signals, with cortical depth and ROI (V1, V2, IPS) as within-subject factors.

Pearson’s r was calculated to assess the correlation between the fMRI saliency sensitivity and the behavioral saliency sensitivity . Data were assessed for multivariate normality using Shapiro–Wilk test (all p > 0.05). A permutation test was performed to correct for the FWE across ROIs. In each permutation, Pearson’s r was calculated after shuffling the correspondence between fMRI signal and perceptual sensitivity across participants. The largest r value was taken and the permutation was repeated to generate a null distribution, from which the FWE was derived for the original r values.

Supporting information

S1 Fig. Cortical depth segmentation and scanning region.

(A) Normalized cortical depth map overlayed on the T1w anatomical image in a representative participant. The equi-volume depth at 0 and 1 correspond to the WM and Pial surfaces, respectively. (B) VASO activations (90° + 15° + 0°, p < 0.001 uncorrected) overlayed on the mean VASO image. Green and yellow lines indicate the WM surface, while the blue and pink lines denote the pial surface. (C) The green box indicates the 3D slab of VASO fMRI acquisition.

https://doi.org/10.1371/journal.pbio.3003159.s001

(TIF)

S2 Fig. Subjects’ performance of the fMRI saliency experiment task.

Accuracy represented the proportion of the runs that the participants gave the correct numbers of fixation changes. Deviation represented the difference between the giving numbers and the real numbers divided by the real numbers. The data underlying this Figure can be found in data/S2_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

https://doi.org/10.1371/journal.pbio.3003159.s002

(TIF)

S3 Fig. Results of eye tracking experiment.

(A) The group-averaged heat maps of gaze density in the entire session. (B) The bivariate contour ellipse area (BCEA) of fixation distribution showed no significant difference across θ conditions (F2,22 = 0.291, p = 0.750, BF10 = 0.227, moderate evidence for H0: no difference across θ conditions). Error bars represent the standard deviation of the means. (C) The percentages of the gaze positions (gaze density) below the center of the fixation point showed no significant difference across θ conditions (F2,22 = 2.197, p = 0.135, BF10 = 0.736, weak evidence for H0). The dash lines indicate the region of the gaze positions selected for analysis. The data underlying this Figure can be found in data/S3_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

https://doi.org/10.1371/journal.pbio.3003159.s003

(TIF)

S4 Fig. Stimulus and procedure of the localizer runs.

Naturalistic stimuli were presented at four images per second in the lower-left or lower-right quadrants in separate stimulus blocks, interleaved with 15-s fixation periods. The size and location of localizer stimuli matched the foreground region in the main experiment.

https://doi.org/10.1371/journal.pbio.3003159.s004

(TIF)

S5 Fig. Normalized BOLD responses in the foreground ROIs of V1, V2, and IPS.

Top panel: BOLD response in different depths of V1, V2, and IPS in 90°, 15°, and 0° orientation contrast conditions; Bottom panel: Calculated from top panel, the response difference between 90° and 15° foregrounds. Error bars represent the standard deviation of the mean. *, ** and *** indicate p < 0.05, p < 0.01, p < 0.001. The data underlying this Figure can be found in data/S5_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

https://doi.org/10.1371/journal.pbio.3003159.s005

(TIF)

S6 Fig. Laminar profile of CBV responses extended to into WM and CSF.

Top panel: CBV responses ( in percent signal change) to the orientation foregrounds in V1, V2, and IPS. Bottom panel: CBV response difference between the 90° and 15° foreground conditions (). Error bars represent SEM. WM: white matter, GM: gray matter, CSF: cerebrospinal fluid.

https://doi.org/10.1371/journal.pbio.3003159.s006

(TIF)

S7 Fig. Normalized CBV responses in the background ROIs of V1 and V2.

Upper: A significant effect of orientation contrast (θ = 90°, 15°, and 0°) was found in V1 (F2,38 = 3.700, p = 0.034) and a similar trend in V2, suggesting weaker background activity in the 90° and 15° conditions compared to the 0° or the uniform texture condition. No significant difference was found between the two θ conditions (F1,19 = 0.365, p = 0.553, BF10 = 5.601 × 10−11). Lower: The suppression effect was calculated as the response difference between the mean of 90° and 15° conditions and the 0° condition ((90° + 15°)/2 − 0°). A significant suppression effect was found only in the superficial depth of V1 (t19 = −2.877, p = 0.023, Holm corrected across cortical depths). These results suggest a suppression effect of background activity in the superficial layers of V1, independent with the orientation contrast between the foreground and the background bars. Each gray dot represents one participant. Error bars indicate SEM. * indicates p < 0.05. D, M, S indicate deep, middle, and superficial depth, respectively. The data underlying this Figure can be found in data/S7_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

https://doi.org/10.1371/journal.pbio.3003159.s007

(TIF)

S8 Fig. Unnormalized (or original) CBV responses in the foreground ROIs.

Conventions are identical as in Fig 3A. The data underlying this Figure can be found in data/S8_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

https://doi.org/10.1371/journal.pbio.3003159.s008

(TIF)

S9 Fig. Normalized CBV response difference in the foreground ROIs between 90° and 0° orientation contrast conditions.

Similar laminar profile with Fig 3A bottom panel. Error bars represent the standard error of the mean. * p < 0.05, + p < 0.1. The data underlying this Figure can be found in data/S9_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

https://doi.org/10.1371/journal.pbio.3003159.s009

(TIF)

S10 Fig. SNRs across cortical depths and the depth-dependent saliency signals normalized by SNR.

(A) The tSNR and CNR of CBV response across cortical depth in the foreground ROIs. tSNR: temporal signal-to-noise ratio, mean signal divided by the standard deviation. CNR: contrast-to-noise ratio, signal difference between stimulus and fixation periods divided by the standard deviation. (B) Saliency-sensitive responses normalized by tSNR. (C) Saliency-sensitive responses normalized by CNR. Error bars represent the standard error of the mean. *, **, and *** indicate p < 0.05, p < 0.01, and p < 0.001, respectively. The data underlying this Figure can be found in data/S10_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

https://doi.org/10.1371/journal.pbio.3003159.s010

(TIF)

S11 Fig. Pearson’s correlation between and across three cortical depths in V1, V2, and IPS.

Each circle represents one participant.

https://doi.org/10.1371/journal.pbio.3003159.s011

(TIF)

S12 Fig. Original connectivity beta values and connectivity matrix.

(A) Beta values of gPPI terms for the 90° and 15° conditions. Each gray dot represents one participant. Error bars represent the standard errors of the mean across participants. One sample t tests indicate significant positive connectivity in 90° of the feedforward pathways, 90° and 15° of the feedback pathways (feedforward 90°: t19 = 5.853, p < 0.001; feedforward 15°: t19 = 1.253, p = 0.225; feedback 90°: t19 = 4.350, p < 0.001; feedback 15°: t19 = 3.734, p = 0.001). Paired t tests indicate significant connectivity difference only in the feedforward pathway (feedforward 90° vs. 15°: t19 = 4.384, p < 0.001; feedback 90° vs. 15°: t19 = 2.041, p = 0.055). A two-way rm ANOVA shows a significant interaction between pathways (feedforward and feedback) and θ conditions (90° and 15°) (F1,19 = 10.067, p = 0.005). (B) The gPPI connectivity matrix across three cortical depths in V1 and IPS. Columns and rows correspond to the seed and target ROIs, respectively. The color scale indicates the beta difference of interaction terms (beta(90°) − beta(15°)). * p < 0.05, ** p < 0.01, *** p < 0.001, uncorrected. The data underlying this Figure can be found in data/S12_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

https://doi.org/10.1371/journal.pbio.3003159.s012

(TIF)

S13 Fig. The lateralization of localizer activations in IPS and the ROI distributions in IPS subregions.

(A) Localizer activations to the left- or right-side stimuli in the IPS of a representative participant. Dashed lines mark the borders of the IPS. (B) Group-averaged ROI volume in IPS subregions. Error bars represent SEM across participants. Each gray dot represents data from one participant. The data underlying this Figure can be found in data/S13_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

(A) Localizer activations to the left- or right-side stimuli in the IPS of a representative participant. Dashed lines mark the borders of the IPS. (B) Group-averaged ROI volume in IPS subregions. Error bars represent SEM across participants. Each gray dot represents data from one participant. The data underlying this Figure can be found in data/S13_data.xlsx at https://www.scidb.cn/doi/10.1101/2025.04.10.648136.

https://doi.org/10.1371/journal.pbio.3003159.s013

(TIF)

S14 Fig. VASO and BOLD response timecourses in deep, middle, and superficial ROIs of the foreground in V1, and the fitted response with GLM using a canonical HRF (BLOCK4 in AFNI).

Error bars represent SEM across participants.

https://doi.org/10.1371/journal.pbio.3003159.s014

(TIF)

References

  1. 1. Baluch F, Itti L. Mechanisms of top-down attention. Trends Neurosci. 2011;34(4):210–24. pmid:21439656
  2. 2. Bisley JW, Goldberg ME. Attention, intention, and priority in the parietal lobe. Annu Rev Neurosci. 2010;33:1–21. pmid:20192813
  3. 3. Buschman TJ, Kastner S. From behavior to neural dynamics: an integrated theory of attention. Neuron. 2015;88(1):127–44. pmid:26447577
  4. 4. Hopfinger JB, Buonocore MH, Mangun GR. The neural mechanisms of top-down attentional control. Nat Neurosci. 2000;3(3):284–91. pmid:10700262
  5. 5. Moore T, Zirnsak M. Neural mechanisms of selective visual attention. Annu Rev Psychol. 2017;68:47–72. pmid:28051934
  6. 6. Itti L, Koch C, Niebur E. A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Anal Machine Intell. 1998;20(11):1254–9.
  7. 7. Koch C, Ullman S. Shifts in selective visual attention: towards the underlying neural circuitry. Hum Neurobiol. 1985;4(4):219–27. pmid:3836989
  8. 8. Li Z. A saliency map in primary visual cortex. Trends Cogn Sci. 2002;6(1):9–16. pmid:11849610
  9. 9. Treisman AM, Gelade G. A feature-integration theory of attention. Cogn Psychol. 1980;12(1):97–136. pmid:7351125
  10. 10. Itti L, Koch C. A saliency-based search mechanism for overt and covert shifts of visual attention. Vision Res. 2000;40(10–12):1489–506. pmid:10788654
  11. 11. Itti L, Koch C. Computational modelling of visual attention. Nat Rev Neurosci. 2001;2(3):194–203. pmid:11256080
  12. 12. Zhaoping L. Attention capture by eye of origin singletons even without awareness—a hallmark of a bottom-up saliency map in the primary visual cortex. J Vis. 2008;8(5):1.1-18. pmid:18842072
  13. 13. Moore T, Armstrong KM. Selective gating of visual signals by microstimulation of frontal cortex. Nature. 2003;421(6921):370–3. pmid:12540901
  14. 14. Thompson KG, Bichot NP. A visual salience map in the primate frontal eye field. Prog Brain Res. 2005;147:251–62. pmid:15581711
  15. 15. Chen X, Zirnsak M, Vega GM, Govil E, Lomber SG, Moore T. Parietal cortex regulates visual salience and salience-driven behavior. Neuron. 2020;106(1):177-187.e4. pmid:32048996
  16. 16. Mazer JA, Gallant JL. Goal-related activity in V4 during free viewing visual search. Evidence for a ventral stream visual salience map. Neuron. 2003;40(6):1241–50. pmid:14687556
  17. 17. Yan Y, Zhaoping L, Li W. Bottom-up saliency and top-down learning in the primary visual cortex of monkeys. Proc Natl Acad Sci U S A. 2018;115(41):10499–504. pmid:30254154
  18. 18. Zhang X, Zhaoping L, Zhou T, Fang F. Neural activities in v1 create a bottom-up saliency map. Neuron. 2012;73(1):183–92. pmid:22243756
  19. 19. Shipp S. The brain circuitry of attention. Trends Cogn Sci. 2004;8(5):223–30. pmid:15120681
  20. 20. Fecteau JH, Munoz DP. Salience, relevance, and firing: a priority map for target selection. Trends Cogn Sci. 2006;10(8):382–90. pmid:16843702
  21. 21. White BJ, Berg DJ, Kan JY, Marino RA, Itti L, Munoz DP. Superior colliculus neurons encode a visual saliency map during free viewing of natural dynamic video. Nat Commun. 2017;8:14263. pmid:28117340
  22. 22. White BJ, Kan JY, Levy R, Itti L, Munoz DP. Superior colliculus encodes visual saliency before the primary visual cortex. Proc Natl Acad Sci U S A. 2017;114(35):9451–6. pmid:28808026
  23. 23. Bahrami B, Lavie N, Rees G. Attentional load modulates responses of human primary visual cortex to invisible stimuli. Curr Biol. 2007;17(6):509–13. pmid:17346967
  24. 24. Kanai R, Tsuchiya N, Verstraten FAJ. The scope and limits of top-down attention in unconscious visual processing. Curr Biol. 2006;16(23):2332–6. pmid:17141615
  25. 25. Kelly SP, Gomez-Ramirez M, Foxe JJ. Spatial attention modulates initial afferent activity in human primary visual cortex. Cereb Cortex. 2008;18(11):2629–36. pmid:18321874
  26. 26. Rauss KS, Pourtois G, Vuilleumier P, Schwartz S. Attentional load modifies early activity in human primary visual cortex. Hum Brain Mapp. 2009;30(5):1723–33. pmid:18711710
  27. 27. Rauss K, Pourtois G, Vuilleumier P, Schwartz S. Effects of attentional load on early visual processing depend on stimulus timing. Hum Brain Mapp. 2012;33(1):63–74. pmid:21438076
  28. 28. Geng JJ, Mangun GR. Anterior intraparietal sulcus is sensitive to bottom-up attention driven by stimulus salience. J Cogn Neurosci. 2009;21(8):1584–601. pmid:18752405
  29. 29. Gottlieb JP, Kusunoki M, Goldberg ME. The representation of visual salience in monkey parietal cortex. Nature. 1998;391(6666):481–4. pmid:9461214
  30. 30. Cheng K. Exploration of human visual cortex using high spatial resolution functional magnetic resonance imaging. Neuroimage. 2018;164:4–9. pmid:27845253
  31. 31. Huber L, Finn ES, Handwerker DA, Bönstrup M, Glen DR, Kashyap S, et al. Sub-millimeter fMRI reveals multiple topographical digit representations that form action maps in human motor cortex. Neuroimage. 2020;208:116463. pmid:31862526
  32. 32. Vizioli L, Huber L, Yacoub E. Laminar and columnar imaging at UHF: considerations for mesoscopic-scale imaging with fMRI. Advances in Magnetic Resonance Technology and Applications. Elsevier. 2023. p. 387–405.
  33. 33. de Hollander G, van der Zwaag W, Qian C, Zhang P, Knapen T. Ultra-high field fMRI reveals origins of feedforward and feedback activity within laminae of human ocular dominance columns. Neuroimage. 2021;228:117683. pmid:33385565
  34. 34. Lawrence SJ, Norris DG, de Lange FP. Dissociable laminar profiles of concurrent bottom-up and top-down modulation in the human visual cortex. Elife. 2019;8:e44422. pmid:31063127
  35. 35. Liu C, Guo F, Qian C, Zhang Z, Sun K, Wang DJ, et al. Layer-dependent multiplicative effects of spatial attention on contrast responses in human early visual cortex. Prog Neurobiol. 2021;207:101897. pmid:32818495
  36. 36. van Mourik T, Koopmans PJ, Bains LJ, Norris DG, Jehee JFM. Investigation of layer-specific BOLD signal in the human visual cortex during visual attention. Aperture Neuro. 2023;3.
  37. 37. Fukuda M, Poplawsky AJ, Kim S-G. Time-dependent spatial specificity of high-resolution fMRI: insights into mesoscopic neurovascular coupling. Philos Trans R Soc Lond B Biol Sci. 2021;376(1815):20190623. pmid:33190606
  38. 38. Huber L, Uludağ K, Möller HE. Non-BOLD contrast for laminar fMRI in humans: CBF, CBV, and CMRO2. Neuroimage. 2019;197:742–60. pmid:28736310
  39. 39. Kay K, Jamison KW, Vizioli L, Zhang R, Margalit E, Ugurbil K. A critical assessment of data quality and venous effects in sub-millimeter fMRI. Neuroimage. 2019;189:847–69. pmid:30731246
  40. 40. Uludag K, Havlicek M. Determining laminar neuronal activity from BOLD fMRI using a generative model. Prog Neurobiol. 2021;207:102055. pmid:33930519
  41. 41. Lu H, Golay X, Pekar JJ, Van Zijl PCM. Functional magnetic resonance imaging based on changes in vascular space occupancy. Magn Reson Med. 2003;50(2):263–74. pmid:12876702
  42. 42. Lu H, Patel S, Luo F, Li S-J, Hillard CJ, Ward BD, et al. Spatial correlations of laminar BOLD and CBV responses to rat whisker stimulation with neuronal activity localized by Fos expression. Magn Reson Med. 2004;52(5):1060–8. pmid:15508149
  43. 43. Finn ES, Huber L, Jangraw DC, Molfese PJ, Bandettini PA. Layer-dependent activity in human prefrontal cortex during working memory. Nat Neurosci. 2019;22(10):1687–95. pmid:31551596
  44. 44. Huber L, Handwerker DA, Jangraw DC, Chen G, Hall A, Stüber C, et al. High-Resolution CBV-fMRI allows mapping of laminar activity and connectivity of cortical input and output in human M1. Neuron. 2017;96(6):1253-1263.e7. pmid:29224727
  45. 45. Felleman DJ, Van Essen DC. Distributed hierarchical processing in the primate cerebral cortex. Cerebral Cortex. 1991;1(1):1–47.
  46. 46. Markov NT, Vezoli J, Chameau P, Falchier A, Quilodran R, Huissoud C, et al. Anatomy of hierarchy: feedforward and feedback pathways in macaque visual cortex. J Comp Neurol. 2014;522(1):225–59. pmid:23983048
  47. 47. Rockland KS, Pandya DN. Laminar origins and terminations of cortical connections of the occipital lobe in the rhesus monkey. Brain Res. 1979;179(1):3–20. pmid:116716
  48. 48. Shipp S. Structure and function of the cerebral cortex. Curr Biol. 2007;17(12):R443-9. pmid:17580069
  49. 49. Shipp S. The functional logic of cortico-pulvinar connections. Philos Trans R Soc Lond B Biol Sci. 2003;358(1438):1605–24. pmid:14561322
  50. 50. Huber L, Ivanov D, Krieger SN, Streicher MN, Mildner T, Poser BA, et al. Slab-selective, BOLD-corrected VASO at 7 Tesla provides measures of cerebral blood volume reactivity with high signal-to-noise ratio. Magn Reson Med. 2014;72(1):137–48. pmid:23963641
  51. 51. Marek S, Tervo-Clemmens B, Calabro FJ, Montez DF, Kay BP, Hatoum AS, et al. Reproducible brain-wide association studies require thousands of individuals. Nature. 2022;603(7902):654–60. pmid:35296861
  52. 52. Steinman RM. Effect of target size, luminance, and color on monocular fixation*. J Opt Soc Am. 1965;55(9):1158.
  53. 53. Koene AR, Zhaoping L. Feature-specific interactions in salience from combined feature contrasts: evidence for a bottom-up saliency map in V1. J Vis. 2007;7(7):6.1-14. pmid:17685802
  54. 54. Zhaoping L. Gaze capture by eye-of-origin singletons: interdependence with awareness. J Vis. 2012;12(2):17. pmid:22344346
  55. 55. Zhaoping L, May KA. Psychophysical tests of the hypothesis of a bottom-up saliency map in primary visual cortex. PLoS Comput Biol. 2007;3(4):e62. pmid:17411335
  56. 56. Zhaoping L, Zhe L. Properties of V1 neurons tuned to conjunctions of visual features: application of the V1 saliency hypothesis to visual search behavior. PLoS One. 2012;7(6):e36223. pmid:22719829
  57. 57. Zhaoping L, Zhe L. Primary visual cortex as a saliency map: a parameter-free prediction and its test by behavioral data. PLoS Comput Biol. 2015;11(10):e1004375. pmid:26441341
  58. 58. Gilbert CD, Wiesel TN. Clustered intrinsic connections in cat visual cortex. J Neurosci. 1983;3(5):1116–33. pmid:6188819
  59. 59. Rockland KS, Lund JS. Intrinsic laminar lattice connections in primate visual cortex. J Comp Neurol. 1983;216(3):303–18. pmid:6306066
  60. 60. Angelucci A, Bijanzadeh M, Nurminen L, Federer F, Merlin S, Bressloff PC. Circuits and mechanisms for surround modulation in visual cortex. Annu Rev Neurosci. 2017;40:425–51. pmid:28471714
  61. 61. Katz LN, Yu G, Krauzlis RJ. Visual activity in primate superior colliculus depends on visual cortex. Journal of Vision. 2024;24(10):744.
  62. 62. May PJ. The mammalian superior colliculus: laminar structure and connections. Prog Brain Res. 2006;151:321–78. pmid:16221594
  63. 63. Swisher JD, Halko MA, Merabet LB, McMains SA, Somers DC. Visual topography of human intraparietal sulcus. J Neurosci. 2007;27(20):5326–37. pmid:17507555
  64. 64. Zhaoping L, Guyader N, Lewis A. Relative contributions of 2D and 3D cues in a texture segmentation task, implications for the roles of striate and extrastriate cortex in attentional selection. J Vis. 2009;9(11):20.1-22. pmid:20053083
  65. 65. Bogler C, Bode S, Haynes J-D. Decoding successive computational stages of saliency processing. Curr Biol. 2011;21(19):1667–71. pmid:21962709
  66. 66. Poltoratski S, Ling S, McCormack D, Tong F. Characterizing the effects of feature salience and top-down attention in the early visual system. J Neurophysiol. 2017;118(1):564–73. pmid:28381491
  67. 67. Thayer DD, Sprague TC. Feature-specific salience maps in human cortex. J Neurosci. 2023;43(50):8785–800. pmid:37907257
  68. 68. Zhaoping L. A new framework for understanding vision from the perspective of the primary visual cortex. Curr Opin Neurobiol. 2019;58:1–10. pmid:31271931
  69. 69. Zhaoping L. Testing the top-down feedback in the central visual field using the reversed depth illusion. iScience. 2025;28(4):112223. pmid:40241755
  70. 70. Faul F, Erdfelder E, Buchner A, Lang A-G. Statistical power analyses using G*Power 3.1: tests for correlation and regression analyses. Behav Res Methods. 2009;41(4):1149–60. pmid:19897823
  71. 71. Brainard DH. The psychophysics toolbox. Spat Vis. 1997;10(4):433–6. pmid:9176952
  72. 72. Pelli DG. The VideoToolbox software for visual psychophysics: transforming numbers into movies. Spatial Vis. 1997;10(4):437–42.
  73. 73. Poser BA, Koopmans PJ, Witzel T, Wald LL, Barth M. Three dimensional echo-planar imaging at 7 Tesla. Neuroimage. 2010;51(1):261–6. pmid:20139009
  74. 74. Marques JP, Kober T, Krueger G, van der Zwaag W, Van de Moortele P-F, Gruetter R. MP2RAGE, a self bias-field corrected sequence for improved segmentation and T1-mapping at high field. Neuroimage. 2010;49(2):1271–81. pmid:19819338
  75. 75. Cox RW. AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. Comput Biomed Res. 1996;29(3):162–73. pmid:8812068
  76. 76. Huber LR, Poser BA, Bandettini PA, Arora K, Wagstyl K, Cho S, et al. LayNii: a software suite for layer-fMRI. Neuroimage. 2021;237:118091. pmid:33991698
  77. 77. Wang J, Nasr S, Roe AW, Polimeni JR. Critical factors in achieving fine-scale functional MRI: removing sources of inadvertent spatial smoothing. Hum Brain Mapp. 2022;43(11):3311–31. pmid:35417073
  78. 78. Fischl B. FreeSurfer. Neuroimage. 2012;62(2):774–81. pmid:22248573
  79. 79. Polimeni JR, Renvall V, Zaretskaya N, Fischl B. Analysis strategies for high-resolution UHF-fMRI data. Neuroimage. 2018;168:296–320. pmid:28461062
  80. 80. Waehnert MD, Dinse J, Weiss M, Streicher MN, Waehnert P, Geyer S, et al. Anatomically motivated modeling of cortical laminae. Neuroimage. 2014;93 Pt 2:210–20. pmid:23603284
  81. 81. Kok P, Bains LJ, van Mourik T, Norris DG, de Lange FP. Selective activation of the deep layers of the human primary visual cortex by top-down feedback. Curr Biol. 2016;26(3):371–6. pmid:26832438
  82. 82. Benson NC, Butt OH, Brainard DH, Aguirre GK. Correction of distortion in flattened representations of the cortical surface allows prediction of V1-V3 functional organization from anatomy. PLoS Comput Biol. 2014;10(3):e1003538. pmid:24676149
  83. 83. Benson NC, Jamison KW, Arcaro MJ, Vu AT, Glasser MF, Coalson TS, et al. The Human Connectome Project 7 Tesla retinotopy dataset: description and population receptive field analysis. J Vis. 2018;18(13):23. pmid:30593068
  84. 84. Wang L, Mruczek REB, Arcaro MJ, Kastner S. Probabilistic maps of visual topography in human cortex. Cereb Cortex. 2015;25(10):3911–31. pmid:25452571
  85. 85. Dumoulin SO, Wandell BA. Population receptive field estimates in human visual cortex. Neuroimage. 2008;39(2):647–60. pmid:17977024
  86. 86. Mo C, He D, Fang F. Attention priority map of face images in human early visual cortex. J Neurosci. 2018;38(1):149–57. pmid:29133433
  87. 87. McLaren DG, Ries ML, Xu G, Johnson SC. A generalized form of context-dependent psychophysiological interactions (gPPI): a comparison to standard approaches. Neuroimage. 2012;61(4):1277–86. pmid:22484411
  88. 88. O’Reilly JX, Woolrich MW, Behrens TEJ, Smith SM, Johansen-Berg H. Tools of the trade: psychophysiological interactions and functional connectivity. Soc Cogn Affect Neurosci. 2012;7(5):604–9. pmid:22569188
  89. 89. Sharoh D, van Mourik T, Bains LJ, Segaert K, Weber K, Hagoort P, et al. Laminar specific fMRI reveals directed interactions in distributed networks during language processing. Proc Natl Acad Sci U S A. 2019;116(42):21185–90. pmid:31570628
  90. 90. Levin JR, Serlin RC, Seaman MA. A controlled, powerful multiple-comparison strategy for several situations. Psychol Bull. 1994;115(1):153–9.