The Effects of Prediction on the Perception for Own-Race and Other-Race Faces

Human beings do not passively perceive important social features about others such as race and age in social interactions. Instead, it is proposed that humans might continuously generate predictions about these social features based on prior similar experiences. Pre-awareness of racial information conveyed by others' faces enables individuals to act in “culturally appropriate” ways, which is useful for interpersonal relations in different ethnicity groups. However, little is known about the effects of prediction on the perception for own-race and other-race faces. Here, we addressed this issue using high temporal resolution event-related potential techniques. In total, data from 24 participants (13 women and 11 men) were analyzed. It was found that the N170 amplitudes elicited by other-race faces, but not own-race faces, were significantly smaller in the predictable condition compared to the unpredictable condition, reflecting a switch to holistic processing of other-race faces when those faces were predictable. In this respect, top-down prediction about face race might contribute to the elimination of the other-race effect (one face recognition impairment). Furthermore, smaller P300 amplitudes were observed for the predictable than for unpredictable conditions, which suggested that the prediction of race reduced the neural responses of human brains.


Introduction
In recent years, one of the topics of interest within face processing research has been the perception of own-race and other-race faces [1]. The popularity of this topic may be because of the fact that it is the key for understanding the controversial and complex phenomena such as the origin and development of racial prejudices.
Regarding electrophysiological studies on the perception of face race, a wealth of data is available. It has been shown in an old/new recognition paradigm that no effects of face race were observed on the amplitudes of P100 [2][3][4]. P100 component is assumed to reflect early visual processing and is sensitive to basic visual attributes of the stimuli such as luminance and contrast. Following P100, a negative deflection peaking between 130 and 190 ms over occipito-temporal sites is usually observed in experiments using faces as stimuli [5,6]. This so-called N170 is larger for faces compared to other stimulus categories [5]. Although some reports did not detect effects of face race on N170 amplitudes [4,7], several other studies reported more negative amplitudes for other-race relative to own-race faces [8,9]. These discrepancies are probably due to differences in experimental task and stimuli. In addition, race effects were observed in P300, a component responding to motivationally-significant events [10].
To date, none of the previous studies, however, has directly investigated the prediction effect on face race. Considered from the viewpoint of proactive brain [11,12], human beings do not passively perceive important social features about others such as race and age, but rather continuously generate a set of predictions about them. Given that top-down predictions allow our brains to quickly interpret sensory information by deducing plausible interpretations from noisy and ambiguous data [13][14][15][16], it is likely that the prediction about race exerts an influence on the subsequent recognition of face stimuli.
Considerable research has indicated that own-race faces are perceived in a configural/holistic manner while other-race faces involve mostly feature-based processing [3,17]. This might be explained by the fact that individuals were experts at distinguishing between own-race faces but relatively in-experts (novices) at distinguishing between other-race faces [17]. A recent study concluded that prediction could facilitate the perception of visual stimuli that involved mostly feature-based processing [18], suggesting that individuals might employ expertise-like skills to perceive other-race faces when those faces were predictable. It should be noted that individuals acquire expertise-like skills via a mechanism other than accruing actual experience discriminating between otherrace faces [19]. In this instance, configural processing is assumed to be increased for predictable other-race faces, which may result in accurate encoding of those faces. In the cross-cultural literature, it was commonly thought that decreased N170 amplitudes reflected the configural processing of faces [2,3]. Thus, we would predict to find that the N170 amplitudes elicited by Caucasian (other-race) faces, but not Chinese (own-race) faces were significantly smaller in the predictable condition compared to the unpredictable condition.
In addition, some other studies [20,21] suggested that pre-awareness of incoming events facilitated reallocating of cognitive resources as well as prepreparation of behavioral coping strategies. Importantly, if that is true, individuals were able to better cope with potential dangers in the environment [21], which implied that less motivated attention might be allocated to faces with a predictable onset than to faces without a predictable onset. We therefore expected to observe smaller P300 amplitudes in the predictable condition.
In the present work, we explored, for the first time to our knowledge, the effects of prediction on the perception for own-race and other-race faces. To do so, we adopted a variation of the S1-S2 paradigm where participants' perception was biased by a face [21,22]. Previous researchers (e.g., [23]) employed the S1-S2 paradigm to investigate the perceptual bias for own-race faces. This bias has been also observed by using a similar task in which an image of a target face was briefly presented and then the perceiver should later select which of two images matches the target [24,25].

Participants
Twenty-four Chinese volunteers (13 women and 11 men; mean age 5 20.44, SD51.21) took part in the experiment and reported no direct contact with members of other-racial (Caucasian) groups. In the city where the study was carried out, 95% of the population was Chinese. All participants were righthanded, with normal or corrected-to-normal vision, and none suffered from affective, psychiatric or neurological disorders. One participant was excluded due to the high level of trait anxiety.

Ethics statement
The ethics committee of Southwest University of China approved this study and the recruitment of participants. Written informed consent was obtained from all participants prior to conducting formal experiment. The experimental procedure was conducted in accordance with the Helsinki guidelines as per the World Health Organization [26].

Materials
Materials consisted of 8 house and 108 face images (4 cue faces and 104 target faces). Each face portrayed a neutral expression (mean valence: 4.59¡0.84, [9point scale]). Half of the face images were Caucasian faces, and the other half were Chinese ones. Caucasian faces (50% female) were taken from the MUG Facial Expression database [27], whereas Chinese faces (50% female) were selected from the Chinese Facial Affective Picture System database [28]. The mean attractiveness for Caucasian and Chinese faces was comparable (Caucasian: 3.54¡0.84, Chinese: 3.74¡0.53; F(1, 103) 52.17; P50.144, [9-point scale]). All images were identical in size, background, spatial frequency, contrast and brightness. The viewing angle of each image was 2.863.7˚, with a screen of 72 pixels per inch. The experiment comprised three blocks, yielding a total of 256 trials. The onset sequence of the four conditions was randomized across the trials.

Procedure
Participants were seated comfortably 90 cm from the computer screen and were instructed to try their best to avoid head movements and eye blinks. On each trial of the experiment, a black fixation cross was first shown for 500 ms, after which the cross disappeared and the screen remained empty for 500-1000 ms. Next, either a blank screen (unpredictable condition) or a cue face (predictable condition, indicating the race of the faces to be seen) was depicted for 500 ms. After 500-1000 ms, the first target face (S1) appeared for 300 ms. Following the S1, a masked stimulus (one house image) was presented for 100 ms and subsequently the screen remained empty for 500-1000 ms. At last, the second target face (S2) was presented until a response was made [29,30]. Participants were asked to judge whether the second target face (S2) had the same or a different identity to the first target face (S1). A schematic illustration of the procedure is shown in Figure 1 (A) and (B). Before the experiment, participants performed 10 practice trials to familiarize themselves with the experimental procedure. Previous studies reported that results using the null cue (a visual stimulus without informative about the valence of the stimulus) as the unpredictable condition were similar to those using the no cue (a blank white screen) [31].

Electrophysiological recording and analysis
The electroencephalography (EEG) was recorded by a BrainAmps system (Brain Products, Munchen, Germany) from 64 scalp sites according to the 10-20 system positions with a reference at FCz [2,32]. A common average reference was recalculated. The vertical electrooculogram (EOG) was recorded with electrodes placed below the right eye and the horizontal EOG was recorded from the right orbital rim. Electrode impedance was maintained below 5 kV. Both the EEG and EOG activities were amplified using a DC-100 Hz bandpass and continuously sampled at 500 Hz/channel. The mean and range of trial numbers after artifact rejection in each condition are listed in Table 1.
For ERP analysis, EEG activities for correct responses in each condition were separately averaged. On the basis of previous studies [33][34][35], the P100 component (80-140 ms) was determined over O1/O2 and PO3/PO4 electrodes, and the N170 component (130-190 ms) was determined over P5/P6, P7/P8, and PO7/PO8 electrodes. Peak amplitudes and latencies of these two components were subjected to repeated-measure ANOVAs with prediction (predictable vs. unpredictable), stimulus race (Caucasian vs. Chinese) and hemisphere (right vs. left) as within-participant factors. Furthermore, the P300 (300-500 ms) was observed and quantified as mean amplitudes at CPz, Pz, POz, CP1/CP2, P1/P2, and PO3/PO4 electrodes [36]. The time window for P300 was chosen on the basis of previous experience with experiments on face recognition [37]. Mean amplitudes of this component were entered into a 26263 ANOVA with the within-participant factors prediction, stimulus race and hemisphere (right, middle vs. left). The ERP data were analyzed off-line with BrainVision Analyzer (Brain Products). All degrees of freedom for the F-ratio were corrected according to the Greenhouse-Geisser method.
We finally carried out correlation analyses to assess the relationship between the behavioral magnitude of the facilitated (predictable) effect and the electrophysiological magnitude of the reduced neural responses. The index of the magnitude of facilitated effect was calculated by subtracting the accuracy reaction times in predictable condition to those in unpredictable condition for own-race and other-race faces of each participant. Similarly, the electrophysiological magnitude of the reduced neural responses was obtained by computing the differences of ERP amplitudes between predictable and unpredictable conditions.

Behavioral and electrophysiological results
As demonstrated in Figure 5

Discussion
In the present study, we employed a facial discrimination task where perception was biased by one cue to investigate how top-down prediction influenced the own-race and other-race face processing. Behaviorally, we reported a facilitated effect whereby participants responded faster to predictable than unpredictable faces. Crucially, the current electrophysiological results showed that the N170 amplitudes elicited by other-race faces, but not own-race faces, were found to be significantly smaller in the predictable condition compared to the unpredictable condition. Moreover, smaller amplitudes were observed for the P300 component in the predictable condition, regardless of the races of the faces. Finally, when participants viewed own-race faces, the magnitude of the facilitated effect on accuracy RTs were positively correlated with the magnitude of the reduced neural responses on N170 component.
In the current study, both own-race and other-race faces were processed faster in the predictable condition than the unpredictable condition, which indicates that prior prediction of race speeded up the subsequent perception of face stimuli. This facilitation might be due to the fact that the prediction allowed our brains to quickly deduce plausible interpretations from noisy and ambiguous data, thereby reducing the amount of possible representations of faces that need to be considered [38][39][40]. However, no significant differences between recognizing own-race and other-race faces were observed at a behavioral level. The lack of behavioral evidence of the other-race effect might be due to the limited sample size [41].
As reported by Stahl, Wiese, and Schweinberger [3], other-race faces elicited larger N170 amplitudes and longer latencies, suggesting more feature-based processing for other-race compared with own-race faces. The present data showed that the N170 amplitudes elicited by other-race faces were significantly smaller in the predictable condition compared to the unpredictable condition, a result which might reflect a switch to holistic processing of other-race faces when those faces are predictable.
According to the predictive coding framework [42], predictive encoding strategies in human brain were assumed to be parsimonious (concise). In other words, the brain did not need to devote abundant cognitive resources to maintaining the same information at different levels of the processing hierarchy when it continuously predicted the immediately relevant future. In this way, the resources needed for representing perceptual input are minimized [42,43]. Given that other-race faces were perceived in a feature-based manner that was not parsimonious processing [18,44], it could be the case that individuals employ one parsimonious processing manner (a configural/holistic processing style) to identify other-race faces when they continuously predict the races of those faces based on prior similar experiences.
Another potential explanation for the switch to holistic processing is the success in allocating more processing resources for individuating information of otherrace faces. It has previously been shown that individuals devote more resources to race-specific information of other-race faces than their individuating information, leading to an inferior recognition performance with other-race faces [45]. However, it seems plausible to suggest that more attentional resources might be allocated to individuating information of other-race faces when the races of those faces are predictable, because the race-specific information of other-race faces has been encoded before the actual occurrences of those faces [18,19]. As noted explicitly by Hugenberg and colleagues [19], when engaging in the individuating information of face stimuli, observers might employ expertise-like skills to perform perceptual processing. It is therefore possible that predictable other-race faces are perceived in a configural/holistic manner.
Since lower P300 amplitudes were associated with reduced motivated attention [46,47], the present results demonstrating smaller P300 amplitudes in the predictable condition might indicate that less resources or attention was needed to process faces with a predictable onset than to faces without a predictable onset. More precisely, prediction of race reduces the neural responses of human brains. A sharpening account of prediction [48] would suggest that top-down prediction about face race increases the gain of prediction error neurons encoding the predictable stimulus feature, leading to a quick resolution of prediction error when the input matches the prediction. This suggests that a large prediction error will not ensue, which may result in a decrease in neural activity.
In this study, both N170 and P300 were modulated by top-down prediction about face race, exhibiting smaller amplitudes for the predictable faces. However, the P100 component was not sensitive to this factor, as demonstrated by the absence of a significant main effect of prediction. This may reflect that prediction starts to influence the perception of own-race and other-race faces after those basic visual analyses reflected in the P100 have been completed.
Interestingly, there was a significant correlation between the behavioral magnitude of the facilitated effect and the N170 magnitude of the reduced neural responses when participants viewed own-race faces, suggesting that while topdown prediction reduced the neural responses of human brains, it improved the recognition efficiency at the behavioral level. This phenomenon may be described more simply as ''less is more'' [40]. However, the correlation was non significant when observers were presented with other-race faces, which was consistent with the work by Vizioli and colleagues [4]. In their study, Western Caucasian subjects, who viewed African-American and Eastern-Asian faces, showed no significant correlation between the magnitude of the face inversion effect (FIE) on the electrophysiological data (N170 amplitudes elicited by inverted minus those elicited by upright faces) and the magnitude of the FIE on the behavioral data.

Conclusions
Although a wealth of research has examined the race effects of human faces, there is no study that directly investigates the influences of top-down prediction on the own-race and other-race face processing. In the present study, both own-race and other-race faces were processed faster in the predictable condition than the unpredictable condition, indicating that prediction of race speeded up the recognition of human faces. Electrophysiological results in the N170 time range showed that the amplitudes elicited by other-race faces were significantly smaller in the predictable condition than the unpredictable condition. This result reflected a switch to holistic processing of other-race faces when those faces are predictable, which may contribute to the elimination of the other-race effect (one face recognition impairment). There were also smaller P300 amplitudes in the predictable condition, implying that prediction of race dampened brain responses to human faces.