The linear discriminant analysis (LDA) method is a classical and commonly utilized technique for dimensionality reduction and classification in brain-computer interface (BCI) systems. Being a first-order discriminator, LDA is usually preceded by the feature extraction of electroencephalogram (EEG) signals, as multi-density EEG data are of second order. In this study, an analytic bilinear classification method which inherits and extends LDA is proposed. This method considers 2-dimentional EEG signals as the feature input and performs classification using the optimized complex-valued bilinear projections. Without being transformed into frequency domain, the complex-valued bilinear projections essentially spatially and temporally modulate the phases and magnitudes of slow event-related potentials (ERPs) elicited by distinct brain states in the sense that they become more separable. The results show that the proposed method has demonstrated its discriminating capability in the development of a rapid image triage (RIT) system, which is a challenging variant of BCIs due to the fast presentation speed and consequently overlapping of ERPs.
Citation: Yu K, AI-Nashash H, Thakor N, Li X (2014) The Analytic Bilinear Discrimination of Single-Trial EEG Signals in Rapid Image Triage. PLoS ONE 9(6): e100097. https://doi.org/10.1371/journal.pone.0100097
Editor: Pedro A. Valdes-Sosa, Cuban Neuroscience Center, Cuba
Received: December 19, 2013; Accepted: May 20, 2014; Published: June 16, 2014
Copyright: © 2014 Yu et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The authors have no support or funding to report.
Competing interests: The authors have declared that no competing interests exist.
A rapid development of brain-computer interface (BCI) related techniques has been seen in the past years. BCI system utilizing electroencephalogram (EEG) provides a shortcut of communication channel between the human brain and an external device, without conventional human’s physical response. Therefore, BCIs could be the goodwill for physically disabled patients as the promising neuroprosthetics solutions –. In addition, due to the advances in computation and communication, BCIs enable the new concept of gaming  and augment people’s performance in some applications, one of which is prioritizing images from an image pool. Fast search of target images (objects) in large-volume imagery, e.g. aerial imagery, has come to a bottleneck. That is, limit number of skillful image analysts cannot handle the increasing volume of imagery in the conventional way. Recently, the rapid image triage (RIT) technique which leverages human vision, split-second judgement capability and machine learning for EEG signal processing, has proven to be a promising solution by researchers –. It can be applied in various applications such as satellite image analysis and image retrieval task.
In one type of RIT, a large-scale imagery is chopped into a number of images of smaller sizes. These images are then presented to an image analyst in a sequential order at a fast speed, which is called rapid serial visual presentation (RSVP) paradigm , , . During the RIT, amongst the images, some contain objects that are perceived as target objects, and hence are required to be identified for further detailed analysis . In contrast, other images being irrelevant to the searching task are considered as nontargets to be disregarded. The occurrence of targets is so rare and infrequent that the searching process will induce the oddball effect , . That is, the unique event-related potentials (ERPs) measurable on the scalp will be elicited by a recognized target image. Among these unique target ERPs, the P300 which is a prominent positive voltage deflection peaking around 300 ms after the onset of the target is the major component differing from nontarget ERPs. Hence, the backbone of RIT system is to detect, identify and dissociate these two types of ERPs.
Since ERPs are usually overwhelmed by noise such as background EEG, it is necessary to resort to various signal processing methods to improve the signal-to-noise ratio (SNR). Some of the methods are spatial decomposition based, such as principal component analysis (PCA) , independent component analysis (ICA)  and common spatial pattern analysis (CSP) . Compared to PCA and ICA which extract uncorrelated/independent components, CSP is naturally more suitable for the binary classification task, as it extremizes the ratio of temporal variance of one condition over the other condition. It has been successfully applied in abundant BCI applications, including the motor imagery , , vowel speech imagery  and RIT . In order to compensate the deficiency due to the temporal invariance, a number of CSP variants have been introduced. There are CSP variants that make use of spectral filters in conjunction with spatial filters, such as common spatio-spectral pattern (CSSP) , the common sparse spectral spatial pattern (CSSSP)  and spectrally weighted common spatial patterns (SPEC-CSP) . There are also methods looking for spatio-temporal projections instead of pure spatial filters , , . Being different from spectral filtering and temporal filtering, the recently proposed CSP variant, namely analytic common spatial patterns (ACSP), emphasizes on the modulation of the phases of EEG signals , . It is accomplished by performing the spatial filtering in the complex-valued space, where the phase information is still preserved.
The phase of EEG signals is of rich unexplored implications. For instance, it has been suggested that the trial-by-trial variability of performance could be partially attributed to the fluctuation of the phase of ongoing oscillations, and the measurement of EEG phase might be useful for the prediction of perceptual and attentional variability . The pre-stimulus EEG phase was claimed to affect the magnitude of the following auditory ERPs . Moreover, phase is also an important property in steady-state visual evoked potentials (SSVEPs) . In the context of RIT, the linkage between ERPs and phase is not apparent. However, the modulation of phase can change the morphology of ERPs in terms of magnitude and latency. And the RSVP paradigm resembles the setting of SSVEP as images are shown at specific frequency. However, it is noteworthy that spatial phase modulation proposed by ACSP, which has shown superior in classification problems such as oscillatory EEG  and SSVEP , may not be optimal in the scenario of RIT. In RIT, the target ERPs are slow potentials and can be overlapping with a number of nontarget ERPs due to the presentation speed. Therefore, a spatio-temporal phase modulation can be more useful in RIT.
In this paper, a method namely analytic bilinear discriminant analysis (ABDA) is proposed to address the spatio-temporal modulation and classification in RIT. The proposed ABDA method belongs to the linear discriminant analysis (LDA) variants, which is a traditional dimension reduction and classification method for low-dimensional feature vector. It finds the projection that maximizes the ratio of between-scatter matrix over within-scatter matrix. However, LDA becomes insufficient for handling 2D images as well as high-density EEG signals . Therefore, 2D-LDA was proposed to be adapted for 2D matrix, which derives a set of orthonormal projections . On the other hand, bilinear discriminant analysis (BDA)  and 2-dimensional linear discriminant analysis (2DLDA)  extend LDA by iteratively optimizing bilinear projections instead of one LDA projection. There are also trilinear (or even higher dimensional) methods, such as parallel factor analysis (PARAFAC)  and general tensor discriminant analysis (GTDA) . These methods usually require the transformation of EEG signals into the frequency domain. Moreover, recent advances attempt to address problems like the limited sample size and doubtful distribution assumption. For instance, a variant of LDA namely enhanced Bayesian LDA (EBLDA), enlarges the sample size by incorporating unlabeled data with high probability into labeled data to refine the classification . In addition, Z-LDA adaptively adjusts decision boundary to accommodate the heteroscedastic signal distribution through z-score . Compared to other methods, the uniqueness of the proposed ABDA method mainly resides in: 1) exploiting the complex-valued bilinear projections, i.e. spatial and temporal projections in a complex-valued space-time domain; 2) The phases of slow ERPs are both spatially and temporally modulated, which is useful in the context of discriminating ERPs that are overlapping; 3) the spatio-temporal phase modulation demonstrated by ABDA accounts for the spatio-temporal propagation of slow ERPs. The method evaluation is conducted in the context of RIT experiments by comparing it with several competitive methods.
2.1 Analytic Presentation
For a real-valued signal, the corresponding analytic signal is presented as(1)where the imaginary part, , is the Hilbert transform of . The Hilbert transform of can be given by(2)where stands for the Cauchy principal value. The effect of Hilbert transform is to shift the phases of both negative frequency components and positive frequency components of a signal, but in different directions, i.e. and, respectively. In addition, introduces another phase shift of to. The ultimate effect is that, the negative frequency components of the analytic signal are shifted above 0 Hz. In other words, contains only positive frequency components. It is worth noting that the phases of positive frequency components of are the equivalent to the counterparts of .
2.2 Objective Function
Given the band-pass filtered EEG epochs and (channeltime) under two conditions (“1” for target condition and “2” for nontarget condition), the corresponding analytic presentations are denoted as and , respectively. There are two bilinear projections, i.e. complex-valued spatial projection and temporal projection . The within-class scatter and between-class scatter of analytic EEG epochs after temporal projecting using can be given as(3)where and stands for the epoch and the mean matrix under condition , respectively. And represents conjugate transpose operator.
It has been shown that there is no analytical solution to a real-valued biquadratic equation , , which is a similar case for a complex-valued biquadratic equation like (4). However, there exists a sub-optimal solution, using the iterative learning.
Suppose that is already given, e.g. is initialized to be an identity matrix in this work. Then and become also known and (4) can be solved by calculating the derivation by the complex-valued . Specifically, (4) can be rewritten as(5)In order to maximize , and shall be set to zero. Since and are complex conjugate transpose of each other, only one of them, e.g. , needs to be calculated:(6)Let be zero, (6) can be further simplified to
On the other hand, the within-class scatter and between-class scatter of analytic EEG epochs after spatial projecting using can be expressed as
By inserting (9) into (10) and inserting (3) into (4), it can be shown that (10) and (4) are actually equivalent. Therefore, after is obtained according to (8), (10) can be used to derive similarly by letting be zero, which is
It is noteworthy that, as , and are all complex-valued, the projected data, i.e. , is also a complex value, which however cannot be directly used to get decision boundary. Here, is decomposed to a 2-element vector , the elements of which are the real part and imaginary part of . The classical LDA method is used to find a projection that separates from .
Unlike the LDA which handles the -dimension to one-dimension discrimination issue , the proposed ABDA tackles the -dimensional classification problem. The classifier makes use of bilinear projections, LDA projection and the bias, as indicated in(12)where denotes the transpose operator. Moreover, and represent the real part and imaginary part of , respectively.
Generally, the bias shall be chosen such that the posteriors in the projected dimension of two conditions will be equal . Appropriate selection of could be important if the sample size of one condition is significantly different from that of the other, which is the unbalanced classification problem.
RIT experiments were conducted under the approval by the National University of Singapore Institutional Review Board (NUS-IRB). After providing written consent forms for participating in the experiment, 22 healthy participants, all right-handed, with normal or corrected-to-normal sight, completed the RIT experiments.
3.1 Experiment Design
The experiment included one training session and one testing session. In each session, aerial images were sequentially presented to a participant, following the standard RSVP paradigm , ,  (see Figure 1). Each image lasted for 150 ms on the centre of the screen and then was replaced by the next one. There was a temporary break between every 50 images. The duration of the break was self-controlled by the participant but was caped to 10 seconds. These aerial images were of 400×400 pixels. A small amount of them (approximately 72) containing objects of interest were defined as targets, while others (over 4400) were considered as nontargets. The participant was informed that he/she should neglect nontargets but was obliged to immediately respond to the appearing targets by pressing a button.
3.2 Acquisition and Preprocessing
For every participant, 62-channel EEG signals were collected at 250 Hz, using an ANT amplifier (ANT B.B., Enschede, Netherlands). EEG signals were referenced to linked ears and grounded to the forehead. The 4th order Butterworth filter was adopted, with the pass-band from 1 Hz to 25 Hz. The filtered signals were segmented into epochs, the time window of which starts from the onset of each image to 500 ms after the onset.
In RIT experiments, sometimes there could be a few bad channels (malfunctioning channels) and these bad channels might deteriorate the performance. Hence, ahead of training the classifier using data collected in training session, bad channels were automatically identified, which would be excluded from both training data and testing data. This was accomplished by monitoring each channel across all epochs in training session. For instance, if the absolute difference between the maximum value and the minimum value (or the mean value) is significantly large for a particular channel over 30% of total epochs, this channel would be labeled as the bad channel.
It is worth noting that the removal of identified bad channels was based on training data. However, there could be more bad channels in testing session. Thus, an additional measure was introduced. That is, every epoch was examined whether there were any suspected abnormal channels. These suspected channels would be replaced by spherical spline interpolation using neighboring functioning channels .
With complex-valued bilinear projections, ABDA is assumed to be able to modulate the phases and the magnitudes of signals in the manner that target ERPs and nontarget ERPs become more differentiable in the context of RIT. This assumption was verified by comparing the proposed ABDA with CSP, ACSP and BDA which omits the phase modulation. All methods were applied to the EEG data collected from 22 participants in RIT experiments on a single-trial basis. The classifiers were derived using target epochs and nontarget epochs of training data, and the single-trial classification results for comparison were obtained from testing data. It is noteworthy that 4 features were extracted by CSP/ACSP using the most discriminative filters that had been derived and these features were fed to the conventional FLD classifier. The performance measure adopted was the balanced accuracy (BA) , which accommodates the unbalanced sample sizes between targets and nontargets. BA is defined as.(13)
Results and Discussion
The classification results are shown in Table 1. It can be seen that ABDA outperformed other methods for 18 out of 22 participants. The average BA achieved by ABDA was close to 90% and was 5.9% higher than CSP, 3.9% higher than ACSP and 2.5% higher than BDA, respectively. The better performance of ABDA over BDA was statistically significant in paired t-test, with p-value<0.05, which however is not significant (p-value = 0.017) in the t-test with Bonferroni correction which is conservative. Moreover, ABDA significantly surpassed CSP and ACSP, respectively, with p-value<0.001 in both paired t-test and t-test with Bonferroni correction. On the other hand, though BDA significantly outperformed CSP (p-value<0.01), its advantage over ACSP was insignificant (p-value>0.15) in t-test with Bonferroni correction.
According to (4) and (10), mi and are the desirable projections that maximize the objective function. Although the derivations of and were complex-valued calculation, the ratios obtained during every iteration were real values (see Figure 2), as and ( and ) were semi-definite matrices. In Figure 2, it can be seen that for all the 22 participants, initially increased and would quickly converge to a constant value after several iteration steps. This indicates that the iterative learning was useful, and there always existed a pair of complex-valued bilinear projections which fulfilled the objective function. And most importantly, these projections could be consistently and reliably obtained for all participants.
The obtained ABDA spatial and temporal projections contained real parts and imaginary parts. According to Euler’s formula, a complex value has a corresponding complex exponential function consisting of two variables, i.e. magnitude and phase. In particular, the normalized spatial projection for P22 was plotted and compared to the counterpart, i.e. the BDA spatial projection, in Figure 3. It can be seen that the phases of BDA spatial projection were binary. That is, they could only be either 0° or 180°, indicating positive or negative sign, respectively. In contrast, there was more flexibility in the ABDA spatial projection. As manifested in Figure 3, the ABDA phases ranged from −180° to 180°. This freedom allowed a delicate modulation of the phases of temporal signals in each channel. The phase modulation is very useful, as it can change the morphology of signals, including amplitude and latency, to improve classification. For instance, the P300, a major signature in this RIT task, propagates from frontal to parietal areas on the scalp . Such latency differences among these spatially distributed EEG channels could be estimated and utilized for denosing . It can also be applied to other EEG signals such as steady-state visual evoked potentials (SSVEPs) . Modulating phases of every individual channel led to the increase in the number of channels that were important for discrimination. As can be seen in Figure 3, there were much more channels of high magnitude (high weight) for ABDA in comparison to BDA. In addition, these critical channels were broadly spread such as midline, which is in accordance with the fact that P300 is measurable widely on the scalp.
By Euler’s identity, a complex value could be interpreted as a combination of a magnitude component and a phase component.
The modulation offered by ABDA has a noticeable consequence which can be observed in Figure 4. After being spatial projected, both real part and imaginary part of target ERPs showed a more prominent P300 component, as compared to that in the scenario of BDA. For nontarget ERPs, the real part and imaginary part were weaker than BDA, although the difference seemed to be less significant as those in Figure 4A. Therefore, the enlarged difference between target ERPs and nontarget ERPs in Figure 4 might imply that target condition and nontarget condition became more separable by the ABDA method. However, the overall performance is determined not only by the spatial projection but also the temporal projection.
(a) shows the projected signals of target condition. (b) shows the projected signals of nontarget condition. ABDA-real is the real part of the projected signals, while ABDA-imag is the imaginary part of the projected signals.
The normalized temporal projections of BDA and ABDA were plotted in Figure 5. Both BDA and ABDA temporal projections appeared to contain high frequency components. It may be due to the fact that, the temporal resolution (time point) was higher than the spatial resolution (the number of electrodes) and a regularization term was not adopted during the iteration learning. From the perspective of the waveform, BDA imposed heavier weights on the first half of the time window, i.e. between 0 ms and 350 ms. These weights could be meaningful. As can be seen in Figure 4A, the projected target ERPs (blue line) peaked at 350 ms, which matched the corresponding weights in Figure 5, indicating a linkage between BDA temporal projection and BDA spatial projection. On the other hand, the ABDA method focused mainly on the late stage of the target ERPs, i.e. 400 ms, and the waveform looked very clean before 200 ms. This is in line with the work of Gerson et al. , where the prominent discriminating activities were observed after 350 ms. It also followed the peak of spatially projected signals in Figure 4A, in particular the ABDA-imag (green line).
ABDA-real is the real part of the ABDA temporal projection, while ABDA-imag is the imaginary part of the ABDA temporal projection.
With the temporal projections as shown in Figure 5, the projected spatial topographies of two conditions were obtained in Figure 6. For both methods, the magnitude difference between target condition and nontarget condition were apparent, suggesting a strong discriminating capability of the temporal projection. With respect to the comparison between BDA and ABDA, there was a similar observation in Figure 3. That is, ABDA seemed to exploit a larger region for classification, from frontal, central to parietal and occipital, which was demonstrated by the magnitude mappings of BDA and ABDA under target condition. Furthermore, it is interesting to note that the phase mapping of ABDA under target condition partially showed the gradual propagation pattern of target ERPs, e.g. P300. It is known that the latency of P300 is shorter over frontal areas and longer over parietal areas . At the first glance, it seemed that there was a noticeable phase gap between frontal areas (dark red) and central areas (dark red). However, it is noteworthy that phase is periodic and 180° is equivalent to −180°. Therefore, the phase difference between the frontal and the central areas was actually small. In general, it could be said that the color-coded phases of ABDA in Figure 6 progressively changed from dark red, dark blue to yellow color along the scalp, resembling the P300 latency changes. The phase mapping in Figure 6 indicated the phase difference between the signals in EEG channels in a quantitative manner. Unlike ABDA, BDA did not account for the ERP propagation. For instance, in Figure 6 under target condition, all phases in the frontal and central areas were 180° (negative), and the rest were 0° (positive).
By Euler’s identity, a complex value could be interpreted as a combination of a magnitude component and a phase component.
Since there is an inherent relation between the iteratively optimized spatial projection and temporal projection, evaluating the spatial projection and temporal projection in a separate way may be insufficient for viewing the big picture. Figure 7 illustrates the combined effect of bilinear projections. Given the ERPs (see the first row in Figure 7), the element-wise product for ABDA was calculated using.(14)where stands for the conjugate function and represents the element-wise product multiplication operator. The formula for BDA was simpler:
The first row shows the target ERPs and nontarget ERPs along all the 62 channels, and are under the scale of [−5 5]. The other three rows are the element-wise products of ERPs (or the corresponding analytic representation) and the spatio-temporal projection. The scale is [−0.1 0.1]. ABDA-real is the real part of the ABDA element-wise products, while ABDA-imag is the imaginary part of the ABDA element-wise products.
(15)It is worth noting that the summation of all the elements of is equivalent to in (12). According to the of ABDA and of BDA in Figure 7, it can be seen that ABDA mainly relied on the late stage of target ERPs, whilst the early ERP components were favored by BDA. Moreover, a larger number of channels and time points were ‘highlighted’ by ABDA to distinguish target condition from nontarget condition. On the other hand, BDA depended on relatively limited spatio-temporal signal segments. Additionally, there is a kind of ‘texture’ at the first row of Figure 7, which can be attributed to the propagating process of ERPs on the scalp. Such a texture is also observable in the of BDA (the second row), which however, became absent in the of ABDA. The absence of this texture should be the result of the phase and magnitude modulation introduced by the complex-valued bilinear projections, which counteracted the latency differences among channels.
In this study, ABDA, the analytic bilinear discriminant analysis, a linear discriminant analysis originated method, is proposed and has been applied to the development of the RIT system. The results showed that, without transforming into frequency domain, the ABDA method is capable of modulating the phases and magnitudes of slow ERP signals that are overlapping with other ERPs, using the coupling of complex-valued spatial projection and complex-valued temporal projection. The complex-valued bilinear projections accommodated the spatio-temporal phase variations of ERPs, and consequently enabled a better usage of high-density EEG measurement to perform the classification task. With the ABDA, the RIT tests have showed an average accuracy increase of 2.5% over that with the BDA method and also outperformed CSP and ACSP.
Conceived and designed the experiments: KY XL. Performed the experiments: KY. Analyzed the data: KY. Contributed reagents/materials/analysis tools: KY XL. Wrote the paper: KY HAI NT.
- 1. Leuthardt EC, Schalk G, Moran D, Ojemann JG (2006) The emerging world of motor neuroprosthetics: a neurosurgical perspective. Neurosurgery 59: 1–14.
- 2. Hochberg LR, Serruya MD, Friehs GM, Mukand JA, Saleh M, et al. (2006) Neuronal ensemble control of prosthetic devices by a human with tetraplegia. Nature 442: 164–171.
- 3. Leuthardt EC, Schalk G, Wolpaw JR, Ojemann JG, Moran DW (2004) A brain–computer interface using electrocorticographic signals in humans. Journal of neural engineering 1: 63.
- 4. Finke A, Lenhardt A, Ritter H (2009) The MindGame: a P300-based brain–computer interface game. Neural Networks 22: 1329–1333.
- 5. Gerson AD, Parra LC, Sajda P (2006) Cortically coupled computer vision for rapid image search. IEEE Trans Neural Syst Rehabil Eng 14: 174–179.
- 6. Hughes G, Mathan S, Yeung N (2012) EEG indices of reward motivation and target detectability in a rapid visual detection task. NeuroImage.
- 7. Huang Y, Erdogmus D, Pavel M, Mathan S, Hild KE (2011) A framework for rapid visual image search using single-trial brain evoked responses. Neurocomputing 74: 2041–2051.
- 8. Yu K, Shen K, Shao S, Ng WC, Li X (2012) Bilinear common spatial pattern for single-trial ERP-based rapid serial visual presentation triage. J Neural Eng 9: 046013.
- 9. Yu K, Shen K, Shao S, Ng WC, Kwok K, et al. (2011) Common spatio-temporal pattern for single-trial detection of event-related potential in rapid serial visual presentation triage. Biomedical Engineering, IEEE Transaction on 58: 2513–2520.
- 10. Sajda P, Pohlmeyer E, Jun W, Parra LC, Christoforou C, et al. (2010) In a Blink of an Eye and a Switch of a Transistor: Cortically Coupled Computer Vision. Proceedings of the IEEE 98: 462–478.
- 11. Pohlmeyer EA, Wang J, Jangraw DC, Lou B, Chang S-F, et al. (2011) Closing the loop in cortically-coupled computer vision: a brain–computer interface for searching image databases. Journal of neural engineering 8: 036025.
- 12. Bernat E, Shevrin H, Snodgrass M (2001) Subliminal visual oddball stimuli evoke a P300 component. Clinical neurophysiology 112: 159–171.
- 13. Polich J, Criado JR (2006) Neuropsychology and neuropharmacology of P3a and P3b. International Journal of Psychophysiology 60: 172–185.
- 14. Lagerlund TD, Sharbrough FW, Busacker NE (1997) Spatial filtering of multichannel electroencephalographic recordings through principal component analysis by singular value decomposition. Journal of Clinical Neurophysiology 14: 73–82.
- 15. Makeig S, Bell AJ, Jung T-P, Sejnowski TJ (1996) Independent component analysis of electroencephalographic data. Advances in neural information processing systems: 145–151.
- 16. Ramoser H, Müller-Gerking J, Pfurtscheller G (2000) Optimal spatial filtering of single trial EEG during imagined hand movement. Rehabilitation Engineering, IEEE Transaction on 8: 441–446.
- 17. Samek W, Vidaurre C, Müller KR, Kawanabe M (2012) Stationary common spatial patterns for brain-computer interfacing. J Neural Eng 9: 026013.
- 18. Blankertz B, Tomioka R, Lemm S, Kawanabe M, Müller KR (2008) Optimizing Spatial filters for Robust EEG Single-Trial Analysis. IEEE Signal Processing Magazine 25: 41–56.
- 19. DaSalla CS, Kambara H, Sato M, Koike Y (2009) Single-trial classification of vowel speech imagery using common spatial patterns. Neural Networks 22: 1334–1339.
- 20. Yu K, Shen K, Shao S, Ng WC, Kwok K, et al. (2012) A spatio-temporal filtering approach to denoising of single-trial ERP in rapid image triage. J Neurosci Methods 204: 288–295.
- 21. Lemm S, Blankertz B, Curio G, Müller KR (2005) Spatio-spectral filters for improving the classification of single trial EEG. IEEE Trans Biomed Eng 52: 1541–1548.
- 22. Dornhege G, Blankertz B, Krauledat M, Losch F, Curio G, et al. (2006) Combined optimization of spatial and temporal filters for improving brain-computer interfacing. Biomedical Engineering, IEEE Transaction on 53: 2274–2281.
- 23. Tomioka R, Dornhege G, Nolte G, Blankertz B, Aihara K, et al.. (2006) Spectrally Weighted Common Spatial Pattern Algorithm for Single Trial EEG Classification. Dept. Math. Eng., Univ. Tokyo.
- 24. Yu K, Wang Y, Shen K, Li X (2013) The Synergy between Complex Channel-Specific FIR Filter and Spatial Filter for Single-Trial EEG Classification. PLoS ONE 8: e76923.
- 25. Falzon O, Camilleri KP, Muscat J (2010) Complex-valued spatial filters for task discrimination. Conf Proc IEEE Eng Med Biol Soc 2010: 4707–4710.
- 26. Falzon O, Camilleri K, Muscat J (2012) Complex-valued spatial filters for SSVEP-based BCIs with phase coding. IEEE Trans Biomed Eng 59: 2486–2495.
- 27. VanRullen R, Busch N, Drewes J, Dubois J (2011) Ongoing EEG phase as a trial-by-trial predictor of perceptual and attentional variability. Frontiers in psychology 2.
- 28. Kruglikov SY, Schiff SJ (2003) Interplay of electroencephalogram phase and auditory-evoked neural activity. The Journal of neuroscience 23: 10122–10127.
- 29. Jia C, Gao X, Hong B, Gao S (2011) Frequency and Phase Mixed Coding in SSVEP-Based Brain–Computer Interface. Biomedical Engineering, IEEE Transactions on 58: 200–206.
- 30. Lu H, Plataniotis KN, Venetsanopoulos AN (2011) A survey of multilinear subspace learning for tensor data. Pattern Recognition 44: 1540–1551.
- 31. Li M, Yuan B (2005) 2D-LDA: A statistical linear discriminant analysis for image matrix. Pattern Recognition Letters 26: 527–532.
- 32. Visani M, Garcia C, Jolion JM (2005) Normalized radial basis function networks and bilinear discriminant analysis for face recognition; 15–16 Sept. 342–347.
- 33. Li J, Janardan R, Li Q (2004) Two-dimensional linear discriminant analysis. Advances in Neural Information Processing Systems 17: 1569–1576.
- 34. Miwakeichi F, Martinez-Montes E, Valdés-Sosa PA, Nishiyama N, Mizuhara H, et al. (2004) Decomposing EEG data into space-time-frequency components using parallel factor analysis. NeuroImage 22: 1035–1045.
- 35. Li J, Zhang L, Tao D, Sun H, Zhao Q (2009) A prior neurophysiologic knowledge free tensor-based scheme for single trial EEG classification. Neural Systems and Rehabilitation Engineering, IEEE Transactions on 17: 107–115.
- 36. Xu P, Yang P, Lei X, Yao D (2011) An Enhanced Probabilistic LDA for Multi-Class Brain Computer Interface. PLoS ONE 6: e14634.
- 37. Zhang R, Xu P, Guo L, Zhang Y, Li P, et al. (2013) Z-Score Linear Discriminant Analysis for EEG Based Brain-Computer Interfaces. PLoS ONE 8: e74433.
- 38. Duda RO, Hart PE, Stork DG (2012) Pattern classification: Wiley-interscience.
- 39. McCleery JP, Surtees AD, Graham KA, Richards JE, Apperly IA (2011) The neural and cognitive time course of theory of mind. The Journal of Neuroscience 31: 12849–12854.
- 40. Polich J, Margala C (1997) P300 and probability: comparison of oddball and single-stimulus paradigms. International journal of psychophysiology: official journal of the International Organization of Psychophysiology 25: 169.
- 41. Falzon O, Camilleri KP, Muscat J (2012) The analytic common spatial patterns method for EEG-based BCI data. Journal of Neural Engineering 9: 045009.
- 42. Mertens R, Polich J (1997) P300 from a single-stimulus paradigm: passive versus active tasks and stimulus modality. Electroencephalography and Clinical Neurophysiology/Evoked Potentials Section 104: 488–497.