A high-performance seizure detection algorithm based on Discrete Wavelet Transform (DWT) and EEG

Duo Chen; Suiren Wan; Jing Xiang; Forrest Sheng Bao

doi:10.1371/journal.pone.0173138

Abstract

In the past decade, Discrete Wavelet Transform (DWT), a powerful time-frequency tool, has been widely used in computer-aided signal analysis of epileptic electroencephalography (EEG), such as the detection of seizures. One of the important hurdles in the applications of DWT is the settings of DWT, which are chosen empirically or arbitrarily in previous works. The objective of this study aimed to develop a framework for automatically searching the optimal DWT settings to improve accuracy and to reduce computational cost of seizure detection. To address this, we developed a method to decompose EEG data into 7 commonly used wavelet families, to the maximum theoretical level of each mother wavelet. Wavelets and decomposition levels providing the highest accuracy in each wavelet family were then searched in an exhaustive selection of frequency bands, which showed optimal accuracy and low computational cost. The selection of frequency bands and features removed approximately 40% of redundancies. The developed algorithm achieved promising performance on two well-tested EEG datasets (accuracy >90% for both datasets). The experimental results of the developed method have demonstrated that the settings of DWT affect its performance on seizure detection substantially. Compared with existing seizure detection methods based on wavelet, the new approach is more accurate and transferable among datasets.

Citation: Chen D, Wan S, Xiang J, Bao FS (2017) A high-performance seizure detection algorithm based on Discrete Wavelet Transform (DWT) and EEG. PLoS ONE 12(3): e0173138. https://doi.org/10.1371/journal.pone.0173138

Editor: Chun Kee Chung, Seoul National University College of Medicine, REPUBLIC OF KOREA

Received: July 1, 2016; Accepted: February 15, 2017; Published: March 9, 2017

Copyright: © 2017 Chen et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting Information files.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Approximately 50 million people worldwide have epilepsy, making it one of the most common neurological diseases globally [1]. Epilepsy is characterized by recurring seizures caused by abnormal discharges in the brain [2]. Electroencephalogram (EEG), a technology directly records electrical activities from the brain, is an important data resource in epilepsy diagnostic tasks, such as, seizure detection [3, 4], spike detection [5, 6] and localization of epileptic foci [7, 8]. In clinical practice, long-term EEG recording up to a few days, is usually required. Therefore, many computer-aided solutions have been developed to assist neurologists. Combining signal processing and machine learning, most of those approaches model the problem as classification of signals, such as epileptic vs. healthy for epilepsy diagnosis [9, 10], ictal (on seizure) vs. inter-ictal for seizure onset detection [11, 12], etc. The most common classification problem is seizure detection, where seizure and non-seizure EEG segments of patients need to be identified [6].

Applying Discrete Wavelet Transform (DWT) on epilepsy-related EEG signal classification is gaining ground in recent years. The main advantage of DWT is that the resolution of time and frequency in DWT can be adapted to the frequency content of the examined patterns, thus leading to an optimal time-frequency resolution across all frequency ranges [13, 14]. This superiority makes DWT especially suitable for the analysis of non-stationary signal, such as EEG [6, 15].

Though DWT has shown promising results on seizure detection [6, 11, 16], it is still an open question regarding how to utilize the full potential of DWT to improve the accuracy and reliability of EEG analysis. Meanwhile, some methods only show promising results for selected patients, the reliability and reproducibility of the results have been questioned when being tested on other EEG datasets [17].

To establish a high-performance seizure detection algorithm based on DWT, the present study proposed a generalized computer-aided EEG analysis method to achieve the optimal seizure detection accuracy with low computational cost. Our method automatically searched the optimal combination of four factors, including, mother wavelet, decomposition level, frequency band, and DWT coefficient feature. These factors may affect the performance of DWT in seizure detection.

To test the performance of our method, we used EEG dataset from CHB-MIT (MIT) and dataset from University of Bonn (UBonn). Empirical results show that: 1) mother wavelet does not influence seizure detection results significantly; 2) seizure detection accuracy is very sensitive to decomposition level if the features of seizure/non-seizure EEGs showing significant difference in several frequency bands; 3) many frequency bands and DWT coefficient features are redundant causing accuracy reduction and unnecessary high computational cost.

Our seizure detection method achieved the accuracies of 92.30% and 99.33%, on MIT dataset and UBonn dataset, respectively. Compared with other seizure detection methods based on DWT, our approach attained the highest accuracy and the best robustness. The main innovation and contribution of the present study is the establishment of a guideline for constructing a high-performance seizure detection algorithm with high accuracy and low computational cost based on DWT and EEG.

Method

EEG datasets

We formulate the problem of seizure detection as classifying multi-channel EEG recordings (seizure and non-seizure). Some previous methods have shown promising results for selected patients; however, they achieved poor performance on other EEG datasets [17]. Considering this, we tested our algorithm on two EEG datasets to check its reliability. These two datasets have been used widely during the past few years [18, 19]. To demonstrate the advantages of our method, it was rational to compare our method with existing wavelet-based algorithms by using these well-recognized datasets. All computational experiments are run on a server with 32-core AMD CPU (1400MHz) using Matlab 2013a (MathWorks, Natick, Massachusetts, U.S.A).

MIT dataset.

The first dataset in this work was collected at the Children’s Hospital Boston, Massachusetts (MIT), consists of EEG recordings from pediatric subjects with intractable seizures. Subjects were monitored for up to several days following withdrawal of anti-seizure medication in order to characterize their seizures and assess their candidacy for surgical intervention [20]. Recordings were collected from 22 subjects (5 males, ages 3–22; and 17 females, ages 1.5–19). The International 10–20 system of EEG electrode positions and nomenclature was used for these recordings. More details about the dataset can be found from [18] and http://www.physionet.org/pn6/chbmit/.

EEG recordings in all channels from seizure start to end (ictal) were considered as “seizure”; EEG recordings out of the period of “seizure” were considered as “non-seizure”. Therefore, seizure detection could be further transformed into a signal classification problem: classifying seizure and non-seizure EEG signals from simultaneously recorded multi-channel EEG signals.

The EEG signals were sampled at 256Hz and digitally filtered by a 48th-order FIR high-pass filter (hamming window) with the cutoff frequency at 0.5Hz to remove low-frequency artifacts. The 256Hz sampling rate is large enough to cover general human EEG rhythms (bandwidths), including, δ(< 4Hz), θ(4 − 7Hz), α(8 − 15Hz), β(16 − 31Hz) and γ(> 31Hz). In this work, 13846 EEG segments were chosen from 18 cases (several subjects having much shorter seizure recordings than others were abandoned to keep the data balance), each segment lasts 20 seconds. Each subject provides the same number of seizure and non-seizure segments [21]. In total, 38.46h seizure EEG and 38.46h non-seizure EEG are used. The EEG segment selection is shown in Fig 1 which gives a 520 seconds EEG recording of a single channel. The 20 second non-overlapping window slides from left to right. When the slide windows falls into a seizure onset area (between the two read lines), the segment was selected as “seizure”. Otherwise, the segment was treated as “non-seizure”. Segments shorter than 20 second were discarded here.

Download:

Fig 1. Seizure and non-seizure EEG segments from MIT dataset.

A 20-second’ window slides across the long-time EEG. If the window goes into a period of seizure, this segment is marked as “seizure”, otherwise, “non-seizure”.

https://doi.org/10.1371/journal.pone.0173138.g001

UBonn dataset.

The second dataset in this work was from University of Bonn (UBonn) [19]. The dataset had five sets denoted A∼E, each containing 100 single channel EEG segments of 23.6-sec duration with a sampling rate of 173.61 Hz. These segments were selected and cut out from continuous multichannel EEG recordings after visual inspection for artifacts. The scalp EEG signals were digitally filtered using a 48th-order FIR high-pass filter (hamming window) with the cutoff frequency at 0.5Hz.

For seizure detection, sets C, D were treated as “non-seizure” while set E was treated as “seizure”. In this study, we focused on seizure detection for patients. Sets C, D originated from EEG archive of presurgical diagnosis. Segments in set D were recorded from within the epileptogenic zone, and those in set C from the hippocampal formation of the opposite hemisphere of the brain. Sets C and D contained only activity measured during seizure-free intervals while set E only contains seizure activity.

Framework

The framework of our seizure detection method based on wavelet is shown in Fig 2. Our algorithm was constructed by two main selection blocks, a Wavelet-Level Selection and a Band-Feature Selection. Long period of seizure and non-seizure EEGs were used, artifact contaminated EEGs were included. This high-performance algorithm was a completely automatic process. DWT was used to construct a feature vector for each EEG segment. A support vector machine (SVM) classifier [22] would learn to distinguish the feature vectors of seizure and non-seizure EEGs, automatically. Details inside were introduced in the following subsections.

Download:

Fig 2. Framework of our method based on wavelet.

The full algorithm can be divided into two parts. The Wavelet-Level Selection and the Band-Feature Selection. For each mother wavelet, one EEG segment is decomposed to the highest theoretical level for later feature extraction. For each wavelet family, only the mother wavelet and corresponding decomposition level, which produce the highest classification accuracy, is retained for Band-Feature Selection. In Band-Feature Selection, the features in certain bands leading to the highest accuracy are used to construct the final prediction model.

https://doi.org/10.1371/journal.pone.0173138.g002

Discrete wavelet transform

DWT played a significant role in our algorithm. A wavelet is a quickly vanishing oscillating function localized both in frequency and time domains. In continuous wavelet analysis, the signal is decomposed into scaled and translated versions (ψ_a,b(t)) of a single function ψ(t) called mother wavelet: (1) where a and b are the scale and translation parameters, respectively, with and a ≠ 0. The discrete wavelet transform (DWT) [23] was obtained by discretizing the parameters a and b. In its most common form, the DWT employs a dyadic sampling with parameters a and b based on powers of two: a = 2^j and b = k2^j, with . By substituting in Eq 1, we obtained the dyadic wavelets: (2) Of note, DWT could be written as (3) where d_j,k are known as wavelet coefficients at level j and location k [24]. These coefficients were used to construct the feature vector of each EEG segment in seizure detection.

Wavelet family & wavelet member

In wavelet-based digital signal processing (DSP), selecting a suitable mother wavelet [23] is always the first step. Various mother wavelets supply different DWT coefficients on the same EEG segment leading to different detection results. In this work, 7 commonly used wavelet families were tested, including, Biorthogonal (bior), Coiflets (coif), Daubechies (db), Reverse biorthogonal (rbio), Symlets (sym), Discrete Meyer (dmey), and Haar (Haar) [6]. Fifty-four family members (mother wavelets) totally contained in these families are shown in Table 1.

Download:

Table 1. Fifty-four Mother Wavelets.

https://doi.org/10.1371/journal.pone.0173138.t001

It is worth noting that in clinical practice, testing all wavelets is impractical and unnecessary. In addition, sometimes mother wavelets should be chosen according to the properties of patient EEG recordings. Heuristics for selecting mother wavelets are discussed in a later section.

Decomposition level

Decomposition level is an important parameter of DWT. Each level in DWT corresponds to a specific frequency band. More levels of decomposition provide more detailed depictions of the signal, but may produce feature redundancy leading to accuracy reduction and computational cost increasing (sometimes exponentially, e.g., when using RBF kernel SVM [25] as the classifier).

The maximum level L of decomposition level is jointly determined by the signal and the mother wavelet to satisfy the condition: (4) where N is the signal size and F is the filter size [26]. Each EEG segment has 5120 samples and 4097 samples, respectively, in MIT dataset and UBonn dataset. The corresponding maximum decomposition level of each wavelet in these two datasets are given in the following section.

Frequency band

In DWT, each decomposition level corresponds to a certain frequency band. Supposing the raw EEG data would fall in frequency band (a, b), according to Mallat algorithm [27], at level n, the approximation frequency band would be: (5) the detail frequency band is (6) Fig 3 illustrates the frequency bands covered by each level of decomposition on MIT and UBonn datasets, given the frequency range (0.5, 128)Hz for MIT and (0.5, 86.8)Hz for UBonn. In this figure, the detail band and approximation band on the i^th decomposition level are denoted as d_i and a_i (i = 1, 2, … 7), respectively. As to be discussed later, wavelet coefficients of several bands, as shown with red annotations in the figure, construct the feature vector for each EEG segment.

Download:

Fig 3. Examples of 7-level decomposition and corresponding frequency bands.

(A) On MIT dataset. (B) On UBonn dataset. The EEG signals are decomposed into several frequency bands. d_i is the detail band while a_i (i = 1, 2, … 7) is the approximation band. All detail bands and the last approximation band might be used for feature extraction.

https://doi.org/10.1371/journal.pone.0173138.g003

In clinical practice, EEG is typically described in terms of rhythmic activity, which means in DWT-based EEG analysis; a specific frequency band corresponds to a certain EEG rhythm. “Seizure” and “non-seizure” EEG segments might have significant difference in certain frequency bands. EEG segments could be classified accurately by features from these bands. However, some frequency bands should be abandoned since features from these bands caused redundancy and accuracy reduction. This issue is considered in later Band-Feature Selection to improve accuracy and reduce feature vector redundancy.

Coefficient feature

Choosing suitable features that can best represent the characteristics of the EEG signals is important for EEG classification [11]. DWT coefficient features from several frequency bands construct the feature vector of one EEG signal segment.

In this study, the DWT coefficients of an EEG segment in each band were calculated according to Eq 3. Seven commonly used wavelet features in wavelet-based EEG signal processing and two statistical features constructed the feature vector of each EEG segment. These features are indicated in Table 2.

Download:

Table 2. Features in Each Band.

https://doi.org/10.1371/journal.pone.0173138.t002

Classification

Seizure detection is formulated into a binary classification problem on two kinds of EEG segments, “seizure” and “non-seizure”. SVM with RBF kernel was used as the classifier. Here we briefly go over the concepts of binary classification and SVM. SVM is a supervised learning algorithm that can be used for binary classification. A SVM constructs an optimal hyperplane as a decision surface such that the margin of separation between the two classes in the data is maximized. Support vectors refer to a small subset of the training observations that are used as support for the optimal location of the decision surface. Only the support vectors chosen from the training data are required to construct the decision surface. Details of SVM and binary classification could be found from previous work [22].

To assess the performance of our approach, especially its ability to overcome individual difference, we used leave-one-subject-out cross-validation on MIT dataset. Each time, only one subject’s data was used as the test set while all others’ data as the training set. Mixing one subject’s data in both training and test sets might give the algorithm prior knowledge and cause false high accuracy. Hence, leave-one-subject-out cross-validation was a fair evaluation scheme to truly reveal the robustness of the classifier on overcoming the individual difference. Since UBonn dataset did not separate the data from different patients, 10-fold cross validation was used instead of leave-one-subject-out.

In this paper, “seizure” EEG segments were considered as “positive” while “non-seizure” segments were considered as “negative”. Therefore, the classifier had 4 possible outcomes [6]:

True positive (TP);
False positive (FP);
True negative (TN);
False negative (FN).

As listed in Table 3, five values in the confusion matrix are employed to evaluate the algorithm performance, including, Accuracy, Sensitivity, Specificity, Positive Predictive Value (PPV), and Negative Predictive Value (NPV).

Download:

Table 3. Confusion Matrix.

https://doi.org/10.1371/journal.pone.0173138.t003

Wavelet-level selection

Mother wavelet and decomposition level are two factors that affect the performance of DWT in digital signal processing. By appropriately selecting the mother wavelet and decomposition level, DWT could accurately interpret the characteristics of the original EEG segment. Considering this, Wavelet-Level Selection was used for exploring the performance of each wavelet with all possible decomposition levels.

Wavelet-Level Selection was done as follows. Supposing a mother wavelet whose maximum decomposition level was j, DWT could divide the EEG segment into several bands with the number from 2 (1 detail bands and 1 approximation band) to j + 1 (j detail bands and 1 approximation band). For each mother wavelet and corresponding decomposition level, features across all the frequency bands constructed the feature vector for each EEG segment. In each wavelet family, the mother wavelet and related decomposition level leading to the highest seizure detection accuracy would be selected for later analysis. For each combination of wavelet and decomposition level, a cross-validation was performed by SVM.

Band-feature selection

EEG is typically described in terms of rhythmic activity making different frequency bands in DWT corresponding to various EEG rhythms. In a certain EEG dataset, seizure and non-seizure EEG segments might provide a significant difference of rhythmic activity in the specific frequency band(s). Similar to frequency bands, in a certain EEG dataset, some features might help to distinguish seizure and non-seizure EEG segments while other features only generated data redundancy. Considering this, we framed a Band-Feature Selection, which explored the band(s) and the feature(s) that most precisely classified seizure and non-seizure EEG signals with low computational cost.

The Band-Feature Selection was done as follows. Given a mother wavelet whose best decomposition level of an EEG segment was j, DWT would divide the frequency range from 0Hz to half of the sampling rate into j + 1 bands (j detail bands and 1 approximation band). Hence, there were combinations of bands. If each band had m features, there were combinations of features. As a result, for this mother wavelet and corresponding decomposition level, we had a total of combinations of bands and features. For each combination of band(s) and feature(s), a cross-validation was performed by SVM.

Results

Results of Wavelet-Level Selection showed the effect of mother wavelet and decomposition level on DWT-based seizure detection. The results of Band-Feature Selection enabled us to improve seizure detection accuracy and remove feature redundancy.

Wavelet-level selection

Both datasets, using suitable wavelet and decomposition level, provide promising seizure detection accuracy (results are summarized in Table 4). On MIT dataset, decomposition level affects the accuracy substantially regardless of the mother wavelets. On UBonn dataset, all wavelets could achieve high accuracy (above 95%) at low decomposition level (less than 2). The results on these two datasets were discussed separately.

Download:

Table 4. Wavelet member, maximum decomposition level, best decomposition level and corresponding accuracy.

https://doi.org/10.1371/journal.pone.0173138.t004

On MIT dataset, the best member of each family and its optimal decomposition level (i.e., the level that yielded to the highest accuracy) were used for Band-Feature Selection. On UBonn dataset, since high accuracy could be achieved at low decomposition level, the wavelet and the lowest decomposition level that achieved accuracy above 95% were of interest in Band-Feature Selection. In a certain wavelet family, if several wavelets achieved accuracies above 95% at the same decomposition level, the one having the smallest vanishing moment were selected.