Research on epilepsy detection methods based on interpretable features and machine learning

Yongxin Sun; Xiaojuan Chen; Xinghua Zhang; Xiaohui Cai

doi:10.1371/journal.pone.0344164

Abstract

Epilepsy is a prevalent neurological condition that impacts a significant number of individuals worldwide. Patients’ physical and mental health, as well as their daily activities, are significantly affected by seizures, necessitating prompt diagnosis and treatment. The automatic detection of epilepsy using electroencephalogram (EEG) signals has been a significant area of research. Nevertheless, the majority of current methods are based on intricate feature engineering processes that require the extraction and selection of a large number of features to identify the most discriminative feature sets. This results in a high level of algorithmic complexity, inadequate robustness, and inadequate interpretability, which complicates the provision of theoretical support to clinicians. This paper proposes a pathophysiology-driven, interpretable machine learning algorithm to address the limitations of current EEG-based epilepsy detection methods, which include poor interpretability and complex feature engineering. We developed a low-dimensional, interpretable feature combination consisting of only five features and systematically validated its discriminative capability across various epilepsy phases by innovatively integrating electrophysiological markers of epileptic seizures with nonlinear dynamical properties. In the binary classification of seizure versus non-seizure EEG segments, the XGB classifier achieved the highest accuracy of 98.73% and an F1 score of 98.57%. Classification accuracy for interictal versus ictal periods reached 95.33%, with an F1 score of 95.27%. In the challenging ternary classification task encompassing preictal, interictal, and ictal periods, the model achieved a respectable accuracy of 86.3% and an F1 score of 85.79%. Cross-database validation yielded a maximum accuracy of 82.17% and an F1 score of 81.99%, confirming the proposed features’ robust generalization capability and transformative potential. This feature set exhibits outstanding and stable performance across all models, as demonstrated by evaluations across two public datasets using five machine learning classifiers. In addition, SHAP values quantified the contribution of each feature to predictions, thereby providing a transparent decision-making rationale that substantially improves the algorithm’s interpretability and clinical utility.

Citation: Sun Y, Chen X, Zhang X, Cai X (2026) Research on epilepsy detection methods based on interpretable features and machine learning. PLoS One 21(3): e0344164. https://doi.org/10.1371/journal.pone.0344164

Editor: Teppei Matsubara, Athinoula A Martinos Center for Biomedical Imaging, UNITED STATES OF AMERICA

Received: October 2, 2025; Accepted: February 17, 2026; Published: March 9, 2026

Copyright: © 2026 Sun et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: study1: the Bonn University (BONN) EEG dataset for epilepsy, is a publicly available electroencephalogram (EEG) dataset widely used in epilepsy research. Accessed at typical repository URL, e.g., https://repositori.upf.edu/handle/10230/42894 study2: the New Delhi Sleep Center EEG dataset, is a publicly available electroencephalography (EEG) dataset primarily used for sleep and neurological disorder studies. Accessed via a public repository such as https://www.researchgate.net/publication/308719109_EEG_Epilepsy_Datasets.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

1. Introduction

Epilepsy is a prevalent neurological condition affecting approximately 70 million individuals worldwide, spanning all age groups [1]. The onset of epilepsy results from transient neurological dysfunction caused by abnormal neuronal discharges, characterized by suddenness, irregularity, and recurrence. Prolonged epilepsy can lead to memory loss and cognitive decline, significantly impacting patients’ daily lives and mental health [2]. However, according to the World Health Organization, it is estimated that 70% of epilepsy patients can achieve seizure freedom if accurately diagnosed and treated. Therefore, the treatment of epilepsy relies on the precise detection and diagnosis of the condition, underscoring its critical importance.

In 1924, Hans Berger, a German neuropsychiatrist, pioneered the recording of the human electroencephalogram (EEG) using precision instruments, thereby establishing a firm foundation for subsequent neuroscience research and opening new avenues. This was made possible by the advancement of science and technology. The International League Against Epilepsy (ILAE) confirmed the significant changes in human EEG during seizures in 1981, thereby providing an essential and objective basis for the diagnosis of epilepsy, based on a number of clinical observations and experimental studies. The objective premise for the diagnosis of epilepsy is provided by the significant changes in human brain electrical signals that occur during a seizure. Epileptic EEG signals can be classified into various stages, including preictal, ictal, and interictal, based on the duration of the seizure. Preictal period: The initial few minutes to tens of minutes of a seizure. The timely detection and identification of EEG signals during this period can enable the prediction of epilepsy and provide the patient with the necessary time to prepare for any additional psychological and physical injuries that may result from epileptic seizures. Ictal period: The period of epileptic seizures. Interictal period: The asymptomatic interval between two clinical epileptic seizures is termed the interictal period. During this phase, the patient’s clinical symptoms have resolved, though their electroencephalogram may still reveal abnormal discharges [3]. The formulation of a treatment plan for epilepsy is facilitated by the precise identification of the aforementioned periods.

Currently, physicians collect patients’ electroencephalogram (EEG) signals by standardizing the placement of multiple electrodes on the patient’s head. The resulting EEG data is typically large in volume [4], which makes the analysis time-consuming and inefficient, relying heavily on the subjective judgment of the physician without objective evidence. Therefore, automated epilepsy detection based on machine learning has become a primary research direction in epilepsy detection [5–7]. Machine learning-based automated epilepsy detection methods generally include steps such as EEG signal preprocessing, feature extraction, feature selection, and classifier design [8]. Among these, feature extraction is a critical link that directly affects the performance of the classifier. However, existing feature extraction methods often struggle to comprehensively represent the complex characteristics of epileptic EEG signals when extracting a single type of feature, limiting further improvements in detection performance. On the other hand, extracting a large number of features requires feature selection for filtering, which increases complexity and introduces bias, lacking generalizability. Furthermore, traditional EEG analysis methods still face limitations in the interpretability and effectiveness verification of features [9], making it difficult to fully mine the discriminative features of EEG signals and explain their intrinsic association with seizures, which is not referenced by physicians. To address this issue, this paper conducts a systematic study on the effectiveness and interpretability of features related to epileptic EEG signals, proposing a set of features that comprehensively represent the neurophysiological patterns of epileptic seizures for the purpose of epilepsy detection. Additionally, the effectiveness and generalizability of the proposed features are verified from the perspectives of model performance and interpretability. The main contributions of this paper are as follows:

We propose a framework for interpretable features that integrates clinical a priori knowledge. In contrast to conventional approaches that depend on high-dimensional, intricate features, we systematically developed a low-dimensional feature set consisting of only five features grounded in the pathophysiological principles of epilepsy. The features exhibit computational efficiency and demonstrate clear clinical significance, thereby enhancing the interpretability of results.
We confirmed the outstanding generalization and robustness of the proposed feature combination: Comprehensive evaluations were performed on two public datasets utilizing five different machine learning classifiers. Experimental results indicate exceptional and consistent performance across all models, confirming the inherent discriminative power and robustness of the feature combination, independent of specific models, thereby ensuring reliability for clinical deployment.
Developed a comprehensive interpretability framework encompassing features and models. In addition to providing high-accuracy models, we assessed the predictive decisions of the optimal XGBoost model through SHAP analysis. This method elucidates the contribution and directionality of each feature to the final classification outcome, transforming the model’s decisions into evidence that clinicians can understand. This offers essential theoretical backing for supplementary diagnosis.

2. Related work

The core principle of epilepsy detection lies in the in-depth analysis of the significant differences between electroencephalogram (EEG) signals during seizures and those in a normal state. These differences are primarily reflected in multiple dimensions such as frequency, amplitude, and waveform complexity. EEG signals during epileptic seizures typically exhibit characteristic patterns such as abnormally high-amplitude spikes, sharp waves, polyspikes, and rhythmic discharges, whereas normal EEG signals display relatively stable physiological rhythms. The reasonable extraction of EEG signal features is fundamental to epilepsy detection. Existing research has explored various feature extraction methods, including time-domain analysis, frequency-domain analysis, time-frequency analysis, and nonlinear dynamics analysis [10–12].

Slimen et al. [13] calculated the number of spikes in EEG signals during three states: interictal, preictal, and ictal. The results indicated that this metric gradually increases with the progression of the epileptic seizure. Furthermore, the average spike rate during the preictal period showed a significant statistical difference compared to the interictal period.

Chen et al. [14] proposed an algorithm for seizure detection based on the time-frequency domain and nonlinear features. They extracted mixed features, including Approximate Entropy (ApEn), Fuzzy Entropy (FuzzyEn), Sample Entropy (SampEn), and Standard Deviation (STD), from the subbands decomposed by Discrete Wavelet Transform (DWT) to form a feature set. After selecting features using the Random Forest algorithm, they employed three classifiers for classification, achieving impressive detection results on both the BONN dataset and the New Delhi dataset. Al-Hadeethi et al. [15] constructed feature vectors using a set of statistical features, implemented a two-level feature selection strategy for feature ranking, and employed an AB-BP-NN network for classification. Qureshi et al. [16] proposed a fuzzy-based model for epilepsy seizure detection, which integrates feature extraction, feature selection, and a fuzzy classifier to accomplish the task of epilepsy detection. Wang et al. [17] proposed an epilepsy detection network combining a convolutional neural network (CNN) and a long short-term memory (LSTM) network to extract EEG signal features from multiple perspectives to improve the accuracy of recognition. Xin Xu et al. [18] proposed a method based on nonlinear features of electroencephalogram (EEG) signals and Gradient Boosting Decision Trees (GBDT) to identify seizure and non-seizure periods. Through ten-fold cross-validation, an average recognition accuracy of 91.76% was achieved. Zhuohan Wang et al. [19] utilized Continuous Wavelet Transform (CWT) to generate phase spectrum and power spectrum of electroencephalogram (EEG), which were then fed into the designed branches of Convolutional Neural Network (CNN) and Vision Transformer (ViT) to extract more discriminative EEG features, achieving an accuracy rate of 98.45%. Hany F. Atlam et al. [20] proposed a hybrid feature selection method that combines Principal Component Analysis (PCA) and Discrete Wavelet Transform (DWT) to extract both time-domain and frequency-domain features. This method, when integrated with Support Vector Machine (SVM), achieved an accuracy of 97.03% in epilepsy detection. Uddipan Hazarika et al. [21] utilized the Hurst exponent and discrete wavelet transform methods for feature extraction, determining the effectiveness of the features. They validated their findings using various machine learning algorithms, achieving satisfactory identification results. Hussain et al. [22] extracted Hurst exponent (HFD) and Katz fractal dimension features using adaptive rate Discrete Wavelet Transform (DWT), and implemented epilepsy detection in combination with Support Vector Machine (SVM). Hadiyoso et al. [23] achieved classification recognition of EEG signals at different times of epilepsy by selecting a feature set composed of spectral entropy, Katz, and Sevick fractal dimensions, in conjunction with a Naive Bayes (NB) classifier. Shiqi Liu et al. [24] applied correlation coefficient and distance correlation feature selection methods to effectively implement epilepsy detection across different datasets using the combination of STD and FD-NL with bagged tree classifiers.

Although the aforementioned feature extraction-based methods for epilepsy detection have achieved good detection performance, they often require the extraction of a large number of features and the use of specific feature selection methods to identify effective features. This not only increases the complexity of the detection task but also results in poor interpretability and generalization ability. Considering the accuracy of epilepsy detection, computational complexity, and the acceptance by physicians, this paper proposes a set of interpretable and highly generalizable features. When combined with machine learning algorithms, these features demonstrate superior results in recognizing different epilepsy phases across two public datasets.

3. Materials and methods

Fig 1 illustrates the workflow of the proposed epilepsy seizure detection system architecture. The following sections provide a detailed discussion of each module.

Download:

Fig 1. Workflow diagram of the epilepsy seizure detection system architecture.

https://doi.org/10.1371/journal.pone.0344164.g001

3.1. Data sets

This study utilized two publicly available datasets, the Bonn University (BONN) EEG dataset for epilepsy and the New Delhi Sleep Center (NDSC) EEG dataset for epilepsy.

The Bonn Epilepsy EEG Dataset is a commonly used benchmark dataset for distinguishing normal from abnormal brain signals [25]. This dataset comprises five subsets labeled A, B, C, D, and E. Subset A and B contain scalp EEG data from healthy subjects; subsets C and D contain intracranial EEG data from epilepsy patients during non-seizure periods; while Subset E contains intracranial EEG recordings acquired during epileptic seizures. Each subset comprises 100 single-channel EEG segments, each lasting 23.6 seconds with a sampling frequency of 173.6 Hz. This study designates these five subsets as A, B, C, D, and E, respectively. This dataset has been extensively utilized in numerous EEG signal analysis studies.

The NDSC dataset was used to classify preictal, interictal, and ictal EEG data [26]. Data were collected from ten epilepsy patients at the Hauz Khas Neuro and Sleep Center in New Delhi. During acquisition, gold-plated scalp EEG electrodes were placed according to the 10–20 electrode placement scheme, and data were recorded at a 200 Hz sampling rate using the Grass Telefactor Comet AS40 amplification system. The collected epileptic signals were filtered within the 0.5 to 70 Hz frequency band, then segmented and categorized into three phases: preictal, interictal, and ictal. This process yielded EEG signal data for each epileptic phase required for the study. Folder names denote the seizure phase, with each folder containing 50.mat files storing EEG time-series signals. Each.mat file contains 5.12 seconds of continuous EEG signal comprising 1024 time-series data samples. Fig 2 presents the visualization of EEG signals corresponding to three distinct categories of epilepsy seizures. The horizontal axis represents the number of samples of the EEG signals, and the vertical axis represents the sample value.

Download:

Fig 2. Visualization of EEG signals for three types of epilepsy.

https://doi.org/10.1371/journal.pone.0344164.g002

3.2. Pretreatment

Electroencephalographic (EEG) signals are weak physiological electrical signals that are highly susceptible to external noise contamination during the acquisition process. This results in the desired EEG signals being mixed with various artifacts, which can obscure some of the information carried by the EEG signals and negatively impact subsequent analytical research. To reduce noise in the EEG signals, this study employs a method that combines Ensemble Empirical Mode Decomposition (EEMD) and Time-Frequency Peak Filtering (TFPF) for filtering the EEG signals [27], in order to enhance signal quality and improve the accuracy of subsequent feature extraction. To enhance the dataset and reduce potential information loss at segment boundaries, the continuous EEG signals were divided using a sliding window of 256 data points, with a 50% overlap between successive frames.

3.3. Feature extraction

3.3.1. Pathological features (PLF).

In the electroencephalogram (EEG) signals of epilepsy, spike and sharp waves are two typical pathological waveforms closely associated with seizures. Spike waves usually show transient discharges of high amplitude, while sharp waves are longer in duration and lower in amplitude. The appearance of these two waveforms often signals abnormal firing activity of neurons in the brain. Therefore, extracting and analyzing the characteristics of spike and sharp waves is important for epilepsy detection.

According to the medical descriptions of spikes and sharp waves in the literature [28], spike waves are the most fundamental form of paroxysmal electroencephalographic activity, also known as transient electroencephalography, with a duration of 20–70 ms (14.5 to 50 Hz) and an amplitude exceeding 50 µV. In contrast, spikes have a time limit of 70–200 ms (5 to 14.7 Hz), with an amplitude ranging from approximately 100–200 µV. Based on the definitions above, we propose a method for extracting energy characteristics of pathological waveforms. This involves first extracting all peak information for each sample, and then calculating the corresponding wave energy based on the constraints of duration, amplitude, frequency, and other factors defined. The specific process for extracting pathological features is illustrated in Algorithm 1.

Algorithm 1. Spike-and-sharp wave energy extraction algorithm.

Input:

EEG sample x

wave-path: {x_t,i, y_t,i}_i=1,2

Amplitude range: {x_a,i,y_a,i}_i=1,2

Output:

Wave energy: {E_i}_i=1,2

1. For i = 1–2 do

2. Use the function find-peaks to find all local maximum points in X as a set of candidate peaks for class i^th epileptic wave: P_i = {p_1i,p_2i,...,p_mi}←find-peaks(x,i)

3. Select candidate peaks where wave-path and amplitude fall within the range: {x_t,i, y_t,i} and {x_a,i,y_a,i};

4. ← P_i ^{wath−path∈{x}_t,i^,y_t,i^{},amplitude range∈{x}_a,i^,y_a,i^}

5. End for

6. Return E_i

Where i = 1 and 2, they correspond to spike waves and sharp waves, respectively. The wave-path and amplitude range are defined by their parameters, where denote the lower and upper limits of the wave path within the specified interval, and indicate the lower and upper limits of the amplitude within the specified interval, respectively.

3.3.2. Nonlinear dynamical features (NDF).

(1). Permutation Entropy (PE): Entropy is an important indicator for measuring the degree of disorder in a system; the higher the disorder, the greater the corresponding entropy value. In recent years, an increasing number of studies have demonstrated that the brain is a highly complex chaotic system, and electroencephalographic (EEG) signals, as a typical nonlinear time series signal, exhibit significant nonlinear dynamic characteristics. Consequently, researchers have begun to widely adopt nonlinear dynamic methods for analyzing and modeling EEG signals. Among these, entropy-based features have been extensively applied in various EEG-related tasks such as sleep stage identification, fatigue state monitoring, and seizure detection and prediction, due to their effectiveness in reflecting the complexity and uncertainty of signals. Commonly used entropy measures in EEG analysis include Approximate Entropy (ApEn), Sample Entropy (SampEn), Fuzzy Entropy (FuzzyEn), and Permutation Entropy (PE). Compared to other types of entropy, Permutation Entropy has distinct advantages in terms of computational efficiency, resistance to noise interference, and data length requirements [29]. Therefore, this paper selects Permutation Entropy as the primary method for extracting nonlinear features from EEG signals to better characterize the dynamic changes of the signals.

For the original EEG sequence , let the embedding dimension be and the delay time be . By performing phase space reconstruction on the original signal, we obtain M reconstruction vectors, where . These reconstruction components can be arranged into matrix, as shown in Equation (1).

(1)

For the reconstructed component , the elements of the component are rearranged in ascending order, as expressed by the following equation:

(2)

In this context, represents the index of each element in the reconstructed components. Therefore, for each reconstructed component after sorting, the corresponding arrangement of indices can be obtained, and different arrangements are recorded as new symbol sequences.

(3)

In the expression , there are at most m! possible permutation results, denoted as . The probability of each symbol sequence appearing is denoted as . Therefore, the permutation entropy of the time series is defined as follows:

(4)

(2). Higuchi fractal dimension (HFD): By constructing multiple sub-sequences with varying time intervals, the slope of the logarithmic length-scale curve is used to estimate the fractal dimension. Unlike traditional methods for calculating fractal dimensions, HFD directly analyzes the temporal structure of the signal, demonstrating remarkable performance in non-stationary biological signals, such as electroencephalograms (EEG). The calculation steps are as follows:
1. Let the scale factor be , and construct sets of different time scale sequences for EEG signals with the following expressions:

(5)

Normalize each subsequence to get the following equation:

(6)

The slope of the fitted logarithm of the normalized length of the subsequence with respect to the scale is the HFD value with the following formula:

(7)

(3). Dynamic Feature Increment (DFI): Dynamic feature increment refers to the first-order difference operation applied to the original feature sequence to capture the trend of changes in feature values between adjacent time windows. By normalizing, taking the absolute value, and summing the first-order differences of each feature, this method effectively enhances the sensitivity of features to state transitions without compromising the physical significance of the original features. By taking the absolute value and summing the first-order differences of the extracted pathological features, entropy features, and classification features, the dynamic intensity index of the electroencephalogram (EEG) signal is calculated. Its mathematical expression is as follows:

(8)

In this equation, , , and represent the normalized values of the first-order differences of the corresponding features, respectively.

3.4. Machine learning based classifier algorithm

To evaluate the effectiveness of the extracted features, several classical classification algorithms are used for the experiments in this paper, including K Nearest Neighbor Algorithm (KNN), Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), and Extreme Gradient Boosting (XGB). These classifiers have been chosen due to their excellent performances in a wide range of classification tasks and their good adaptability to different types of data distributions.

KNN algorithm is a nonparametric instance-based learning method. It calculates the distance between a new sample and each sample in the training set to find the nearest K neighbors and classifies them according to most of their categories. The KNN algorithm has a simple structure, but it shows better classification effect when the boundary between categories is more complex or irregular.

SVM is a powerful supervised learning algorithm, whose core idea is to find an optimal hyperplane so that the samples of different categories can be separated as much as possible, and the maximum classification interval is maintained. SVM performs well in high-dimensional spaces and is especially suitable for the binary classification problem where there are obvious classification boundaries.

RF is a classification method based on integrated learning, which improves the generalization ability of the model and reduces the overfitting problem by constructing multiple decision trees and combining their prediction results. The method is particularly suitable for dealing with high-dimensional datasets and can achieve high classification accuracy by aggregating the prediction results of individual trees.

DT is a basic but intuitive classification model. It performs classification by dividing the feature space into subsets based on feature values. Although Decision Tree is prone to overfitting when used alone, it provides a good benchmark for comparison with more sophisticated integration methods.

XGB is a sophisticated supervised learning algorithm derived from the gradient boosting framework. The basic principle is to combine multiple weak classifiers to create a high-performance predictive model, while improving the model’s performance by iteratively minimizing a specified loss function. The approach integrates regularization terms in the optimization process, efficiently managing model complexity and reducing overfitting, therefore improving the model’s generalization capacity. Due to its computational efficiency, exceptional scalability, and robust capacity to manage high-dimensional structured data, XGB has been extensively utilized in numerous biomedical signal categorization applications.

3.5. SHAP-based interpretability analysis approach

Shapley Additive exPlanations (SHAP) is a model interpretation method based on the principle of Shapley Value in cooperative game theory, which was first systematically proposed by Lundberg and Lee in 2017. The core idea of the method is to achieve a transparent explanation of the decision-making process of machine learning models by quantifying the marginal contribution of each feature to the model prediction. The advantage of the SHAP method is that it combines both model independence and theoretical rigor, and is able to provide a consistent and interpretable assessment of the importance of features at both the global and local levels. In the study of machine learning model interpretability, the SHAP method solves the problem of inconsistency in interpretation that may exist in traditional feature importance methods through a rigorous mathematical framework, and provides a reliable interpretation tool for the decision-making mechanism of complex models.

For a given input sample, the SHAP value of each feature reflects the average marginal contribution of that feature in all possible combinations with other features. Assuming that the input sample is and the set of features is , for the feature of the sample, the computation of its SHAP value follows the definition of the Shapley value in cooperative game theory, which is mathematically expressed as follows:

(9)

Where is the trained model, is the subset of features that does not contain feature , is the number of elements in the subset of features, and is the total number of all features.

By performing a weighted average of all possible feature subsets, the Shapley value of the feature is ultimately obtained. This process ensures the fairness and completeness of feature contribution assessment. When the prediction model yields a result of for the sample, and the model’s baseline value is , the following equation holds:

(10)

The application of SHAP methods in the task of epilepsy detection based on electroencephalogram (EEG) signals is of great clinical importance. Assuming that the set of EEG features includes various types, such as time domain, frequency domain, and nonlinear dynamic features, it is often difficult for traditional methods to accurately assess the relative importance of each feature to the classification results. SHAP analysis allows quantification of the contribution of each EEG feature to status epilepticus detection. This fine-grained analysis of feature contributions can help to reveal the neurophysiological mechanisms behind epileptic seizures, while also providing interpretable decision support for clinical diagnosis.

4. Experimental results and discussion

4.1. Evaluation indicators

This paper uses accuracy, precision, recall, sensitivity, and F1 score as evaluation metrics for epilepsy detection. First, it is essential to clarify four basic concepts: let TP denote the number of electroencephalogram (EEG) signals classified as epileptic and correctly predicted as epileptic samples; TN represents the number of EEG signals that are non-epileptic and correctly predicted as non-epileptic samples; FP indicates the number of EEG signals that are non-epileptic but incorrectly predicted as epileptic; FN signifies the number of EEG signals that are epileptic but incorrectly predicted as non-epileptic.

Accuracy (Acc): It refers to the ratio of the number of correctly predicted samples to the total number of samples, used to evaluate the basic performance of the algorithm classification. It is the most common metric for model classification evaluation, represented as follows:

(11)

Specificity (Spe): It refers to the proportion of non-epileptic electroencephalograms (EEGs) that are correctly predicted by the model as non-epileptic EEGs. It is used to evaluate the differences in the characteristics of various EEG states extracted by the model, reflecting the model’s ability to avoid false positives, represented as follows:

(12)

Sensitivity (Sen): It refers to the ratio of epileptic signals correctly predicted by the model as epileptic, used to evaluate the algorithm’s sensitivity to epileptic electroencephalogram signals, represented as follows:

(13)

Precision (Pre): It refers to the probability that samples predicted as having epilepsy are indeed actual cases of epilepsy. It is used to measure the severity of false positives, represented as follows:

(14)

F1 Score: It refers to the harmonic mean of precision and sensitivity, taking into account both false positives and false negatives. In this paper, some datasets are imbalanced, and this metric reflects the model’s performance better than accuracy. It is represented as:

(15)

The dataset for this study involved multiple classification tasks. We use macro-averaging to extend the accuracy (Acc), precision (Pre), sensitivity (Sen), and F1 scores from the binary classification task to the multi-category task. Specifically, we compute each evaluation metric separately for each category and then take the average of the evaluation metrics across all categories. This approach can effectively mitigate the impact of data imbalance across different categories on performance, and thus more objectively reflect the model’s recognition performance on different data types.

4.2. Feature validity analysis

Spike energy (SpikeE), sharp energy (SharpE), Higuchi fractal dimension (HFD), and permutation entropy (PE) were calculated for each signal fragment. In order to perform a preliminary a priori analysis of the feature distributions, we used a univariate Box-and-Whisker Plot to visualize the above features for different categories. The Box-and-Whisker Plot can visualize the concentration trend and the degree of dispersion of the data, and effectively identify potential outliers. In the plot, the distribution includes the maximum, minimum, mean, and outliers of the data, and the outliers in the distribution are indicated by circles.

In the box plot analysis of the BONN epilepsy EEG dataset, we concentrated on the disparities in feature distribution between epileptic and non-epileptic states. Fig 3 shows the statistically significant differences in the distribution of each feature between the two groups. It is very clear that the spike waves energy and sharp waves energy are the most discriminative features for distinguishing between epileptic and non-epileptic states, which is consistent with the clinical basis for epilepsy diagnosis. Regarding nonlinear dynamic characteristics, the median HFD in the non-epileptic state was elevated, with a somewhat concentrated data distribution, signifying that the complexity of the EEG signals was maintained at a high and steady level during this condition. Conversely, the median of HFD in the seizure phase group was markedly diminished, exhibiting a comprehensive downward change in distribution. The pattern in the disparities of PE values between the two groups mirrored that of HFD while offering supplementary insights into signal regularity. The median PE in the seizure phase group was much lower than in the non-seizure phase group, and its data distribution displayed comparatively less variability. The decrease in both HFD and PE, two nonlinear dynamic characteristics, generally signifies a reduction in signal complexity, which corresponds with the pathophysiological mechanism of epileptic seizures, where abnormal synchronous discharges from numerous neurons result in simplified and regularized patterns of brain electrical activity.

Download:

Fig 3. Box plot of the BONN dataset features.

https://doi.org/10.1371/journal.pone.0344164.g003

Unlike the BONN dataset, the epileptic EEG signals from the NDSC have been systematically classified by researchers into three discrete phases: preictal, interictal, and ictal. This dataset enables the investigation of the dynamic progression of brain activity before epileptic episodes. In the box-plot analysis of this dataset, we concentrated on the disparities in feature distributions across the different epileptic EEG phases As shown in Fig 4, the extracted four types of features exhibit statistically significant differences between different EEG states, reflecting good discriminative ability.

Download:

Fig 4. Box plot of features of epilepsy dataset from NDSC.

https://doi.org/10.1371/journal.pone.0344164.g004

Among the two complexity-related features, Higuchi fractal dimension (HFD) and permutation entropy (PE), the interictal state has the highest overall feature values, and the distribution is more centralized and less discrete; while after entering the preictal state, both types of features show a significant downward trend, and the inter-individual differences increase, showing a larger degree of discretization. This phenomenon may reflect the fact that the EEG activity in the pre-seizure stage is in the dynamic process of transition from the stable state to the seizure state, and the closer the data points are to the seizure, the more significant the change of the features, while the signals far away from the seizure stage show a relatively slow change trend, suggesting that the potential value of these two features in the prediction of epileptic seizures.

In terms of pathological characteristics, the BONN dataset exhibits the same features, with higher spike energy and sharp wave energy distribution during the seizure phase. In this dataset, we focus on changes prior to the seizure onset, as shown in the Fig 4. The pathological wave energy in the preictal stage showed a significant upward trend, suggesting that abnormal neural activities at the subclinical level may have begun to appear in the brain at this stage. Through the characterization of different EEG states in the NDSC epilepsy EEG dataset, the changing pattern of electrodynamic features before and after the seizure can be clearly identified. Compared with the analysis of the BONN dataset, the NDSC dataset distinctly captures the evolution of preictal feature distributions, which may offer new insights and could inform the theoretical basis for early epilepsy detection and warning.

Through the box plots we also found that the discretization degree of each type of feature showed significant differences between different categories. This distributional instability may affect the robustness and discriminative ability of the features in the classification task. In order to compensate for the limitations of the original features in portraying state transitions, and to further explore the dynamic evolution of EEG signals in the time dimension, the dynamic feature incremental features proposed in this paper can enrich the feature representation system without destroying the physical meaning of the original features, and also provide more discriminative input variables for the design of the subsequent classifier, which can potentially improve the model’s ability to distinguish between different EEG states. It also provides more discriminative input variables for the design of subsequent classifiers, which may improve the model’s ability to distinguish different EEG states.

4.3. Machine learning classification results and analysis

To thoroughly assess the efficacy of the suggested strategy, we developed specific categorization tasks suited to the attributes of each dataset. The Bonn dataset’s main objective was the binary classification of seizure and non-seizure EEG segments, encompassing three comparative experiments: AB vs E, CD vs E, and ABCD vs E. For the NDSC dataset, which offers more detailed temporal annotations, we undertook three specialized tasks: 1) Binary classification of interictal vs ictal phases; 2) Binary classification of preictal vs ictal phases; 3) A more complex multi-class classification encompassing preictal, interictal, and ictal states.

The primary objective of this study is to verify the interpretability of the proposed model and the efficacy of the methodology. Consequently, no parameter optimization was conducted on the classifiers to guarantee consistency and impartiality in the detection results. The primary parameter configurations for the classifiers employed in this investigation are as follows: The radial basis function kernel was selected for SVM, and C was set to 1. The Max_depth was set to 10 for DT, RF, and XGB, and the RF and XGB models included 100 estimators. The KNN classifier was configured with five neighbors. A 10-fold cross-validation technique, stratified at the recording level, was employed to avert data leakage and maintain the integrity of the evaluation. The comprehensive procedure is outlined as follows: Initially, all recordings were divided into 10 segments. In each cross-validation iteration, 9 segments were used for training while 1 segment was retained for testing. Features were retrieved from the windows created from the recordings in the current training fold. Subsequently, normalization parameters were derived exclusively from the characteristics of the training fold and implemented on the features of the matching test fold. The procedure was executed 10 times, and the concluding performance metrics are shown as the mean of the results from the 10 test folds.

Table 1 illustrates the efficacy of the BONN dataset in seizure and non-seizure tasks, highlighting the advantages and applicability of various feature types across different control groups. Pathological characteristics (the energy of spike waves and sharp waves) exhibited remarkable performance across all control groups, attaining accuracies of 96.57%, 94.11%, and 96.84%, respectively, with sensitivities of 96.04%, 93.87%, and 93.1%, demonstrating robust discriminative capability for epileptic conditions. This corresponds with clinical cognition: typical abnormal discharge patterns in epileptic electroencephalograms, such as spike waves and sharp waves, accurately indicate heightened cortical excitability due to their energy fluctuations, rendering them crucial for differentiating between normal and epileptic seizure states. The nonlinear dynamics attribute enhanced the efficacy of the majority of models in the “CD vs E” and “ABCD vs E” classification tasks. This may be ascribed to the fact that, despite the CD control group being in a non-epileptic state, its data were obtained from the intracranial lesion region of epileptic patients, and the EEG signals may still exhibit pathological characteristics. The nonlinear dynamics characteristic can enhance the pathogenic symptoms related to complexity when differentiating from the epileptic state. In the three classification tasks, the integration of the two features yielded the most significant enhancements relative to individual features, achieving ideal accuracy rates of 10.98%, 3.24%, and 7.38%, respectively, along with F1 scores of 12.62%, 4.11%, and 12.25%. The dynamic incremental feature captures nuanced fluctuations in different states at a precise level, enhancing detection performance for both seizure and non-seizure conditions. The integration of pathological waveforms, nonlinear dynamics, and dynamic incremental characteristics markedly enhanced the model’s classification efficacy in the majority of instances.

Download:

Table 1. Classification tasks for the BONN epilepsy EEG dataset.

https://doi.org/10.1371/journal.pone.0344164.t001

The results of the multi-tasks classification experiments based on the NDSC epilepsy EEG dataset are shown in Table 2, where different feature types show significant differences in different stages of epilepsy detection. Pathological characteristics demonstrate exceptional efficacy in differentiating interictal from ictal states, as well as preictal from ictal states, with XGB model accuracies of 98.25% and 95.33%, respectively. This indicates that such features adeptly encapsulate the energy fluctuations of abnormal cerebral discharges during seizures, exhibiting a high level of sensitivity and specificity in state discrimination. In the nuanced classification task of preictal, interictal, and ictal states, effective differentiation is challenging when solely depending on pathological or nonlinear kinetic features. The accuracy of XGB is merely 67.2% and 51.3%, indicating the restricted efficacy of these features in identifying subtle state transitions. The effective integration of the two feature types significantly enhances detection performance, with the XGB model achieving accuracies of 17.86% and 33.76%, demonstrating that the fused features can elucidate the nuanced variations in the complexity of the neural network. Moreover, the incremental dynamical features enhance the model’s robustness and generalization capacity across most tasks; specifically, the XGB model attained improvements of 1.08%, 0.66%, and 1.24% in accuracy across three classification tasks when employing the three feature fusion methods. The accuracy has been enhanced to 98.25%, 95.33%, and 86.3%. The F1 scores were enhanced to 98.23%, 95.27%, and 85.79%, respectively. They demonstrate their pivotal significance in encapsulating the trajectory of state evolution.

Download:

Table 2. Classification tasks for the NDSC epilepsy EEG dataset.

https://doi.org/10.1371/journal.pone.0344164.t002

To provide a more intuitive view of the robustness of the proposed model, Fig 5 presents bar charts depicting the classification performance of different classifiers. Panel (a) illustrates the performance comparison of various classifiers on three classification tasks within the BONN dataset, while panel (b) shows the performance comparison of different classifiers on three classification tasks within the epileptic EEG dataset from NDSC.

Download:

Fig 5. Performance comparison of different classifiers in the epilepsy detection task.

(a) Three detection tasks on the BONN dataset; (b) Three detection tasks on the NDSC dataset.

https://doi.org/10.1371/journal.pone.0344164.g005

The bar charts comparing the detection performance of the two datasets show that the performance of the different classifiers is highly similar. In the three classification tasks of the BONN dataset, the standard deviation range of each evaluation metric on different classifiers is 0.01–0.03; in the corresponding tasks of the NDSC dataset, the standard deviation range is slightly larger, 0.03–0.05. This smaller standard deviation range adequately suggests that the difference in performance between different classification models is not statistically significant. This finding further validates that the feature extraction method proposed in this study has excellent discriminative ability in the epilepsy detection task, and its effectiveness is minimally affected by the significant influence of the specific classifier selection, with good generalization ability.

Combining the classification tasks of the above two datasets, some common patterns can be found, i.e., pathological features are the most discriminative for seizure status, which are consistent with clinical diagnostic criteria and highly interpretable. Nonlinear features enhance the ability to perceive subtle neural activity changes, especially in the preictal period, which is consistent with the fact that the brain has already shown subclinical manifestations prior to the somatization response, and is a very effective features, while the dynamic incremental features provide evolutionary information in the time dimension. The combination of the three constructs a multi-level and multi-angle epilepsy detection feature system from the perspective of static and dynamic features, and from the perspective of multidomain fusion, providing a methodological basis for realizing a high-precision and high-robust automatic identification system. It is fully proven that our proposed features are highly competitive.

To thoroughly analyze the classification error distribution characteristics of the proposed method and validate its classification robustness and feature discriminative power, this study selected the most challenging classification tasks from two datasets: the ABCD vs E control group on the BONN dataset and the preictal (P) vs interictal (I) vs ictal (S) control group on the NDSC dataset. The confusion matrices for the top three performing models are analyzed, as shown in Fig 6. The rows and columns of the matrix correspond to true labels and predicted labels, respectively. All classifiers exhibit highly consistent patterns in distinguishing non-seizure (ABCD) from seizure (E) periods. The non-seizure correct recognition rate consistently remains above 96.52%, with only a small number misclassified as seizure. The seizure correct recognition rate also stably maintains above 92.49%, and the misclassification rate for both categories remains consistently low. For the NDSC dataset, the confusion matrix reveals that all classifiers maintain high recognition accuracy for seizure periods and equally stable recognition for interictal periods. The primary source of error significantly stems from misclassifications of samples in the preictal and interictal phases. This aligns perfectly with clinical reality, demonstrating both the features’ precise capture of physiological differences across phases and their interpretability.

Download:

Fig 6. Confusion matrix of models in two datasets detection tasks.

(a)~(c) results of different classifiers on ABCD-E classification task; (d)~(f) results of different classifiers on interictal vs preictal vs ictal.

https://doi.org/10.1371/journal.pone.0344164.g006

4.4. Feature interpretability analysis

Interpretability makes models transparent and trustworthy, and the interpretability provided by these techniques helps to trust the decisions made by automated models. Among them SHAP provides a way to analyze the contribution of each feature to the total model prediction. From the experiments, it can be seen that the XGB classifier achieves the highest classification performance across all six tasks. Therefore, we plotted the XGB based SHAP plot.

Fig 7 shows the SHAP plots for the three classification tasks of the BONN dataset and NDSC. SHAP value analysis results indicate that across all epilepsy vs. non-seizure classification tasks in both datasets, two pathological features consistently emerged as the most critical indicators for distinguishing epileptic seizures, demonstrating exceptional discriminatory power. Nonlinear dynamical features characterize different states of EEG signals from a physiological perspective, such as complexity and regularity, playing a particularly crucial role in the most challenging three-classification task: distinguishing between preictal, interictal, and ictal periods. These features effectively reflect the randomness and complexity of EEG signals, providing key quantitative evidence for epilepsy detection. Additionally, the Dynamic Feature Increment (DFI) offers valuable supplementary information across all tasks. SHAP interpretability analysis corroborates the machine learning-based detection results.

Download:

Fig 7. SHAP plot analysis of the BONN dataset and NDSC dataset.

(a) BONN dataset (b) NDSC dataset, I stands for interictal, P for preictal, and S for ictal.

https://doi.org/10.1371/journal.pone.0344164.g007

To further illustrate the strong reliability of our interpretability analysis, we concurrently plotted feature importance analysis based on XBG classification. Fig 8 illustrates that, with the exception of the CD vs E control group, the feature importance rankings for all other classification tasks are in complete concordance with those derived from the SHAP-based interpretability study. Within the CD versus E control group, sharp waves energy is just slightly superior to spikes wave energy, exhibiting low variation in the SHAP analysis. This repeatedly illustrates that pathological characteristics exert complete dominance in both seizure and non-seizure classification tests. Collectively, these findings not only validate the representational power of the selected features but also confirm that the effective integration of pathological and nonlinear dynamical features significantly enhances the overall performance of epilepsy detection.

Download:

Fig 8. Feature importance analysis of epilepsy dataset from BONN and NDSC.

(a) BONN dataset (b) NDSC dataset, I stands for Interictal, P for Preictal, and S for Ictal.

https://doi.org/10.1371/journal.pone.0344164.g008

4.5. Results and analysis of cross-subject experiments

This work conducted focused cross-dataset validation studies to assess the generalization capability of the proposed feature combination and model. To enhance the uniformity of EEG signal attributes, the ABCD versus E control group from the BONN dataset and the Interictal versus Ictal control group from the NDSC dataset were chosen, employing the “single dataset training and another dataset testing” methodology for cross-dataset experimentation. The findings are presented in Table 3. Utilizing the BONN dataset with an expanded sample size for training and the NDSC dataset for testing, the XGB model achieves a classification accuracy of 82.17% an F1 score of 81.14%, while maintaining a balance in sensitivity and specificity. Utilizing the NDSC dataset with a reduced sample size for training resulted in a notable loss in the model’s metrics on the BONN dataset: the accuracy decreased to 77.17%, and the F1 score fell to 77.13%. Despite the absence of substantial divergence among the criteria, the total performance was marginally inferior to the previous one. Although model performance decreased following cross-dataset transfer relative to validation inside a single dataset, it remained substantially superior to random guessing. This outcome demonstrates that the feature combination suggested in this study can proficiently identify the shared discriminative characteristics of EEG signals during ictal and interictal phases, remaining largely unaffected by variations in dataset origins and patient demographics, thereby possessing notable cross-data applicability. It also confirms that the model is not too tailored to the unique characteristics of a singular dataset.

Download:

Table 3. Results of the cross-dataset epileptic seizure detection.

https://doi.org/10.1371/journal.pone.0344164.t003

4.6. Comparative experimental results and analysis

In order to validate the effectiveness of the proposed method in this study, comparative experiments are conducted between the proposed model and the latest epilepsy detection methods, which include feature extraction-based machine learning methods, feature extraction-based deep learning methods, and end-to-end deep learning-based methods, aiming to highlight the advancement of the proposed model while making a comprehensive comparison (Table 4).

Download:

Table 4. Comparison of the latest methods on the BONN dataset.

https://doi.org/10.1371/journal.pone.0344164.t004

From the table, we can find that the existing epilepsy detection methods for feature extraction and machine learning have a large number of selected features, Chen et al extracted 13 traditional features from different wavelet subbands, Al-Hadeethi et al extracted 10 basic features, and Qureshi et al. even extracted 14 features, and these methods need to rely on the feature selection methods to screen the extracted features with effective features to achieve better detection performance, too many features will bring information redundancy, after feature screening will increase the complexity of the detection model, and the proposed features are mostly based on the statistical properties of the data, and the medical interpretability is poor. With the rapid advancement of deep learning technology, numerous scholars have applied it to epilepsy detection research. For instance, Wang et al. proposed an end-to-end detection method combining CNN and LSTM, achieving 98.3% accuracy in the AB vs E control group of the BONN dataset; Jiang et al. achieved 97.14% accuracy in the CD vs. E control group using a TSK fuzzy classifier. However, despite their superior performance, deep learning models inherently suffer from complex structures and poor interpretability. The method proposed in this paper utilizes only five physiologically interpretable features, achieving excellent performance across different classifiers without requiring feature selection. Notably, across all detection tasks based on the XGB classifier, accuracy ranged from 97.47% to 98.73%, representing a maximum improvement of 1.06% over traditional machine learning models and 8.73% over deep learning detection models. The proposed epilepsy detection model breaks the conventional paradigm that high performance detection must rely on deep learning or complex feature engineering, providing a more transparent foundation for clinical decision-making tasks.

5. Conclusions

This study tackles the shortcomings of conventional epileptic EEG detection techniques, including inadequate model interpretability and restricted generalization ability, by introducing an interpretable low-dimensional feature amalgamation and a machine learning framework grounded in physiological mechanisms, thus offering a novel methodology for automated epilepsy diagnosis. This study’s feature combination, based on the physiological properties of epileptic convulsions, encompasses pathogenic features, nonlinear dynamic features, and their dynamic increments. This method not only corresponds with the pathophysiological nature of EEG signals but also mitigates model redundancy resulting from intricate features. This feature combination exhibits stable and consistent classification performance across multiple public datasets using five widely utilized classifiers: SVM, RF, DT, KNN, and XGB. This indicates robust discriminative power and broad applicability, showing no significant dependence on specific models, thereby offering flexibility for model selection in various application contexts. This study employed the Shap interpretability analysis approach to examine the decision-making logic of the model, elucidate the contribution weights of each feature across several seizure stages of epilepsy, and align the model’s predictive outcomes with clinical pathological understanding. The cross-dataset validation results indicate that while model performance is slightly diminished relative to a single dataset, it sustains a high overall detection rate, confirming the robustness of the feature combination and the model’s cross-domain adaptability, thereby offering dependable support for clinical applications.

The study possesses specific limitations. The free datasets included in this research exhibit limited sample sizes and are primarily comprised of brief EEG segments, lacking validation against extensive, continuous, long-term EEG data. Subsequent research should assess model efficacy using extensive long-term EEG datasets. The data segmentation strategy necessitates optimization. The fixed-length segmentation method utilized in this study may induce border effects between physiological states and does not adequately address the dynamic, time-varying properties of EEG signals. Shorter segments may hinder the detection of low-frequency features, whereas longer segments could hide the nuances of localized aberrant discharges. Future research should investigate adaptive segmentation techniques initiated by physiological events to improve the specificity and precision of feature extraction.

References

1. Epilepsy. World Health Organization. 2024. Available from: https://www.who.int/news-room/fact-sheets/detail/epilepsy
2. Spagnoli C, Fusco C, Pisani F. Rett syndrome spectrum in monogenic developmental-epileptic encephalopathies and epilepsies: a review. Genes (Basel). 2021;12(8):1157. pmid:34440332
- View Article
- PubMed/NCBI
- Google Scholar
3. Aslam MH, Usman SM, Khalid S, Anwar A, Alroobaea R, Hussain S, et al. Classification of EEG signals for prediction of epileptic seizures. Appl Sci. 2022;12(14):7251.
- View Article
- Google Scholar
4. Zazzaro G, Cuomo S, Martone A, Montaquila RV, Toraldo G, Pavone L. EEG signal analysis for epileptic seizures detection by applying data mining techniques. Internet of Things. 2021;14:100048.
- View Article
- Google Scholar
5. Molina E, Sarmiento Torres CE, Salazar-Cabrera R, López DM, Vargas-Cañas R. Intelligent telehealth system to support epilepsy diagnosis. J Multidiscip Healthc. 2020;13:433–45. pmid:32523350
- View Article
- PubMed/NCBI
- Google Scholar
6. Ahmad I, Wang X, Javeed D, Kumar P, Samuel OW, Chen S. A hybrid deep learning approach for epileptic seizure detection in EEG signals. IEEE J Biomed Health Inform. 2026;30(2):1019–29. pmid:37037252
- View Article
- PubMed/NCBI
- Google Scholar
7. Chen Y, Ou Z, Zhou D, Wu X. Advancements and challenges of artificial intelligence-assisted electroencephalography in epilepsy management. J Clin Med. 2025;14(12):4270. pmid:40566015
- View Article
- PubMed/NCBI
- Google Scholar
8. Rana N, Thakur T, Jain S. Smart seizure detection system: machine learning based model in healthcare IoT. Curr Aging Sci. 2025;18(1):29–38. pmid:38706349
- View Article
- PubMed/NCBI
- Google Scholar
9. S A, S SK, Mohan N, KP S. EEG based automated detection of seizure using machine learning approach and traditional features. Exp Syst Appl. 2024;251:123991.
- View Article
- Google Scholar
10. Dedeo M, Garg M. Early detection of pediatric seizures in the high gamma band. IEEE Access. 2021;9:85209–16.
- View Article
- Google Scholar
11. Viana PF, Remvig LS, Duun-Henriksen J, Glasstetter M, Dümpelmann M, Nurse ES, et al. Signal quality and power spectrum analysis of remote ultra long-term subcutaneous EEG. Cold Spring Harbor Laboratory; 2021. https://doi.org/10.1101/2021.04.15.21255388
12. Bai L, Litscher G, Li X. Epileptic seizure detection using machine learning: a systematic review and meta-analysis. Brain Sci. 2025;15(6):634.
- View Article
- Google Scholar
13. Slimen IB, Boubchir L, Seddik H. Epileptic seizure prediction based on EEG spikes detection of ictal-preictal states. J Biomed Res. 2020;34(3):162–9. pmid:32561696
- View Article
- PubMed/NCBI
- Google Scholar
14. Chen W, Wang Y, Ren Y, Jiang H, Du G, Zhang J, et al. An automated detection of epileptic seizures EEG using CNN classifier based on feature fusion with high accuracy. BMC Med Inform Decis Mak. 2023;23(1):96. pmid:37217878
- View Article
- PubMed/NCBI
- Google Scholar
15. Al-Hadeethi H, Abdulla S, Diykh M, Green JH. Determinant of covariance matrix model coupled with AdaBoost classification algorithm for EEG seizure detection. Diagnostics (Basel). 2021;12(1):74. pmid:35054242
- View Article
- PubMed/NCBI
- Google Scholar
16. Aayesha , Bilal Qureshi M, Afzaal M, Shuaib Qureshi M, Gwak J. Fuzzy-based automatic epileptic seizure detection framework. Computers, Materials & Continua. 2022;70(3):5601–30.
- View Article
- Google Scholar
17. Wang X, Wang Y, Liu D, Wang Y, Wang Z. Automated recognition of epilepsy from EEG signals using a combining space-time algorithm of CNN-LSTM. Sci Rep. 2023;13(1):14876. pmid:37684278
- View Article
- PubMed/NCBI
- Google Scholar
18. Xu X, Lin M, Xu T. Epilepsy seizures prediction based on nonlinear features of EEG signal and gradient boosting decision tree. Int J Environ Res Public Health. 2022;19(18):11326. pmid:36141613
- View Article
- PubMed/NCBI
- Google Scholar
19. Wang Z, Hu Y, Xin Q, Jin G, Zhao Y, Zhou W, et al. EEG-based seizure detection using dual-branch CNN-ViT network integrating phase and power spectrograms. Brain Sci. 2025;15(5):509. pmid:40426681
- View Article
- PubMed/NCBI
- Google Scholar
20. Atlam HF, Aderibigbe GE, Nadeem MS. Effective epileptic seizure detection with hybrid feature selection and SMOTE-based data balancing using SVM classifier. Appl Sci. 2025;15(9):4690.
- View Article
- Google Scholar
21. Hazarika U, Borah BB, Roy S, Saikia MJ. Automated detection of aberrant episodes in epileptic conditions: leveraging EEG and machine learning algorithms. Bioengineering (Basel). 2025;12(4):355. pmid:40281715
- View Article
- PubMed/NCBI
- Google Scholar
22. Fawad Hussain S, Mian Qaisar S. Epileptic seizure classification using level-crossing EEG sampling and ensemble of sub-problems classifier. Exp Syst Appl. 2022;191:116356.
- View Article
- Google Scholar
23. Epileptic electroencephalogram classification using relative wavelet sub-band energy and wavelet entropy. IJE. 2021;34(1).
- View Article
- Google Scholar
24. Liu S, Zhou Y, Yang X, Wang X, Yin J. A robust automatic epilepsy seizure detection algorithm based on interpretable features and machine learning. Electronics. 2024;13(14):2727.
- View Article
- Google Scholar
25. Andrzejak RG, Lehnertz K, Mormann F, Rieke C, David P, Elger CE. Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: dependence on recording region and brain state. Phys Rev E Stat Nonlin Soft Matter Phys. 2001;64(6 Pt 1):061907. pmid:11736210
- View Article
- PubMed/NCBI
- Google Scholar
26. Swami P, Panigrahi BK, Nara S, Bhatia M, Gandhi T. EEG Epilepsy Datasets. 2016. https://doi.org/10.13140/RG.2.2.14280.32006
27. Chen X, Sun Y. A method to denoise the epileptic EEG by EEMD and TFPF. 2021 International Conference on Electronic Information Engineering and Computer Science (EIECS). Changchun, China: IEEE; 2021. pp. 197–200. https://doi.org/10.1109/eiecs53707.2021.9587941
28. Ding S, Zhu H, Jia W, Su C. A survey on feature extraction for pattern recognition. Artif Intell Rev. 2011;37(3):169–80.
- View Article
- Google Scholar
29. Bandt C, Pompe B. Permutation entropy: a natural complexity measure for time series. Phys Rev Lett. 2002;88(17):174102. pmid:12005759
- View Article
- PubMed/NCBI
- Google Scholar
30. Swami P, Gandhi TK, Panigrahi BK, Tripathi M, Anand S. A novel robust diagnostic model to detect seizures in electroencephalography. Exp Syst Appl. 2016;56:116–30.
- View Article
- Google Scholar
31. Kaya Y, Uyar M, Tekin R, Yıldırım S. 1D-local binary pattern based feature extraction for classification of epileptic EEG signals. Appl Math Comput. 2014;243:209–19.
- View Article
- Google Scholar
32. Jiang YL, Weng JF, Shen Q, Hu WJ, Zhang XT. TSK fuzzy classifier based on enhanced deep feature for epilepsy EEG signal recognition. Control Decision. 2023;38:171–80.
- View Article
- Google Scholar
33. Kumar Y, Dewal ML, Anand RS. Epileptic seizure detection using DWT based fuzzy approximate entropy and support vector machine. Neurocomputing. 2014;133:271–9.
- View Article
- Google Scholar

[ref1] 1. Epilepsy. World Health Organization. 2024. Available from: https://www.who.int/news-room/fact-sheets/detail/epilepsy

[ref2] 2. Spagnoli C, Fusco C, Pisani F. Rett syndrome spectrum in monogenic developmental-epileptic encephalopathies and epilepsies: a review. Genes (Basel). 2021;12(8):1157. pmid:34440332
View Article
PubMed/NCBI
Google Scholar

[3] View Article

[4] PubMed/NCBI

[5] Google Scholar

[ref3] 3. Aslam MH, Usman SM, Khalid S, Anwar A, Alroobaea R, Hussain S, et al. Classification of EEG signals for prediction of epileptic seizures. Appl Sci. 2022;12(14):7251.
View Article
Google Scholar

[7] View Article

[8] Google Scholar

[ref4] 4. Zazzaro G, Cuomo S, Martone A, Montaquila RV, Toraldo G, Pavone L. EEG signal analysis for epileptic seizures detection by applying data mining techniques. Internet of Things. 2021;14:100048.
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref5] 5. Molina E, Sarmiento Torres CE, Salazar-Cabrera R, López DM, Vargas-Cañas R. Intelligent telehealth system to support epilepsy diagnosis. J Multidiscip Healthc. 2020;13:433–45. pmid:32523350
View Article
PubMed/NCBI
Google Scholar

[13] View Article

[14] PubMed/NCBI

[15] Google Scholar

[ref6] 6. Ahmad I, Wang X, Javeed D, Kumar P, Samuel OW, Chen S. A hybrid deep learning approach for epileptic seizure detection in EEG signals. IEEE J Biomed Health Inform. 2026;30(2):1019–29. pmid:37037252
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref7] 7. Chen Y, Ou Z, Zhou D, Wu X. Advancements and challenges of artificial intelligence-assisted electroencephalography in epilepsy management. J Clin Med. 2025;14(12):4270. pmid:40566015
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref8] 8. Rana N, Thakur T, Jain S. Smart seizure detection system: machine learning based model in healthcare IoT. Curr Aging Sci. 2025;18(1):29–38. pmid:38706349
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref9] 9. S A, S SK, Mohan N, KP S. EEG based automated detection of seizure using machine learning approach and traditional features. Exp Syst Appl. 2024;251:123991.
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref10] 10. Dedeo M, Garg M. Early detection of pediatric seizures in the high gamma band. IEEE Access. 2021;9:85209–16.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref11] 11. Viana PF, Remvig LS, Duun-Henriksen J, Glasstetter M, Dümpelmann M, Nurse ES, et al. Signal quality and power spectrum analysis of remote ultra long-term subcutaneous EEG. Cold Spring Harbor Laboratory; 2021. https://doi.org/10.1101/2021.04.15.21255388

[ref12] 12. Bai L, Litscher G, Li X. Epileptic seizure detection using machine learning: a systematic review and meta-analysis. Brain Sci. 2025;15(6):634.
View Article
Google Scholar

[36] View Article

[37] Google Scholar

[ref13] 13. Slimen IB, Boubchir L, Seddik H. Epileptic seizure prediction based on EEG spikes detection of ictal-preictal states. J Biomed Res. 2020;34(3):162–9. pmid:32561696
View Article
PubMed/NCBI
Google Scholar

[39] View Article

[40] PubMed/NCBI

[41] Google Scholar

[ref14] 14. Chen W, Wang Y, Ren Y, Jiang H, Du G, Zhang J, et al. An automated detection of epileptic seizures EEG using CNN classifier based on feature fusion with high accuracy. BMC Med Inform Decis Mak. 2023;23(1):96. pmid:37217878
View Article
PubMed/NCBI
Google Scholar

[43] View Article

[44] PubMed/NCBI

[45] Google Scholar

[ref15] 15. Al-Hadeethi H, Abdulla S, Diykh M, Green JH. Determinant of covariance matrix model coupled with AdaBoost classification algorithm for EEG seizure detection. Diagnostics (Basel). 2021;12(1):74. pmid:35054242
View Article
PubMed/NCBI
Google Scholar

[47] View Article

[48] PubMed/NCBI

[49] Google Scholar

[ref16] 16. Aayesha , Bilal Qureshi M, Afzaal M, Shuaib Qureshi M, Gwak J. Fuzzy-based automatic epileptic seizure detection framework. Computers, Materials & Continua. 2022;70(3):5601–30.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref17] 17. Wang X, Wang Y, Liu D, Wang Y, Wang Z. Automated recognition of epilepsy from EEG signals using a combining space-time algorithm of CNN-LSTM. Sci Rep. 2023;13(1):14876. pmid:37684278
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref18] 18. Xu X, Lin M, Xu T. Epilepsy seizures prediction based on nonlinear features of EEG signal and gradient boosting decision tree. Int J Environ Res Public Health. 2022;19(18):11326. pmid:36141613
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref19] 19. Wang Z, Hu Y, Xin Q, Jin G, Zhao Y, Zhou W, et al. EEG-based seizure detection using dual-branch CNN-ViT network integrating phase and power spectrograms. Brain Sci. 2025;15(5):509. pmid:40426681
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref20] 20. Atlam HF, Aderibigbe GE, Nadeem MS. Effective epileptic seizure detection with hybrid feature selection and SMOTE-based data balancing using SVM classifier. Appl Sci. 2025;15(9):4690.
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref21] 21. Hazarika U, Borah BB, Roy S, Saikia MJ. Automated detection of aberrant episodes in epileptic conditions: leveraging EEG and machine learning algorithms. Bioengineering (Basel). 2025;12(4):355. pmid:40281715
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref22] 22. Fawad Hussain S, Mian Qaisar S. Epileptic seizure classification using level-crossing EEG sampling and ensemble of sub-problems classifier. Exp Syst Appl. 2022;191:116356.
View Article
Google Scholar

[73] View Article

[74] Google Scholar

[ref23] 23. Epileptic electroencephalogram classification using relative wavelet sub-band energy and wavelet entropy. IJE. 2021;34(1).
View Article
Google Scholar

[76] View Article

[77] Google Scholar

[ref24] 24. Liu S, Zhou Y, Yang X, Wang X, Yin J. A robust automatic epilepsy seizure detection algorithm based on interpretable features and machine learning. Electronics. 2024;13(14):2727.
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref25] 25. Andrzejak RG, Lehnertz K, Mormann F, Rieke C, David P, Elger CE. Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: dependence on recording region and brain state. Phys Rev E Stat Nonlin Soft Matter Phys. 2001;64(6 Pt 1):061907. pmid:11736210
View Article
PubMed/NCBI
Google Scholar

[82] View Article

[83] PubMed/NCBI

[84] Google Scholar

[ref26] 26. Swami P, Panigrahi BK, Nara S, Bhatia M, Gandhi T. EEG Epilepsy Datasets. 2016. https://doi.org/10.13140/RG.2.2.14280.32006

[ref27] 27. Chen X, Sun Y. A method to denoise the epileptic EEG by EEMD and TFPF. 2021 International Conference on Electronic Information Engineering and Computer Science (EIECS). Changchun, China: IEEE; 2021. pp. 197–200. https://doi.org/10.1109/eiecs53707.2021.9587941

[ref28] 28. Ding S, Zhu H, Jia W, Su C. A survey on feature extraction for pattern recognition. Artif Intell Rev. 2011;37(3):169–80.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

[ref29] 29. Bandt C, Pompe B. Permutation entropy: a natural complexity measure for time series. Phys Rev Lett. 2002;88(17):174102. pmid:12005759
View Article
PubMed/NCBI
Google Scholar

[91] View Article

[92] PubMed/NCBI

[93] Google Scholar

[ref30] 30. Swami P, Gandhi TK, Panigrahi BK, Tripathi M, Anand S. A novel robust diagnostic model to detect seizures in electroencephalography. Exp Syst Appl. 2016;56:116–30.
View Article
Google Scholar

[95] View Article

[96] Google Scholar

[ref31] 31. Kaya Y, Uyar M, Tekin R, Yıldırım S. 1D-local binary pattern based feature extraction for classification of epileptic EEG signals. Appl Math Comput. 2014;243:209–19.
View Article
Google Scholar

[98] View Article

[99] Google Scholar

[ref32] 32. Jiang YL, Weng JF, Shen Q, Hu WJ, Zhang XT. TSK fuzzy classifier based on enhanced deep feature for epilepsy EEG signal recognition. Control Decision. 2023;38:171–80.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref33] 33. Kumar Y, Dewal ML, Anand RS. Epileptic seizure detection using DWT based fuzzy approximate entropy and support vector machine. Neurocomputing. 2014;133:271–9.
View Article
Google Scholar

[104] View Article

[105] Google Scholar

Figures

Abstract

1. Introduction

2. Related work

3. Materials and methods

3.1. Data sets

3.2. Pretreatment

3.3. Feature extraction

3.3.1. Pathological features (PLF).

3.3.2. Nonlinear dynamical features (NDF).

3.4. Machine learning based classifier algorithm

3.5. SHAP-based interpretability analysis approach

4. Experimental results and discussion

4.1. Evaluation indicators

4.2. Feature validity analysis

4.3. Machine learning classification results and analysis

4.4. Feature interpretability analysis

4.5. Results and analysis of cross-subject experiments

4.6. Comparative experimental results and analysis

5. Conclusions

References