Enhancing generalizability in classification of peripheral neural recordings with graph neural network

Rui Qi Ji; Mehdy Dousty; Ryan G. L. Koh; Ervin Sejdić

doi:10.1371/journal.pone.0345204

Abstract

The peripheral nervous system plays a crucial role in facilitating communication between biological systems. However, decoding neural signals from peripheral nerve recordings remains a challenge due to their complex spatiotemporal patterns. In this study, we propose a graph-based learning approach to more effectively capture temporal and spatial information for classifying neural signal patterns. Unlike previous work, our method incorporates the physical geometry of the nerve cuff, addressing the underrepresented relationships between electrodes. We used a publicly available dataset consisting of neural recordings from eight Long-Evans rats, obtained using a 56-channel nerve cuff electrode. We constructed graphs where each node represents the time series recorded from an electrode, and edges correspond to the distances between electrodes along the surface of the nerve cuff (e.g., geodesic distance). We employed a leave-one-out strategy to evaluate the generalizability of the approach. We further evaluated the within-rat performance of the model by training on two folds and testing on the remaining fold of each rat’s data. In generalizability evaluation, we achieved a mean F1 score of 65.03%, representing a 17.74% improvement over the previous study, and in within-rat testing, we achieved a mean F1 score of 77.50%, representing a 3.14% increase. These findings highlight the value of incorporating the recording geometry into model design, particularly in this small dataset setting, where explicit spatial priors help compensate for limited training examples and improve decoding performance.

Citation: Ji RQ, Dousty M, Koh RGL, Sejdić E (2026) Enhancing generalizability in classification of peripheral neural recordings with graph neural network. PLoS One 21(4): e0345204. https://doi.org/10.1371/journal.pone.0345204

Editor: Luca Citi, University of Essex, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND

Received: August 29, 2025; Accepted: March 3, 2026; Published: April 17, 2026

Copyright: © 2026 Ji et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The data used in this study were from a publicly available dataset available at Borealis, U of T Dataverse: https://doi.org/10.5683/SP3/JRZDDR.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

The nervous system is a fundamental component of biological function, serving as the primary communication network within the body [1]. It is responsible for transmitting electrical and chemical signals that regulate movement, sensation, and autonomic processes [2,3]. The peripheral nervous system facilitates bidirectional communication between the brain and the rest of the body [4,5]. As such, peripheral neural signals encode crucial information about sensory inputs and motor commands, making their accurate interpretation essential for various biomedical engineering applications, and the diagnosis of neurologic disorders [6–8].

Nevertheless, obtaining reliable and selective recordings in the peripheral nervous system is challenging [9]. Recent studies have utilized multi-contact electrodes [10–12] to achieve encoding both temporal and spatial information of neural recordings in Long-Evan rats. Convolutional neural networks (CNNs) have demonstrated significant capability in analyzing physiological data [13–16]. Koh et al. proposed a CNN-based approach to classify three afferent activities in peripheral neural recordings from Long-Evans rats: dorsiflexion, plantarflexion, and pricking [12]. These movements are fundamental to locomotion, balance, and pain perception [17]. Thus, classifying these three types of neural activity is critical for understanding motor control mechanisms. Such classification has direct implications in neuroprosthetics, rehabilitation, and sensory feedback systems for amputees or patients with neuromuscular disorders [17,18].

CNNs operate through receptive fields and therefore capture the structural layout of multichannel neural recordings, enabling them to model local spatial dependencies by leveraging weight sharing [19]. Transformers, based on self-attention, have been shown to lack the strong architectural inductive biases of convolutional networks, such as limited receptive fields and translational invariance, that can be beneficial for spatially structured inputs [20]. Although Transformers can model long-range dependencies [21], they require substantially more data to learn useful representations in the absence of such biases, and do not inherently encode the geometric relationships present in nerve cuff recordings. Similarly, other neural network architectures commonly applied in the literature for peripheral neural signal analysis or classic machine learning classifiers do not encode the true inter-electrode distances or circumferential arrangement around the nerve, meaning that important physiological dimensions of electrode placement, such as the true inter-electrode distances and the way neural signals propagate across adjacent contacts, remain unmodeled. In contrast, graph neural networks (GNNs) offer greater flexibility in representing irregular topologies, as they enable a more adaptive encoding of both temporal and spatial relationships in neural data [22]. Unlike CNNs, which require large amounts of data to learn spatial filters on fixed grid structures, GNNs incorporate structural priors through graph connectivity, enabling more effective biosignal analysis and improved generalization in data-limited settings [23–27]. Previous work has also highlighted the importance of how combining both temporal and spatial information yields the highest performance [28], thus, the main motivation for using GNNs in our context is their ability to simultaneously encode the data’s temporal and structural relationships. For instance, electrodes located on opposite sides of a nerve cuff may record similar physiological activity, yet this relationship cannot be effectively captured using conventional CNN or Transformer architectures. By modeling these electrode connections as edges in a graph, GNNs naturally integrate this spatial context alongside temporal dynamics.

In this paper, we propose a graph-based learning approach for classifying peripheral neural signals. Our key contributions are

1) Geometry-aware graph construction: We model nerve cuff electrodes as graph nodes and define edges based on geodesic distances to encode both the physical arrangement of electrodes and functional similarities between neural signals. We examine how varying graph connectivity affects model performance and conduct ablation studies to verify that the proposed graph-based approach is the primary contributor to the performance gains. In the ablation studies, we also compare the feature extraction from neural recordings using a LSTM module versus a 1D CNN module.
2) Improved generalizability in small-data regimes: Our approach achieves higher classification accuracy and better generalization, outperforming the CNN baseline from previous studies and highlighting the importance of encoding information with graphs in smaller datasets.

Materials and methods

Data description & preprocessing

Neural recordings were previously collected from nine Long-Evan rats from the sciatic nerve using a 56-multi-contact nerve cuff electrode were used [11]. In this study, we excluded one rat due to an issue with the degradation of the plantarflexion signal, resulting in a dataset comprising of eight rats. Neural recordings were collected from a 56 channel nerve cuff electrode comprised of 7 rings of 8 contact, evenly distributed over the length of the electrode. The recordings were acquired at a sampling frequency of 30 kHz with a neural data acquisition board (RHD2000, Intan Technologies, USA) [11]. Three afferent activities, dorsiflexion, plantarflexion, and pricking, were manually performed to evoke neural activity in the recording. Naturally evoked compound action potentials (nCAPs), produced by proprioceptive or mechano-sensory afferent activity in response to physiological limb movements or mechanical stimulation, were detected, and used to construct spatiotemporal signatures for each activity. Each spatiotemporal signature is a matrix in which rows represent neural activity of individual channels over time, and columns capture signals at specific time points across channels, resulting in a matrix of size 56 by 100 (number of contacts by time samples) [11,12,29]. These spatiotemporal signatures were then constructed into graph structures, which were fed into our proposed model for classification of the three activities. Representing the data as graphs allows us to explicitly encode spatial relationships between electrodes, which may capture physiologically meaningful similarities that conventional matrix-based approaches overlook. The number of samples for each rat is presented in Table 1.

Download:

Table 1. Number of samples from each Long-Evan rat, and number of samples for each class.

https://doi.org/10.1371/journal.pone.0345204.t001

Graphs & adjacency matrices

A graph can be represented mathematically as an order pair G=(V,E), where V represents the nodes, and E represents the edges, connecting pairs of nodes together [30]. The graph can be mathematically expressed through an adjacency matrix A, where A_ij indicates the presence of an edge between the nodes. If only the existence of edges is considered, the graph is unweighted, where A_ij=1 if the edge exists between i and j, and A_ij=0 otherwise. In weighted graphs, non-zero values of A_ij also represent the strength or significance of the connection [31]. GNNs are deep learning architectures designed to operate directly on graph-structured data. Unlike conventional models that assume regularly structured 1D, 2D, or 3D inputs, GNNs leverage both node features and the connectivity defined by edges to learn representations that capture the underlying topology [22]. Through iterative message passing, each node aggregates information from its neighbors, enabling the model to integrate both local and global structural patterns in the data.

In this work, we represent the electrode contact points (i.e., channels) as nodes and define the edges based on the geodesic distance relationships between them. The nerve cuff consists of multiple electrodes arranged circumferentially around the nerve, so electrodes positioned on opposite sides may capture similar neural activity due to their proximity to the same underlying fibers. The geodesic distance measures the shortest path along a curved surface rather than the straight-line Euclidean distance [32]. This distinction is particularly important for nerve cuff electrodes, which are positioned on a cylindrical surface around the nerve. While Euclidean distance may treat two electrodes on opposite sides of the cuff as far apart, geodesic distance captures their true proximity along the nerve’s surface, providing a more physiologically meaningful representation of spatial relationships.

Given the structure of the nerve cuff electrode, this distance is computed using Equation 1 below, in which x and y represent the row and column indices (e.g., electrode coordinates) of the electrode channels in the nerve cuff array, respectively. The term captures the vertical distance between electrodes, while accounts for the wrap-around nature of the circular arrangement in the horizontal direction, as shown in Fig 1.

(1)

Download:

Fig 1. Geodesic distance configuration in graph construction, where x and y represent the row and column indices (e.g., electrode coordinates).

https://doi.org/10.1371/journal.pone.0345204.g001

By utilizing geodesic distance, electrodes that are closer to each other on the cuff are more likely to be directly connected in the graph, representing stronger spatial relationships. This connectivity structure enables the model to learn signal propagation patterns along the electrodes, rather than being limited to local information as in other methods, thereby better capturing spatial correlations and neural dynamics within the recordings. With this approach, we constructed a weighted adjacency matrix of size 56 by 56, where the weights are computed based on the distance between electrode positions, measured by their geodesic distance, as shown in Equation 2 [33].

(2)

To refine the graph topology, we use the distance-scaling parameter and the number of nearest neighbours, k, as tunable hyperparameters. Specifically, for each node, edges are first formed only with its k nearest neighboring electrodes based on spatial distance, and the corresponding edge weights are assigned using a distance-based weighting function parametrized by . With lower values of , only electrodes that are immediately adjacent on the nerve cuff will have substantial edge weights, leading to sharper decay in edge weights and emphasizing local interactions. Larger values result in high weights even for electrodes located further apart along the cuff, allowing the model to integrate global information from distant regions of the nerve. By restricting connectivity to the k nearest neighbors, the graph remains sparse and focuses message passing on the most relevant spatial relationships, while controls the strength of these connections. This combined-tuning mechanism enables flexible tuning of the graph structure, balancing the trade-off between local specificity and global connectivity, ultimately improving the model’s ability to capture informative patterns from the neural data.

Model architecture

The model architecture is implemented with a hybrid approach, incorporating both sequential and graph-based learning paradigms. The full model architecture is shown in Fig 2. Following the graph formulation introduced above, each neural recording is represented as a graph G = (V,E), where each node corresponds to an electrode contact and each edge encodes the spatial relationship between electrodes. The graph structure is represented by a weighted adjacency matrix A, where each non-zero entry A_ij denotes the strength of the connection between nodes i and j, as defined by the geodesic distance-based graph construction procedure.

Download:

Fig 2. Proposed model architecture.

https://doi.org/10.1371/journal.pone.0345204.g002

We first extract temporal representation from the neural signals using a Long Short-Term Memory (LSTM) layer with 256 units. The resulting hidden representations serve as node feature vectors h_i for each . These node features, together with the weighted adjacency matrix A, are then processed by an edge convolution layer followed by a general graph convolution layer.

The edge convolution layer follows a message-passing formulation in which node features are updated by aggregating information from neighboring nodes , explicitly incorporating edge information through the corresponding adjacency weights A_ij. In this work, edge features are defined directly by these weighted adjacency values, which encode spatial proximity between electrode contacts. This allows adaptive feature learning that captures spatial dependencies expressed in adjacency matrices [34]. An edge convolution update can be represented by Equation 3, where denotes the feature vector of node i at layer l, and denotes the feature vector of a neighboring node . The term A_ij corresponds to the weighted adjacency value between nodes i and j, which represents edge features and encodes the spatial relationship between electrode contacts based on the distance-based graph construction. The function is a learnable mapping implemented as a multilayer perceptron that combines node features and edge information to generate messages from neighboring nodes.

General graph convolutions then aggregate node features based on neighborhood information, enabling the model to learn feature dependencies across the graph structure, as shown in Equation 4 [35]. Here, H^(l) represents the matrix of node features at layer l, is the adjacency matrix with added self-loops, and is the corresponding degree matrix. The matrix W^(l) contains learnable weights, and denotes a nonlinear activation function. This formulation enables the model to propagate and integrate information across the graph while preserving its underlying structure. Together, these convolutional layers ensure that both local and global structural properties of the graph input are effectively processed.

(3)

(4)

In this model, an edge convolution layer with 32 units and a general graph covolution with 128 units were used, with an L2 regularizer of magnitude 5 × 10⁻³ to improve generalizability. The final feature representation undergoes global average pooling before being passed through fully connected layers with rectified linear unit (ReLU) activations. Finally, we use a softmax layer to output the classification probabilities. During training, a batch size of 1024 and a learning rate of 0.001 were used. A 20% dropout rate is applied to the dense layers to enhance regularization and reduce overfitting.

Prior to training, we applied data augmentation in the form of low-amplitude Gaussian noise, which serves as a physiologically meaningful perturbation for neural recordings. Gaussian noise injection acts as a regularizer by simulating naturally occurring variability in peripheral nerve signals—such as background neural activity, electrode impedance fluctuations, and minor recording noise—while preserving the underlying spatiotemporal structure of the compound action potentials. This approach helps improve model robustness and generalizability without altering the temporal dynamics or spatial relationships that are essential for accurate decoding.

Training & evaluation

Hyperparameter tuning.

We tuned the model hyperparameters using data from two randomly selected rats (e.g., Rats 2 and 10), which served as a validation set for hyperparameter optimization. The training hyperparameters, including the learning rate (1e-4 –1e-2), batch size (128–1024), and L2 regularization coefficient (1e-5 – 5e-3), were systematically tuned based on validation performance. In addition, architectural parameters such as the number of hidden units (128–512), dropout rates (0.1–0.8), and overall model layer configurations were also selected through the same validation-based tuning procedure to balance model capacity and generalization. Once the optimal hyperparameters were determined, these rats were reincorporated into the training and evaluation process to maximize data utilization. In addition, we systematically varied key graph construction parameters, such as the number of neighbors and the distance decay parameter () in the adjacency matrix, to analyze their impact on model performance. Model development and training were carried out in Python using the TensorFlow deep learning framework.

Evaluation.

Model performance was evaluated using both cross-subject and within-subject strategies. For generalizability across rats, we employed an eight-fold leave-one-out cross-validation approach, where data from seven rats were used for training and the remaining rat was held out for testing. This design allowed us to rigorously assess across-subject generalization, addressing a limitation of prior CNN-based studies that focused only on within-subject performance [12]. Performance was quantified using test accuracy and the macro-averaged F1 score, which is more reliable for imbalanced datasets. To ensure comparability with previous work, we also conducted within-subject evaluations by performing a cross-validation in which two folds were used for training and the remaining fold was used for testing. This process was repeated such that each fold served as the test set once.

Ablation studies

To evaluate the contribution of specific architectural and structural design choices in our proposed framework, we also conducted three sets of ablation studies. These studies were aimed at disentangling the effects of (1) temporal modeling via LSTM and (2) graph construction based on geodesic distances. To ensure that the changes in performance were not solely attributable to the use of LSTM, we replaced the it with a 1D CNN module. This design choice also aligns more closely with the architecture used by the previous study [11,12], thereby enabling a more direct comparison with prior work. In addition to the proposed geodesic graph, we constructed graphs based on Euclidean distances between electrode contacts to evaluate whether preserving the true surface geometry of the nerve cuff provides advantages over simpler spatial proximity measures. Euclidean distances were computed directly from the two-dimensional spatial coordinates of the electrode contacts, without accounting for the curved surface geometry of the nerve cuff. Furthermore, we constructed a random graph baseline in which edges were assigned randomly between nodes. This setup preserved the graph’s sparsity while removing the physiological prior embedded in the geodesic topology, allowing us to observe the performance of the model under different spatial constraints.

Results

Generalizability performance

Table 2 presents the classification accuracies (%) and macro-averaged F1 score for each individual rat when used as a test set, as well as the mean and standard deviation across all rats. The first row shows the performance of the baseline CNN model reimplemented from the previous study [12] and the subsequent rows show the graph-based model proposed in this work. For the graph-based model, we compared how varying the number of neighbouring nodes changes the performance at a fixed of 2. This allows us to better examine the effects of neighbouring nodes in classification performances. For the graph-based models, we have grayed out the performance scores for Rats 2 and 10, as these subjects were used as a validation set during hyperparameter tuning. Accordingly, their results were excluded from the calculation of the mean ± standard deviation. The scores are nonetheless reported to provide additional context regarding their individual performance. Overall, the graph-based approaches resulted in an improved mean classification accuracy compared to the CNN model, which achieved 54.00 ± 5.21% (* beside the mean ± standard deviation indicates significant difference, p < 0.05 from the baseline model performance, computed using the t-test). The graph-based approach with connectivity defined by four or five neighbors showed a statistically significant improvement from the CNN model in terms of F1-score, and among the graph learning models evaluated, the graphs constructed with five neighbors was the highest-performing model, outperforming the baseline CNN by 14.32% in mean classification accuracy and 17.74% in F1 score. Fig 3 illustrates graph connectivity with five neighbors, which is the optimal connectivity found.

Download:

Table 2. Generalization performance (accuracy and macro-averaged F1 score) of baseline model and graph-based models with varying connectivity (Rats 2 and 10 are excluded from final mean ± standard deviation evaluation as they were used in the validation set for hyperparameter tuning).

https://doi.org/10.1371/journal.pone.0345204.t002

Download:

Fig 3. Graph connectivity with five neighbors, defined by geodesic distance.

https://doi.org/10.1371/journal.pone.0345204.g003

To systematically analyze the effect of graph construction hyperparameters, we present a comprehensive heatmap of averaged F1-scores across all rats while varying the number of neighbors and (Fig 4). This visualization highlights how model performance changes with different parameter combinations and demonstrates that the chosen hyperparameters ( of 2 and number of neighbors of 5) achieve the highest classification accuracy. We then performed ablation studies with these chosen hyperparameters to evaluate model generalizability, with results summarized in Table 3. Fig 5a and 5b further illustrate the graph connectivity constructed using Euclidean distance and random assignments, respectively. Compared with the geodesic-distance-based connectivity shown in Fig 3, these alternative graph structures exhibit remarkably different topologies.

Download:

Table 3. Generalization performance (accuracy and macro-averaged F1 score) of ablation studies (Rats 2 and 10 are excluded from final mean ± standard deviation evaluation as they were used in the validation set for hyperparameter tuning).

https://doi.org/10.1371/journal.pone.0345204.t003

Download:

Fig 4. Heatmap of F1-scores across different distance decay parameters and number of neighbors.

https://doi.org/10.1371/journal.pone.0345204.g004

Download:

Fig 5. Comparison of graph connectivity definitions in ablation studies.

(a) Euclidean distance–based graph. (b) Randomly constructed graph.

https://doi.org/10.1371/journal.pone.0345204.g005

Substituting the LSTM branch with a CNN did not provide significant performance gains, and our model still outperformed prior CNN-based work. Interestingly, when using a random graph in place of the correct adjacency matrix, the accuracy remained close to that of the CNN baseline. This observation likely reflects the limited generalization capability of the CNN baseline in the across-subject setting. CNNs rely on fixed, grid-based receptive fields and tend to learn subject-specific spatial patterns that do not transfer well across rats. As a result, both the CNN and the random-graph model lack an explicit inductive bias that enforces physiologically meaningful spatial relationships between electrodes, leading to similar generalization performance. In contrast, incorporating anatomically informed graph connectivity enables the GNN to leverage consistent spatial organization across subjects, resulting in substantially improved performance.

Within-rat performance

We then evaluated the within-rat performance to establish a direct comparison with the previous study (note that the results are slightly different as one rat is removed in this study) [12]. The accuracies and F1-scores from both the model proposed in the previous study, the model proposed in this study, as well as the ablation studies performed, are summarized in Table 4. The graph-based approach achieved a 1.92% improvement in accuracy and a 3.14% improvement in F1-score compared to the CNN baseline. Paired t-test analysis revealed that, although the geodesic-distance-based graph model consistently outperformed the CNN baseline, the observed improvements were not statistically significant (p > 0.05). In contrast, both the random graph and Euclidean-distance-based graph ablations resulted in significantly lower performance compared to the geodesic graph, indicating that preserving the physiologically meaningful geodesic structure is critical and represents the most effective choice for graph construction.

Download:

Table 4. Within-rat performance (accuracy and macro-averaged F1 score) of baseline model, graph-based models with optimal connectivity, as well as ablation studies.

https://doi.org/10.1371/journal.pone.0345204.t004

Discussion

The proposed approach in this work showed the significance and effectiveness of using graphs to encode information from neural recordings.

The results demonstrated that the proposed graph-based learning approach outperforms conventional CNNs in classifying peripheral neural signals, both in across-subject generalization and within-subject evaluations. Compared to the reimplemented CNN baseline from [12], the graph-based model achieved substantial improvements in mean accuracy and F1 score (Tables 2 and 4), highlighting the advantage of explicitly incorporating spatial relationships between electrodes into the learning process. These gains were consistent across most test subjects, indicating robustness to subject, specific variability, a key challenge in neural decoding. It should be noted that the reported performance differences also reflect the exclusion of one rat from the analysis, which accounts for slight deviations from previously reported values [12].

Ablation studies (Tables 3 and 4) confirm that the performance advantage is not solely attributable to the LSTM module, as replacing it with a 1D CNN yielded comparable results. In contrast, removing the geodesic-distance-based graph construction and replacing it with random connectivity led to a sharp performance drop, underscoring the importance of physiologically meaningful graph structures. Using Euclidean-distance-based graphs also resulted in a noticeable reduction in performance compared to geodesic connectivity, although it consistently outperformed the random graph baseline. This pattern suggests that incorporating spatial proximity alone is beneficial, but Euclidean distances do not reflect the meaningful geometry of the nerve cuff, as they ignore the circumferential arrangement of electrodes and the way neural signals propagate along the nerve surface. In contrast, geodesic distances capture both the physical layout and the physiologically relevant pathways through which activity spreads across adjacent contacts, leading to superior performance. By modeling electrode positions using geodesic distances, the graph-based approach leverages both local and global spatial dependencies, enabling more informative spatiotemporal feature extraction than CNNs, which primarily capture local spatial patterns.

Our analysis of connectivity hyperparameters reveals that graphs constructed with more than three neighbors outperformed those with fewer, with peak performance achieved at five neighbors and a of 2. This configuration appears to optimally balance local specificity and global context. The distance decay parameter plays a key role in modulating the model’s sensitivity to spatial distance: higher reduces the decay of edge weights with distance, allowing for more global integration when many neighbors are retained, while lower enforces stronger locality. This interaction suggests that careful tuning of both parameters can maximize the model’s ability to capture meaningful spatial relationships while avoiding overfitting to noise.

From a physiological perspective, the geodesic-based connectivity preserves the true spatial organization of electrodes on the nerve cuff, reflecting how neural signals propagate through the peripheral nervous system. Electrodes positioned close together are more likely to record correlated activity, while distant contacts often provide complementary, non-redundant information. Capturing these relationships is crucial for improving model performance and interpretability.

Overall, graph-based approaches not only outperformed traditional methods such as CNNs in classification performance but also have great potential in clinical explainability. One of the advantages of GNNs is their ability to provide deeper understanding into the model’s decision-making process. For instance, future studies may examine the learned weights in the GNN layers to identify the most influential nodes (e.g., electrodes) and edges (e.g., connections) that contribute to the final classification decision. By investigating node importance, we can determine which specific electrode channels contributed more in distinguishing different neural patterns, providing valuable information about the distribution of neural activity. Similarly, analyzing edge importance can reveal how different electrodes interact and contribute to temporal dynamics, helping us understand how neural signals propagate across the nerve. Additionally, the graph-based approach has proven to be more generalizable, allowing for the incorporation of data from multiple subjects rather than relying solely on a single animal model for training. This ensures consistent model performance across different subjects and enhances real-world applicability.

Future studies should explore different strategies for defining graph connectivity, such as adaptive thresholding based on statistical dependencies between channels or dynamic graphs that incorporate learnable edges. Subgraph-based approaches could be investigated to focus on the most informative nodes and edges, potentially improving computational efficiency and model interpretability [36]. Additionally, self-supervised learning techniques, such as contrastive learning or graph autoencoders, could be explored to enhance the ability of the model to extract meaningful representations without relying on a large amount of labeled data. Prior studies have shown that graph-based approaches are particularly effective in low-data regimes, as they leverage relational inductive biases to learn richer representations compared to grid-based methods [37,38]. Our findings align with this evidence, suggesting that integrating self-supervised objectives could further strengthen performance under limited data availability. A notable limitation of the present study is the relatively small dataset, consisting of recordings from only eight rats. Although statistical testing demonstrated significant performance differences between the graph-based models and the CNN baseline, the small sample size inherently limits the strength of these conclusions. This constraint reflects the practical challenges of collecting nCAPs in vivo, an experimental process that is resource-intensive and time-consuming. As larger datasets become available, future work should evaluate graph-based methods on more extensive cohorts to further validate generalizability and robustness.

While this study focuses on rat peripheral nerve recordings, the proposed graph-based framework is well positioned for translation to human applications. Human peripheral neural signals are characterized by greater anatomical variability, differences in nerve size, and increased signal heterogeneity arising from subject-specific physiology, electrode placement, and clinical noise sources. By explicitly modeling inter-electrode relationships rather than relying on fixed grid-based assumptions, the graph formulation provides a flexible representation that can naturally adapt to these sources of variability.

Conclusion

We explored a graph-based approach for classifying three afferent activities using neural recordings obtained from Long-Evan rats with a 56-channel nerve cuff electrode. The GNN model effectively captured temporal patterns through nodal features, and by constructing weighted graph adjacency matrices with geodesic distance, the model was able to extract more informative and temporal and spatial features, outperforming the CNN model. Under the leave-one-out strategy, the GNN model outperformed the CNN model by 14.32% in mean classification accuracy, and by 17.74% in macro-averaged F1 score, and in the within-rat evaluation, the proposed model in this study achieved an improvement of 1.92% in accuracy and 3.14% in F1 score. In summary, our findings highlight the potential of graph-based models for decoding neural signals by effectively utilizing both temporal and spatial relationships. A particularly promising future direction is the interpretability of learned graph representations, which may provide insight into how individual electrodes and their interactions contribute to classification decisions. Analyzing node- and edge-level importance within the learned graphs could help reveal physiologically meaningful patterns of neural activity and improve transparency for neuroscience and clinical applications. Future work in this area can further refine graph-based approaches to enhance classification accuracy and interpretability, and expand the applicability of these models to other neural decoding tasks.

References

1. Vizi ES, Kiss JP, Lendvai B. Nonsynaptic communication in the central nervous system. Neurochem Int. 2004;45(4):443–51. pmid:15186910
- View Article
- PubMed/NCBI
- Google Scholar
2. Oosting PH. Signal transmission in the nervous system. Rep Prog Phys. 1979;42(9):1479–532.
- View Article
- Google Scholar
3. Hildebrand JG. Analysis of chemical signals by nervous systems. Proc Natl Acad Sci U S A. 1995;92(1):67–74. pmid:7816849
- View Article
- PubMed/NCBI
- Google Scholar
4. Kamimura D, Tanaka Y, Hasebe R, Murakami M. Bidirectional communication between neural and immune systems. Int Immunol. 2020;32(11):693–701. pmid:31875424
- View Article
- PubMed/NCBI
- Google Scholar
5. Townsend KL. One Nervous System: Critical Links Between Central and Peripheral Nervous System Health and Implications for Obesity and Diabetes. Diabetes. 2024;73(12):1967–75. pmid:39401394
- View Article
- PubMed/NCBI
- Google Scholar
6. Thakor NV, Wang Q, Greenwald E. Bidirectional peripheral nerve interface and applications. Annu Int Conf IEEE Eng Med Biol Soc. 2016;2016:6327–30. pmid:28269696
- View Article
- PubMed/NCBI
- Google Scholar
7. Petrini FM. Interfacing the peripheral nervous system: towards the development of a bidirectional neural communication. 2015.
8. Varho T, Jääskeläinen S, Tolonen U, Sonninen P, Vainionpää L, Aula P, et al. Central and peripheral nervous system dysfunction in the clinical variation of Salla disease. Neurology. 2000;55(1):99–104. pmid:10891913
- View Article
- PubMed/NCBI
- Google Scholar
9. Koh RGL, Zariffa J, Jabban L, Yen S-C, Donaldson N, Metcalfe BW. Tutorial: a guide to techniques for analysing recordings from the peripheral nervous system. J Neural Eng. 2022;19(4):10.1088/1741-2552/ac7d74. pmid:35772397
- View Article
- PubMed/NCBI
- Google Scholar
10. Larson CE, Meng E. A review for the peripheral nerve interface designer. J Neurosci Methods. 2020;332:108523. pmid:31743684
- View Article
- PubMed/NCBI
- Google Scholar
11. Koh RGL, Nachman AI, Zariffa J. Classification of naturally evoked compound action potentials in peripheral nerve spatiotemporal recordings. Sci Rep. 2019;9(1):11145. pmid:31366940
- View Article
- PubMed/NCBI
- Google Scholar
12. Koh RGL, Balas M, Nachman AI, Zariffa J. Selective peripheral nerve recordings from nerve cuff electrodes using convolutional neural networks. J Neural Eng. 2020;17(1):016042. pmid:31581142
- View Article
- PubMed/NCBI
- Google Scholar
13. Dousty M, Fleet DJ, Zariffa J. Personalized Video-Based Hand Taxonomy Using Egocentric Video in the Wild. IEEE J Biomed Health Inform. 2025;29(9):6214–25. pmid:39527414
- View Article
- PubMed/NCBI
- Google Scholar
14. Dousty M, Fleet DJ, Zariffa J. Hand Grasp Classification in Egocentric Video After Cervical Spinal Cord Injury. IEEE J Biomed Health Inform. 2024;28(2):645–54. pmid:37093722
- View Article
- PubMed/NCBI
- Google Scholar
15. Riek NT, Akcakaya M, Bouzid Z, Gokhale T, Helman SM, Kraevsky K, et al. ECG-SMART-NET: A Deep Learning Architecture for Precise ECG Diagnosis of Occlusion Myocardial Infarction. IEEE Trans Biomed Eng. 2025;72(12):3613–20. pmid:40418608
- View Article
- PubMed/NCBI
- Google Scholar
16. Somani S, Russak AJ, Richter F, Zhao S, Vaid A, Chaudhry F, et al. Deep learning and the electrocardiogram: review of the current state-of-the-art. Europace. 2021;23(8):1179–91. pmid:33564873
- View Article
- PubMed/NCBI
- Google Scholar
17. Mueller MJ, Minor SD, Schaaf JA, Strube MJ, Sahrmann SA. Relationship of plantar-flexor peak torque and dorsiflexion range of motion to kinetic variables during walking. Phys Ther. 1995;75(8):684–93. pmid:7644572
- View Article
- PubMed/NCBI
- Google Scholar
18. Micera S, Navarro X. Bidirectional interfaces with the peripheral nervous system. Int Rev Neurobiol. 2009;86:23–38. pmid:19607988
- View Article
- PubMed/NCBI
- Google Scholar
19. Romero DW, Knigge DM, Gu A, Bekkers EJ, Gavves E, Tomczak JM. Towards a general purpose CNN for long range dependencies in N D. 2022. https://doi.org/arXiv:220603398
20. Nerella S, Bandyopadhyay S, Zhang J, Contreras M, Siegel S, Bumin A, et al. Transformers and large language models in healthcare: A review. Artif Intell Med. 2024;154:102900. pmid:38878555
- View Article
- PubMed/NCBI
- Google Scholar
21. Lin T, Wang Y, Liu X, Qiu X. A survey of transformers. AI Open. 2022;3:111–32.
- View Article
- Google Scholar
22. Zhou J, Cui G, Hu S, Zhang Z, Yang C, Liu Z, et al. Graph neural networks: A review of methods and applications. AI Open. 2020;1:57–81.
- View Article
- Google Scholar
23. Demir A, Koike-Akino T, Wang Y, Haruna M, Erdogmus D. EEG-GNN: Graph Neural Networks for Classification of Electroencephalogram (EEG) Signals. Annu Int Conf IEEE Eng Med Biol Soc. 2021;2021:1061–7. pmid:34891471
- View Article
- PubMed/NCBI
- Google Scholar
24. Tang S, Dunnmon JA, Saab K, Zhang X, Huang Q, Dubost F. Self-supervised graph neural networks for improved electroencephalographic seizure analysis. arXiv preprint. 2021. https://doi.org/10.48550/arXiv.2104.08336
25. Li R, Yuan X, Radfar M, Marendy P, Ni W, O’Brien TJ, et al. Graph Signal Processing, Graph Neural Network and Graph Learning on Biological Data: A Systematic Review. IEEE Rev Biomed Eng. 2023;16:109–35. pmid:34699368
- View Article
- PubMed/NCBI
- Google Scholar
26. Atoar Rahman SM, Ibrahim Khalil M, Zhou H, Guo Y, Ding Z, Gao X, et al. Advancement in Graph Neural Networks for EEG Signal Analysis and Application: A Review. IEEE Access. 2025;13:50167–87.
- View Article
- Google Scholar
27. Tang S, Dunnmon JA, Liangqiong Q, Saab KK, Baykaner T, Lee-Messer C. Modeling multivariate biosignals with graph neural networks and structured state space models. In: Conference on health, inference, and learning, 2023. 50–71.
28. Wang Z, Wang Y, Zhang J, Hu C, Yin Z, Song Y. Spatial-Temporal Feature Fusion Neural Network for EEG-Based Emotion Recognition. IEEE Trans Instrum Meas. 2022;71:1–12.
- View Article
- Google Scholar
29. Koh RGL, Nachman AI, Zariffa J. Use of spatiotemporal templates for pathway discrimination in peripheral nerve recordings: a simulation study. J Neural Eng. 2017;14(1):016013. pmid:28000616
- View Article
- PubMed/NCBI
- Google Scholar
30. Hamilton WL. Graph representation learning. Morgan & Claypool Publishers. 2020.
31. Kovalenko A, Pozdnyakov V, Makarov I. Graph Neural Networks With Trainable Adjacency Matrices for Fault Diagnosis on Multivariate Sensor Data. IEEE Access. 2024;12:152860–72.
- View Article
- Google Scholar
32. Whiteley N, Gray A, Rubin-Delanchy P. Matrix factorisation and the interpretation of geodesic distance. Advances in Neural Information Processing Systems. 2021;34:24–38.
- View Article
- Google Scholar
33. Agarwal R, Aziz A, Krishnan AS, Challa A, Danda S. ESW Edge Weights: Ensemble Stochastic Watershed Edge Weights for Hyperspectral Image Classification. IEEE Geosci Remote Sensing Lett. 2022;19:1–5.
- View Article
- Google Scholar
34. Coupeau P, Fasquel J-B, Dinomais M. On the relevance of edge-conditioned convolution for GNN-based semantic image segmentation using spatial relationships. In: 2022 Eleventh International Conference on Image Processing Theory, Tools and Applications (IPTA), 2022. 1–6. https://doi.org/10.1109/ipta54936.2022.9784143
35. Gama F, Marques AG, Leus G, Ribeiro A. Convolutional Graph Neural Networks. In: 2019 53rd Asilomar Conference on Signals, Systems, and Computers, 2019. 452–6. https://doi.org/10.1109/ieeeconf44664.2019.9048767
36. Zhao L, Jin W, Akoglu L, Shah N. From stars to subgraphs: uplifting any GNN with local structure awareness. In: 2021. https://doi.org/10.48550/arXiv.2110.03753
37. Pappu A, Paige B. Making graph neural networks worth it for low-data molecular machine learning. arXiv preprint. 2020. https://doi.org/10.48550/arXiv.2011.12203
38. Agarwal S, Dubey T, Gupta S, Bedathur S. A transfer framework for enhancing temporal graph learning in data-scarce settings. In: 2025. https://arxiv.org/abs/250300852

[ref1] 1. Vizi ES, Kiss JP, Lendvai B. Nonsynaptic communication in the central nervous system. Neurochem Int. 2004;45(4):443–51. pmid:15186910
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Oosting PH. Signal transmission in the nervous system. Rep Prog Phys. 1979;42(9):1479–532.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref3] 3. Hildebrand JG. Analysis of chemical signals by nervous systems. Proc Natl Acad Sci U S A. 1995;92(1):67–74. pmid:7816849
View Article
PubMed/NCBI
Google Scholar

[9] View Article

[10] PubMed/NCBI

[11] Google Scholar

[ref4] 4. Kamimura D, Tanaka Y, Hasebe R, Murakami M. Bidirectional communication between neural and immune systems. Int Immunol. 2020;32(11):693–701. pmid:31875424
View Article
PubMed/NCBI
Google Scholar

[13] View Article

[14] PubMed/NCBI

[15] Google Scholar

[ref5] 5. Townsend KL. One Nervous System: Critical Links Between Central and Peripheral Nervous System Health and Implications for Obesity and Diabetes. Diabetes. 2024;73(12):1967–75. pmid:39401394
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref6] 6. Thakor NV, Wang Q, Greenwald E. Bidirectional peripheral nerve interface and applications. Annu Int Conf IEEE Eng Med Biol Soc. 2016;2016:6327–30. pmid:28269696
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref7] 7. Petrini FM. Interfacing the peripheral nervous system: towards the development of a bidirectional neural communication. 2015.

[ref8] 8. Varho T, Jääskeläinen S, Tolonen U, Sonninen P, Vainionpää L, Aula P, et al. Central and peripheral nervous system dysfunction in the clinical variation of Salla disease. Neurology. 2000;55(1):99–104. pmid:10891913
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref9] 9. Koh RGL, Zariffa J, Jabban L, Yen S-C, Donaldson N, Metcalfe BW. Tutorial: a guide to techniques for analysing recordings from the peripheral nervous system. J Neural Eng. 2022;19(4):10.1088/1741-2552/ac7d74. pmid:35772397
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref10] 10. Larson CE, Meng E. A review for the peripheral nerve interface designer. J Neurosci Methods. 2020;332:108523. pmid:31743684
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref11] 11. Koh RGL, Nachman AI, Zariffa J. Classification of naturally evoked compound action potentials in peripheral nerve spatiotemporal recordings. Sci Rep. 2019;9(1):11145. pmid:31366940
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref12] 12. Koh RGL, Balas M, Nachman AI, Zariffa J. Selective peripheral nerve recordings from nerve cuff electrodes using convolutional neural networks. J Neural Eng. 2020;17(1):016042. pmid:31581142
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref13] 13. Dousty M, Fleet DJ, Zariffa J. Personalized Video-Based Hand Taxonomy Using Egocentric Video in the Wild. IEEE J Biomed Health Inform. 2025;29(9):6214–25. pmid:39527414
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref14] 14. Dousty M, Fleet DJ, Zariffa J. Hand Grasp Classification in Egocentric Video After Cervical Spinal Cord Injury. IEEE J Biomed Health Inform. 2024;28(2):645–54. pmid:37093722
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref15] 15. Riek NT, Akcakaya M, Bouzid Z, Gokhale T, Helman SM, Kraevsky K, et al. ECG-SMART-NET: A Deep Learning Architecture for Precise ECG Diagnosis of Occlusion Myocardial Infarction. IEEE Trans Biomed Eng. 2025;72(12):3613–20. pmid:40418608
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref16] 16. Somani S, Russak AJ, Richter F, Zhao S, Vaid A, Chaudhry F, et al. Deep learning and the electrocardiogram: review of the current state-of-the-art. Europace. 2021;23(8):1179–91. pmid:33564873
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref17] 17. Mueller MJ, Minor SD, Schaaf JA, Strube MJ, Sahrmann SA. Relationship of plantar-flexor peak torque and dorsiflexion range of motion to kinetic variables during walking. Phys Ther. 1995;75(8):684–93. pmid:7644572
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref18] 18. Micera S, Navarro X. Bidirectional interfaces with the peripheral nervous system. Int Rev Neurobiol. 2009;86:23–38. pmid:19607988
View Article
PubMed/NCBI
Google Scholar

[66] View Article

[67] PubMed/NCBI

[68] Google Scholar

[ref19] 19. Romero DW, Knigge DM, Gu A, Bekkers EJ, Gavves E, Tomczak JM. Towards a general purpose CNN for long range dependencies in N D. 2022. https://doi.org/arXiv:220603398

[ref20] 20. Nerella S, Bandyopadhyay S, Zhang J, Contreras M, Siegel S, Bumin A, et al. Transformers and large language models in healthcare: A review. Artif Intell Med. 2024;154:102900. pmid:38878555
View Article
PubMed/NCBI
Google Scholar

[71] View Article

[72] PubMed/NCBI

[73] Google Scholar

[ref21] 21. Lin T, Wang Y, Liu X, Qiu X. A survey of transformers. AI Open. 2022;3:111–32.
View Article
Google Scholar

[75] View Article

[76] Google Scholar

[ref22] 22. Zhou J, Cui G, Hu S, Zhang Z, Yang C, Liu Z, et al. Graph neural networks: A review of methods and applications. AI Open. 2020;1:57–81.
View Article
Google Scholar

[78] View Article

[79] Google Scholar

[ref23] 23. Demir A, Koike-Akino T, Wang Y, Haruna M, Erdogmus D. EEG-GNN: Graph Neural Networks for Classification of Electroencephalogram (EEG) Signals. Annu Int Conf IEEE Eng Med Biol Soc. 2021;2021:1061–7. pmid:34891471
View Article
PubMed/NCBI
Google Scholar

[81] View Article

[82] PubMed/NCBI

[83] Google Scholar

[ref24] 24. Tang S, Dunnmon JA, Saab K, Zhang X, Huang Q, Dubost F. Self-supervised graph neural networks for improved electroencephalographic seizure analysis. arXiv preprint. 2021. https://doi.org/10.48550/arXiv.2104.08336

[ref25] 25. Li R, Yuan X, Radfar M, Marendy P, Ni W, O’Brien TJ, et al. Graph Signal Processing, Graph Neural Network and Graph Learning on Biological Data: A Systematic Review. IEEE Rev Biomed Eng. 2023;16:109–35. pmid:34699368
View Article
PubMed/NCBI
Google Scholar

[86] View Article

[87] PubMed/NCBI

[88] Google Scholar

[ref26] 26. Atoar Rahman SM, Ibrahim Khalil M, Zhou H, Guo Y, Ding Z, Gao X, et al. Advancement in Graph Neural Networks for EEG Signal Analysis and Application: A Review. IEEE Access. 2025;13:50167–87.
View Article
Google Scholar

[90] View Article

[91] Google Scholar

[ref27] 27. Tang S, Dunnmon JA, Liangqiong Q, Saab KK, Baykaner T, Lee-Messer C. Modeling multivariate biosignals with graph neural networks and structured state space models. In: Conference on health, inference, and learning, 2023. 50–71.

[ref28] 28. Wang Z, Wang Y, Zhang J, Hu C, Yin Z, Song Y. Spatial-Temporal Feature Fusion Neural Network for EEG-Based Emotion Recognition. IEEE Trans Instrum Meas. 2022;71:1–12.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref29] 29. Koh RGL, Nachman AI, Zariffa J. Use of spatiotemporal templates for pathway discrimination in peripheral nerve recordings: a simulation study. J Neural Eng. 2017;14(1):016013. pmid:28000616
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref30] 30. Hamilton WL. Graph representation learning. Morgan & Claypool Publishers. 2020.

[ref31] 31. Kovalenko A, Pozdnyakov V, Makarov I. Graph Neural Networks With Trainable Adjacency Matrices for Fault Diagnosis on Multivariate Sensor Data. IEEE Access. 2024;12:152860–72.
View Article
Google Scholar

[102] View Article

[103] Google Scholar

[ref32] 32. Whiteley N, Gray A, Rubin-Delanchy P. Matrix factorisation and the interpretation of geodesic distance. Advances in Neural Information Processing Systems. 2021;34:24–38.
View Article
Google Scholar

[105] View Article

[106] Google Scholar

[ref33] 33. Agarwal R, Aziz A, Krishnan AS, Challa A, Danda S. ESW Edge Weights: Ensemble Stochastic Watershed Edge Weights for Hyperspectral Image Classification. IEEE Geosci Remote Sensing Lett. 2022;19:1–5.
View Article
Google Scholar

[108] View Article

[109] Google Scholar

[ref34] 34. Coupeau P, Fasquel J-B, Dinomais M. On the relevance of edge-conditioned convolution for GNN-based semantic image segmentation using spatial relationships. In: 2022 Eleventh International Conference on Image Processing Theory, Tools and Applications (IPTA), 2022. 1–6. https://doi.org/10.1109/ipta54936.2022.9784143

[ref35] 35. Gama F, Marques AG, Leus G, Ribeiro A. Convolutional Graph Neural Networks. In: 2019 53rd Asilomar Conference on Signals, Systems, and Computers, 2019. 452–6. https://doi.org/10.1109/ieeeconf44664.2019.9048767

[ref36] 36. Zhao L, Jin W, Akoglu L, Shah N. From stars to subgraphs: uplifting any GNN with local structure awareness. In: 2021. https://doi.org/10.48550/arXiv.2110.03753

[ref37] 37. Pappu A, Paige B. Making graph neural networks worth it for low-data molecular machine learning. arXiv preprint. 2020. https://doi.org/10.48550/arXiv.2011.12203

[ref38] 38. Agarwal S, Dubey T, Gupta S, Bedathur S. A transfer framework for enhancing temporal graph learning in data-scarce settings. In: 2025. https://arxiv.org/abs/250300852

Figures

Abstract

Introduction

Materials and methods

Data description & preprocessing

Graphs & adjacency matrices

Model architecture

Training & evaluation

Hyperparameter tuning.

Evaluation.

Ablation studies

Results

Generalizability performance

Within-rat performance

Discussion

Conclusion

References