Hyperspectral anomaly detection leveraging spatial attention and right-shifted spectral energy

Ruhan A; Quanxue Gao; Xiaoni Zhang; Wenwen Feng; Siti Khadijah Ali

doi:10.1371/journal.pone.0330640

Abstract

In this research, we have proposed a novel anomaly detection algorithm for processing hyperspectral images (HSIs), called the Graph Attention Network–Beta Wavelet Graph Neural Network-based Hyperspectral Anomaly Detection (GAN–BWGNN HAD). This algorithm treats each pixel as a node in a graph, where edges represent pixel correlations and node attributes correspond to spectral features. The algorithm integrates spatial and spectral information, utilizing graph neural networks to identify nonlinear relationships within the image, thereby enhancing anomaly detection precision. The K-nearest neighbor (KNN) algorithm facilitates the creation of edges between pixels, enabling the incorporation of distant pixels and improving resilience to noise and local irregularities. The GAN component incorporates an adaptive attention mechanism to dynamically prioritize relevant spatial features. The BWGNN component employs beta wavelets as a localized bandpass filter, effectively identifying spectral anomalies by addressing the right-shifted spectral energy phenomenon. Furthermore, the utilization of beta wavelets obviates the necessity for computationally intensive Laplacian matrix decompositions, thereby enhancing processing efficiency. This approach effectively integrates spatial and spectral information, providing a more accurate and efficient solution for hyperspectral anomaly detection. Experiments on six real-world hyperspectral datasets and one simulated dataset demonstrate the superior performance of our proposed method. It consistently achieved high Area Under the Curve (AUC) values (e.g., 0.9986 on AVIRIS-II, 0.9961 on abu-beach-2, 0.9982 on abu-urban-3, 0.9999 on Salinas-simulate, 0.9872 on Cri), significantly outperforming state-of-the-art methods. The proposed method also exhibited sub-second detection times (0.20–0.28 s) on most datasets, significantly faster than traditional methods (achieving a speedup of 100 to 500 times) and deep learning models (achieving a speedup of 6 to 8 times).

Citation: A R, Gao Q, Zhang X, Feng W, Ali SK (2025) Hyperspectral anomaly detection leveraging spatial attention and right-shifted spectral energy. PLoS One 20(9): e0330640. https://doi.org/10.1371/journal.pone.0330640

Editor: Panos Liatsis, Khalifa University of Science and Technology, UNITED ARAB EMIRATES

Received: January 14, 2025; Accepted: August 5, 2025; Published: September 4, 2025

Copyright: © 2025 A et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All data may be found in the following public repository: https://earthexplorer.usgs.gov/.

Funding: This research was funded by the Xi’an Peihua University Research Institutions and Innovation Team Special Project under Grant PHJT2406. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Introduction

Hyperspectral remote sensing captures detailed spectral information across hundreds of contiguous narrow bands, typically spanning the visible to infrared spectrum (400–2500 nm). This generates a three-dimensional (3D) hyperspectral image (HSI) data cube, where each pixel contains a high-resolution spectral signature, enabling fine-grained material identification beyond the capabilities of traditional RGB or multispectral imaging. These capabilities make HSIs invaluable for diverse applications including mineral exploration, precision agriculture, environmental monitoring, and military surveillance, where detecting rare or unexpected targets—known as hyperspectral anomaly detection (HAD)—is frequently a critical task.

Hyperspectral anomaly detection (HAD) techniques can be broadly categorized into four main types: statistical models, collaborative representations, low-rank and sparse decomposition, and deep learning [1–12]. Statistical models, such as the RX algorithm [13], detect anomalies by computing the Mahalanobis distance between a test pixel and the background mean. Variants like global RX (GRX) and local RX (LRX) improve detection accuracy by estimating background statistics using the entire image or local regions. However, these methods are computationally intensive, sensitive to data distribution assumptions, and struggle with non-Gaussian distributed data [14]. They also rely heavily on surrounding pixel data, making them susceptible to noise and anomalies. Recent advances like Information Entropy Estimation based on Point-Set Topology (IEEPST) [15] and the Chessboard Topology-based Anomaly Detection (CTAD) algorithm [16] address some of these limitations. IEEPST maps hyperspectral image data into ordered topological spaces, revealing the data’s mathematical-statistical properties and addressing data-model discrepancies. CTAD decomposes hyperspectral images using a chessboard-shaped framework to extract deep-level features, quantify differences between anomalies and background, and highlight spectral trends. These methods, free from specific model assumptions, adaptively learn data features, achieving robust and generalizable anomaly detection results. However, they still face challenges in efficiently integrating spatial and spectral information, handling non-Gaussian data, and being resilient to noise and anomalies.

The collaborative representation detection (CRD) algorithm is a classic anomaly detection algorithm proposed by Li and Du. This algorithm relies on the linear representation of background pixels using surrounding data [17]. Establishing dual windows and calculating background information for each pixel is computationally expensive for large-scale hyper spectral data. Moreover, CRD utilizing linear representation methods may struggle to adequately manage and characterize nonlinear relationships.

Low-rank and sparse decomposition techniques [18,19], such as the Hyperspectral Anomaly Detection via Generalized Shrinkage Mappings (HADGSM) [20], have been proposed to capture spectral correlations, spatial smoothness, and local geometrical structures in hyperspectral images. These methods leverage nonconvex penalties for group sparsity, l₀ gradient, and low-rankness to enhance detection accuracy and efficiency. In a recent study, the Hyperspectral Simultaneous Anomaly Detection and Denoising (HyADD) framework [21] integrates anomaly detection and denoising in a single framework, leveraging spatial-spectral gradient domain-based smoothness and subspace domain-based low-rankness to improve detection performance while removing additive noise. HyADD uses an adaptive dictionary construction to enhance the robustness of anomaly detection and noise removal. However, these approaches often require complex optimization processes that can be computationally intensive, particularly for high-dimensional data. Additionally, while they effectively model linear relationships, they may struggle to fully capture the nonlinear relationships that are common in complex hyperspectral scenes. These limitations highlight the need for a method that can efficiently handle high-dimensional data and nonlinear relationships without compromising computational efficiency.

Deep learning methods have shown promise in hyperspectral anomaly detection [22,23]. Transferred deep convolutional neural networks (CNND) [24] classify pixel pairs using labeled reference data, accurately localizing anomalies. However, CNND relies heavily on surrounding pixels, making it susceptible to noise and local irregularities. Self-supervised networks such as the blind-block reconstruction network (BockNet) [25] and the pixel-shuffle downsampling blind-spot reconstruction network (PDBSNet) [26] address this issue by introducing blind-spot architectures to reduce the impact of anomalies on reconstruction results. The sliding dual-window-inspired reconstruction network (DirectNet) [27] further improves anomaly suppression by reconstructing central pixels using only outer window pixels. The nonlocal and local feature-coupled self-supervised network (NL2Net) [28] integrates local and nonlocal feature extraction to capture fine-grained spatial-spectral details and model long-range dependencies. Additionally, an improved center block masked convolution enhances the network’s focus on surrounding background features, enabling precise background reconstruction and superior anomaly separation. More recently, the global feature-injected blind-spot network (PUNNet) [29] incorporates patch-shuffle downsampling and nonlinear activation-free network (NAFNet) blocks with dilated convolution to capture both local and global spatial-spectral features, achieving superior detection performance by reliably reconstructing the background while weakening the expression of anomalous features. Another innovative approach, the frequency-to-spectrum mapping generative adversarial network (FTSGAN) [30], maps the original spectra to the fractional Fourier domain to enhance the separability of backgrounds and anomalies, using a semisupervised learning strategy to prevent the model from focusing on numerical equivalence between input and output, thereby improving anomaly detection accuracy. Despite these advancements, existing deep learning methods often struggle to fully exploit the spatial-spectral relationships in hyperspectral images and may be computationally intensive, particularly for high-dimensional data. Moreover, while these methods have shown significant improvements in anomaly detection, they often rely on complex network architectures that may not be easily interpretable or adaptable to different types of hyperspectral data. There is a need for a method that can efficiently integrate spatial and spectral information, handle high-dimensional data, and provide a more interpretable and adaptable framework for anomaly detection.

Graph neural networks (GNNs) have recently been utilized to analyze hyperspectral images, predominantly for classification purposes rather than for anomaly detection [31–34]. Traditional methods use surrounding pixels to discriminate anomalous targets, which can be easily affected by anomalous targets and noise. Anomalous targets may appear at any position in the image and may even span the entire image. Furthermore, nonlinear relationships frequently occur in scenes characterized by intricate spatial distributions and numerous ground objects. GNNs proficiently address non-linear relationships by integrating both distant and adjacent pixels through the graph’s topological structure, thereby enhancing the precision of anomaly detection. We propose a graph attention network (GAN) combined with a beta wavelet graph neural network (BWGNN) for hyperspectral anomaly detection, wherein each pixel in the hyperspectral image is regarded as a node, the relationships between pixels are depicted as edges, and the spectral characteristics of each pixel are represented as node attributes. By leveraging both spatial and spectral information, the algorithm provides a comprehensive analysis of the hyperspectral data. The contributions of this article are summarized as follows:

HSIs are graph represented with spatial and spectral information. GNNs use node - to - node learning to handle the nonlinear relationships in images, enhancing the detection of anomalous targets. The k - nearest neighbor algorithm is used to establish node edges by calculating pixel value distances, including distant pixels in detection and reducing the impact of noise and nearby anomalies on performance. The GAN–BWGNN HAD HAD algorithm treats each pixel as a graph node, leveraging both spatial and spectral information in HSIs for more accurate anomaly detection.
The GAN component in the algorithm features an adaptive attention mechanism. This mechanism dynamically weights the importance of neighboring nodes, enabling the algorithm to focus on relevant spatial features and improve sensitivity to spatially localized anomalies.
The BWGNN component creates a localized bandpass filter using beta wavelets to capture the right - shifted spectral energy phenomenon, distinguishing anomalous pixels from the background based on spectral signatures. Moreover, the proposed algorithm uses a beta wavelet as the spectral filter, avoiding the computational complexities of Laplace matrix decomposition and multiplication, thus enhancing hyperspectral data processing efficiency.

The remainder of this paper is organized as follows: The proposed algorithm section details the proposed GAN-BWGNN HAD methodology, including hyperspectral graph construction, spatial modeling via graph attention networks, and spectral anomaly detection using beta wavelet transforms. The Experiments section presents comprehensive experiments on seven benchmark datasets, evaluating detection performance, computational efficiency, parameter sensitivity, and ablation studies. Finally, the Conclusions section concludes the work and discusses future research directions.

Proposed algorithm

Graph construction of hyperspectral images

In this paper, the 3D hyperspectral data cube is set as , is the number of pixels, d is the number of spectral bands, and is the spectral vector of a pixel in . The hyperspectral image is represented by the graph data structure, where the graph is defined as , taking each pixel as a node , is a set of nodes, with the number of nodes being denoted as N, is a set of edges, and indicates that node and node are associated having an edge in between. The KNN algorithm is used to establish the edges between nodes. For each node, the pixel value disparity is computed between that node and other nodes. k-closest nodes are investigated, an edge is established among them, and the adjacency matrix is constructed. In most hyperspectral abnormal target detection algorithms, this is usually conducted using the surrounding pixels. However, noise and abnormal targets in surrounding pixels influence the detection performance. Using the KNN algorithm, the remote pixel can be involved in the detection, avoiding the interference of noise and abnormal targets in the neighborhood. is the feature matrix, and the spectral vector of each pixel is used as a feature of the graph. The hyperspectral image is characterized by the graph data structure, which includes both the spatial information and spectral information of the hyperspectral image. Considering to be the adjacency matrix and , the degree matrix, one would have . The Laplacian matrix of the graph is defined as , and symmetric normalized Laplacian is , where is an identity matrix. Eigenvalue decomposition for is performed as follows: where are the eigenvalues, the corresponding eigenvector , is an orthogonal matrix. Except for and , the threshold is arbitrarily selected to divide the eigenvalues into low frequencies and high frequencies . Assuming that is a signal on , the Fourier transform of is obtained as follows: . The pseudocode of the algorithm is shown in Algorithm 1.

Algorithm 1. Graph construction of hyperspectral images.

Require

1: 3D hyperspectral data cube

2: Number of nearest neighbors k

Ensure

3: Graph , where V is the set of nodes, E is the

set of edges, and X is the feature matrix.

4: Flatten the 3D data cube X into a 2D matrix , where

.

5: Treat each pixel as a node .

6: Use KNN algorithm to compute edges E between nodes based on

pixel values.

7: Construct adjacency matrix A and degree matrix D.

8: Compute Laplacian matrix L = D−A.

9: return Graph

Graph attention network

GANs [35] leverage the attention mechanism to dynamically assign importance to the nodes’ neighbors during feature aggregation, allowing a more nuanced representation of graph-structured data and adaptation to the specificities of each node’s local neighborhood. The core idea behind GANs is to compute attention coefficients that reflect the importance of each neighbor’s features to a node. These coefficients are calculated as follows:

Linear Transformation: First, a shared linear transformation, parameterized by a weight matrix , is applied to every node’s features , where is the size of the new feature space, and F is the size of the original feature space. This transformation is described using:

(1)

This step maps the node features into a space where the attention mechanism can be applied more efficiently.

Pairwise Attention Scores: For each pair of nodes, an attention mechanism computes a raw attention score e_ij that indicates the importance of node j’s features to the node i, described as follows:

(2)

A common choice for the attention mechanism a is a single-layer feed forward neural network, parameterized by a weight vector , and by applying the LeakyReLU nonlinearity:

(3)

where denotes concatenation.

Normalization of Attention Scores: The raw scores are then normalized across all choices of j using the softmax function to facilitate straightforward comparison:

(4)

where denotes the set of neighbors of the node i and represents the normalized attention coefficient that quantifies the importance of node j’s features to node i.

Feature Update: Finally, the node features are updated by computing a weighted combination of the neighbors’ features using the normalized attention coefficients:

(5)

where σ denotes an activation function, and is the updated feature vector of the node i.

Through these mechanisms, GANs adaptively focus on the most relevant parts of the graph structure for each node, leading to more effective learning from graph-structured data.

Right-shift phenomenon

For any , the k-th low-frequency energy ratio as the accumulated energy distribution in the first k eigenvalues, is described as follows [36]:

(6)

A larger indicates that a larger part of the energy corresponds to the first k eigenvalues. The calculation of the spectral energy rate depends on Laplacian matrix decomposition; however, matrix decomposition and multiplication increase computational complexity. To improve computational efficiency, a high-frequency area is introduced [36]. It is assumed that the low-frequency energy ratio curve f(t) is defined as , where and . The area between f(t) and is defined as the high-frequency area:

(7)

The calculation of avoids Laplacian matrix decomposition and reduces computational complexity. In the work of [36], the authors proposed the idea of right-shift phenomenon. The presence of anomalies leads to a right shift in spectral energy, indicating that the spectral energy distribution is concentrated less at low frequencies and more at high frequencies. monotonically increases with the anomaly degree; therefore, can be used to represent the right-shift phenomenon and to measure the influence of the anomalous target in the spectral domain.

GAN–BWGNN HAD

Because the graph spectral energy shows the right-shift phenomenon in the spectral domain, the graph signal can be projected in the spectral domain via a Fourier transform. Nonetheless, the Fourier transform relies on the eigendecomposition of the Laplacian matrix, and the eigenvector is not a sparse matrix. Therefore, the calculation cost is exceedingly high regardless of whether eigendecomposition or eigenvector multiplication is performed. Moreover, the Fourier transform encompasses elements from the entire spectral domain, which does not possess adequate locality to accurately represent the right-shift phenomenon. The graph wavelet transform can overcome the limitations of the Fourier transform. This way, the Fourier transform basis can be replaced with the wavelet transform basis. A set of wavelets as the basis can be expressed as follows: , where is described as

(8)

where is the eigenvector of the Laplacian matrix, is a kernel function, , s is a scale coefficient that describes different scales of the wavelet base, . The wavelet transform of the graph signal x based on is described as follows:

(9)

According to Parseval’s theorem, the kernel function must satisfy the wavelet admissibility condition [37]:

(10)

Here, g_si(w) is a bandpass filter; . The wavelet function rapidly decays as it approaches 0 and infinity; therefore, the wavelet transform has good locality.

The graph wavelet transform projects the graph signal into the frequency domain via a Fourier transform and then filters the signal in the frequency domain via the wavelet function. Finally, the filtering result is projected back into the original domain via the Fourier transform. Owing to the right-shift phenomenon, the presence of anomalous targets leads to the concentration of spectral energy at high frequencies. Therefore, the selection of filters is important for capturing anomalous target signals. GNNs are usually low-pass filters or adaptive filters, which cannot capture the right-shift phenomenon. Furthermore, the kernel function selects a polynomial function or uses polynomial function approximation to avoid high computational complexity due to Laplacian matrix decomposition. Therefore, the beta distribution is selected as the graph wavelet base. The beta function is denoted by

(11)

The probability density function of beta distribution is described as follows:

(12)

where , and is a constant. As the eigenvalues of the normalized graph Laplacian L satisfy , we adjust the probability density function of the beta distribution as follows:

(13)

where , is a polynomial, and Beta wavelet can be written as

(14)

Through recursive computation of the powers of L, such as and , the polynomial kernel can be efficiently implemented. This approach avoids the high computational cost that would otherwise be incurred by explicit eigendecomposition. Let a + b = C be a constant,the Beta wavelet transform is constructed using a set of Beta wavelets with the same order:

(15)

In this equation, is a low-pass filter, and the rest are bandpass filters of different scales. Besides, according to the Parseval theorem, when a > 1, the kernel function satisfies the wavelet admissibility condition:

(16)

Herein, we propose a novel HAD algorithm that combines the GAN and BWGNN to enhance detection performance. The algorithm works in two stages: spatial-domain processing and frequency-domain processing.

In the spatial-domain processing stage, the algorithm employs the GAN to capture the spatial information in the hyperspectral images. GANs utilize an attention mechanism to assign different weights to each pixel and its neighbors, effectively extracting local spatial features. This step enhances the model’s understanding of the spatial context, laying the foundation for subsequent frequency-domain processing. The pseudocode of the algorithm is shown in Algorithm 2.

Algorithm 2. Spatial domain processing using graph attention network.

Thus, during the frequency-domain processing phase, the algorithm employs the BWGNN to examine the spectral attributes of hyperspectral images. Due to the right-shift phenomenon in spectral energy observed in hyperspectral data, the BWGNN employs the beta wavelet basis as a filter, which serves as an efficient and localized bandpass filter, particularly adept at capturing the right-shift in spectral energy. By processing in the frequency domain, the algorithm can more accurately identify and locate anomalous targets.

Overall, this combined approach of spatial and frequency domain processing provides a powerful tool for anomaly detection in hyperspectral images. It leverages the spatial information processing capabilities of the GAN and the frequency-domain analysis strengths of the BWGNN, enabling the algorithm to identify anomalies more effectively and thereby improving detection accuracy and efficiency. The network propagation process is described as follows [38]:

(17)

(18)

(19)

(20)

(21)

is a wavelet obtained from Eq. 15, is a multilayer perceptron, and AGG is an aggregation function, such as summation or concatenation. The signal features enter the multilayer perceptron and are filtered using different wavelets in parallel. The filtering results are aggregated as S and transmitted to the perceptron of the next layer. The sigmoid function is used as the network activation function. The network output is the anomaly probability p_i of the pixels. The weighted cross-entropy loss is used for the training:

(22)

where γ is the ratio of anomaly labels (y_i = 1) to normal labels (y_i = 0). The pseudocode of the algorithm is presented in Algorithm 3, and the flow chart of the proposed GAN–BWGNN HAD algorithm is shown in Fig 1.

Download:

Fig 1. Flowchart of the GAN–BWGNN HAD algorithm.

https://doi.org/10.1371/journal.pone.0330640.g001

Algorithm 3. Frequency domain processing using beta wavelet graph neural network (BWGNN).

Require

1: Updated node features

2: Beta wavelet parameters

Ensure

3: Anomaly probability map

4: Define Beta wavelet basis as per Equation 9.

5: for each node do

6: Apply Beta wavelet filter to node features: .

7: Aggregate filtered features across all wavelet scales: .

8: Compute anomaly probability: .

9: end for

10: return Anomaly probability map P

Experiments

The experiments were conducted on six real hyperspectral datasets and one simulated dataset. The hardware configuration consisted of an E5-2680v3 CPU, 128 GiB RAM, and an NVIDIA GeForce RTX 3060 GPU, with software implementation utilizing CUDA 11.6, PyTorch 1.12.0, and Python 3.7.

Hyperspectral images were modeled as homogeneous graphs based on pixel-level relationships. For the GABW-GNN HAD algorithm, model training was conducted using the Adam optimizer, with a learning rate of 0.01. The dimension of the network’s hidden layer was set to 128, and the maximum number of epochs was set to 200. The training ratio was set to . The order C in the beta wavelet was set to 2, whereas k in the KNN algorithm was set to 10. In GABW-GNN HAD, AGG denotes concatenation. The evaluation framework compared the proposed method against three conventional detection algorithms (GRX [13], LRX [13], and CRD [17]) and four advanced deep learning approaches (PDBSNet [26], BockNet [25], DirectNet [27], and NL2Net [28]). All comparative methods employed parameter settings consistent with their original publications to ensure fair benchmarking.

Quantitative assessment included AUC scores, 2D-ROC curves analyzing , , and relationships, 3D-ROC surface visualization, detection color maps for spatial anomaly localization, background-anomaly separation boxplots, and runtime comparisons across all methodologies.

Datasets

The datasets used in this study are publicly available and widely utilized in the field of hyperspectral anomaly detection. These datasets, including AVIRIS-I, AVIRIS-II, abu-beach-2, abu-urban-3, Cri, and GrandIsle, are real-world datasets that generally come with pre-labeled information. The labeled data are typically provided by the dataset creators based on known targets in real-world scenes (e.g., airplanes, vehicles, rocks) or through simulated embedding of anomalies. The Salinas-simulate dataset, which is a simulated dataset, has anomalies generated using a linear mixing model, and their locations are explicitly labeled.

1) AVIRIS-I: Captured by the NASA Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) sensor, this dataset has a spatial resolution of 3.5 m per pixel and contains 224 spectral bands covering a wavelength range of 370 to 2510 nm.After eliminating bands affected by low signal-to-noise ratio, sensor issues, and water vapor absorption, 189 bands remained. A sub-image (120 120 pixels) was extracted from the upper-left corner of the original 400×400 pixel image. The dataset features diverse ground objects, including three airplanes, represented by 58 pixels as anomalies.
2) AVIRIS-II: This dataset was obtained from the San Diego Naval Airport, with a spatial resolution of 3.5 m per pixel. A 100×100 pixel sub-image was selected for the experiment. The scene contains various ground objects and a clear spectral distinction between the targets (three airplanes, occupying 134 pixels) and the background, with the target having a well-defined morphology.
3) abu-beach-2: Part of the Airport–Beach–Urban (ABU) dataset [39]. This scene was captured from the beach area in San Diego using the AVIRIS sensor. The dataset has a spatial resolution of 7.5 m per pixel and consists of 100×100 pixels. After removing noise-affected bands, 193 spectral bands remained. The anomalies in this scene were identified as fishing areas.
4) abu-urban-3: Another scene from the Airport-Beach-Urban dataset [39]. This urban scene was captured over Gainesville using the AVIRIS sensor. It has a resolution of 3.5 m per pixel and contains 191 spectral bands. The 100×100 pixel scene includes diverse land covers, such as buildings, roads, vegetation, and vehicles, with the vehicles being identified as anomalies.
5) Salinas-simulate: This dataset was created by embedding anomalous targets into the real Salinas dataset. Six square-shaped anomalous areas were arranged in reverse order, in close proximity to one another. The anomaly spectral signature was generated using a linear mixing model.
6) Cri: Captured by the Nuance Cri hyperspectral sensor [40], this large-scale dataset has a size of 400×400 pixels and includes 46 spectral bands from 650 to 1100 nm. Ten rock anomalies, represented by 2216 pixels, were identified in the scene.
7) Grand Isle: This dataset was captured over Grand Isle, Louisiana, USA, using NASA’s Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) sensor. It has a spatial resolution of 4.4 m per pixel and contains 224 spectral bands spanning the 366-2496 nm wavelength range.The scene (300×480 pixels) features islands, seawater, and artificial platforms in the Gulf of Mexico. The anomalies comprise artificial structures, predominantly oil/gas platforms situated offshore.

Detection results

In the following Table 1, the AUC values of the GABW-GNN HAD are compared with those of eight other methods.

Download:

Table 1. Comparison of AUC values.

https://doi.org/10.1371/journal.pone.0330640.t001

The comparative experimental results indicated that for the AVIRIS-I dataset, the LRX and CRD algorithms achieved optimal results with dual window settings of (31, 13) and (17, 15), respectively. As evident from Table 1, the CRD algorithm performs the best, achieving an AUC value of 0.9968. In comparison, the proposed algorithm achieves the second-best performance with an AUC value of 0.9937, showing only a marginal difference of 0.0031 from the CRD algorithm. The slightly lower AUC of the proposed method on the AVIRIS-I dataset compared to CRD can be attributed to the inherent characteristics of the dataset and algorithmic trade-offs. The AVIRIS-I scene contains three compact anomalies (aircrafts, 58 pixels) with high spectral contrast against a homogeneous background. CRD, a local linear collaborative representation method, excels in such scenarios by leveraging the strong correlation between anomalies and their immediate spatial neighbors. In contrast, proposed graph-based approach constructs edges via KNN, which introduces non-local pixel associations. While this global connectivity enhances resilience to noise and irregular backgrounds, it may dilute the localized discriminative features critical for detecting spatially compact anomalies. This suggests that the algorithm’s strength in capturing global spatial-spectral dependencies may trade off sensitivity to highly localized targets. As shown in Fig 2, the proposed algorithm outperforms other methods in the 2D-ROC curve of (P_D, P_F). Its curve lies above the others across most P_F values, indicating higher detection probability (P_D) at lower false alarm rates (P_F). The CRD algorithm performs the second best, while the other algorithms show relatively inferior performance. Based on the PR curve, the proposed methodexhibits sub-optimal yet robust performance: while slightly underperforming CRD (light pink curve) at specific points (e.g., a P_D gap of 0.15 at ), it significantly outperforms other baseline methods across the P_F range, demonstrating superior reliability in balancing detection accuracy and false alarm suppression. As shown in Fig 2, which presents the 2D-ROC () curve, the curves of PDBSNet, BockNet, DirectNet, and NL2Net are overlapping, indicating similar detection performance. The proposed algorithm demonstrates outstanding performance in balancing detection probability and threshold robustness, particularly exhibiting competitive advantages in low to medium threshold ranges. The 2D-ROC () curve in Fig 2 demonstrates that the proposed method maintains a significantly lower false positive rate (p_F) across the entire threshold range (τ) compared to other methods, indicating its advantage in false alarm control. In Fig 3, the detection results for various algorithms are visualized via color maps. Both the GRX and LRX algorithms failed to detect the anomalies in the image. The DirectNet algorithm succeeded in pinpointing the locations of three airplanes; however, their shapes were not clearly defined. The NL2Net algorithm successfully detected the shapes and positions of the three airplanes, but it did not effectively suppress the background. CRD, PDBSNet, and BockNet accurately identified the shapes and locations of the three airplanes. Nevertheless, they lacked sufficient prominence to distinguish themselves from the background, primarily because the algorithms failed to sufficiently enhance the contrast between the anomalies and the surrounding environment. The proposed algorithm generated distinct and vivid representations of the three airplanes, resulting in the highest accuracy in identification compared to the other techniques. Nonetheless, it inaccurately categorized a limited quantity of background pixels as anomalies. As shown in the boxplot (Fig 4), the proposed algorithm exhibits the largest separation between background and anomaly targets (maximum mean difference) compared to other methods, with both data distributions being highly compact (narrow box ranges and minimal outliers).

Download:

Fig 2. Detection accuracy evaluation for the AVIRIS-I dataset.

https://doi.org/10.1371/journal.pone.0330640.g002

Download:

Fig 3. Detection maps obtained using different algorithms for the AVIRIS-I dataset.

https://doi.org/10.1371/journal.pone.0330640.g003

Download:

Fig 4. Background-anomaly separation in AVIRIS-I dataset.

https://doi.org/10.1371/journal.pone.0330640.g004

Experimental results demonstrated that for the AVIRIS-II dataset, the LRX and CRD algorithms achieved optimal performance with dual window settings of (35, 13) and (21, 19), respectively. As shown in Table 1, the proposed method exhibited the highest AUC value, surpassing all competing algorithms. Fig 5 presents 2D and 3D ROC curves comparing the performance of different methods for hyperspectral anomaly detection. In the 2D ROC curve (P_D,P_F), the proposed method’s curve was closest to the top-left corner, indicating the highest detection rate at equivalent false alarm rates, thereby outperforming other methods. In the 2D ROC curve (P_D,τ), the proposed method demonstrated superior performance under stringent conditions, with a rapid increase in detection rate at high thresholds (). Additionally, in the 2D ROC curve (P_F, τ), the proposed method maintained the lowest false alarm rate. The 3D ROC curve further highlighted the proposed method’s comprehensive advantages in balancing detection rate, false alarm rate, and threshold adaptability. Overall, the proposed method demonstrated the best performance in terms of detection rate, false alarm rate, and threshold optimization. The visualization of detection results using color maps in Fig 6 demonstrated the performance differences among various algorithms. The LRX algorithm failed to identify any anomalies in the scene. While GRX, CRD, PDBSNet, BockNet, and DirectNet were able to detect the locations of the three airplanes, they did not fully outline their shapes. NL2Net exhibited limited background suppression. In contrast, the proposed algorithm achieved the highest detection accuracy, accurately identifying the positions and shapes of the three airplanes while clearly distinguishing them from the background. Despite the presence of background noise, the proposed method demonstrated superior performance in hyperspectral anomaly detection compared to other algorithms. As shown in the boxplot (Fig 7), the proposed algorithm exhibits the largest gap between background and anomaly targets compared to other algorithms.

Download:

Fig 5. Detection accuracy evaluation for the AVIRIS-II dataset.

https://doi.org/10.1371/journal.pone.0330640.g005

Download:

Fig 6. Detection maps obtained using different algorithms for the AVIRIS-II dataset.

https://doi.org/10.1371/journal.pone.0330640.g006

Download:

Fig 7. Background-anomaly separation in AVIRIS-II dataset.

https://doi.org/10.1371/journal.pone.0330640.g007

Experimental results demonstrated that for the abu-beach-2 dataset, the LRX and CRD algorithms achieved optimal performance with dual window settings of (5, 3) and (7, 3), respectively. As shown in Table 1, the proposed algorithm achieved the highest AUC value, indicating its superior performance over comparative methods. Fig 8 demonstrated the performance comparison of different methods for hyperspectral anomaly detection using 2D and 3D ROC curves. In the 2D ROC curve (P_D,P_F), the proposed method achieved the highest detection rate at equivalent false alarm rates, as its curve was closest to the top-left corner, outperforming other methods. In the 2D ROC curve (P_D,τ), the proposed method exhibited superior performance under strict conditions, with a rapid increase in detection rate at high thresholds (). Additionally, in the 2D ROC curve (P_F,τ), the proposed method maintained the lowest false alarm rate, indicating strong background suppression. The 3D ROC curve further highlighted the proposed method’s comprehensive advantages in balancing detection rate, false alarm rate, and threshold adaptability. Overall, the proposed method demonstrated the best performance in terms of detection rate, false alarm rate, and threshold optimization. Fig 9 presents the detection color maps for various algorithms. The GRX algorithm successfully detected all anomalies but exhibited limited background suppression. PDBSNet, DirectNet, and NL2Net failed to identify all anomalies. While LRX, CRD, and BockNet were able to detect the positions of anomalies, the detected targets lacked clarity. The proposed method distinctly differentiated the anomalous objects from the background with remarkable brightness and clarity, despite a minor number of background pixels being erroneously labeled as anomalies. Fig 10 presents box plots comparing the separation of background and anomaly targets across different algorithms. The results show that NL2Net and PDBSNet achieved the highest separation, followed closely by the proposed method.

Download:

Fig 8. Detection accuracy evaluation for the abu-beach-2 dataset.

https://doi.org/10.1371/journal.pone.0330640.g008

Download:

Fig 9. Detection maps obtained using different algorithms for the abu-beach-2 dataset.

https://doi.org/10.1371/journal.pone.0330640.g009

Download:

Fig 10. Background-anomaly separation in abu-beach-2 dataset.

https://doi.org/10.1371/journal.pone.0330640.g010

Experimental results demonstrated that for the abu-urban-3 dataset, the LRX and CRD algorithms achieved optimal performance with dual window settings of (35, 33) and (9, 3), respectively. As demonstrated in Table 1, the proposed algorithm achieved the highest AUC value, highlighting its superior detection capability. Fig 11 demonstrated the performance comparison of different methods for hyperspectral anomaly detection using 2D and 3D ROC curves. In the 2D ROC curve (P_D,P_F), the proposed method’s curve is closest to the top-left corner, indicating superior detection performance compared to other methods. In the 2D ROC curve (P_D,τ), the proposed method achieves a rapid increase in detection rate at high thresholds (), demonstrating strong performance under strict conditions. In the 2D ROC curve (P_F,τ), the proposed method maintains the lowest false alarm rate across varying thresholds. The 3D ROC curve further highlights the proposed method’s comprehensive advantages in balancing detection rate, false alarm rate, and threshold adaptability. Overall, the proposed method outperforms other algorithms in terms of detection rate, false alarm rate, and threshold optimization. The anomaly detection results visualized via color maps in Fig 12 showed that the LRX algorithm failed to identify any anomalies within the scene. Algorithms such as GRX, CRD, PDBSNet, and NL2Net exhibited limited background suppression, leaving residual noise in the results. BockNet and DirectNet were able to detect anomalies but lacked clarity in outlining their shapes. In contrast, the proposed algorithm accurately identified the positions and shapes of all anomalies with higher clarity, though it had minor misclassifications. As shown in the boxplot (Fig 13), the proposed algorithm exhibits the largest gap between background and anomaly targets compared to other algorithms.

Download:

Fig 11. Detection accuracy evaluation for the abu-urban-3 dataset.

https://doi.org/10.1371/journal.pone.0330640.g011

Download:

Fig 12. Detection maps obtained using different algorithms for the abu-urban-3 dataset.

https://doi.org/10.1371/journal.pone.0330640.g012

Download:

Fig 13. Background-anomaly separation in abu-urban-3 dataset.

https://doi.org/10.1371/journal.pone.0330640.g013

In experiments on the Salinas-simulate dataset, the optimal dual-window configurations for LRX and CRD were determined as (25, 23) and (15, 3), respectively. As shown in Table 1, the proposed algorithm achieved the highest AUC value, surpassing all baseline methods and validating its enhanced detection accuracy. Fig 14 demonstrated the performance comparison of different methods for hyperspectral anomaly detection using 2D and 3D ROC curves. In the 2D ROC curve (P_D,P_F), the proposed method appears as a straight line consistently positioned at the top (), demonstrating its ability to achieve near-perfect detection performance across all P_F ranges and significantly outperforming the other comparative methods. In the 2D ROC curve (P_D,τ), the proposed algorithm forms a near-straight line at the top () across the logarithmic threshold range ( to 10⁰), demonstrating its superior and stable detection probability under compared to other methods. Additionally, in the 2D ROC curve (P_F,τ), the proposed method maintains the lowest false alarm rate across varying thresholds, underscoring its robustness. The 3D ROC curve provides a comprehensive view, further emphasizing the proposed method’s balanced performance in detection rate, false alarm rate, and threshold adaptability. Fig 15 displays the anomaly detection results on the Salinas-simulate dataset. While LRX, CRD, and DirectNet failed to detect anomalies entirely, GRX and NL2Net exhibited limited separability between anomalies and background regions. PDBSNet and BockNet successfully identified all anomalous targets, though with partial background interference. In contrast, the proposed algorithm precisely localized both the positions and geometric contours of anomalies, effectively enhancing their visibility against the background with minimal interference. As shown in the boxplot (Fig 16), the proposed algorithm exhibits the largest separation between background and anomaly targets compared to other methods, with both data distributions being highly compact.

Download:

Fig 14. Detection accuracy evaluation for the Salinas-simulate dataset.

https://doi.org/10.1371/journal.pone.0330640.g014

Download:

Fig 15. Detection maps obtained by different algorithms for the Salinas-simulate dataset.

https://doi.org/10.1371/journal.pone.0330640.g015

Download:

Fig 16. Background-anomaly separation in Salinas-simulate dataset.

https://doi.org/10.1371/journal.pone.0330640.g016

In experiments on the GrandIsle dataset, the optimal dual-window configurations for LRX and CRD were both determined as (11, 9). As shown in Table 1, The proposed method achieved an AUC of 0.9966, closely approaching BockNet’s peak performance (0.9989) with a marginal gap of merely 0.0023, demonstrating near-state-of-the-art detection capability. Fig 17 demonstrated the performance comparison of different methods for hyperspectral anomaly detection using 2D and 3D ROC curves. In the 2D ROC curve (P_D,P_F), the BockNet algorithm achieves the highest detection rate at equivalent false alarm rates, with its curve closest to the top-left corner. The proposed method follows closely, indicating strong performance. In the 2D ROC curve (P_D,τ), BockNet shows a significant increase in detection rate at high thresholds (), while the proposed method also demonstrates a considerable rise, highlighting its effectiveness under stringent conditions. In the 2D ROC curve (P_F,τ), the proposed method maintains a lower false alarm rate across varying thresholds. The 3D ROC curve provides a comprehensive view, further emphasizing the proposed method’s balanced performance in detection rate, false alarm rate, and threshold adaptability, placing it as the second-best performer overall. Color maps for the anomaly detection displayed in Fig 18 revealed that the GRX, LRX, CRD,and DirectNet algorithms failed to detect anomalous targets. The PDBSNet, BockNet, and DirectNet algorithms mistakenly classified islands as anomalous targets. In contrast, the proposed algorithm precisely detected the positions and shapes of the anomalous targets, with the targets appearing bright and distinct, albeit with some minor noise. As shown in the boxplot (Fig 19), the proposed algorithm exhibits the largest gap between background and anomaly targets compared to other algorithms.

Download:

Fig 17. Detection accuracy evaluation for the GrandIsle dataset.

https://doi.org/10.1371/journal.pone.0330640.g017

Download:

Fig 18. Detection maps obtained using different algorithms for the Grand Isle dataset.

https://doi.org/10.1371/journal.pone.0330640.g018

Download:

Fig 19. Background-anomaly separation in GrandIsle dataset.

https://doi.org/10.1371/journal.pone.0330640.g019

In experiments on the Cri dataset, the optimal dual-window configurations for LRX and CRD were determined as (25, 21) and (17, 13), respectively. As shown in Table 1, the proposed algorithm achieved the highest AUC value among all methods. Fig 20 demonstrated the performance comparison of different methods for hyperspectral anomaly detection using 2D and 3D ROC curves. In the 2D ROC curve (P_D,P_F), the proposed method achieves the highest detection rate at equivalent false alarm rates, with its curve closest to the top-left corner. In the 2D ROC curve (P_D,τ), the proposed method shows a significant increase in detection rate at high thresholds (), highlighting its effectiveness under stringent conditions. In the 2D ROC curve (P_F,τ), the proposed method maintains the lowest false alarm rate across varying thresholds. The 3D ROC curve further emphasizes the proposed method’s balanced performance in detection rate, false alarm rate, and threshold adaptability. The anomaly detection results for the Cri dataset, visualized in Fig 21, reveal distinct performance variations among methods. While GRX, LRX, BockNet, and DirectNet partially localized anomalies, they failed to delineate target shapes clearly. CRD and NL2Net suffered from inadequate background suppression. In contrast, PDBSNet and the proposed algorithm precisely identified all anomalies in both location and morphology. Notably, the proposed method enhanced target-background contrast by sharply highlighting detected anomalies (rendered as bright regions), despite minor residual noise. As shown in the boxplot (Fig 22), the proposed algorithm exhibits the largest gap between background and anomaly targets compared to other algorithms.

Download:

Fig 20. Detection accuracy evaluation for the Cri dataset.

https://doi.org/10.1371/journal.pone.0330640.g020

Download:

Fig 21. Detection maps obtained using different algorithms for the Cri dataset.

https://doi.org/10.1371/journal.pone.0330640.g021

Download:

Fig 22. Background-anomaly separation in Cri dataset.

https://doi.org/10.1371/journal.pone.0330640.g022

Computational complexity analysis

The computational complexities of the algorithms are as follows:

The GRX algorithm has a complexity of .

The LRX algorithm has a complexity of , where and are the sizes of the outer and inner windows, respectively.

The CRD algorithm has a complexity of .

The computational complexity of PDBSNet is determined by its core components. For a hyperspectral image of size , the pixel-shuffle downsampling (PD) operation has time complexity with stride factor s. The dilated blind-spot network (DBSN) employs N dilated convolution layers, each requiring computations for kernels. The space complexity is dominated by intermediate feature maps at . Combined with the PD-inverse operation (), the overall time complexity becomes , where the quadratic terms in L and N represent the primary computational bottlenecks.

The computational complexity of BockNet is primarily dominated by its simplified U-Net architecture, which involves multiple convolutional and pooling layers. For an input hyperspectral image of size , the encoder and decoder components introduce operations, where K represents the number of convolutional layers. The guard window mechanism requires rotating and fusing four directional feature maps, contributing an additional term. Overall, the complexity scales as , driven by the convolutional operations and linear in the number of spectral bands L and quadratic in spatial dimensions R.

The computational complexity of DirectNet can be analyzed as follows. Given an input hyperspectral patch of size (where is the outer window size and L is the spectral bands), the network depth grows linearly with window size. Each ResNet block contains two convolutional layers, leading to an overall complexity of , where C represents the number of feature channels. This simplifies to since , making the complexity cubic in window size and linear in spectral dimensions.

The computational complexity of NL2Net is dominated by its dual-branch architecture: for an input HSI cube with spatial size and C spectral bands, the local feature extraction branch (LFEB) incurs O(HWk²CD) complexity via masked/dilated convolutions, while the nonlocal branch (NLFEB) reduces self-attention complexity from O((HW)²C) to O(HWG²C) through grid attention partitioning ( sub-regions). The pixel-shuffle downsampling with stride F further scales spatial dimensions by 1/F², yielding an overall complexity of .

Among them, the GAN–BWGNN HAD algorithm has the lowest computational complexity because the β wavelet in GAN–BWGNN HAD is a polynomial, avoiding the Laplacian matrix decomposition and matrix multiplication, hence having a computational complexity of . The algorithms runtimes are summarized in Table 2.

Download:

Table 2. Comparison of running time (seconds).

https://doi.org/10.1371/journal.pone.0330640.t002

Based on the runtime comparisons in Table 2, the proposed method achieves sub-second detection (0.20–0.28 s) on most datasets (AVIRIS-I/II, abu-beach-2/urban-3, Salinas-simulate), significantly outperforming traditional methods (LRX/CRD) by a speedup factor of 100–500 (e.g., AVIRIS-I: 0.24 s vs. LRX 144.93 s) and deep learning models (NL2Net/BockNet) by 6–8 (Salinas-simulate: 0.28 s vs. NL2Net 1.59 s). However, compared to PDBSNet, the proposed method exhibits marginally longer execution times on the AVIRIS-I (0.2408 s), AVIRIS-II (0.20479 s), abu-beach-2 (0.20179 s), and abu-urban-3 (0.20279 s) datasets, with an average increase of 0.04–0.05 s (corresponding to a 7.8%–10.5% difference) relative to PDBSNet’s execution times (0.2178 s, 0.1628 s, 0.1581 s, and 0.1558 s, respectively). Notably, on the Cri dataset, the proposed method maintains real-time detection with a runtime of 1.606 s, demonstrating a three-order-of-magnitude speedup over CRD (1655 s). These results highlight the efficiency and scalability of the proposed approach across diverse hyperspectral imaging scenarios.

Parameter analysis

1) Analysis of C:In the design framework of Beta Wavelets, the parameters p and q are constrained by p + q = C, where C denotes the order of the wavelet kernel. This constraint ensures that all generated Beta wavelet kernels are C-th order polynomials, systematically covering the full spectral range from low to high frequencies. Specifically, the set of wavelet kernels , generated by iterating , comprises filters with complementary frequency characteristics. For instance, functions as a low-pass filter primarily capturing low-frequency signals. As i increases, the central frequency of the filter shifts toward higher frequencies, exhibiting band-pass properties. At i = C, operates as a high-pass filter, focusing on high-frequency components. This multi-scale architecture enables flexible adaptation to anomaly features across diverse frequency bands.
Proposition 1 (Spectral Locality). Let p > 0, q > 0, and , where is a band-pass distribution. The mean and variance of X are:
When under the condition p = cq, it is shown that and , which can take any value in the interval .
To ensure p and q do not deviate significantly, p = cq is employed. Proposition 4 is derived from the properties of the Beta distribution. It demonstrates that can concentrate on any , indicating that Beta graph wavelets can be designed to target specific frequency bands for anomaly detection.
Proposition 2 (Spatial Locality). Let and be two nodes on a graph . The effect of a one-hot signal on node after the wavelet transform is denoted as . It is observed that is localized within -hops of node . Specifically, if the distance exceeds p + q, then equals zero.
The results of Propositions 4 and 5 suggest that increasing the value of C can enhance spectral locality, but this comes at the expense of reduced spatial locality. Conversely, decreasing C improves spatial locality while potentially degrading spectral locality. This highlights a fundamental trade-off between the two domains.
To tackle the common ”right-shift phenomenon” in spectral anomaly detection, where anomalies mainly appear in high-frequency bands, C is usually set to values for adequate high-frequency coverage.The parameter C is set by selecting the optimal value through experimental comparisons of AUC values across seven datasets, with . The experimental results are shown in Fig 23 below.
The experimental results indicate that the optimal values of parameter C for the datasets AVIRIS-I, AVIRIS-II, abu-beach-2, abu-urban-3, alinas-simulate, GrandIsle, and Cri are 2, 2, 2, 2, 3, 2, and 3, respectively.
2) Analysis of K: Since K controls the number of node connected to each node, that is, it determines the edges of the graph, different values of K will result in different graph structures. A too small value of K (such as K < 10) will make the model overly dependent on adjacent pixels, making it vulnerable to noise interference and difficult to capture long-distance correlations. A too large value of K such as K > 50) will introduce redundant long-distance pixels, diluting the abnormal signals and increasing the computational complexity. In order to determine the optimal parameter, this study conducted a grid search for K = 10, 20, 30, 40, and 50 on seven hyperspectral datasets and compared the AUC performance. The experiment is shown in Fig 24 below.
The experimental results indicate that the optimal values of parameter C for the datasets AVIRIS-I, AVIRIS-II, abu-beach-2, abu-urban-3, alinas-simulate, GrandIsle, and Cri are 40, 20, 30, 10, 20, 50, and 50, respectively.

Download:

Fig 23. Influence of C on the AUC on the seven datasets.

https://doi.org/10.1371/journal.pone.0330640.g023

Download:

Fig 24. Influence of K on the AUC on the seven datasets.

https://doi.org/10.1371/journal.pone.0330640.g024

Ablation study

Based on the ablation experimental results presented in Table 3, the proposed GAN-BWGNN algorithm demonstrates superior anomaly detection performance compared to the baseline BWGNN method (with the GAN module removed) across most hyperspectral datasets. Specifically, in typical scenarios such as abu-beach-2, abu-urban-3, and Grand Isle, the AUC values of GAN-BWGNN reach 0.9961, 0.9982, and 0.9966, respectively, representing relative improvements of 5.35%, 2.07%, and 2.22% over the baseline. This highlights the GAN module’s effectiveness in enhancing the model’s ability to discriminate spatial-spectral features between complex backgrounds and anomalous targets. Notably, both methods achieve saturated performance (0.9999) on the Salinas-simulate dataset, indicating lower detection difficulty in this simulated scenario, while only a 0.32% improvement is observed on the Cri dataset, potentially due to the strong spectral separability between anomalies and backgrounds in this scene. The experimental results conclusively validate that the Graph Attention Network significantly improves the robustness and generalization capability of hyperspectral anomaly detection through multi-scale neighborhood information fusion.

Download:

Table 3. AUC for ablation study.

https://doi.org/10.1371/journal.pone.0330640.t003

Conclusions

We introduced the GAN–BWGNN HAD algorithm, which leverages the spatial context through GANs and addresses the spectral characteristics of anomalies using a BWGNN detector. Experimental results on five real-world datasets and one synthetic dataset demonstrated that GAN–BWGNN HAD had superior detection performance compared to the existing methods. The combination of the GAN and BWGNN allows for the utilization of both spatial and spectral information, providing a comprehensive analysis of hyperspectral data. The attention mechanism of the GAN enhances the adaptability of the proposed method to various spatial contexts, whereas the beta wavelet-based spectral filtering via the BWGNN efficiently captures the right-shifted spectral energy associated with anomalies. Our findings show that the GAN–BWGNN HAD algorithm is a promising tool for HAD, offering improved accuracy and efficiency. Subsequent research will enhance this algorithm, apply it to additional remote sensing tasks, and investigate its potential integration with other sophisticated machine learning methodologies.

References

1. Zhang L, Lin F, Fu B. A joint model based on graph and deep learning for hyperspectral anomaly detection. Infrared Phys Technol. 2024;139:105335.
- View Article
- Google Scholar
2. Kumar B, Dikshit O, Gupta A, Singh MK. Feature extraction for hyperspectral image classification: A review. Int J Remote Sensing. 2020;41(16):6248–87.
- View Article
- Google Scholar
3. Duan P, Ghamisi P, Kang X, Rasti B, Li S, Gloaguen R. Fusion of dual spatial information for hyperspectral image classification. IEEE Trans Geosci Remote Sensing. 2021;59(9):7726–38.
- View Article
- Google Scholar
4. Duan P, Kang X, Li S, Ghamisi P. Multichannel pulse-coupled neural network-based hyperspectral image visualization. IEEE Trans Geosci Remote Sensing. 2020;58(4):2444–56.
- View Article
- Google Scholar
5. Qu B, Zheng X, Qian X, Lu X. Research progress on hyperspectral anomaly detection. Natl Remote Sensing Bull. 2024;28(1):42–54.
- View Article
- Google Scholar
6. Ma Z, Jia G, Schaepman ME, Zhao H. Uncertainty analysis for topographic correction of hyperspectral remote sensing images. Remote Sensing. 2020;12(4):705.
- View Article
- Google Scholar
7. Awasthi S, Varade D. Recent advances in the remote sensing of alpine snow: A review. GIScience Remote Sensing. 2021;58(6):852–88.
- View Article
- Google Scholar
8. Chi J, Kim H-C. Retrieval of daily sea ice thickness from AMSR2 passive microwave data using ensemble convolutional neural networks. GIScience Remote Sensing. 2021;58(6):812–30.
- View Article
- Google Scholar
9. Hartling S, Sagan V, Maimaitijiang M. Urban tree species classification using UAV-based multi-sensor data fusion and machine learning. GIScience Remote Sensing. 2021;58(8):1250–75.
- View Article
- Google Scholar
10. Lu B, Proctor C, He Y. Investigating different versions of PROSPECT and PROSAIL for estimating spectral and biophysical properties of photosynthetic and non-photosynthetic vegetation in mixed grasslands. GIScience Remote Sensing. 2021;58(3):354–71.
- View Article
- Google Scholar
11. Paz-Kagan T, Chang JG, Shoshany M, Sternberg M, Karnieli A. Assessment of plant species distribution and diversity along a climatic gradient from Mediterranean woodlands to semi-arid shrublands. GIScience Remote Sensing. 2021;58(6):929–53.
- View Article
- Google Scholar
12. Jamali A, Mahdianpari M, Brisco B, Granger J, Mohammadimanesh F, Salehi B. Deep Forest classifier for wetland mapping using the combination of Sentinel-1 and Sentinel-2 data. GIScience Remote Sensing. 2021;58(7):1072–89.
- View Article
- Google Scholar
13. Reed IS, Yu X. Adaptive multiple-band CFAR detection of an optical pattern with unknown spectral distribution. IEEE Trans Acoust Speech Signal Process. 1990;38(10):1760–70.
- View Article
- Google Scholar
14. Chang S, Du B, Zhang L. BASO: A background-anomaly component projection and separation optimized filter for anomaly detection in hyperspectral images. IEEE Trans Geosci Remote Sensing. 2018;56(7):3747–61.
- View Article
- Google Scholar
15. Sun X, Zhuang L, Gao L, Gao H, Sun X, Zhang B. Information entropy estimation based on point-set topology for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2024;62:1–15.
- View Article
- Google Scholar
16. Gao L, Sun X, Sun X, Zhuang L, Du Q, Zhang B. Hyperspectral anomaly detection based on chessboard topology. IEEE Trans Geosci Remote Sensing. 2023;61:1–16.
- View Article
- Google Scholar
17. Li W, Du Q. Collaborative representation for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2015;53(3):1463–74.
- View Article
- Google Scholar
18. Lin S, Cheng X, Zeng Y, Huo Y, Zhang M, Wang H. Low-rank and sparse representation inspired interpretable network for hyperspectral anomaly detection. IEEE Trans Instrum Meas. 2024;73:1–16.
- View Article
- Google Scholar
19. Xiao Q, Zhao L, Chen S, Li X. Hyperspectral anomaly detection via enhanced low-rank and smoothness fusion regularization plus saliency prior. IEEE J Sel Top Appl Earth Observ Remote Sensing. 2024;17:18987–9002.
- View Article
- Google Scholar
20. Ren L, Gao L, Wang M, Sun X, Chanussot J. HADGSM: A unified nonconvex framework for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2024;62:1–15.
- View Article
- Google Scholar
21. Wang M, Gao L, Ren L, Sun X, Chanussot J. Hyperspectral simultaneous anomaly detection and denoising: Insights from integrative perspective. IEEE J Sel Top Appl Earth Observ Remote Sensing. 2024;17:13966–80.
- View Article
- Google Scholar
22. Gao L, Wang D, Zhuang L, Sun X, Huang M, Plaza A. BS3LNet: A new blind-spot self-supervised learning network for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2023;61:1–18.
- View Article
- Google Scholar
23. Liu S, Peng L, Chang X, Wang Z, Wen G, Zhu C. Adaptive dual-domain learning for hyperspectral anomaly detection with state-space models. IEEE Trans Geosci Remote Sensing. 2025;63:1–19.
- View Article
- Google Scholar
24. Li W, Wu G, Du Q. Transferred deep learning for anomaly detection in hyperspectral imagery. IEEE Geosci Remote Sensing Lett. 2017;14(5):597–601.
- View Article
- Google Scholar
25. Wang D, Zhuang L, Gao L, Sun X, Huang M, Plaza A. BockNet: Blind-block reconstruction network with a guard window for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2023;61:1–16.
- View Article
- Google Scholar
26. Wang D, Zhuang L, Gao L, Sun X, Huang M, Plaza AJ. PDBSNet: Pixel-shuffle downsampling blind-spot reconstruction network for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2023;61:1–14.
- View Article
- Google Scholar
27. Wang D, Zhuang L, Gao L, Sun X, Zhao X, Plaza A. Sliding dual-window-inspired reconstruction network for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2024;62:1–15.
- View Article
- Google Scholar
28. Wang D, Ren L, Sun X, Gao L, Chanussot J. Nonlocal and local feature-coupled self-supervised network for hyperspectral anomaly detection. IEEE J Sel Top Appl Earth Observ Remote Sensing. 2025;18:6981–93.
- View Article
- Google Scholar
29. Wang D, Zhuang L, Gao L, Sun X, Zhao X. Global feature-injected blind-spot network for hyperspectral anomaly detection. IEEE Geosci Remote Sensing Lett. 2024;21:1–5.
- View Article
- Google Scholar
30. Wang D, Gao L, Qu Y, Sun X, Liao W. Frequency-to-spectrum mapping GAN for semisupervised hyperspectral anomaly detection. CAAI Trans on Intel Tech. 2023;8(4):1258–73.
- View Article
- Google Scholar
31. Hu H, Yao M, He F, Zhang F. Graph neural network via edge convolution for hyperspectral image classification. IEEE Geosci Remote Sensing Lett. 2022;19:1–5.
- View Article
- Google Scholar
32. Dong Y, Liu Q, Du B, Zhang L. Weighted feature fusion of convolutional neural network and graph attention network for hyperspectral image classification. IEEE Trans Image Process. 2022;31:1559–72. pmid:35077363
- View Article
- PubMed/NCBI
- Google Scholar
33. Hu H, Ding Y, He F, Zhang F, Zhao J, Yao M. Bi-kernel graph neural network with adaptive propagation mechanism for hyperspectral image classification. Remote Sensing. 2022;14(24):6224.
- View Article
- Google Scholar
34. Tu B, Wang Z, Ouyang H, Yang X, Li J, Plaza A. Hyperspectral anomaly detection using the spectral–spatial graph. IEEE Trans Geosci Remote Sensing. 2022;60:1–14.
- View Article
- Google Scholar
35. Veličković P, Cucurull G, Casanova A, Romero A, Lio P, Bengio Y. Graph attention networks. arXiv preprint arXiv:171010903; 2017.
36. Tang J, Li J, Gao Z, Li J. In: 2022.
37. Hammond DK, Vandergheynst P, Gribonval R. Wavelets on graphs via spectral graph theory. Appl Computat Harmon Anal. 2011;30(2):129–50.
- View Article
- Google Scholar
38. Kipf TN, Welling M. Semi-supervised classification with graph convolutional networks. arXiv preprint; 2016. https://doi.org/arXiv:160902907
39. Kang X, Zhang X, Li S, Li K, Li J, Benediktsson JA. Hyperspectral anomaly detection with attribute and edge-preserving filters. IEEE Trans Geosci Remote Sensing. 2017;55(10):5600–11.
- View Article
- Google Scholar
40. Zhang Y, Du B, Zhang L, Wang S. A low-rank and sparse matrix decomposition-based mahalanobis distance method for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2016;54(3):1376–89.
- View Article
- Google Scholar

[ref1] 1. Zhang L, Lin F, Fu B. A joint model based on graph and deep learning for hyperspectral anomaly detection. Infrared Phys Technol. 2024;139:105335.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Kumar B, Dikshit O, Gupta A, Singh MK. Feature extraction for hyperspectral image classification: A review. Int J Remote Sensing. 2020;41(16):6248–87.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Duan P, Ghamisi P, Kang X, Rasti B, Li S, Gloaguen R. Fusion of dual spatial information for hyperspectral image classification. IEEE Trans Geosci Remote Sensing. 2021;59(9):7726–38.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Duan P, Kang X, Li S, Ghamisi P. Multichannel pulse-coupled neural network-based hyperspectral image visualization. IEEE Trans Geosci Remote Sensing. 2020;58(4):2444–56.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. Qu B, Zheng X, Qian X, Lu X. Research progress on hyperspectral anomaly detection. Natl Remote Sensing Bull. 2024;28(1):42–54.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Ma Z, Jia G, Schaepman ME, Zhao H. Uncertainty analysis for topographic correction of hyperspectral remote sensing images. Remote Sensing. 2020;12(4):705.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Awasthi S, Varade D. Recent advances in the remote sensing of alpine snow: A review. GIScience Remote Sensing. 2021;58(6):852–88.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref8] 8. Chi J, Kim H-C. Retrieval of daily sea ice thickness from AMSR2 passive microwave data using ensemble convolutional neural networks. GIScience Remote Sensing. 2021;58(6):812–30.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref9] 9. Hartling S, Sagan V, Maimaitijiang M. Urban tree species classification using UAV-based multi-sensor data fusion and machine learning. GIScience Remote Sensing. 2021;58(8):1250–75.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref10] 10. Lu B, Proctor C, He Y. Investigating different versions of PROSPECT and PROSAIL for estimating spectral and biophysical properties of photosynthetic and non-photosynthetic vegetation in mixed grasslands. GIScience Remote Sensing. 2021;58(3):354–71.
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref11] 11. Paz-Kagan T, Chang JG, Shoshany M, Sternberg M, Karnieli A. Assessment of plant species distribution and diversity along a climatic gradient from Mediterranean woodlands to semi-arid shrublands. GIScience Remote Sensing. 2021;58(6):929–53.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref12] 12. Jamali A, Mahdianpari M, Brisco B, Granger J, Mohammadimanesh F, Salehi B. Deep Forest classifier for wetland mapping using the combination of Sentinel-1 and Sentinel-2 data. GIScience Remote Sensing. 2021;58(7):1072–89.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref13] 13. Reed IS, Yu X. Adaptive multiple-band CFAR detection of an optical pattern with unknown spectral distribution. IEEE Trans Acoust Speech Signal Process. 1990;38(10):1760–70.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref14] 14. Chang S, Du B, Zhang L. BASO: A background-anomaly component projection and separation optimized filter for anomaly detection in hyperspectral images. IEEE Trans Geosci Remote Sensing. 2018;56(7):3747–61.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref15] 15. Sun X, Zhuang L, Gao L, Gao H, Sun X, Zhang B. Information entropy estimation based on point-set topology for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2024;62:1–15.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref16] 16. Gao L, Sun X, Sun X, Zhuang L, Du Q, Zhang B. Hyperspectral anomaly detection based on chessboard topology. IEEE Trans Geosci Remote Sensing. 2023;61:1–16.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref17] 17. Li W, Du Q. Collaborative representation for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2015;53(3):1463–74.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref18] 18. Lin S, Cheng X, Zeng Y, Huo Y, Zhang M, Wang H. Low-rank and sparse representation inspired interpretable network for hyperspectral anomaly detection. IEEE Trans Instrum Meas. 2024;73:1–16.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref19] 19. Xiao Q, Zhao L, Chen S, Li X. Hyperspectral anomaly detection via enhanced low-rank and smoothness fusion regularization plus saliency prior. IEEE J Sel Top Appl Earth Observ Remote Sensing. 2024;17:18987–9002.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref20] 20. Ren L, Gao L, Wang M, Sun X, Chanussot J. HADGSM: A unified nonconvex framework for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2024;62:1–15.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref21] 21. Wang M, Gao L, Ren L, Sun X, Chanussot J. Hyperspectral simultaneous anomaly detection and denoising: Insights from integrative perspective. IEEE J Sel Top Appl Earth Observ Remote Sensing. 2024;17:13966–80.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref22] 22. Gao L, Wang D, Zhuang L, Sun X, Huang M, Plaza A. BS3LNet: A new blind-spot self-supervised learning network for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2023;61:1–18.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref23] 23. Liu S, Peng L, Chang X, Wang Z, Wen G, Zhu C. Adaptive dual-domain learning for hyperspectral anomaly detection with state-space models. IEEE Trans Geosci Remote Sensing. 2025;63:1–19.
View Article
Google Scholar

[68] View Article

[69] Google Scholar

[ref24] 24. Li W, Wu G, Du Q. Transferred deep learning for anomaly detection in hyperspectral imagery. IEEE Geosci Remote Sensing Lett. 2017;14(5):597–601.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref25] 25. Wang D, Zhuang L, Gao L, Sun X, Huang M, Plaza A. BockNet: Blind-block reconstruction network with a guard window for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2023;61:1–16.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref26] 26. Wang D, Zhuang L, Gao L, Sun X, Huang M, Plaza AJ. PDBSNet: Pixel-shuffle downsampling blind-spot reconstruction network for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2023;61:1–14.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref27] 27. Wang D, Zhuang L, Gao L, Sun X, Zhao X, Plaza A. Sliding dual-window-inspired reconstruction network for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2024;62:1–15.
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref28] 28. Wang D, Ren L, Sun X, Gao L, Chanussot J. Nonlocal and local feature-coupled self-supervised network for hyperspectral anomaly detection. IEEE J Sel Top Appl Earth Observ Remote Sensing. 2025;18:6981–93.
View Article
Google Scholar

[83] View Article

[84] Google Scholar

[ref29] 29. Wang D, Zhuang L, Gao L, Sun X, Zhao X. Global feature-injected blind-spot network for hyperspectral anomaly detection. IEEE Geosci Remote Sensing Lett. 2024;21:1–5.
View Article
Google Scholar

[86] View Article

[87] Google Scholar

[ref30] 30. Wang D, Gao L, Qu Y, Sun X, Liao W. Frequency-to-spectrum mapping GAN for semisupervised hyperspectral anomaly detection. CAAI Trans on Intel Tech. 2023;8(4):1258–73.
View Article
Google Scholar

[89] View Article

[90] Google Scholar

[ref31] 31. Hu H, Yao M, He F, Zhang F. Graph neural network via edge convolution for hyperspectral image classification. IEEE Geosci Remote Sensing Lett. 2022;19:1–5.
View Article
Google Scholar

[92] View Article

[93] Google Scholar

[ref32] 32. Dong Y, Liu Q, Du B, Zhang L. Weighted feature fusion of convolutional neural network and graph attention network for hyperspectral image classification. IEEE Trans Image Process. 2022;31:1559–72. pmid:35077363
View Article
PubMed/NCBI
Google Scholar

[95] View Article

[96] PubMed/NCBI

[97] Google Scholar

[ref33] 33. Hu H, Ding Y, He F, Zhang F, Zhao J, Yao M. Bi-kernel graph neural network with adaptive propagation mechanism for hyperspectral image classification. Remote Sensing. 2022;14(24):6224.
View Article
Google Scholar

[99] View Article

[100] Google Scholar

[ref34] 34. Tu B, Wang Z, Ouyang H, Yang X, Li J, Plaza A. Hyperspectral anomaly detection using the spectral–spatial graph. IEEE Trans Geosci Remote Sensing. 2022;60:1–14.
View Article
Google Scholar

[102] View Article

[103] Google Scholar

[ref35] 35. Veličković P, Cucurull G, Casanova A, Romero A, Lio P, Bengio Y. Graph attention networks. arXiv preprint arXiv:171010903; 2017.

[ref36] 36. Tang J, Li J, Gao Z, Li J. In: 2022.

[ref37] 37. Hammond DK, Vandergheynst P, Gribonval R. Wavelets on graphs via spectral graph theory. Appl Computat Harmon Anal. 2011;30(2):129–50.
View Article
Google Scholar

[107] View Article

[108] Google Scholar

[ref38] 38. Kipf TN, Welling M. Semi-supervised classification with graph convolutional networks. arXiv preprint; 2016. https://doi.org/arXiv:160902907

[ref39] 39. Kang X, Zhang X, Li S, Li K, Li J, Benediktsson JA. Hyperspectral anomaly detection with attribute and edge-preserving filters. IEEE Trans Geosci Remote Sensing. 2017;55(10):5600–11.
View Article
Google Scholar

[111] View Article

[112] Google Scholar

[ref40] 40. Zhang Y, Du B, Zhang L, Wang S. A low-rank and sparse matrix decomposition-based mahalanobis distance method for hyperspectral anomaly detection. IEEE Trans Geosci Remote Sensing. 2016;54(3):1376–89.
View Article
Google Scholar

[114] View Article

[115] Google Scholar

Figures

Abstract

Introduction

Proposed algorithm

Graph construction of hyperspectral images

Graph attention network

Right-shift phenomenon

GAN–BWGNN HAD

Experiments

Datasets

Detection results

Computational complexity analysis

Parameter analysis

Ablation study

Conclusions

References