SpaMWGDA: Identifying spatial domains of spatial transcriptomes using multi-view weighted fusion graph convolutional network and data augmentation

Lin Yuan; Boyuan Meng; Qingxiang Wang; Chunyu Hu; Cuihong Wang; De-Shuang Huang

doi:10.1371/journal.pcbi.1013667

Abstract

The rapid development of spatial transcriptomics (ST) has made it possible to effectively integrate gene expression and spatial information of cells and accurately identify spatial domains. A large number of deep learning (DL)-based methods have been proposed to perform spatial domain identification and achieved impressive results. However, these methods have some limitations. First, these methods rely on a fixed similarity metric and cannot fully utilize neighborhood information. Second, they cannot efficiently and adaptively integrate key information when fusing and reconstructing gene expression using purely additive methods. Finally, these methods ignore key nonlinear features and introduce noise during clustering. To address these limitations, we propose a novel DL model SpaMWGDA based on multi-view weighted fused graph convolutional network (GCN) and data augmentation. By modeling spatial information using different similarity metrics, the model is able to successfully capture comprehensive neighborhood information of the spot features. By combining data augmentation and contrastive learning, SpaMWGDA is able to learn key gene expressions. SpaMWGDA uses a multi-view GCN encoder to model the similarities between spatial information and gene features, and uses a view-level attention mechanism for weighted fusion to adaptively learn the dependencies between them and learn the key features of each view. Experimental results not only demonstrate that SpaMWGDA outperforms competing methods in spatial domain identification and trajectory inference but also show the ability of SpaMWGDA to analyse tissue structure and function. The source code for SpaMWGDA is available at https://github.com/nathanyl/SpaMWGDA .

Author summary

DL-based methods have some limitations. First, these methods rely on a fixed similarity metric and cannot fully utilize neighborhood information. Second, they cannot efficiently and adaptively integrate key information when fusing and reconstructing gene expression using purely additive methods. Finally, these methods ignore key nonlinear features and introduce noise during clustering. We propose a novel DL model SpaMWGDA based on multi-view weighted fused graph convolutional network (GCN) and data augmentation. SpaMWGDA uses a multi-view GCN encoder to model the similarities between spatial information and gene features, and uses a view-level attention mechanism for weighted fusion to adaptively learn the dependencies between them and learn the key features of each view. Experimental results not only demonstrate that SpaMWGDA outperforms competing methods in spatial domain identification and trajectory inference but also show the ability of SpaMWGDA to analyse tissue structure and function.

Citation: Yuan L, Meng B, Wang Q, Hu C, Wang C, Huang D-S (2025) SpaMWGDA: Identifying spatial domains of spatial transcriptomes using multi-view weighted fusion graph convolutional network and data augmentation. PLoS Comput Biol 21(11): e1013667. https://doi.org/10.1371/journal.pcbi.1013667

Editor: Guang-Zhong Wang, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, CHINA

Received: July 4, 2025; Accepted: October 27, 2025; Published: November 12, 2025

Copyright: © 2025 Yuan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The code and data of SpaMWGDA is available at https://github.com/nathanyl/SpaMWGDA.

Funding: LY is supported by the National Natural Science Foundation of China (Nos. 62472239, 62002189), the Shandong Provincial Natural Science Foundation (No. ZR2024MF011), the Ability Improvement Project of Science and Technology SMES in Shandong Province (2023TSGC0279), the Youth Innovation Team of Colleges and Universities in Shandong Province (2023KJ329). CHW is supported by the Cultivation Fund of the Second Hospital of Shandong University (No. 2023JX16), supported by the Shandong Medical Association Qilu medical special project (YKH2022K02112), and supported by the Shandong Province Key Research and Development Program-International Scientific and Technological Cooperation Project (2024KJHZ029). DSH is supported by grants from the National Natural Science Foundation of China, Nos. 62333018, 62372255, 62073231, partly supported by the Joint Project of National Natural Science Foundation of China and Russian Science Foundation (W2412087), and supported by the Natural Science Foundation of Guizhou Province, No. ZK2024ZD035, and supported by the Natural Science Foundation of Ningbo City under Grant No.2023J199, and supported by Key Research and Development (Digital Twin) Program of Ningbo City under Grant Nos.2023Z219, 2023Z226. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Spatial transcriptomics (ST) technologies [1–5] can pinpoint gene expression while maintaining the structural integrity of tissues, helping us to understand molecular communication and tissue structure. Spatial domains are areas with similar spatial gene expression distribution.

Precise identification of the spatial domain is a crucial step in ST data analysis. It helps to elucidate the complex relationships between gene expression, its spatial characteristics, and tissue functions, and also aids in understanding the distribution and interactions of cells within the tissue structure [6–8]. However, accurately identifying spatial domains remains challenging due to the frequent presence of loss and noise in the generated data. Many computational methods have been developed for spatial domain identification. The shallow learning algorithms (e.g., K-means, Louvain [9] and Leiden [10]) are often used in Scanpy [11] or Seurat [12] packages to form integrated analysis workflows. The spatial domains identified by these methods are usually discontinuous because they ignore the adjacent relationships between spatial domains [13]. Giotto [14] identified spatial domains by comparing the intrinsic gene expression patterns of neighboring cells using hidden Markov random field (HMRF). The method is computationally complex and difficult to deal with large-scale datasets. In addition, HMRF may have limitations in capturing nonlinear relationships between cells. BayesSpace [15], based on a fully Bayesian statistical approach, encouraged neighboring cells to belong to the same group via predefined spatial priors. However, BayesSpace fails to effectively utilize spatial coordinates, and the choice of spatial priors may affect the clustering results.

Recently, deep learning (DL)-based spatial clustering methods have emerged as powerful tools for spatial domain identification [16–24]. For example, stlearn [25] efficiently performed spatial domain identification by integrating gene expression, spatial location, and tissue morphology. spaGCN [26] employed an undirected weighted graph to represent spatial data dependencies and incorporated spatial location, histological images, and gene expression into the graph’s construction to identify spatial domains. DeepST [27] used a denoising autoencoder and a graph neural network (GNN) autoencoder to infer latent embeddings in enhanced ST data. However, noise and inaccurate spot relationships in histological images may lead to erroneous spatial domain identification results. To better utilize high-resolution ST data, researchers considered the spatial dependencies of gene expression by modeling the similarity between neighboring spots. For example, SEDR [28] used a deep autoencoder and a variational graph autoencoder to integrate transcriptomics data with relevant spatial information. STAGATE [29] employed a graph attention autoencoder to integrate spatial information and gene expression, learning latent representations to distinguish spatial domains. GraphST [30] combined GNNs with self-supervised contrastive learning to identify spatial domains by learning spot representations. Although these methods can identify spots as distinct regions, they cannot adaptively capture the interrelationships between gene expression and spatial information. Recently, multi-view graph convolutional network (GCN)-based methods have been proposed to learn the relationship between spatial location and gene expression more effectively [31]. For example, Spatial-MGCN [32] utilized a multi-view GCN encoder to identify spatial domains. STMGCN [33] introduced an unsupervised learning framework that learns view-specific spot representations and integrates them with an attention mechanism to generate the final representation of the spots. But multi-view convolutional networks encounter several challenges, including noise and inconsistent feature distributions across views, which can hinder effective fusion and compromise stability. Additionally, certain views may contribute minimally, and inappropriate fusion strategies may introduce redundancy, further affecting model performance.

Although these methods have achieved remarkable results, there are still some limitations. First, these methods rely on a fixed similarity metric and cannot fully utilize neighborhood information. Second, they cannot efficiently and adaptively integrate key information when fusing and reconstructing gene expression using purely additive methods. Finally, these methods ignore key nonlinear features and introduce noise during clustering [34,35].

In this article, we are interested in integrating multi-view weighted fusion graph convolutional network (GCN) and data augmentation to guide deep learning architecture to accurately identify spatial domains. Multi-view Graph Convolutional Networks enhance the analytical accuracy and generalization capabilities of spatial transcriptomics data by integrating data from multiple perspectives to augment the model’s expressive power. The weighted fusion attention mechanism dynamically adjusts feature weights to identify critical regions or genes, thereby improving prediction precision. Data augmentation enhances model robustness by generating new samples, particularly in scenarios with sparse or imbalanced data, further boosting generalization performance.

Here, we name the proposed spatial domain identification method as SpaMWGDA. The workflow of SpaMWGDA is presented in Fig 1. First, SpaMWGDA employs KNN and Radius to construct a neighborhood graph [36,37], which can help the model successfully capture the comprehensive neighborhood information of point features. Second, SpaMWGDA learns key feature representations from original and augmented gene expression data by constructing graph-embedded contrastive encoder [38,39]. Third, SpaMWGDA integrates a multi-view GCN encoder and a ZINB (Zero-Inflated Negative Binomial) decoder [40,41] to reconstruct the gene expression matrix. In multi-view GCN encoder, the weighted fusion attention mechanism dynamically adjusts the weight of each view, enabling the model to better adapt to the contribution of different views, thus improving model performance. The view-level attention mechanism can adaptively and effectively integrate information from multiple views, thereby improving the performance and robustness of the model. Finally, SpaMWGDA uses a spatial regularization constraint to train the model to cluster neighboring points in space and effectively separates spatially non-neighboring points. We compared the performance of SpaMWGDA with seven state-of-the-art methods on five datasets (three from 10 x Visium, one from Stereo-seq platform and one from Xenium). Experimental results not only indicate that SpaMWGDA outperforms seven state-of-the-art methods in spatial domain identification and trajectory inference but also demonstrate the ability of SpaMWGDA to analyse tissue structure and function.

Download:

Fig 1. Schematic overview of SpaMWGDA.

(A) Spatial neighborhood network construction module. (B) Gene expression enhancement module. (C) Multi-view weighted fusion GCN encoder. (D) ZINB decoder. (E) Spatial regularization constraint. (F) Downstream analysis.

https://doi.org/10.1371/journal.pcbi.1013667.g001

Results

Ablation experiments

To explore the contribution of key modules to the performance of SpaMWGDA, we constructed five variants of SpaMWGDA and performed ablation experiments on DLPFC datasets. The five variant models are: (i) (w/o)Radius; (ii)(w/o)KNN;(iii) (w/o)CL; (iv) (w/o)WFA and (v) (w/o)KNN and Radius. (w/o)Radius represents that SpaMWGDA only uses KNN (without Radius) to identify neighboring points. (w/o)KNN represents that SpaMWGDA only uses Radius (without KNN) to identify neighboring points. (w/o)CL represents that SpaMWGDA only use original gene expression to obtain feature representation without using contrastive learning. (w/o)WFA represents that SpaMWGDA uses equal weights instead of dynamic weights in the attention layer. (w/o)KNN and Radius represents that SpaMWGDA uses a distance-based similarity matrix to model spatial information.

As shown in Fig 2A, compared to SpaMWGDA, the ARI and NMI of (w/o)Radius, (w/o)KNN, (w/o)CL, (w/o)WFA and (w/o)KNN and Radius decreased by 16.1% and 16.2%, 17.7% and 17.6%, 21% and 20.6%, 16.1% and 19.1%, and 37.1% and 17.6%, respectively. The ARI and NMI of each module were listed in Table 1. The experimental results highlight the importance of constructing spatial graphs using different similarity metrics, contrastive learning using augmented gene features, and using weighted fusion attention mechanisms. These modules can help the model better identify the interaction between spatial information and genetic features, improve the accuracy of spatial domain identification, and ultimately improve model performance.

Download:

Table 1. The ARI and NMI of four variant models and SpaMWGDA.

https://doi.org/10.1371/journal.pcbi.1013667.t001

Download:

Fig 2. (A) Ablation experiment results.

(B) The performance of contrastive learning and weighted fusion attention on SpaMWGDA. (C) The results of SpaMWGDA and seven competing methods on the noisy DLPFC dataset.

https://doi.org/10.1371/journal.pcbi.1013667.g002

The performance of contrastive learning and weighted fusion attention on model

In order to further evaluate the impact of contrastive learning and weighted fusion attention on the model, we constructed four variants of SpaMWGDA: (i) CL- > PCA; (ii) CL- > AE [42], (iii) Attention- > Cross Attention; and (iv) Attention- > Soft Attention [43]. Principal component analysis (PCA) and autoencoder (AE) are widely used feature learning methods. CL- > PCA indicates using PCA instead of contrastive learning. and CL- > AE replaces the contrastive learning with autoencoder. The weighted fusion attention mechanism is replaced by the cross-attention mechanism and the soft attention mechanism respectively to evaluate the impact of the weighted fusion attention mechanism on SpaMWGDA.

We compared the performance of these four variants with SpaMWGDA on the DLPFC and human breast cancer datasets. As shown in the Fig 2B, SpaMWGDA outperforms these variants in terms of ARI and NMI. Contrastive learning enables SpaMWGDA to learn critical and discriminative nonlinear features. The adaptive property of the weighted fusion attention mechanism is particularly beneficial in integrating information from different views, helping to extract features that are critical for spatial clustering, thereby improving the performance of the model.

PCA is a linear dimensionality reduction method that cannot capture nonlinear relationships. Autoencoder is effective in learning latent representations, but may focus too much on global features during reconstruction and ignore subtle differences between cell types that are critical for spatial clustering. Cross attention mechanism uses multiple attentions, which increases computational complexity and may affect model performance. Soft attention mechanism assigns weights to input features to focus on important information. However, it may over-rely on global features and ignore local details.

The impact of noise on model robustness and the scalability of SpaMWGDA

To assess the impact of noise on the performance of SpaMWGDA, we constructed three kinds of random gaussian noise datasets based on the DLPFC dataset: (i) Gaussian Noise (10%); (ii) Gaussian Noise (20%); (iii) Gaussian Noise (30%). Gaussian Noise (10%), Gaussian Noise (20%), and Gaussian Noise (30%) respectively represent adding 10%, 20%, and 30% Gaussian noise to the original gene expression data, respectively. We compared the performance of SpaMWGDA with state-of-the- art methods (Scanpy, stlearn, SpaGCN, SEDR, STAGATE, GraphST, and Spatial-MGCN) on three noisy datasets. As shown in Fig 2C, although the performance of SpaMWGDA decreases due to the increase of noise, it still outperforms the competing methods. The results of SpaMWGDA and seven competing methods on the Gaussian Noise 10% DLPFC dataset were listed in Table 2, and the results of the remaining noise dataset were listed in S1 Table. These results showed that SpaMWGDA is robust to noise and its performance remains excellent even as noise increases.

Download:

Table 2. The ARI and NMI of SpaMWGDA and seven competing methods on Gaussian Noise 10% DLPFC dataset.

https://doi.org/10.1371/journal.pcbi.1013667.t002

We calculated the running time of SpaMWGDA under different data scales and spot counts. The results in Table 3 show the scalability of SpaMWGDA on large-scale datasets.

Download:

Table 3. Running time of SpaMWGDA under different data scales and spot counts.

https://doi.org/10.1371/journal.pcbi.1013667.t003

Performance comparison of methods for identifying spatial domain

To evaluate the performance of SpaMWGDA in spatial domain identification, we conducted a comparison with seven state-of-the-art methods: Scanpy, stlearn, SpaGCN, SEDR, STAGATE, GraphST, and Spatial-MGCN, using the 10x Visium DLPFC dataset, which contains 12 slices [44]. These seven methods include two shallow learning methods (Scanpy, stlearn) and five state-of-the-art DL-based methods (SpaGCN, SEDR, STAGATE, GraphST, and Spatial-MGCN).

As shown in Fig 3A, SpaMWGDA achieves the best clustering across these slices, with the highest mean ARI of 0.62 and the highest mean NMI of 0.68, surpassing Spatial- MGCN, SEDR, STAGATE and GraphST by 0.06 and 0.02, 0.1 and 0.01, 0.13 and 0.05, and 0.1 and 0.04, respectively. The mean ARI and mean NMI of other methods (Scanpy, stlearn and SpaGCN) are below 0.45 and 0.6, respectively. Detailed results for all 12 slices in the DLPFC dataset were shown in S1 Fig. It is important to note that while Scanpy exhibits minimal variation in ARI between slices, its median NMI is only 0.35. In contrast, the performance of Spatial-MGCN and STAGATE shows instability, with significant ARI variation across slices. The mean values of three histology-informed spatial clustering methods (stlearn and SpaGCN) are all below 0.45, which indicates that histological images may introduce noise that adversely affects clustering performance.

Download:

Fig 3. (A) The performance comparison of spatial domain identification of SpaMWGDA and seven state-of-the-art methods (Scanpy, stlearn, SpaGCN, SEDR, STAGATE, GraphST, and Spatial-MGCN) on DLPFC dataset.

(B) Comparison of ARI and NMI of SpaMWGDA with seven state-of-the-art methods (Scanpy, stlearn, SpaGCN, SEDR, STAGATE, GraphST, and Spatial-MGCN) on DLPFC slice 151508. (C) The performance comparison of spatial domain identification of SpaMWGDA and seven state-of-the-art methods (Scanpy, stlearn, SpaGCN, SEDR, STAGATE, GraphST, and Spatial-MGCN) on DLPFC slice 151508. (D) The performance comparison of SpaMWGDA and seven state-of-the-art methods on Human Pancreas dataset and FFPE Human Renal Cell Carcinoma dataset.

https://doi.org/10.1371/journal.pcbi.1013667.g003

To further validate the effectiveness of SpaMWGDA, we show the identification results of DLPFC slice 151508. As shown in Fig 3B, compared to competing methods, SpaMWGDA achieves a better fit with the ground truth, with ARI and NMI of 0.66 and 0.72 respectively, while the ARI and NMI of competing methods are all below 0.6 and 0.7 respectively. As shown in Fig 3C, Scanpy shows layer mixing, SEDR shows unclear cluster boundaries, Spatial-MGCN only achieves an ARI of 0.48, GraphST and STAGATE confuse spots from layer 3 and WM (White Matter), and both stlearn and SpaGCN fail to distinguish most DLPFC layers. In UMAP visualization analysis, the clustering divisions of Scanpy and SpaGCN are not clear. Although STAGATE, SEDR, GraphST, and stlearn achieve good clustering results, the hierarchical order is unclear. The embeddings of both SpaMWGDA and Spatial-MGCN clearly show the cortical development trajectories, and the UMAP map of SpaMWGDA not only reveals the different clustering divisions in each domain, but also highlights the clear sequential relationships between layers. These results indicate the excellent clustering performance of SpaMWGDA and highlight the advantage of SpaMWGDA in identifying spatial domains. In addition, we also compared the performance of SpaMWGDA and seven competing methods on the Human Pancreas dataset from 10x Visium HD and FFPE Human Renal Cell Carcinoma dataset from 10x Xenium. The results show that SpaMWGDA is better than other methods.

SpaMWGDA reveals the laminar structure of the olfactory bulb from Stereo-seq data and infers its biological functions

Stereo-seq technology provides highly detailed spatial and molecular data for spatial transcriptomics studies of the mouse olfactory bulb (MOB) [45], enabling the evaluation of models’ ability to explore biological functions. For comparative studies, we selected Scanpy, GraphST, SEDR, STAGATE, Spatial-MGCN, stlearn, and SpaGCN as benchmark methods to assess the spatial domain identification capability of SpaMWGDA.

As shown in Fig 4A, Scanpy does not clearly delineate the partitions, and GraphST and Spatial-MGCN confuse different regions. SEDR, stlearn, STAGATE, SpaGCN, and SpaMWGDA all accurately capture the hierarchical structure of the MOB. SEDR divides the region into seven layers. However, it fails to link the identified regions to any specific areas in the histological images and fails to detect the narrow glomerular layer. STAGATE, stlearn and SpaGCN segment the MOB into seven regions, most of which are accurately identified, but there is still considerable overlap between clusters. SpaMWGDA exploits a multi-view weighted fusion GCN encoder to deeply explore the intrinsic relationship between gene expression and spatial information, thereby obtaining more informative and discriminative latent representations.

Download:

Fig 4. (A) Comparison of the results of identifying the laminar structure of the olfactory bulb using SpaMWGDA and seven state-of-the-art methods (Scanpy, stlearn, SpaGCN, SEDR, STAGATE, GraphST, and Spatial-MGCN).

(B) Analysis results of differentially expressed genes (DEGs) between the coronal tissue layers of the mouse olfactory bulb identified by SpaMWGDA.

https://doi.org/10.1371/journal.pcbi.1013667.g004

We then performed a detailed analysis of the differentially expressed genes (DEGs) between the coronal tissue layers of the mouse olfactory bulb identified by SpaMWGDA. As shown in Fig 4B, the results revealed that the identified clusters were strongly correlated with the expression of well-known marker genes such as Apod and Cck [46]. The olfactory nerve layer (ONL) corresponds to the expression domain of the Apod gene, which is involved in encoding components of high-density lipoproteins, suggesting that the ONL may be linked to lipoprotein metabolism. The Cck gene encodes cholecystokinin (CCK), which modulates neuronal excitability and synaptic transmission and plays a key role in the processing and feedback regulation of olfactory signals [47,48]. CCK has an important influence on the glomerular layer (GL), which is responsible for the initial integration of olfactory information. CCK enhances the transmission and perception of olfactory signals by regulating the activity of neurons in this region. These findings not only demonstrate the ability of SpaMWGDA to handle tissue structures at different resolutions, but also indicate that SpaMWGDA can infer potential tissue functions from identified spatial domains, offering valuable insights for further exploration of unknown tissue structures and functions.

SpaMWGDA delivers detailed insights into tumor heterogeneity in human breast cancer

In this section, we analysed the human breast cancer dataset from 10x Visium platform, which contains 20 domains and 4 main tissue types: DCIS/LCIS, healthy tissues, IDC, and hypomalignant tumor margins. As shown in Fig 5A, SpaMWGDA achieves the highest ARI and NMI among all methods. Compared to competing methods, SpaMWGDA can consistently identify domains that align with manual annotations and accurately detect domains such as Healthy_1 and IDC_5, with smoother boundaries for each domain. In contrast, the boundaries of regions identified by Scanpy are highly irregular and contain a large amount of noise. Although Spatial-MGCN, SEDR, SpaGCN, stlearn, GraphST, and STAGATE identify more domains than Scanpy, these domains still have rough boundaries and outliers.

Download:

Fig 5. (A) The performance comparison of spatial domain identification of SpaMWGDA and seven state-of-the-art methods (Scanpy, stlearn, SpaGCN, SEDR, STAGATE, GraphST, and Spatial-MGCN) on human breast cancer dataset.

(B) Results of differentially expressed genes analysis among different clusters identified by SpaMWGDA. (C) Results of differentially expressed genes analysis between cluster 3 (IDC) and cluster 13 (DCIS/LCIS).

https://doi.org/10.1371/journal.pcbi.1013667.g005

We further validated the performance of SpaMWGDA in detecting cancer tissue heterogeneity using the human breast cancer dataset. We compared the expression of the top differentially expressed genes (DEGs) in clusters 4 (healthy), 13 (DCIS/LCIS), 3 (IDC), and 15 (tumor margins), and found significant heterogeneity among these clusters (Fig 5B). We also performed differential expression analysis (|logFoldChange| ≥ 2 and P-value < 0.05) between cluster 3 (IDC) and cluster 13 (DCIS/LCIS) to explore gene expression differences between IDC and DCIS/LCIS, and identified about 100 significant DEGs between these two clusters (Fig 5C). As shown in Fig 5C, among the significant differentially expressed genes (DEGs) identified, CRISP3 [49] plays a pivotal role in breast cancer by regulating tumor cell migration, invasion, and immune modulation. It contributes to the tumor microenvironment through inflammatory pathways and immune evasion mechanisms. Additionally, MGP (Matrix Gla Protein), a calcium-binding protein, is integral to extracellular matrix remodeling and mineralization, and its elevated expression is associated with tumor aggressiveness and adverse prognosis. According to the studies [27,50], we found that the upregulation of CRISP3, S100A13, and S100A16 in domain 3 suggests that this region possesses tumor invasiveness, metastatic potential, and an active inflammatory environment, thereby promoting tumor progression. In contrast, domain 13, with high expression of MGP, CPB1, and S100G, points to a region associated with calcification, ECM remodeling, and tumor progression, which may contribute to treatment resistance and poor clinical outcomes.

These findings highlight the critical roles of CRISP3 and MGP in breast cancer pathogenesis, positioning them as valuable prognostic biomarkers and potential therapeutic targets. The distinct molecular profiles observed in domains 3 and 13 underscore the regional heterogeneity of the tumor microenvironment and the need for region-specific therapeutic strategies. In summary, SpaMWGDA offers a more refined approach to dissect cancer tissue heterogeneity and enhances our understanding of ST data.

Discussion

In this paper, we proposed a novel deep learning model, SpaMWGDA, combined a multi-view weighted fused graph convolutional network and data augmentation for spatial domain identification of spatial transcriptome (ST) data. By leveraging multi-view weighted fusion and data enhancement, SpaMWGDA effectively learns the relationships between spatial information and gene expression. Experimental results demonstrated that SpaMWGDA outperforms competing methods in terms of clustering accuracy and identification of biologically relevant domains, and exhibits superior performance in spatial domain identification. Additionally, SpaMWGDA successfully evaluated ST data from various platforms with different spatial resolutions. Experiment results highlighted the importance of integrating different similarity metrics for exploring spatial information, as well as reconstructing gene expression through data enhancement and weighted fusion attention mechanism.

However, SpaMWGDA still has limitations. For instance, in experiments involving model noise injection, we can employ various noise addition methods such as signal-to-noise ratio (SNR) to validate the model's robustness from different perspectives. When considering the use of different similarity measures to construct spatial neighborhood graphs, we can experiment with alternative metrics, such as the cosine similarity measure based on vectors. Utilizing spatial multi-omics data [51] will enable the model to more accurately resolve the spatial domain, which is vital for inferring the biological functions of complex tissues in organisms. Therefore, spatial domain identification methods that effectively use spatial multi-omics data should be proposed in future work.

Materials and methods

Data preparation

To validate the performance of the model, we used five datasets from multiple platforms. As shown in Table 4, the first dataset is from 10 x Visium, with each slice containing five to seven manually annotated regions of the human dorsolateral prefrontal cortex (DLPFC) [44]. The second dataset is the ST dataset of human breast cancer from 10 x Visium, which contains 20 domains and four major morphological types: DCIS/LCIS (Ductal Carcinoma In Situ/Lobular Carcinoma In Situ), IDC (invasive ductal carcinoma), healthy tissues, and hypomalignant tumor margins [52]. The third dataset is a mouse olfactory bulb dataset obtained from Stereo-seq [53], which annotates the RMS (rostral migratory stream), GCL (granular cell layer), IPL (internal plexiform layer), MCL (mitral cell layer), EPL (external plexiform layer), and ONL (olfactory nerve layer). The fourth dataset is a Human Pancreas dataset from 10x Visium HD [54]. It offers spatial gene expression data from human pancreas tissue, enabling analysis of gene activity across different pancreatic regions. The fifth dataset is a FFPE Human Renal Cell Carcinoma dataset from 10x Xenium [55]. It provides high-resolution spatial transcriptomic data from formalin-fixed, paraffin-embedded (FFPE) human renal cell carcinoma tissue. It allows for the exploration of gene expression patterns within the tumor microenvironment, aiding in the study of tumor heterogeneity and cellular interactions in cancer.

Download:

Table 4. Overview of five ST datasets used in this study.

https://doi.org/10.1371/journal.pcbi.1013667.t004

SpaMWGDA takes gene expression and spatial location as input. To reduce technical noise, spots outside the main tissue areas were first removed. Subsequently, SCANPY [11] was used to filter genes with low-expression and low-variance, eliminate genes that were not expressed in less than 100 cells, and select top 3,000 highly variable genes (HVGs). Finally, the expression data of HVGs were normalized to the total expression level of each cell to 10,000. The formula can be defined as follows:

(1)

where represents the j-th gene expression value at the i-th point.

Experiment settings

For SpaMWGDA, we employed the learning rate of 0.001, the weight decay of 5e-4, and utilized the ADAM optimization algorithm. Additionally, to optimize the nearest neighbor search, we adopted the K-D-tree algorithm and tested various k values ranging from 1 to 20. Performance scores were calculated for each k value, enabling dynamic adjustment of k to enhance the quality of the neighborhood matrix. The radius r is set based on the data resolution. All experiments are repeated 10 times, and spatial domain recognition performance is evaluated using ARI and NMI. The average of 10 runs is taken to obtain a reasonable performance assessment.

The SpaMWGDA framework

SpaMWGDA extracts both gene expression and spatial location from ST data, leveraging a deep neural network architecture to identify spatial domains. As depicted in the Fig 1, SpaMWGDA consists of five key modules: (i) spatial neighborhood network construction module; (ii) gene expression enhancement module; (iii) multi-view weighted fusion GCN encoder; (iv) ZINB decoder; and (v) spatial regularization constraint.

In the spatial neighborhood network construction module, KNN and Radius similarity measures are used to create a normalized adjacency matrix, ensuring the model adapts to ST data with varying resolutions and fully utilizes neighborhood information. The gene expression enhancement module enhances gene expression data, yielding improved feature representations. A graph-based contrastive encoder is employed to learn effective representations of gene expression from original and enhanced features. This enhancement allows adaptive integration of key information when reconstructing gene expression. The multi-view weighted fusion GCN encoder extracts embeddings from gene expression data, spatial location data, and their combinations. These embeddings are adaptively fused via a view-level attention mechanism, which helps model minimize noise during clustering. Afterwards, the ZINB decoder reconstructs the feature matrix to capture the global information of the spatial expression spectrum. Finally, a spatial regularization constraint is incorporated into the learning process of the model to ensure that spatial neighborhood information is preserved and the inherent spatial structure of the data is maintained.

Spatial neighborhood networks construction

K-nearest neighbor (KNN) can effectively capture the local structure of data, while Radius helps to uncover the global structure of the data [56]. We construct two undirected neighborhood graphs using KNN and Radius respectively. Here V represent the set of points, and represent the set of edges in n-th graph. The adjacency matrix is defined as , N is the number of points. if points i and j are each other’s K-nearest neighbors, and otherwise 0. if points i and j are within a radius, and otherwise 0. We employ K-D tree algorithm to determine the best k value and set the radius r according to the data resolution. The normalized adjacency matrix is calculated as follows:

(2)

where , , n = 1,2, which is an adjacency matrix with additional self-connections, is a unit matrix.

Gene expression enhancement

1) Feature graph construction: SpaMWGDA calculates gene expression similarity using cosine distance. is graph of the gene expression matrix X, is the feature adjacency matrix, if point j is the nearest neighbor of point i, and otherwise 0.
2) Data augment: We perform contrastive learning [57] on gene expression to learn latent representations from both original and augmented data. SpaMWGDA takes as input and extracts spot features without changing graph structure. For contrastive learning, we generate corrupted neighborhood graph on the , which scrubs gene expression data while maintaining the original neighborhood graph topology.
3) Graph-embedded contrastive encoder: We develop a graph-embedded contrastive encoder to learn key feature representations. The model consists of three modules: graph convolutional encoder, graph deconvolutional decoder, and deep contrastive self-encoder. The process is summarized as follows:
- Graph convolutional encoder: We utilize a GCN as the encoder to extract relevant feature representations from both original and augmented data.

(3)

(4)

The normalized Laplacian matrix is denoted as , and D represents the degree matrix. The activation functions and for the l-th GCN encoder layer are applied to X and , respectively. The weight matrices of the encoder for X and are denoted as and , respectively. The bias terms for the VAE corresponding to X and are represented as and . In this model, we use the convolutional operator of GCN as the convolution operation. For convenience, we refer to the gene expression matrices X and as and , respectively.

Graph deconvolutional decoder: Although graph convolution is adept at capturing local feature, the resulting smoothing can adversely affect the quality of data reconstruction and compromise the learnable global features. We adopt a graph deconvolution network (GDN) as a decoder to alleviate the negative effects of graph convolution and learn feature representation more effectively.

(5)

(6)

where and are the activation functions of the k-th GDN decoder layer for X and , respectively. and denote the weight matrix of decoder for X and , respectively. and correspond to the bias term of VAE for X and , respectively. And the deconvolutional operator is the inverse function of .

Deep contrastive self-encoder: To effectively balance local and global information, we propose a deep contrastive learning strategy as a constraint for feature learning. This approach enhances SpaMWGDA’s ability to represent complex information. The loss of contrastive learning can be expressed as:

(7)

where and represent the feature representation on original graph and corrupted graph respectively, and controls the distance difference between positive and negative samples.

Multi-view weighted fusion GCN encoder

GCN is a powerful neural network that processes graph data by aggregating information from neighboring nodes, capturing dependencies, and generating embeddings [58]. We utilize a multi-view weighted fusion GCN encoder to extract important information [33]. This encoder consists of four main modules: spatial convolution, feature convolution, co-convolution, and weighted fusion attention mechanism.

1) Spatial convolution: A convolution operation is performed on to aggregate spatial information of the neighbors and multi-layer spatial convolutional network applies hierarchical propagation rules:

(8)

where , are weight matrices of l-th layer in spatial convolution, and initially . and are diagonal matrices of and respectively. , and the joint embedding is .

2) Feature convolution: To learn more comprehensive gene expression information from latent representations obtained by contrast learning, feature convolution is performed on and :

(9)

where is weight matrix of l-th layer, initially .

3) Co-convolution: We adopt a parameter sharing strategy to extract co-embedding of gene expression and spatial distribution:

(10)

where is weight matrix of l-th layer, and initially . is joint embedding. The normalization constraint is defined as follows:

(11)

Attention mechanism

In order to adaptively learn the importance of each latent embedding, we introduce a weighted attention mechanism. The final attention weight is obtained by calculating weighted sum of attention weights of each view.

(12)

where is the feature representation of each view, , , , , are weights and biases.

The weighted fusion module performs weighted fusion on the features of different views according to the weights to obtain an integrated feature representation. The weighted fusion is computed as follows:

(13)

The weighted merged features are accumulated into the final integrated feature representation.

ZINB decoder

The ZINB decoder is widely used to reconstruct gene expression matrix to capture the global information [59]. The latent low-dimensional representation Z is used as the input. Given gene expression data X of the ST data, are parameter matrices of zero-inflation parameter, mean, decoder output discretization, and bias vector, respectively. The decoder outputs the estimated values of three parameters.

(14)

Spatial regularization constraint

Spatially neighboring points should be close to each other, while spatially non-neighboring points should be far apart in the latent space [60–62]. Similarity information and spatial neighborhood information are used to compute the loss of spatial regularization constraint:

(15)

where represents the cosine similarity between embedding vector of i-th point and embedding vector of j-th point in learning-based potential representation H. represents the set of all neighboring nodes of point i, N represents the set of all non-neighboring nodes of point i, and represents the set of spatial neighbors of point i.

SpaMWGDA learns more informative and discriminative latent representations by maximizing the similarity of neighboring point pairs and minimizing the similarity of non-neighboring point pairs.

Evaluation strategies

We compared SpaMWGDA with Scanpy [11], stlearn [25], SpaGCN [26], SEDR [28], STAGATE [29], GraphST [30], and Spatial-MGCN [32] to test the performance of SpaMWGDA. These seven methods include two shallow learning algorithms (Scanpy, stlearn) and five state-of-the-art DL-based methods (SpaGCN, SEDR, STAGATE, GraphST, and Spatial-MGCN). Scanpy (2018) is a Python library that provides data processing, dimensionality reduction, and clustering tools that can be used to identify spatial domain. stlearn (2020) integrated gene expression, spatial data, and morphological features to efficiently identify spatial domains. SpaGCN (2021) combined gene expression and spatial data using GCN to identify spatial domains with consistent gene expression patterns. SEDR (2024) used a deep autoencoder to generate unsupervised spatial embeddings by learning gene representations and embedding spatial information with a variogram autoencoder. STAGATE (2022) fused spatial and gene expression information to identify spatial domains via an adaptive graph attention autoencoder. GraphST (2023) achieved spatial domain identification by integrating graph neural networks with self-supervised contrastive learning. Spatial-MGCN (2023) utilized a multi-view GCN encoder to identify spatial domains.

All benchmark methods were executed using their default parameters, with an equal number of clusters applied during the clustering process. Two widely used metrics (ARI [63] and NMI [64]) were used to evaluate the performance of each method.

Evaluation metrics

Adjusted Rand Index (ARI) is a measure of clustering similarity that adjusts the Rand Index (RI) for chance. It quantifies how well the predicted clusters align with the ground truth labels while accounting for random cluster assignments. The ARI is calculated as follows:

(16)

where TP is the number of true positives, TN is the number of true negatives, FP is the number of false positives, and FN is the number of false negatives.

Normalized Mutual Information (NMI) is a normalized version of Mutual Information (MI) that corrects for biases introduced by differing numbers of labels. It ensures that values range between 0 and 1, where 1 indicates perfect agreement between predicted clusters and ground truth labels, and 0 indicates no correlation. NMI is calculated as follows:

(17)

Let W represent the set of clustering results, and V represent the set of ground truth. Then and denote the probabilities of the ground truth w and the clustering label v, while represents the probabilities of w and v occurring simultaneously. and denote the entropy of the clustering results and the true labels, respectively, reflecting the uncertainty in each label set.

Supporting information

S1 Table. Experimental results of SpaMWGDA and seven competing methods on the noisy DLPFC dataset.

https://doi.org/10.1371/journal.pcbi.1013667.s001

(DOCX)

S1 Fig. Detailed results for all 12 slices in the DLPFC dataset.

https://doi.org/10.1371/journal.pcbi.1013667.s002

(TIF)

References

1. Huang Z, Luo S, Zhang Z, Wang Z, Zhou T, Zhang J. A Unified Probabilistic Framework for Modeling and Inferring Spatial Transcriptomic Data. CBIO. 2024;19(3):222–34.
- View Article
- Google Scholar
2. Wang H, Ding Y, Tang J, Guo F. Identification of membrane protein types via multivariate information fusion with Hilbert–Schmidt Independence Criterion. Neurocomputing. 2020;383:257–69.
- View Article
- Google Scholar
3. Cui Y, Wei L, Wang R, Ye X, Sakurai T. Identification of Spatial Domains, Spatially Variable Genes, and Genetic Association Studies of Alzheimer Disease with an Autoencoder-based Fuzzy Clustering Algorithm. CBIO. 2024;19(8):765–76.
- View Article
- Google Scholar
4. Rodriques SG, Stickels RR, Goeva A, Martin CA, Murray E, Vanderburg CR, et al. Slide-seq: A scalable technology for measuring genome-wide expression at high spatial resolution. Science. 2019;363(6434):1463–7. pmid:30923225
- View Article
- PubMed/NCBI
- Google Scholar
5. Gulati GS, D’Silva JP, Liu Y, Wang L, Newman AM. Profiling cell identity and tissue architecture with single-cell and spatial transcriptomics. Nat Rev Mol Cell Biol. 2025;26(1):11–31. pmid:39169166
- View Article
- PubMed/NCBI
- Google Scholar
6. Wei L, He W, Malik A, Su R, Cui L, Manavalan B. Computational prediction and interpretation of cell-specific replication origin sites from multiple eukaryotes by exploiting stacking framework. Brief Bioinform. 2021;22(4):bbaa275. pmid:33152766
- View Article
- PubMed/NCBI
- Google Scholar
7. Wang Z, Ding H, Zou Q. Identifying cell types to interpret scRNA-seq data: how, why and more possibilities. Brief Funct Genomics. 2020;19(4):286–91. pmid:32232401
- View Article
- PubMed/NCBI
- Google Scholar
8. Qi R, Wu J, Guo F, Xu L, Zou Q. A spectral clustering with self-weighted multiple kernel learning method for single-cell RNA-seq data. Brief Bioinform. 2021;22(4):bbaa216. pmid:33003206
- View Article
- PubMed/NCBI
- Google Scholar
9. Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre EJ. Fast unfolding of communities in large networks. J Stat Mech. 2008;2008(10):P10008.
- View Article
- Google Scholar
10. Traag VA, Waltman L, Van Eck NJ. From Louvain to Leiden: guaranteeing well-connected communities. Scientometrics. 2019;9(1):1–12.
- View Article
- Google Scholar
11. Wolf FA, Angerer P, Theis FJ. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018;19(1):15. pmid:29409532
- View Article
- PubMed/NCBI
- Google Scholar
12. Satija R, Farrell JA, Gennert D, Schier AF, Regev AJ. Spatial reconstruction of single-cell gene expression data. Nat Biotechnol. 2015;33(5):495–502.
- View Article
- Google Scholar
13. Dai C, Jiang Y, Yin C, Su R, Zeng X, Zou Q, et al. scIMC: a platform for benchmarking comparison and visualization analysis of scRNA-seq data imputation methods. Nucleic Acids Res. 2022;50(9):4877–99. pmid:35524568
- View Article
- PubMed/NCBI
- Google Scholar
14. Dries R, Zhu Q, Dong R, Eng CHL, Li H, Liu K. Giotto: a toolbox for integrative analysis and visualization of spatial expression data. 2021;22:1–31.
15. Zhao E, Stone MR, Ren X, Guenthoer J, Smythe KS, Pulliam T, et al. Spatial transcriptomics at subspot resolution with BayesSpace. Nat Biotechnol. 2021;39(11):1375–84. pmid:34083791
- View Article
- PubMed/NCBI
- Google Scholar
16. Xu J, Lu C, Jin S, Meng Y, Fu X, Zeng X, et al. Deep learning-based cell-specific gene regulatory networks inferred from single-cell multiome data. Nucleic Acids Res. 2025;53(5):gkaf138. pmid:40037709
- View Article
- PubMed/NCBI
- Google Scholar
17. Li HL, Pang YH, Liu BJN. BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models. Nucleic Acids Research. 2021;49(22):e129-e.
- View Article
- Google Scholar
18. Liu J, Ma L, Ju F, Zhao C, Yu L. SpaCcLink: exploring downstream signaling regulations with graph attention network for systematic inference of spatial cell–cell communication. J Biol B. 2025;23(1):44.
- View Article
- Google Scholar
19. Huang Z, Guo X, Qin J, Gao L, Ju F, Zhao C, et al. Accurate RNA velocity estimation based on multibatch network reveals complex lineage in batch scRNA-seq data. BMC Biol. 2024;22(1):290. pmid:39696422
- View Article
- PubMed/NCBI
- Google Scholar
20. Zhao M, Li J, Liu X, Ma K, Tang J, Guo F. A gene regulatory network-aware graph learning method for cell identity annotation in single-cell RNA-seq data. Genome Res. 2024;34(7):1036–51. pmid:39134412
- View Article
- PubMed/NCBI
- Google Scholar
21. Zhang H-Q, Arif M, Thafar MA, Albaradei S, Cai P, Zhang Y, et al. PMPred-AE: a computational model for the detection and interpretation of pathological myopia based on artificial intelligence. Front Med (Lausanne). 2025;12:1529335. pmid:40182849
- View Article
- PubMed/NCBI
- Google Scholar
22. Wang J, Zou Q, Lin C. A comparison of deep learning-based pre-processing and clustering approaches for single-cell RNA sequencing data. Brief Bioinform. 2022;23(1):bbab345. pmid:34472590
- View Article
- PubMed/NCBI
- Google Scholar
23. Wang Y, Zhai Y, Ding Y, Zou Q. SBSM-Pro: support bio-sequence machine for proteins. Sci China Inf Sci. 2024;67(11).
- View Article
- Google Scholar
24. Liu T, Fang ZY, Li X, Zhang LN, Cao DS, Yin MZ. Graph deep learning enabled spatial domains identification for spatial transcriptomics. J Biol Inorg Chem. 2023;24(3):bbad146.
- View Article
- Google Scholar
25. Pham D, Tan X, Xu J, Grice LF, Lam PY, Raghubar A. stLearn: integrating spatial location, tissue morphology and gene expression to find cell types, cell-cell interactions and spatial trajectories within undissociated tissues. 2020;2020:31.125658.
26. Hu J, Li X, Coleman K, Schroeder A, Ma N, Irwin DJ, et al. SpaGCN: Integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network. Nat Methods. 2021;18(11):1342–51. pmid:34711970
- View Article
- PubMed/NCBI
- Google Scholar
27. Xu C, Jin X, Wei S, Wang P, Luo M, Xu Z, et al. DeepST: identifying spatial domains in spatial transcriptomics by deep learning. Nucleic Acids Res. 2022;50(22):e131. pmid:36250636
- View Article
- PubMed/NCBI
- Google Scholar
28. Xu H, Fu H, Long Y, Ang KS, Sethi R, Chong K, et al. Unsupervised spatially embedded deep representation of spatial transcriptomics. Genome Med. 2024;16(1):12. pmid:38217035
- View Article
- PubMed/NCBI
- Google Scholar
29. Dong K, Zhang S. Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder. Nat Commun. 2022;13(1):1739. pmid:35365632
- View Article
- PubMed/NCBI
- Google Scholar
30. Long Y, Ang KS, Li M, Chong KLK, Sethi R, Zhong C, et al. Spatially informed clustering, integration, and deconvolution of spatial transcriptomics with GraphST. Nat Commun. 2023;14(1):1155. pmid:36859400
- View Article
- PubMed/NCBI
- Google Scholar
31. Pang C, Qiao J, Zeng X, Zou Q, Wei L. Deep Generative Models in De Novo Drug Molecule Generation. J Chem Inf Model. 2024;64(7):2174–94. pmid:37934070
- View Article
- PubMed/NCBI
- Google Scholar
32. Wang B, Luo J, Liu Y, Shi W, Xiong Z, Shen C, et al. Spatial-MGCN: a novel multi-view graph convolutional network for identifying spatial domains with attention mechanism. Brief Bioinform. 2023;24(5):bbad262. pmid:37466210
- View Article
- PubMed/NCBI
- Google Scholar
33. Shi X, Zhu J, Long Y, Liang C. Identifying spatial domains of spatially resolved transcriptomics via multi-view graph convolutional networks. Brief Bioinform. 2023;24(5):bbad278. pmid:37544658
- View Article
- PubMed/NCBI
- Google Scholar
34. Guo X, Huang Z, Ju F, Zhao C, Yu L. Highly Accurate Estimation of Cell Type Abundance in Bulk Tissues Based on Single-Cell Reference and Domain Adaptive Matching. Adv Sci (Weinh). 2024;11(7):e2306329. pmid:38072669
- View Article
- PubMed/NCBI
- Google Scholar
35. Qi R, Ma A, Ma Q, Zou Q. Clustering and classification methods for single-cell RNA-sequencing data. Brief Bioinform. 2020;21(4):1196–208. pmid:31271412
- View Article
- PubMed/NCBI
- Google Scholar
36. Cover T, Hart P. Nearest neighbor pattern classification. IEEE Trans Inform Theory. 1967;13(1):21–7.
- View Article
- Google Scholar
37. Bentley JL, Stanat DF, Williams EH Jr. The complexity of finding fixed-radius near neighbors. Information Processing Letters. 1977;6(6):209–12.
- View Article
- Google Scholar
38. Zeng Y, Yin R, Luo M, Chen J, Pan Z, Lu Y, et al. Identifying spatial domain by adapting transcriptomics with histology through contrastive learning. Brief Bioinform. 2023;24(2):bbad048. pmid:36781228
- View Article
- PubMed/NCBI
- Google Scholar
39. Qian Y, Zou Q, Zhao M, Liu Y, Guo F, Ding Y. scRNMF: An imputation method for single-cell RNA-seq data by robust and non-negative matrix factorization. PLoS Comput Biol. 2024;20(8):e1012339. pmid:39116191
- View Article
- PubMed/NCBI
- Google Scholar
40. Yu Z, Lu Y, Wang Y, Tang F, Wong K-C, Li X. ZINB-Based Graph Embedding Autoencoder for Single-Cell RNA-Seq Interpretations. AAAI. 2022;36(4):4671–9.
- View Article
- Google Scholar
41. Yuan H, Liu M, Qiu Y, Ching W-K, Zou Q. PLNMFG: Pseudo-label guided non-negative matrix factorization model with graph constraint for single-cell multi-omics data clustering. PLoS Comput Biol. 2025;21(8):e1013375. pmid:40825083
- View Article
- PubMed/NCBI
- Google Scholar
42. Li X, Huang W, Xu X, Zhang HY, Shi Q. Deciphering tissue heterogeneity from spatially resolved transcriptomics by the autoencoder-assisted graph convolutional neural network. Frontiers in Genetics. 2023;14:1202409.
- View Article
- Google Scholar
43. Si Z, Li H, Shang W, Zhao Y, Kong L, Long C, et al. SpaNCMG: improving spatial domains identification of spatial transcriptomics using neighborhood-complementary mixed-view graph convolutional network. Brief Bioinform. 2024;25(4):bbae259. pmid:38811360
- View Article
- PubMed/NCBI
- Google Scholar
44. Maynard KR, Collado-Torres L, Weber LM, Uytingco C, Barry BK, Williams SR, et al. Transcriptome-scale spatial gene expression in the human dorsolateral prefrontal cortex. Nat Neurosci. 2021;24(3):425–36. pmid:33558695
- View Article
- PubMed/NCBI
- Google Scholar
45. Jiang J, Wang N, Chen P, Zheng C, Wang B. Prediction of Protein Hotspots from Whole Protein Sequences by a Random Projection Ensemble System. Int J Mol Sci. 2017;18(7):1543. pmid:28718782
- View Article
- PubMed/NCBI
- Google Scholar
46. Kadowaki K, Sugimoto K, Yamaguchi F, Song T, Watanabe Y, Singh K, et al. Phosphohippolin expression in the rat central nervous system. Brain Res Mol Brain Res. 2004;125(1–2):105–12. pmid:15193427
- View Article
- PubMed/NCBI
- Google Scholar
47. Liu X, Liu S. Cholecystokinin selectively activates short axon cells to enhance inhibition of olfactory bulb output neurons. J Physiol. 2018;596(11):2185–207. pmid:29572837
- View Article
- PubMed/NCBI
- Google Scholar
48. Sun X, Liu X, Starr ER, Liu S. CCKergic Tufted Cells Differentially Drive Two Anatomically Segregated Inhibitory Circuits in the Mouse Olfactory Bulb. J Neurosci. 2020;40(32):6189–206. pmid:32605937
- View Article
- PubMed/NCBI
- Google Scholar
49. Wang Y, Sheng N, Xie Y, Chen S, Lu J, Zhang Z, et al. Low expression of CRISP3 predicts a favorable prognosis in patients with mammary carcinoma. J Cell Physiol. 2019;234(8):13629–38. pmid:30609035
- View Article
- PubMed/NCBI
- Google Scholar
50. Liang X, Liu P, Xue L, Chen B, Liu W, Shi W, et al. A multi-modality and multi-granularity collaborative learning framework for identifying spatial domains and spatially variable genes. Bioinformatics. 2024;40(10):btae607. pmid:39418177
- View Article
- PubMed/NCBI
- Google Scholar
51. Long Y, Ang KS, Sethi R, Liao S, Heng Y, van Olst L, et al. Deciphering spatial domains from spatial multi-omics with SpatialGlue. Nat Methods. 2024;21(9):1658–67. pmid:38907114
- View Article
- PubMed/NCBI
- Google Scholar
52. Buache E, Etique N, Alpy F, Stoll I, Muckensturm M, Reina-San-Martin B, et al. Deficiency in trefoil factor 1 (TFF1) increases tumorigenicity of human breast cancer cells and mammary tumor development in TFF1-knockout mice. Oncogene. 2011;30(29):3261–73. pmid:21358676
- View Article
- PubMed/NCBI
- Google Scholar
53. Chen A, Liao S, Cheng M, Ma K, Wu L, Lai Y, et al. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell. 2022;185(10):1777-1792.e21. pmid:35512705
- View Article
- PubMed/NCBI
- Google Scholar
54. Jennings RE, Berry AA, Strutt JP, Gerrard DT, Hanley NA. Human pancreas development. Development. 2015;142(18):3126–37. pmid:26395141
- View Article
- PubMed/NCBI
- Google Scholar
55. Eikrem OS, Strauss P, Beisland C, Scherer A, Landolt L, Flatberg A, et al. Development and confirmation of potential gene classifiers of human clear cell renal cell carcinoma using next-generation RNA sequencing. Scand J Urol. 2016;50(6):452–62. pmid:27739342
- View Article
- PubMed/NCBI
- Google Scholar
56. Cui L, Guo G, Ng MK, Zou Q, Qiu Y. GSTRPCA: irregular tensor singular value decomposition for single-cell multi-omics data clustering. Brief Bioinform. 2024;26(1):bbae649. pmid:39680741
- View Article
- PubMed/NCBI
- Google Scholar
57. Zhu Y, Xu Y, Yu F, Liu Q, Wu S, Wang L. Graph Contrastive Learning with Adaptive Augmentation. In: Proceedings of the Web Conference 2021, 2021. 2069–80.
- View Article
- Google Scholar
58. Kipf TN, Welling MJ. Semi-supervised classification with graph convolutional networks. 2016.
59. Gan Y, Huang X, Zou G, Zhou S, Guan J. Deep structural clustering for single-cell RNA-seq data jointly through autoencoder and graph neural network. Brief Bioinform. 2022;23(2):bbac018. pmid:35172334
- View Article
- PubMed/NCBI
- Google Scholar
60. Chen T, Kornblith S, Norouzi M, Hinton G. A simple framework for contrastive learning of visual representations. In: Proceedings of Machine Learning Research, 2020.
61. Stoltzfus CR, Filipek J, Gern BH, Olin BE, Leal JM, Wu Y. CytoMAP: a spatial analysis toolbox reveals features of myeloid cell organization in lymphoid tissues. 2020;31(3).
62. Yuan L, Xu Z, Meng B, Ye L. scAMZI: attention-based deep autoencoder with zero-inflated layer for clustering scRNA-seq data. BMC Genomics. 2025;26(1):350. pmid:40197174
- View Article
- PubMed/NCBI
- Google Scholar
63. Hu Y, Xiao K, Yang H, Liu X, Zhang C, Shi Q. Spatially contrastive variational autoencoder for deciphering tissue heterogeneity from spatially resolved transcriptomics. J Biol Informa Bio. 2024;25(2):bbae016.
- View Article
- Google Scholar
64. Estévez PA, Tesmer M, Perez CA, Zurada JMJ. Normalized mutual information feature selection. JITonn. 2009;20(2):189–201.
- View Article
- Google Scholar

[ref1] 1. Huang Z, Luo S, Zhang Z, Wang Z, Zhou T, Zhang J. A Unified Probabilistic Framework for Modeling and Inferring Spatial Transcriptomic Data. CBIO. 2024;19(3):222–34.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Wang H, Ding Y, Tang J, Guo F. Identification of membrane protein types via multivariate information fusion with Hilbert–Schmidt Independence Criterion. Neurocomputing. 2020;383:257–69.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Cui Y, Wei L, Wang R, Ye X, Sakurai T. Identification of Spatial Domains, Spatially Variable Genes, and Genetic Association Studies of Alzheimer Disease with an Autoencoder-based Fuzzy Clustering Algorithm. CBIO. 2024;19(8):765–76.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Rodriques SG, Stickels RR, Goeva A, Martin CA, Murray E, Vanderburg CR, et al. Slide-seq: A scalable technology for measuring genome-wide expression at high spatial resolution. Science. 2019;363(6434):1463–7. pmid:30923225
View Article
PubMed/NCBI
Google Scholar

[11] View Article

[12] PubMed/NCBI

[13] Google Scholar

[ref5] 5. Gulati GS, D’Silva JP, Liu Y, Wang L, Newman AM. Profiling cell identity and tissue architecture with single-cell and spatial transcriptomics. Nat Rev Mol Cell Biol. 2025;26(1):11–31. pmid:39169166
View Article
PubMed/NCBI
Google Scholar

[15] View Article

[16] PubMed/NCBI

[17] Google Scholar

[ref6] 6. Wei L, He W, Malik A, Su R, Cui L, Manavalan B. Computational prediction and interpretation of cell-specific replication origin sites from multiple eukaryotes by exploiting stacking framework. Brief Bioinform. 2021;22(4):bbaa275. pmid:33152766
View Article
PubMed/NCBI
Google Scholar

[19] View Article

[20] PubMed/NCBI

[21] Google Scholar

[ref7] 7. Wang Z, Ding H, Zou Q. Identifying cell types to interpret scRNA-seq data: how, why and more possibilities. Brief Funct Genomics. 2020;19(4):286–91. pmid:32232401
View Article
PubMed/NCBI
Google Scholar

[23] View Article

[24] PubMed/NCBI

[25] Google Scholar

[ref8] 8. Qi R, Wu J, Guo F, Xu L, Zou Q. A spectral clustering with self-weighted multiple kernel learning method for single-cell RNA-seq data. Brief Bioinform. 2021;22(4):bbaa216. pmid:33003206
View Article
PubMed/NCBI
Google Scholar

[27] View Article

[28] PubMed/NCBI

[29] Google Scholar

[ref9] 9. Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre EJ. Fast unfolding of communities in large networks. J Stat Mech. 2008;2008(10):P10008.
View Article
Google Scholar

[31] View Article

[32] Google Scholar

[ref10] 10. Traag VA, Waltman L, Van Eck NJ. From Louvain to Leiden: guaranteeing well-connected communities. Scientometrics. 2019;9(1):1–12.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref11] 11. Wolf FA, Angerer P, Theis FJ. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018;19(1):15. pmid:29409532
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref12] 12. Satija R, Farrell JA, Gennert D, Schier AF, Regev AJ. Spatial reconstruction of single-cell gene expression data. Nat Biotechnol. 2015;33(5):495–502.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref13] 13. Dai C, Jiang Y, Yin C, Su R, Zeng X, Zou Q, et al. scIMC: a platform for benchmarking comparison and visualization analysis of scRNA-seq data imputation methods. Nucleic Acids Res. 2022;50(9):4877–99. pmid:35524568
View Article
PubMed/NCBI
Google Scholar

[44] View Article

[45] PubMed/NCBI

[46] Google Scholar

[ref14] 14. Dries R, Zhu Q, Dong R, Eng CHL, Li H, Liu K. Giotto: a toolbox for integrative analysis and visualization of spatial expression data. 2021;22:1–31.

[ref15] 15. Zhao E, Stone MR, Ren X, Guenthoer J, Smythe KS, Pulliam T, et al. Spatial transcriptomics at subspot resolution with BayesSpace. Nat Biotechnol. 2021;39(11):1375–84. pmid:34083791
View Article
PubMed/NCBI
Google Scholar

[49] View Article

[50] PubMed/NCBI

[51] Google Scholar

[ref16] 16. Xu J, Lu C, Jin S, Meng Y, Fu X, Zeng X, et al. Deep learning-based cell-specific gene regulatory networks inferred from single-cell multiome data. Nucleic Acids Res. 2025;53(5):gkaf138. pmid:40037709
View Article
PubMed/NCBI
Google Scholar

[53] View Article

[54] PubMed/NCBI

[55] Google Scholar

[ref17] 17. Li HL, Pang YH, Liu BJN. BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models. Nucleic Acids Research. 2021;49(22):e129-e.
View Article
Google Scholar

[57] View Article

[58] Google Scholar

[ref18] 18. Liu J, Ma L, Ju F, Zhao C, Yu L. SpaCcLink: exploring downstream signaling regulations with graph attention network for systematic inference of spatial cell–cell communication. J Biol B. 2025;23(1):44.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref19] 19. Huang Z, Guo X, Qin J, Gao L, Ju F, Zhao C, et al. Accurate RNA velocity estimation based on multibatch network reveals complex lineage in batch scRNA-seq data. BMC Biol. 2024;22(1):290. pmid:39696422
View Article
PubMed/NCBI
Google Scholar

[63] View Article

[64] PubMed/NCBI

[65] Google Scholar

[ref20] 20. Zhao M, Li J, Liu X, Ma K, Tang J, Guo F. A gene regulatory network-aware graph learning method for cell identity annotation in single-cell RNA-seq data. Genome Res. 2024;34(7):1036–51. pmid:39134412
View Article
PubMed/NCBI
Google Scholar

[67] View Article

[68] PubMed/NCBI

[69] Google Scholar

[ref21] 21. Zhang H-Q, Arif M, Thafar MA, Albaradei S, Cai P, Zhang Y, et al. PMPred-AE: a computational model for the detection and interpretation of pathological myopia based on artificial intelligence. Front Med (Lausanne). 2025;12:1529335. pmid:40182849
View Article
PubMed/NCBI
Google Scholar

[71] View Article

[72] PubMed/NCBI

[73] Google Scholar

[ref22] 22. Wang J, Zou Q, Lin C. A comparison of deep learning-based pre-processing and clustering approaches for single-cell RNA sequencing data. Brief Bioinform. 2022;23(1):bbab345. pmid:34472590
View Article
PubMed/NCBI
Google Scholar

[75] View Article

[76] PubMed/NCBI

[77] Google Scholar

[ref23] 23. Wang Y, Zhai Y, Ding Y, Zou Q. SBSM-Pro: support bio-sequence machine for proteins. Sci China Inf Sci. 2024;67(11).
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref24] 24. Liu T, Fang ZY, Li X, Zhang LN, Cao DS, Yin MZ. Graph deep learning enabled spatial domains identification for spatial transcriptomics. J Biol Inorg Chem. 2023;24(3):bbad146.
View Article
Google Scholar

[82] View Article

[83] Google Scholar

[ref25] 25. Pham D, Tan X, Xu J, Grice LF, Lam PY, Raghubar A. stLearn: integrating spatial location, tissue morphology and gene expression to find cell types, cell-cell interactions and spatial trajectories within undissociated tissues. 2020;2020:31.125658.

[ref26] 26. Hu J, Li X, Coleman K, Schroeder A, Ma N, Irwin DJ, et al. SpaGCN: Integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network. Nat Methods. 2021;18(11):1342–51. pmid:34711970
View Article
PubMed/NCBI
Google Scholar

[86] View Article

[87] PubMed/NCBI

[88] Google Scholar

[ref27] 27. Xu C, Jin X, Wei S, Wang P, Luo M, Xu Z, et al. DeepST: identifying spatial domains in spatial transcriptomics by deep learning. Nucleic Acids Res. 2022;50(22):e131. pmid:36250636
View Article
PubMed/NCBI
Google Scholar

[90] View Article

[91] PubMed/NCBI

[92] Google Scholar

[ref28] 28. Xu H, Fu H, Long Y, Ang KS, Sethi R, Chong K, et al. Unsupervised spatially embedded deep representation of spatial transcriptomics. Genome Med. 2024;16(1):12. pmid:38217035
View Article
PubMed/NCBI
Google Scholar

[94] View Article

[95] PubMed/NCBI

[96] Google Scholar

[ref29] 29. Dong K, Zhang S. Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder. Nat Commun. 2022;13(1):1739. pmid:35365632
View Article
PubMed/NCBI
Google Scholar

[98] View Article

[99] PubMed/NCBI

[100] Google Scholar

[ref30] 30. Long Y, Ang KS, Li M, Chong KLK, Sethi R, Zhong C, et al. Spatially informed clustering, integration, and deconvolution of spatial transcriptomics with GraphST. Nat Commun. 2023;14(1):1155. pmid:36859400
View Article
PubMed/NCBI
Google Scholar

[102] View Article

[103] PubMed/NCBI

[104] Google Scholar

[ref31] 31. Pang C, Qiao J, Zeng X, Zou Q, Wei L. Deep Generative Models in De Novo Drug Molecule Generation. J Chem Inf Model. 2024;64(7):2174–94. pmid:37934070
View Article
PubMed/NCBI
Google Scholar

[106] View Article

[107] PubMed/NCBI

[108] Google Scholar

[ref32] 32. Wang B, Luo J, Liu Y, Shi W, Xiong Z, Shen C, et al. Spatial-MGCN: a novel multi-view graph convolutional network for identifying spatial domains with attention mechanism. Brief Bioinform. 2023;24(5):bbad262. pmid:37466210
View Article
PubMed/NCBI
Google Scholar

[110] View Article

[111] PubMed/NCBI

[112] Google Scholar

[ref33] 33. Shi X, Zhu J, Long Y, Liang C. Identifying spatial domains of spatially resolved transcriptomics via multi-view graph convolutional networks. Brief Bioinform. 2023;24(5):bbad278. pmid:37544658
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref34] 34. Guo X, Huang Z, Ju F, Zhao C, Yu L. Highly Accurate Estimation of Cell Type Abundance in Bulk Tissues Based on Single-Cell Reference and Domain Adaptive Matching. Adv Sci (Weinh). 2024;11(7):e2306329. pmid:38072669
View Article
PubMed/NCBI
Google Scholar

[118] View Article

[119] PubMed/NCBI

[120] Google Scholar

[ref35] 35. Qi R, Ma A, Ma Q, Zou Q. Clustering and classification methods for single-cell RNA-sequencing data. Brief Bioinform. 2020;21(4):1196–208. pmid:31271412
View Article
PubMed/NCBI
Google Scholar

[122] View Article

[123] PubMed/NCBI

[124] Google Scholar

[ref36] 36. Cover T, Hart P. Nearest neighbor pattern classification. IEEE Trans Inform Theory. 1967;13(1):21–7.
View Article
Google Scholar

[126] View Article

[127] Google Scholar

[ref37] 37. Bentley JL, Stanat DF, Williams EH Jr. The complexity of finding fixed-radius near neighbors. Information Processing Letters. 1977;6(6):209–12.
View Article
Google Scholar

[129] View Article

[130] Google Scholar

[ref38] 38. Zeng Y, Yin R, Luo M, Chen J, Pan Z, Lu Y, et al. Identifying spatial domain by adapting transcriptomics with histology through contrastive learning. Brief Bioinform. 2023;24(2):bbad048. pmid:36781228
View Article
PubMed/NCBI
Google Scholar

[132] View Article

[133] PubMed/NCBI

[134] Google Scholar

[ref39] 39. Qian Y, Zou Q, Zhao M, Liu Y, Guo F, Ding Y. scRNMF: An imputation method for single-cell RNA-seq data by robust and non-negative matrix factorization. PLoS Comput Biol. 2024;20(8):e1012339. pmid:39116191
View Article
PubMed/NCBI
Google Scholar

[136] View Article

[137] PubMed/NCBI

[138] Google Scholar

[ref40] 40. Yu Z, Lu Y, Wang Y, Tang F, Wong K-C, Li X. ZINB-Based Graph Embedding Autoencoder for Single-Cell RNA-Seq Interpretations. AAAI. 2022;36(4):4671–9.
View Article
Google Scholar

[140] View Article

[141] Google Scholar

[ref41] 41. Yuan H, Liu M, Qiu Y, Ching W-K, Zou Q. PLNMFG: Pseudo-label guided non-negative matrix factorization model with graph constraint for single-cell multi-omics data clustering. PLoS Comput Biol. 2025;21(8):e1013375. pmid:40825083
View Article
PubMed/NCBI
Google Scholar

[143] View Article

[144] PubMed/NCBI

[145] Google Scholar

[ref42] 42. Li X, Huang W, Xu X, Zhang HY, Shi Q. Deciphering tissue heterogeneity from spatially resolved transcriptomics by the autoencoder-assisted graph convolutional neural network. Frontiers in Genetics. 2023;14:1202409.
View Article
Google Scholar

[147] View Article

[148] Google Scholar

[ref43] 43. Si Z, Li H, Shang W, Zhao Y, Kong L, Long C, et al. SpaNCMG: improving spatial domains identification of spatial transcriptomics using neighborhood-complementary mixed-view graph convolutional network. Brief Bioinform. 2024;25(4):bbae259. pmid:38811360
View Article
PubMed/NCBI
Google Scholar

[150] View Article

[151] PubMed/NCBI

[152] Google Scholar

[ref44] 44. Maynard KR, Collado-Torres L, Weber LM, Uytingco C, Barry BK, Williams SR, et al. Transcriptome-scale spatial gene expression in the human dorsolateral prefrontal cortex. Nat Neurosci. 2021;24(3):425–36. pmid:33558695
View Article
PubMed/NCBI
Google Scholar

[154] View Article

[155] PubMed/NCBI

[156] Google Scholar

[ref45] 45. Jiang J, Wang N, Chen P, Zheng C, Wang B. Prediction of Protein Hotspots from Whole Protein Sequences by a Random Projection Ensemble System. Int J Mol Sci. 2017;18(7):1543. pmid:28718782
View Article
PubMed/NCBI
Google Scholar

[158] View Article

[159] PubMed/NCBI

[160] Google Scholar

[ref46] 46. Kadowaki K, Sugimoto K, Yamaguchi F, Song T, Watanabe Y, Singh K, et al. Phosphohippolin expression in the rat central nervous system. Brain Res Mol Brain Res. 2004;125(1–2):105–12. pmid:15193427
View Article
PubMed/NCBI
Google Scholar

[162] View Article

[163] PubMed/NCBI

[164] Google Scholar

[ref47] 47. Liu X, Liu S. Cholecystokinin selectively activates short axon cells to enhance inhibition of olfactory bulb output neurons. J Physiol. 2018;596(11):2185–207. pmid:29572837
View Article
PubMed/NCBI
Google Scholar

[166] View Article

[167] PubMed/NCBI

[168] Google Scholar

[ref48] 48. Sun X, Liu X, Starr ER, Liu S. CCKergic Tufted Cells Differentially Drive Two Anatomically Segregated Inhibitory Circuits in the Mouse Olfactory Bulb. J Neurosci. 2020;40(32):6189–206. pmid:32605937
View Article
PubMed/NCBI
Google Scholar

[170] View Article

[171] PubMed/NCBI

[172] Google Scholar

[ref49] 49. Wang Y, Sheng N, Xie Y, Chen S, Lu J, Zhang Z, et al. Low expression of CRISP3 predicts a favorable prognosis in patients with mammary carcinoma. J Cell Physiol. 2019;234(8):13629–38. pmid:30609035
View Article
PubMed/NCBI
Google Scholar

[174] View Article

[175] PubMed/NCBI

[176] Google Scholar

[ref50] 50. Liang X, Liu P, Xue L, Chen B, Liu W, Shi W, et al. A multi-modality and multi-granularity collaborative learning framework for identifying spatial domains and spatially variable genes. Bioinformatics. 2024;40(10):btae607. pmid:39418177
View Article
PubMed/NCBI
Google Scholar

[178] View Article

[179] PubMed/NCBI

[180] Google Scholar

[ref51] 51. Long Y, Ang KS, Sethi R, Liao S, Heng Y, van Olst L, et al. Deciphering spatial domains from spatial multi-omics with SpatialGlue. Nat Methods. 2024;21(9):1658–67. pmid:38907114
View Article
PubMed/NCBI
Google Scholar

[182] View Article

[183] PubMed/NCBI

[184] Google Scholar

[ref52] 52. Buache E, Etique N, Alpy F, Stoll I, Muckensturm M, Reina-San-Martin B, et al. Deficiency in trefoil factor 1 (TFF1) increases tumorigenicity of human breast cancer cells and mammary tumor development in TFF1-knockout mice. Oncogene. 2011;30(29):3261–73. pmid:21358676
View Article
PubMed/NCBI
Google Scholar

[186] View Article

[187] PubMed/NCBI

[188] Google Scholar

[ref53] 53. Chen A, Liao S, Cheng M, Ma K, Wu L, Lai Y, et al. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell. 2022;185(10):1777-1792.e21. pmid:35512705
View Article
PubMed/NCBI
Google Scholar

[190] View Article

[191] PubMed/NCBI

[192] Google Scholar

[ref54] 54. Jennings RE, Berry AA, Strutt JP, Gerrard DT, Hanley NA. Human pancreas development. Development. 2015;142(18):3126–37. pmid:26395141
View Article
PubMed/NCBI
Google Scholar

[194] View Article

[195] PubMed/NCBI

[196] Google Scholar

[ref55] 55. Eikrem OS, Strauss P, Beisland C, Scherer A, Landolt L, Flatberg A, et al. Development and confirmation of potential gene classifiers of human clear cell renal cell carcinoma using next-generation RNA sequencing. Scand J Urol. 2016;50(6):452–62. pmid:27739342
View Article
PubMed/NCBI
Google Scholar

[198] View Article

[199] PubMed/NCBI

[200] Google Scholar

[ref56] 56. Cui L, Guo G, Ng MK, Zou Q, Qiu Y. GSTRPCA: irregular tensor singular value decomposition for single-cell multi-omics data clustering. Brief Bioinform. 2024;26(1):bbae649. pmid:39680741
View Article
PubMed/NCBI
Google Scholar

[202] View Article

[203] PubMed/NCBI

[204] Google Scholar

[ref57] 57. Zhu Y, Xu Y, Yu F, Liu Q, Wu S, Wang L. Graph Contrastive Learning with Adaptive Augmentation. In: Proceedings of the Web Conference 2021, 2021. 2069–80.
View Article
Google Scholar

[206] View Article

[207] Google Scholar

[ref58] 58. Kipf TN, Welling MJ. Semi-supervised classification with graph convolutional networks. 2016.

[ref59] 59. Gan Y, Huang X, Zou G, Zhou S, Guan J. Deep structural clustering for single-cell RNA-seq data jointly through autoencoder and graph neural network. Brief Bioinform. 2022;23(2):bbac018. pmid:35172334
View Article
PubMed/NCBI
Google Scholar

[210] View Article

[211] PubMed/NCBI

[212] Google Scholar

[ref60] 60. Chen T, Kornblith S, Norouzi M, Hinton G. A simple framework for contrastive learning of visual representations. In: Proceedings of Machine Learning Research, 2020.

[ref61] 61. Stoltzfus CR, Filipek J, Gern BH, Olin BE, Leal JM, Wu Y. CytoMAP: a spatial analysis toolbox reveals features of myeloid cell organization in lymphoid tissues. 2020;31(3).

[ref62] 62. Yuan L, Xu Z, Meng B, Ye L. scAMZI: attention-based deep autoencoder with zero-inflated layer for clustering scRNA-seq data. BMC Genomics. 2025;26(1):350. pmid:40197174
View Article
PubMed/NCBI
Google Scholar

[216] View Article

[217] PubMed/NCBI

[218] Google Scholar

[ref63] 63. Hu Y, Xiao K, Yang H, Liu X, Zhang C, Shi Q. Spatially contrastive variational autoencoder for deciphering tissue heterogeneity from spatially resolved transcriptomics. J Biol Informa Bio. 2024;25(2):bbae016.
View Article
Google Scholar

[220] View Article

[221] Google Scholar

[ref64] 64. Estévez PA, Tesmer M, Perez CA, Zurada JMJ. Normalized mutual information feature selection. JITonn. 2009;20(2):189–201.
View Article
Google Scholar

[223] View Article

[224] Google Scholar

SpaMWGDA: Identifying spatial domains of spatial transcriptomes using multi-view weighted fusion graph convolutional network and data augmentation

SpaMWGDA: Identifying spatial domains of spatial transcriptomes using multi-view weighted fusion graph convolutional network and data augmentation

Correction

Figures

Abstract

Author summary

Introduction

Results

Ablation experiments

The performance of contrastive learning and weighted fusion attention on model

The impact of noise on model robustness and the scalability of SpaMWGDA

Performance comparison of methods for identifying spatial domain

SpaMWGDA reveals the laminar structure of the olfactory bulb from Stereo-seq data and infers its biological functions

SpaMWGDA delivers detailed insights into tumor heterogeneity in human breast cancer

Discussion

Materials and methods

Data preparation

Experiment settings

The SpaMWGDA framework

Spatial neighborhood networks construction

Gene expression enhancement

Multi-view weighted fusion GCN encoder

Attention mechanism

ZINB decoder

Spatial regularization constraint

Evaluation strategies

Evaluation metrics

Supporting information

S1 Table. Experimental results of SpaMWGDA and seven competing methods on the noisy DLPFC dataset.

S1 Fig. Detailed results for all 12 slices in the DLPFC dataset.

References