Figures
Abstract
Motivation
The expression of circular RNAs (circRNAs) has been shown to be strongly correlated with drug sensitivity in human cells. However, experimental validation using wet-lab techniques is costly and inefficient, leaving a substantial portion of circRNA–drug sensitivity associations undiscovered. Therefore, improving the prediction efficiency of circRNA and sensitivity associations remains critical.
Methods
Here, we describe a method that integrates collaborative feature learning and graph structure learning to predict associations between circRNAs and drug sensitivity (CFGSCDSA). Specifically, collaborative learning integrated heterogeneous features from diverse data sources, thereby addressing the issue of data sparsity. Furthermore, graph structure learning with a confidence-guided pseudo-labeling strategy was employed to mitigate the detrimental effect of excessive negative samples. Results: Experimental evaluation revealed that CFGSCDSA attained superior performance compared to all competing models. Moreover, case studies provided further evidence of its capability to accurately predict both novel associations and new drug-related links.
Author summary
Circular RNAs (circRNAs) play important roles in regulating drug sensitivity, but identifying their associations with specific drugs through biological experiments is costly and time-consuming. Computational methods therefore provide an efficient alternative for uncovering these potential relationships. In this study, we propose CFGSCDSA, a predictive framework that integrates collaborative feature learning with graph structure learning to improve the accuracy of circRNA–drug sensitivity association prediction. The collaborative feature-learning module effectively merges heterogeneous information from multiple biological sources, addressing data sparsity and enrich feature representations. A graph structure learning strategy guided by confidence-based pseudo-labeling strategy is employed to refine graph topology and mitigate the negative impact of false-negative samples. Extensive experiments demonstrate that CFGSCDSA outperforms existing state-of-the-art methods across multiple evaluation metrics. Furthermore, case studies verify its capability to uncover both known and novel circRNA–drug sensitivity associations, highlighting its potential to facilitate the discovery of therapeutic biomarkers.
Citation: Zhang X, Zou Q, Wang C, Niu M (2026) CFGSCDSA: Predicting circRNA-drug sensitivity associations based on collaborative feature learning and graph structure learning. PLoS Comput Biol 22(3): e1014072. https://doi.org/10.1371/journal.pcbi.1014072
Editor: Peng Wei, The University of Texas MD Anderson Cancer Center, UNITED STATES OF AMERICA
Received: October 17, 2025; Accepted: February 26, 2026; Published: March 13, 2026
Copyright: © 2026 Zhang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All code written in support of this publication is publicly available at https://github.com/zhangxue2115/CFGSCDSA.
Funding: The work was supported by the Natural Science Foundation of China (No. 62231013 to CYW, 62473268 to MTN), National Key Research and Development Program of China (Grant no. 2024YFC3405902 to CYW), the Special Science Foundation of Quzhou (No. 2024D001 to QZ) and Shenzhen Science and Technology Program (Grant No. RCBS20231211090800004 to MTN). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
Circular RNAs (circRNAs) are a class of non-coding RNAs generated through back-splicing, resulting in covalently closed circular structures [1–3]. Research has shown that circRNAs modulate drug resistance and sensitivity by influencing diverse biological processes, including DNA repair mechanisms, immune evasion, and tumor-promoting inflammation [4–6]. Huang et al. demonstrated that CircESRP1 regulates the CDKN1A pathway, contributing to increased cisplatin sensitivity in small cell lung cancer [7]. Wang et al. showed that circ-METRN contributes to the radio-resistance of glioblastoma cells by regulating DNA damage repair via the miR-4709-3p/GRB14/PDGFRα cascade [8]. Qu et al. demonstrated that circCDYL2 enhances sensitivity in nasopharyngeal carcinoma by modulating the expression of RAD51 mRNA [9]. Additionally, circNOP14 sensitizes hepatocellular carcinoma cells to radiation through its interaction with Ku70 [10], and circCCAR1 contributes to anti-PD-1 immunotherapy resistance through modulation of the miR-127-5p feedback loop [11,12]. Collectively, this evidence highlights a robust relationship between circRNAs and drug sensitivity. Thus, investigating this relationship may facilitate the identification of potential therapeutic targets for cancer in clinical settings [13,14]. Accordingly, an urgent need exists to explore circRNAs–drug sensitivity associations(CDSAs) to better understand their roles in therapeutic response.
Traditional biological methods for identifying CDSAs are often expensive, labor-intensive, and inefficient [15–17]. Consequently, computational approaches have emerged as valuable tools for uncovering potential links between circRNAs and drug sensitivity [18–20]. GATECDA incorporates host gene sequences of circRNAs and drug structural information to compute similarity matrices, and then leverages a graph attention autoencoder to obtain low-dimensional embeddings for prediction [21]. Based on diverse data sources of circRNAs and drugs, MNGACDA establishes a multimodal network and then utilizes a graph attention autoencoder to extract node-level information [22]. MAGSDMF leverages multi-attention and graph learning methods to dynamically capture feature representations, followed by stacked deep matrix factorization for further feature refinement, and ultimately integrates the learned features using a multi-channel attention mechanism [23]. DPMGCDA first constructs homogenous and heterogeneous graphs for circRNAs and drugs, then extracts both global and fused features, and finally uses a path-masked graph autoencoder to predict their associations [24]. SNMGCDA extracts features using a sparse autoencoder, non-negative matrix factorization, and a multi-head graph attention network, and integrates them for final prediction [25]. HMFHCL constructs heterogeneous networks at both global and local levels for circRNAs and drugs, and enhances feature representation by performing matrix factorization and hypergraph convolution [26]. MiGNN2CDS introduces instance-level learning, designs a pseudo-metapath generator and a bidirectional translation embedding projection to learn metapath representations, and employs a multi-scale attention network for prediction [27]. Although these models have achieved certain progress in predicting CDSAs, several limitations remain. First, they fail to fully exploit data heterogeneity, leading to insufficient interaction learning. Second, CDSA datasets are relatively sparse, which may weaken the representation capability of the models.
To effectively address these challenges, we propose CFGSCDSA, a novel framework that integrates collaborative feature learning with a confidence-guided pseudo-labeling based graph structure learning strategy. Unlike existing approaches, CFGSCDSA is specifically designed to jointly overcome the two major challenges of insufficient heterogeneous information utilization and dataset sparsity, thereby improving both interaction learning and topological representation robustness. The workflow of CFGSCDSA is illustrated in Fig 1. The CFGSCDSA model consists of the following three modules:
In step A, CFGSCDSA constructs multi-source similarity networks for circRNAs and drugs. In step B, the model employs collaborative feature learning strategy to comprehensively capture information from different data sources. In step C, graph structure learning with confidence-guided pseudo-labeling strategy is adopted to enhance the learning of topological features.
- (1). Multi-source similarity construction, which incorporates multiple circRNA and drug similarity networks to provide enriched biological and topological information;
- (2). Collaborative feature learning, which models and integrates differences among heterogeneous data sources, yielding more comprehensive representations and alleviating sparsity-induced overfitting;
- (3). Confidence-guided pseudo-labeling based graph structure learning, which optimizes graph topology, enhances structural representation learning, and mitigates the influence of latent false-negative associations.
2. Materials and methods
2.1. Datasets
The dataset used in this study was sourced from reference [21], and obtained from the circRic database, where drug sensitivity data were derived from the Genomics of Drug Sensitivity in Cancer (GDSC) database [28]. The circRic database systematically profiled circRNA expression based on data from the Cancer Cell Line Encyclopedia (CCLE) and statistically analyzed the associations between circRNA expression and drug response. For each circRNA, the Wilcoxon test was employed to identify drugs whose sensitivities were significantly associated with circRNA expression. Notably, circRNAs in this dataset are labeled using their host gene names.
The original dataset comprised 80,076 circRNA–drug sensitivity associations involving 404 circRNAs and 250 drugs. Following significance filtering using a false discovery rate (FDR) threshold of 0.05, the final dataset used in this work consisted of 4,134 associations between 271 circRNAs and 218 drugs. These associations were encoded into an association matrix Y, where denotes the presence of an association between circRNA i and drug j, and
denotes its absence.
These associations were encoded into an association matrix Y, where denotes an association and
otherwise.
2.2. Data pre-processing and encoding
To obtain comprehensive and high-quality initial features, we computed six similarity matrices, including ,
,
,
,
, and
. Table 1 summarizes all similarity matrices used in this study, including their corresponding entity types, data sources, and computational strategies.
CircRNA sequence similarity.
We adopted the similarity calculation method for circRNAs proposed by Deng et al. [21]. Specifically, we first retrieved host gene information of circRNAs from the National Center for Biotechnology Information (NCBI) database [29] and then computed the similarity between host genes using the Levenshtein distance, which served as the similarity measure between circRNAs [30–32].
Let and
denote the sequences of the host genes corresponding to circRNA i and circRNA j, respectively. The Levenshtein distance between them is computed via a dynamic programming matrix
, where each entry
represents the edit distance between the first p characters of
and the first q characters of
. The recurrence relation is defined as:
where the mismatch indicator function is defined as:
The complete edit distance between circRNA i and j is obtained at:
To normalize this distance and transform it into a similarity score, the circRNA sequence similarity matrix is defined as:
Drug structure similarity.
Analyzing the chemical structure of drugs enables the evaluation of their functional similarity. Specifically, SMILES structural data were obtained from the PubChem database [33], topological fingerprints were generated using RDKit, and structural similarity was computed using the Tanimoto coefficient.
Here, and
correspond to the topological fingerprint vectors of drugs i and j.
Entropy similarity of circRNA and drug.
The concept of information entropy was originally introduced by Shannon [34,35]. In this study, we employ entropy-based methods to independently compute the similarity between circRNAs and drugs. Taking circRNAs as an example, where denotes the drug set associated with circRNA i. For each drug
, the occurrence probability is defined as:
Where N represent the total number of known circRNA–drug associations, denotes the number of circRNAs known to be associated with drug
The information entropy of circRNA i is then calculated as:
The entropy-based similarity between circRNA i and circRNA j is defined as:
refers to the set of drugs that exhibit associations with both circRNA i and circRNA j.
The entropy similarity is then calculated using the same formula:
Gaussian kernel function similarity of circRNAs and drugs.
The Gaussian interaction profile (GIP) kernel similarity was incorporated to obtain more comprehensive features [36]. The GIP similarity matrix for circRNAs was determined as follows:
Here, denotes the bandwidth parameter for circRNAs, and n represents the total number of circRNAs.
refers to the column in the association matrix corresponding to circRNA i.
The GIP similarity matrix for drugs was determined as follows:
Here, denotes the bandwidth parameter for drugs, and m represents the total number of drugs.
refers to the row in the association matrix corresponding to drug i.
2.3. Network structure
2.3.1. Shared-constraint-driven collaborative feature learning.
To mitigate the weak interactions between heterogeneous sources, we adopted a shared-constraint-driven collaborative feature learning approach.
Feature Encoding: The similarity network was used as the input feature to the autoencoder. Specifically, the circRNA similarity network was encoded using a Graph Convolutional Network(GCN) to generate node representations,
where denotes the node features of
, and
denotes the encoder.
Shared Constraint: We determined shared constraints based on the similarity between nodes. In semi-supervised clustering, nodes with high similarity are referred to as “must-link” pairs. Specifically, Pearson correlation coefficients (PCC) were employed to compute pairwise node similarities :
In this context, and
refer to the feature representations of
and
within
, while
indicates the dimension of the feature space.
After obtaining the PCC values from all circRNA networks, a subset of entries is selected. The corresponding positions are set to 1 while the rest are set to 0, resulting in the “must-link” constraint matrix MLM.
All similarity attributes are subsequently utilized to jointly extract circRNA feature representations, and the shared of all “must-link” constraint matrix SMLM is defined as the shared constraint.
Specifically, denotes the operation that computes the intersection between sets.
This shared constraint, together with the low-dimensional features obtained from the first autoencoder, is then fed into the subsequent autoencoder.
Here, represents the decoder.
Loss Function Construction: We first constructed a loss function based on the original and the reconstructed features obtained through the encoder-decoder framework.
Here, k denotes the number of circRNA nodes.
Subsequently, we introduced a shared constraint loss to enforce similarity between feature representations. Specifically, for nodes and that form a must-link pair, their embeddings are required to be proximal in the feature space. The corresponding loss function is defined as:
where and
correspond to the features in the vector space
, while S indicates the number of shared constraint pairs.
Therefore, the overall loss function of the collaborative feature extraction module is defined as follows:
After applying collaborative feature learning, similarity features of circRNAs and drugs were obtained and then aggregated to form the feature matrix X. To integrate heterogeneous circRNA and drug features, we construct a block-structured feature matrix X. Zero matrices are inserted between circRNA and drug feature blocks to prevent unintended cross-type interactions. This block structure preserves the independence of the two biological entities while enabling unified feature representation learning.
2.3.2. Graph structure learning.
To reduce the influence of latent false-negative samples in the CDSA dataset, we incorporate a confidence-guided pseudo-labeling mechanism into graph structure learning. This design reduces the impact of low-confidence negative samples, preventing potentially false-negative pairs from dominating the graph structure learning process and yielding a more reliable circRNA–drug interaction topology.
Graph learner: Graph learners are capable of modeling graph structures. We adopted the FGP learner to model the adjacency matrix S as follows:
where σ is defined as the nonlinear activation function.
The adjacency matrix S is normalized and symmetrized to ensure consistency, resulting in .
Here, D denotes the degree matrix of S, and represents the activation function.
View Construction: To enhance feature representation, we adopted an inference and an optimization view as dual perspectives in the learning process. The refined view was constructed based on the original topological structure and the features extracted by the collaborative feature learning module.
The inference view is composed of an adjacency matrix
generated by the graph learner and feature representations derived from the collaborative feature learning module.
Data Augmentation: When two views exhibit high similarity, the learning of discriminative features becomes limited [37,38]. To mitigate this, we incorporated edge dropout and feature masking across the two views, thereby inducing structural variation and facilitating the extraction of richer semantic features.
Different feature masking probabilities are applied to the two views, where is used for the refinement view and
for the inference view.
and
denote the feature matrix of the inference view after feature masking, where
represents the transpose of the i-th row feature of X, and
and
are the masking vectors generated according to a Bernoulli distribution with the masking probability
and
.
Feature masking resulted in significantly different graph structures of the two views; thus, the same edge dropout probability was applied to both views. The adjacency matrices of the refined and inference views after edge dropout are defined as follows:
is the dropping vector generated from the Bernoulli distribution with the dropping probability
and
represent the adjacency matrices of the defined view and inference view after edge dropout, respectively. The augmented views are thus obtained as follows:
For both the refined view and the inference view, representations and
are obtained from the augmented views via an encoder, and then mapped to feature vectors
and
through Multilayer Perceptron (MLP). The process is defined as follows:
Here, f and g denote the parameters of the GCN encoder and the MLP layer, respectively.
Confidence-guided Pseudo-labeling: Inspired by the minimum-entropy principle in self-supervised learning [39], we design a confidence-guided pseudo-labeling strategy to provide reliable supervision for cross-view representation learning. It consists of three key steps. The first step involves normalizing the feature representations:
Then, compute the index of the dimension with the maximum value in the normalized feature vector.
Finally, the loss function is based on cross-entropy loss, as follows:
where means cross-entropy loss.
2.3.3. CDSA prediction.
The optimal graph structure was obtained through the semi-supervised learning module, after which a GCN was employed to predict CDSA associations, resulting in the prediction result matrix
.
The loss function employs mean squared error (MSE) loss to minimize the discrepancy between the predicted values and the ground truth.
The predicted value of the i-th sample in set is denoted by
, while the true value of the i-th sample in set A is represented by
.
3. Results and discussion
3.1. Performance evaluation
In the experiments, the ratio of positive to negative samples was set to 1:1. We used 5-fold and 10-fold cross-validation (CV) to evaluate the model’s predictive ability [40,41]. Under 5-fold CV, CFGSCDSA achieved an area under the curve (AUC) of 0.9354 and an area under the precision–recall curve (AUPR) of 0.9336, as illustrated in Fig 2. For 10-fold CV, the model attained an AUC of 0.9358 and an AUPR of 0.9341.
3.2. Comparison with other methods
To assess the effectiveness of CFGSCDSA, we compared it against seven representative state-of-the-art CDSA prediction methods: MHCDA [42], DHANMKF [43], DGATCCDA [44], MNGACDA [22], MNCLCDA [45], GATECDA [21], and MHGTCDA [46]. MHCDA leverages GCN and hypergraph convolution to capture local and global information. DHANMKF applies intra and inter-type attention mechanisms. DGATCCDA uses a DeepWalk-enhanced graph attention network for multimodal node embedding. MNGACDA employs a graph autoencoder with node-level attention. MNCLCDA integrates hybrid neighborhood GCN with graph contrastive learning. GATECDA extracts low-dimensional features using a graph attention autoencoder, and MHGTCDA combines an adaptive encoder with a multi-layer heterogeneous graph transformer for association prediction. For a fair comparison, all competing methods were implemented using the parameter settings recommended in their original publications.
We describe three evaluation metrics—AUC, AUPR, and accuracy. The results of 5-CV for the eight methods are shown in Fig 3A. CFGSCDSA outperformed GATECDA, MNCLCDA, and MNGACDA in terms of AUC by 5.08%, 2.7%, and 2.34%, respectively, and exceeds the second-best model by 1.35%, indicating that CFGSCDSA consistently outperformed existing CDSA prediction approaches in all metrics, thereby validating the effectiveness of the proposed model. While existing approaches employ different strategies to extract circRNA and drug features, they insufficiently consider the correlations within homogeneous similarity networks. Moreover, the proposed model integrates the confidence-guided pseudo-labeling strategy to reduce the adverse effects of unreliable negative samples and improve feature robustness. These findings demonstrate the effectiveness of CFGSCDSA for CDSA prediction, underscoring its capability to enhance feature representation via collaborative learning and self-supervised mechanisms.
(B) Parameter analysis for edge dropout. (C) Parameter analysis for masking probability. (D) Performance of ablation study.
3.3. Parameter sensitivity analysis
Hyperparameter settings significantly affect model performance. We primarily analyzed the effects of masking probability and
, as well as edge dropout probability
on the outcome.
We first evaluated edge dropout probability, as removing edges enhances performance during graph structure learning. The results under varying edge dropout probability settings are depicted in Fig 3B. Optimal performance was obtained at = 0.1, whereas higher values result in progressively decreased performance.
Further, we applied distinct masking probabilities and
, respectively. Fig 3C presents the AUC values under all combinations of masking probabilities. The highest AUC was achieved when
and
, whereas other combinations resulted in varying degrees of performance degradation.
3.4. Ablation study
To assess the contribution of our modifications, we constructed two ablated variants: (1) CFGSCDSA-noCFE, in which the collaborative feature learning module was removed and the input was formed by concatenating similarities; and (2) CFGSCDSA-noCGPL, where the confidence-guided pseudo-labeling strategy was excluded from the graph structure learning process. Fig 3D illustrates the ablation results in terms of AUC, AUPR, F1, and recall. CFGSCDSA- noCGPL achieves the lowest scores across all evaluation metrics, indicating that confidence-guided pseudo-labeling strategy plays a crucial role in stabilizing self-supervised learning and enhancing representation quality. Likewise, the performance drop observed in CFGSCDSA-noCFE indicates that the collaborative feature-sharing module is essential for facilitating multi-source feature interactions. In addition, removing both modules results in a substantial reduction in recall, implying that the full CFGSCDSA framework is more effective at identifying true circRNA–drug sensitivity associations. The cooperative effect of confidence-guided pseudo-labeling strategy and collaborative feature learning helps minimize the omission of positive samples.
3.5. Case study
To assess the practical significance of CFGSCDSA, we performed case studies on crizotinib and etoposide. Data retrieved from the GDSC database were used to train CFGSCDSA and generate prediction outputs. For each drug, the 20 top-scoring circRNAs candidates were identified and validated through Cancer Therapeutics Response Portal (CTRP) to confirm their reliability. Specifically, circRNA–drug associations annotated as “CTRP” indicate that they are supported by the CTRP database with FDR less than 0.05, whereas entries labeled as “No evidence” denote that no supporting evidence was identified in CTRP. The detailed results are provided in Tables 2 and 3.
Crizotinib, as a tyrosine kinase inhibitor, suppresses the activity of multiple oncogenic receptors, including hepatocyte growth factor receptor, anaplastic lymphoma kinase (ALK), and ROS1 [47]. Based on the CFGSCDSA predictions, 20 circRNAs were identified as potential candidates associated with crizotinib, and notably, 16 of them were validated by CTRP. Several of the top predicted circRNAs, such as those derived from VIM, SPINT2, THBS1, and CTTN, are hosted by genes involved in cytoskeletal remodeling, epithelial–mesenchymal transition, and cell adhesion [48–51]. These processes are known to modulate the activity of receptor tyrosine kinase pathways targeted by crizotinib, providing a biologically plausible link between the predicted circRNAs and drug sensitivity.
Etoposide (VP-16) is widely employed as an antineoplastic agent due to its ability to inhibit DNA topoisomerase II [52,53]. Previous studies have demonstrated that this drug is associated with the development of leukemia. As shown in Table 3, among the top 20 circRNAs potentially associated with etoposide predicted by CFGSCDSA, 17 were validated in the CTRP database. Many of the highly ranked circRNAs for etoposide, including those derived from POLR2A, MEF2D, KRT19, and ASPH, originate from genes associated with DNA repair, transcriptional regulation, and apoptosis [54–57]. Because etoposide induces cytotoxicity through DNA double-strand break accumulation, circRNAs from these host genes may influence cellular responses to DNA damage, thereby affecting drug sensitivity.
In order to assess the performance of CFGSCDSA on novel drug prediction, we selected belinostat and vorinostat and all prior associations were removed. The top 10 predicted circRNAs for each drug are listed in Table 4. Validation results from CTRP showed that seven circRNAs for belinostat and seven circRNAs for vorinostat were successfully confirmed. For the HDAC inhibitors belinostat and vorinostat, several top circRNAs are hosted by genes involved in chromatin remodeling and epigenetic regulation, such as LMNA, JUP, FN1, and CALD1 [58–61]. Since HDAC inhibitors modulate transcriptional activity and chromatin accessibility, circRNAs originating from these pathways may plausibly alter cellular responses to epigenetic therapy.
4. Conclusions
This study introduces CFGSCDSA, a model that integrates collaborative learning and self-supervised learning for predicting CDSA. Specifically, CFGSCDSA employs collaborative learning to integrate heterogeneous features, followed by graph structure learning based on confidence-guided pseudo-labeling strategy to refine feature representations. Among seven state-of-the-art methods, CFGSCDSA demonstrates superior performance according to experimental results, and case studies further validate its practical utility. Nevertheless, the model still faces challenges such as data sparsity and class imbalance. Future work will focus on optimizing graph structure learning and addressing imbalance issues. Overall, CFGSCDSA provides a practical valuable framework for CDSA prediction and offers reliable technical support for clinical drug discovery.
References
- 1. Kristensen LS, Jakobsen T, Hager H, Kjems J. The emerging roles of circRNAs in cancer and oncology. Nat Rev Clin Oncol. 2022;19(3):188–206. pmid:34912049
- 2. Zhuang M, Zhang M, Wang Y, Zou L, He S, Liu J, et al. Genome-wide analysis of ovarian cancer-specific circRNAs in alternative splicing regulation. Curr Bioinf. 2025;20.
- 3. Liu Y, Li R, Ding Y, Hei X, Wu F-X. P4PC: a portal for bioinformatics resources of piRNAs and circRNAs. Curr Bioinf. 2024;19(9):873–8.
- 4. Wei M, Wang L, Li Y, Li Z, Zhao B, Su X, et al. BioKG-CMI: a multi-source feature fusion model based on biological knowledge graph for predicting circRNA-miRNA interactions. Sci China Inf Sci. 2024;67(8).
- 5. Qi R, Liu S, Hui X, Shaytan AK, Liu B. AI in drug development: advances in response, combination therapy, repositioning, and molecular design. Sci China Inf Sci. 2025;68(7).
- 6. Huang Z, Guo X, Qin J, Gao L, Ju F, Zhao C, et al. Accurate RNA velocity estimation based on multibatch network reveals complex lineage in batch scRNA-seq data. BMC Biol. 2024;22(1):290. pmid:39696422
- 7. Huang W, Yang Y, Wu J, Niu Y, Yao Y, Zhang J, et al. Circular RNA cESRP1 sensitises small cell lung cancer cells to chemotherapy by sponging miR-93-5p to inhibit TGF-β signalling. Cell Death Differ. 2020;27(5):1709–27. pmid:31728016
- 8. Wang X, Cao Q, Shi Y, Wu X, Mi Y, Liu K, et al. Identification of low-dose radiation-induced exosomal circ-METRN and miR-4709-3p/GRB14/PDGFRα pathway as a key regulatory mechanism in Glioblastoma progression and radioresistance: Functional validation and clinical theranostic significance. Int J Biol Sci. 2021;17(4):1061–78. pmid:33867829
- 9. Qu H, Wang Y, Yan Q, Fan C, Zhang X, Wang D, et al. CircCDYL2 bolsters radiotherapy resistance in nasopharyngeal carcinoma by promoting RAD51 translation initiation for enhanced homologous recombination repair. J Exp Clin Cancer Res. 2024;43(1):122. pmid:38654320
- 10. Lin L, Hu P, Luo M, Chen X, Xiao M, Zhong Z, et al. CircNOP14 increases the radiosensitivity of hepatocellular carcinoma via inhibition of Ku70-dependent DNA damage repair. Int J Biol Macromol. 2024;264(Pt 2):130541. pmid:38460628
- 11. Hu Z, Chen G, Zhao Y, Gao H, Li L, Yin Y, et al. Exosome-derived circCCAR1 promotes CD8 + T-cell dysfunction and anti-PD1 resistance in hepatocellular carcinoma. Mol Cancer. 2023;22(1):55. pmid:36932387
- 12. Sheng N, Liu Y, Gao L, Wang L, Si C, Huang L, et al. Deep-learning-based integration of sequence and structure information for efficiently predicting miRNA-drug associations. J Chem Inf Model. 2025;65(11):5317–26. pmid:40380921
- 13. Peng D, Wei C, Jing B, Yu R, Zhang Z, Han L. A novel protein encoded by circCOPA inhibits the malignant phenotype of glioblastoma cells and increases their sensitivity to temozolomide by disrupting the NONO-SFPQ complex. Cell Death Dis. 2024;15(8):616. pmid:39183343
- 14. Qiao J, Jin J, Wang D, Teng S, Zhang J, Yang X, et al. A self-conformation-aware pre-training framework for molecular property prediction with substructure interpretability. Nat Commun. 2025;16(1):4382. pmid:40355450
- 15.
Zhang Z, Wang Z, Zhao L, Wang J, Wang C. Multimodal Contrastive Learning for Protein–Protein Interaction Inhibitor Prediction. In: 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE; 2024. pp. 1327–33. https://doi.org/10.1109/bibm62325.2024.10822227
- 16. Wang Y, Zhai Y, Ding Y, Zou Q. SBSM-Pro: support bio-sequence machine for proteins. Sci China Inf Sci. 2024;67(11).
- 17. Luo Y, Shi L, Li Y, Zhuang A, Gong Y, Liu L, et al. From intention to implementation: automating biomedical research via LLMs. Sci China Inf Sci. 2025;68(7).
- 18. Zhang Z, Zou Q, Wang C, Wang J, Zhao L. Improving protein–protein interaction modulator predictions via knowledge-fused language models. Inf Fus. 2025;123:103227.
- 19. Qiao J, Jin J, Yu H, Wei L. Towards retraining-free RNA modification prediction with incremental learning. Inf Sci. 2024;660:120105.
- 20. Guo X, Huang Z, Ju F, Zhao C, Yu L. Highly accurate estimation of cell type abundance in bulk tissues based on single-cell reference and domain adaptive matching. Adv Sci (Weinh). 2024;11(7):e2306329. pmid:38072669
- 21. Deng L, Liu Z, Qian Y, Zhang J. Predicting circRNA-drug sensitivity associations via graph attention auto-encoder. BMC Bioinform. 2022;23(1):160. pmid:35508967
- 22. Yang B, Chen H. Predicting circRNA-drug sensitivity associations by learning multimodal networks using graph auto-encoders and attention mechanism. Brief Bioinform. 2023;24(1):bbac596. pmid:36617209
- 23. Ai N, Yuan H, Liang Y, Lu S, Ouyang D, Lai QH, et al. Multi-view multiattention graph learning with stack deep matrix factorization for circRNA-drug sensitivity association identification. IEEE J Biomed Health Inform. 2024;28(12):7670–82. pmid:39186430
- 24. Luo Y, Deng L. DPMGCDA: deciphering circRNA-drug sensitivity associations with dual perspective learning and path-masked graph autoencoder. J Chem Inf Model. 2024;64(10):4359–72. pmid:38745420
- 25. Yin S, Xu P, Jiang Y, Yang X, Lin Y, Zheng M, et al. Predicting the potential associations between circRNA and drug sensitivity using a multisource feature-based approach. J Cell Mol Med. 2024;28(19):e18591. pmid:39347936
- 26. Liu T, et al. Deciphering circRNA-drug sensitivity associations via global-local heterogeneous matrix factorization and hypergraph contrastive learning. Exp Syst Appl. 2025:128548.
- 27. Niu M, Wang C, Chen Y, Zou Q, Luo X. Interpretable multi-instance heterogeneous graph network learning modelling CircRNA-drug sensitivity association prediction. BMC Biol. 2025;23(1):131. pmid:40369616
- 28. Yang W, Soares J, Greninger P, Edelman EJ, Lightfoot H, Forbes S, et al. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res. 2013;41:D955–61. pmid:23180760
- 29. Rangwala SH, Kuznetsov A, Ananiev V, Asztalos A, Borodin E, Evgeniev V, et al. Accessing NCBI data using the NCBI sequence viewer and genome data viewer (GDV). Genome Res. 2021;31(1):159–69. pmid:33239395
- 30. Li H, Liu B. BioSeq-Diabolo: Biological sequence similarity analysis using Diabolo. PLoS Comput Biol. 2023;19(6):e1011214. pmid:37339155
- 31. Xu Z, Song L, Liu S, Zhang W. DeepCRBP: improved predicting function of circRNA-RBP binding sites with deep feature learning. Front Comput Sci. 2023;18(2).
- 32. Zou X, Ren L, Cai P, Zhang Y, Ding H, Deng K, et al. Accurately identifying hemagglutinin using sequence information and machine learning methods. Front Med (Lausanne). 2023;10:1281880. pmid:38020152
- 33. Wang Y, Xiao J, Suzek TO, Zhang J, Wang J, Bryant SH. PubChem: a public information system for analyzing bioactivities of small molecules. Nucleic Acids Res. 2009;37:W623–33. pmid:19498078
- 34. Huang L, Sheng N, Gao L, Wang L, Hou W, Hong J, et al. Self-supervised contrastive learning on attribute and topology graphs for predicting relationships among lncRNAs, miRNAs and diseases. IEEE J Biomed Health Inform. 2025;29(1):657–68. pmid:39316476
- 35. Wang C, Liu Y, Song S, Cao K, Liu X, Sharma G, et al. DHAG-DTA: dynamic hierarchical affinity graph model for drug-target binding affinity prediction. IEEE Trans Comput Biol Bioinform. 2025;22(2):697–709. pmid:40811402
- 36. van Laarhoven T, Nabuurs SB, Marchiori E. Gaussian interaction profile kernels for predicting drug-target interaction. Bioinformatics. 2011;27(21):3036–43. pmid:21893517
- 37. Zeng X, Xiang H, Yu L, Wang J, Li K, Nussinov R, et al. Accurate prediction of molecular properties and drug targets using a self-supervised image representation learning framework. Nat Mach Intell. 2022;4(11):1004–16.
- 38. Ai C, Yang H, Liu X, Dong R, Ding Y, Guo F. MTMol-GPT: De novo multi-target molecular generation with transformer-based generative adversarial imitation learning. PLoS Comput Biol. 2024;20(6):e1012229. pmid:38924082
- 39. Li S, Liu F, Hao Z, Jiao L, Liu X, Guo Y. MinEnt: Minimum entropy for self-supervised representation learning. Pattern Recognition. 2023;138:109364.
- 40. Niu M, Chen Y, Wang C, Zou Q, Xu L. Computational approaches for circRNA-disease association prediction: a review. Front Comput Sci. 2024;19(4).
- 41. Huang Z, Xiao Z, Ao C, Guan L, Yu L. Computational approaches for predicting drug-disease associations: a comprehensive review. Front Comput Sci. 2024;19(5).
- 42.
Yin C, Jiang T, Liu H, Luo L. Multi-channel Hypergraph Convolutional Network Predicts circRNA-Drug Sensitivity Associations. Communications in Computer and Information Science. Springer Nature Singapore; 2025. pp. 130–46. https://doi.org/10.1007/978-981-96-3752-2_9
- 43. Lu S, Liang Y, Li L, Liao S, Zou Y, Yang C, et al. Inferring circRNA-drug sensitivity associations via dual hierarchical attention networks and multiple kernel fusion. BMC Genomics. 2023;24(1):796. pmid:38129810
- 44. Li G, Li Y, Liang C, Luo J. DeepWalk-aware graph attention networks with CNN for circRNA-drug sensitivity association identification. Brief Funct Genomics. 2024;23(4):418–28. pmid:38061910
- 45. Li G, Zeng F, Luo J, Liang C, Xiao Q. MNCLCDA: predicting circRNA-drug sensitivity associations by using mixed neighbourhood information and contrastive learning. BMC Med Inform Decis Mak. 2023;23(1):291. pmid:38110886
- 46. Liu Y, Ren X, Li J, Chen X, Zhu X. Prediction of circRNA-drug sensitivity using random auto-encoders and multi-layer heterogeneous graph transformers. Appl Intell. 2024;55(4).
- 47. Musa S, Amara N, Selawi A, Wang J, Marchini C, Agbarya A, et al. Overcoming chemoresistance in cancer: the promise of crizotinib. Cancers (Basel). 2024;16(13):2479. pmid:39001541
- 48. Mendez MG, Kojima S-I, Goldman RD. Vimentin induces changes in cell shape, motility, and adhesion during the epithelial to mesenchymal transition. FASEB J. 2010;24(6):1838–51. pmid:20097873
- 49. Hwang S, Kim H-E, Min M, Raghunathan R, Panova IP, Munshi R, et al. Epigenetic silencing of SPINT2 promotes cancer cell motility via HGF-MET pathway activation in melanoma. J Invest Dermatol. 2015;135(9):2283–91. pmid:25910030
- 50. Liu X, Xu D, Liu Z, Li Y, Zhang C, Gong Y, et al. THBS1 facilitates colorectal liver metastasis through enhancing epithelial-mesenchymal transition. Clin Transl Oncol. 2020;22(10):1730–40. pmid:32052380
- 51. Shen Z-Z, Xu Q-S, Wu C-J, Feng X-D, Chen Q-M. CTTN overexpression in HNSCC inhibits Anoikis-apoptosis. Sci Rep. 2025;15(1):38018. pmid:41168318
- 52. Zheng N, Pang S, Oe T, Felix CA, Wehrli S, Blair IA. Characterization of an etoposide-glutathione conjugate derived from metabolic activation by human cytochrome p450. Curr Drug Metab. 2006;7(8):897–911. pmid:17168690
- 53. Xie X, Wu C, Dao F. scRiskCell: A single‐cell framework for quantifying pancreatic islet risk cells and unravelling their dynamic transcriptional and molecular adaptation in the progression of type 2 diabetes. iMeta. 2025;:e70060.
- 54. Muñoz MJ, Pérez Santangelo MS, Paronetto MP, de la Mata M, Pelisch F, Boireau S, et al. DNA damage regulates alternative splicing through inhibition of RNA polymerase II elongation. Cell. 2009;137(4):708–20. pmid:19450518
- 55. Zhao L, Zhang P, Galbo PM, Zhou X, Aryal S, Qiu S, et al. Transcription factor MEF2D is required for the maintenance of MLL-rearranged acute myeloid leukemia. Blood Adv. 2021;5(22):4727–40. pmid:34597364
- 56. Saha SK, Choi HY, Kim BW, Dayem AA, Yang G-M, Kim KS, et al. KRT19 directly interacts with β-catenin/RAC1 complex to regulate NUMB-dependent NOTCH signaling pathway and breast cancer properties. Oncogene. 2017;36(3):332–49. pmid:27345400
- 57. Meng H, Jiang L, Jia P, Niu R, Bu F, Zhu Y, et al. Inhibition of circular RNA ASPH reduces the proliferation and promotes the apoptosis of hepatic stellate cells in hepatic fibrosis. Biochem Pharmacol. 2023;210:115451. pmid:36758707
- 58. Shimi T, Pfleghaar K, Kojima S, Pack C-G, Solovei I, Goldman AE, et al. The A- and B-type nuclear lamin networks: microdomains involved in chromatin organization and transcription. Genes Dev. 2008;22(24):3409–21. pmid:19141474
- 59. Aktary Z, Pasdar M. Plakoglobin: role in tumorigenesis and metastasis. Int J Cell Biol. 2012;2012:189521. pmid:22481945
- 60. Okazaki K, Sandell LJ. Extracellular matrix gene regulation. Clin Orthop Relat Res. 2004;(427 Suppl):S123–8. pmid:15480054
- 61. Mayanagi T, Morita T, Hayashi K, Fukumoto K, Sobue K. Glucocorticoid receptor-mediated expression of caldesmon regulates cell migration via the reorganization of the actin cytoskeleton. J Biol Chem. 2008;283(45):31183–96. pmid:18772142