Figures
Abstract
Cancer drug combination therapies offer a promising strategy to overcome resistance and improve treatment efficacy, but identifying synergistic drug pairs is challenging due to complex biological interactions and tumor heterogeneity. Current machine learning algorithms for drug synergy prediction primarily rely on large-scale, multimodal datasets, yet suffer from critical limitations including poor interpretability, difficulty distinguishing causative biological relationships from correlations, and inadequate modeling of cancer-specific molecular interactions. To address these challenges, we propose CASynergy (Causal Attention and Cross-attention Synergy), a novel deep learning model for predicting cancer drug synergy that addresses limitations of prior approaches in accuracy and interpretability. CASynergy introduces a causal attention mechanism to distinguish true causal genomic features from spurious correlations, cell line-specific gene network construction to capture the unique molecular context of each cancer cell line, and a cross-attention module to integrate drug molecular features with cell line gene expression profiles. These improvements allow CASynergy to clearly identify significant drug-gene interactions and provides interpretable insights into why a combination is predicted to be synergistic. Experiments on two benchmark datasets (DrugCombDB and Oncology-Screen) suggests that CASynergy outperformed five state-of-the-art models. CASynergy offers a better and more reliable way to predict effective drug combinations. It works well across different cancer types and is easier to understand, which is important for personalized cancer treatment and finding new drugs.
Author summary
Cancer often requires treatment with multiple drugs simultaneously to effectively combat resistance and improve patient outcomes. However, identifying which drug combinations work best together remains challenging because of the complex and diverse nature of tumors. To address this, we developed a new computational approach called CASynergy. Unlike traditional methods, CASynergy can clearly distinguish which genetic features are truly driving drug responses from those that are merely coincidental. This capability greatly enhances the accuracy and reliability of predicting effective drug combinations. Additionally, CASynergy uses detailed biological knowledge to build unique gene interaction maps for each cancer cell type, allowing our model to capture the individual characteristics of different tumors accurately. We tested CASynergy using well-established datasets and found it consistently outperformed existing methods. Moreover, it clearly shows researchers how different drugs affect specific genes, providing valuable insights into why certain drug combinations are effective. Our method thus not only improves prediction accuracy but also helps researchers and clinicians better understand drug interactions at the molecular level. Ultimately, this can guide more personalized and effective cancer treatments.
Citation: Li H, Zheng L, Li L, Chen Y, Li J, Zheng C, et al. (2025) CASynergy: A causal attention model for interpretable prediction of cancer drug synergy. PLoS Comput Biol 21(10): e1013567. https://doi.org/10.1371/journal.pcbi.1013567
Editor: Lun Hu, Xinjiang Technical Institute of Physics and Chemistry, CHINA
Received: June 17, 2025; Accepted: September 29, 2025; Published: October 15, 2025
Copyright: © 2025 Li et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The code used in this reasarch are publicly available and can be accessed at https://github.com/ahu-bioinf-lab/CASynergy.
Funding: This work was funded by The University Synergy Innovation Program of Anhui Prov-ince(GXXT-2022-035 to YSS); National Natural Science Foundation of China (62302007 to HTL, 62322301 to YSS, 62172002 to YSS); Anhui Provincial Natural Science Foundation 2008085QF294 to YSS; The University Outstanding Youth Re-search Project of Anhui Province (2022AH020010 to YSS); The University Synergy Innovation Program of Anhui Province (No. GXXT-2021-039 to CHZ); The Project of Key Laboratory of Intelligent Computing & Signal Processing (Anhui University), Ministry of Education(2020A005 to YSS). State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia – Anhui Uni-versity Work station Joint Fund Project, Grant No. SKL-HIDCA-2024- AH3 to HTL. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
1 Introduction
Cancer, as one of the leading diseases threatening human health, continues to exhibit high incidence and mortality rates. Drug therapy is crucial in cancer treatment; however, monotherapy often faces challenges due to limited efficacy and drug resistance. Consequently, researchers are committed to exploring combination therapies to improve treatment outcomes, making the discovery of synergistic drug combinations a key focus in modern pharmacology and cancer therapy [1].
Traditional screening methods, such as high-throughput screening, have facilitated the discovery of novel drug combinations to some extent, but they are time- consuming, costly, and limited in identifying effective drug combinations [2]. In recent years, advances in machine learning have provided powerful alternatives to accelerate this process. Traditional machine learning methods, such as Support Vector Machines (SVM) [3]and Random Forests [4,5], rely on data including the physicochemical properties of drugs, drug-target interactions (DTI), and gene expression profiles of cell lines.While these methods perform well in predicting the effects of single drugs, their predictive capabilities are constrained by the complexity of drug combinations. Additionally, traditional methods often require manual feature engineering, making it difficult to effectively capture the hidden nonlinear relationships in high-dimensional data [6].
In contrast, deep learning models, with their ability to learn complex patterns from large datasets, have shown significant potential in predicting drug synergy, particularly in the field of anti-cancer therapies [7]. Deep learning approaches are primarily categorized into feature concatenation-based models and graph structure-based models. Feature concatenation- based models extract multimodal features, such as molecular characteristics of drugs and genomic data of cell lines, and concatenate these features into high-dimensional inputs for deep neural networks to predict synergy. For example, DeepSynergy [8] and MatchMaker [9] utilize fully connected neural networks to learn complex patterns from concatenated molecular and cell line features to predict drug synergistic effects. The advantages of these models lie in their simplicity and rapid adaptability to various data types. However, existing methods heavily rely on large-scale input data and complex feature extraction processes, which hinder their ability to effectively identify key causal features essential for drug synergy prediction. Furthermore, these methods often use simple concatenation or independently process drug features and gene expression data, limiting their capability to capture the subtle yet critical interactions between drugs and cell lines. Moreover, these models are often perceived as “black boxes,” lacking interpretability, which limits their clinical application and trustworthiness.
To better model the complex interactions among drugs, proteins, and cell lines, researchers have proposed graph structure-based models. These models use Graph Neural Networks (GNNs) [10] or knowledge graph embeddings [11] to integrate drug molecular structures, target information, and the biological characteristics of cell lines into graphs, thereby capturing higher-order interaction information. For instance, GraphSynergy [12] constructs a drug-protein- cell interaction network and employs Graph Convolutional Networks (GCNs) to capture network topology information, thereby enhancing the accuracy of drug synergy predictions. Similarly, KGANSynergy [13] predicts drug synergy by leveraging knowledge graphs and attention mechanisms to analyze relationships among drugs, targets, and cell lines, using entity embeddings and phenotypic features to represent drug-drug-cell triplets. HypergraphSynergy [14] casts drug–drug–cell line triplets as hyperedges and enhances generalization via an auxiliary similarity-network reconstruction task, achieving strong results and applicability to unseen combinations/cell lines. SDCNet [15] leverages a relational graph convolutional network with attention to learn both common and cell line-specific patterns within a unified model and reports robustness to new cell lines. While these methods demonstrate excellent predictive accuracy and biological interpretability, they fail to adequately consider disease-specific features, which may limit their predictive performance across diverse datasets and in personalized medicine.
In summary, current cancer drug combination synergy algorithms have the following limitations: first, existing deep learning models depend heavily on large amounts of input data and complex feature learning, making it difficult to effectively identify and utilize key causal features, and are viewed as “black boxes” lacking interpretability. Second, in cancer treatment, drug effects vary significantly across different cell lines or cancer types, and traditional models using a unified framework for prediction cannot effectively capture these molecular mechanism differences. Third, existing studies have not sufficiently integrated drug combination features with cell line features, failing to comprehensively reflect the impact and relationship of drug combinations on cell lines.
To address these limitations, we propose the Causal Attention and Cross-attention Synergy (CASynergy) model. CASynergy constructs cell line-specific networks and incorporates causal inference theory to achieve more stable, accurate, and interpretable predictions of synergistic drug combinations across different cancer types and datasets. The main innovations of the CASynergy model include:
- A novel causal attention mechanism significantly enhances drug combination prediction by explicitly distinguishing causal features from non-causal noise, allowing the model to accurately capture crucial causal patterns. Unlike traditional correlation-based methods, this approach effectively identifies and mitigates confounding factors within causal pathways, thereby greatly improving the model’s generalization capability on out-of-distribution (OOD) datasets and ensuring robust predictive performance in diverse clinical scenarios [16].
- By integrating cell line-specific feature topology networks and a sophisticated cross-attention mechanism, CASynergy precisely captures molecular heterogeneity and subtle interactions between drugs and cell lines. The cell line-specific networks leverage prior biological knowledge to construct unique gene association structures for each cell line, facilitating personalized modeling of drug synergy. Concurrently, the cross-attention mechanism seamlessly merges multiple data modalities, such as drug molecular features and gene expression profiles, significantly enhancing the model’s granularity and prediction accuracy.
- Experimental results on the DrugCombDB and Oncology-Screen datasets demonstrate that the CASynergy model outperforms existing advanced algorithms in terms of accuracy, robustness, and interpretability. Particularly, CASynergy has excellent interpretability, allowing clear explanations of how specific drug combinations affect gene expression profiles. This helps researchers understand the biological mechanisms behind drug synergy predictions and provides insights for clinical decisions and personalized cancer treatments.
2 Results
To improve the accuracy and interpretability of drug combination synergy prediction, we propose an algorithmic framework called Causal Attention and Cross-attention Synergy (CASynergy), as shown in Fig 1. The CASynergy algorithm integrates the molecular features of drug combinations with disease cell line-specific features to predict the synergy score of the drug combination for cancer treatment targeting the specific cell line. This approach enables the identification of drug combinations that are specifically effective for treating a particular disease. CASynergy consists of three modules: 1) the cell line-specific gene network extraction module, 2) the drug and cell line feature extraction and fusion module, and 3) the drug combination synergy prediction module based on causal attention learning.
(a) Cell line-specific gene network extraction module: Using the expression data of key genes in the cell line and the PPI network
, a disease-specific topological network is constructed; (b) Drug and cell line feature extraction and fusion module: The fusion of drug combination features and disease cell line features forms a semantic network; (c) Drug combination synergy prediction module based on causal attention learning: By combining the topological and semantic networks, and utilizing a causal inference-based attention mechanism, the synergy of drug combinations is predicted.
2.1 Method comparison
To validate the effectiveness of CASynergy, we compared it with several mainstream drug synergy prediction methods: Random Forest, DeepSynergy [8], DeepDDS [17], GraphSynergy [12],and KGANSynergy [13].
Random Forest: A traditional machine learning method that utilizes drug molecular fingerprints and cell line gene expression information as inputs to predict drug synergy through an ensemble of decision trees (Random Forest).
DeepSynergy [8]: A deep learning-based drug synergy prediction model that employs neural network architectures such as convolutional layers to extract features from drug chemical structures and genetic information, modeling synergistic effects.
DeepDDS [18]: A deep model combining graph neural networks and attention mechanisms to predict drug synergy by aggregating molecular graphs and gene expression information, enabling high-level integration of molecular structures and cell line features.
GraphSynergy [12]: Utilizes spatial graph convolutional networks and attention mechanisms to encode high-order structural information of drug and cell line protein modules, thereby enriching entity embedding representations.
KGANSynergy [13]: Leverages knowledge graph structures and multi-head attention mechanisms to integrate multi-source neighbor node information during hierarchical propagation, enhancing the expressive power for drug synergy prediction.
These methods each have unique characteristics in drug synergy prediction, ranging from traditional machine learning to deep integrations of graph neural networks and knowledge graphs, providing diverse approaches for modeling synergistic interactions between drug combinations and cell lines. Tables 1 and 2 present the comparative experimental results on the DrugCombDB (Fig 2) and Oncology-Screen (Fig 3) datasets, respectively.
From Table 1, it is evident that CASynergy achieved the highest AUC (0.8388) and AUPR (0.8196) on the DrugCombDB dataset, showing significant improvements over other methods. This indicates that CASynergy possesses higher predictive stability in distinguishing between positive and negative samples and balancing precision and recall. The F1 score (0.7451) and ACC (0.7548) also reached the highest levels. Overall, CASynergy maintains stable performance on large-scale and highly heterogeneous drug synergy data, demonstrating its strong generalization capability in integrating molecular graph structures with cell line-specific networks.
As shown in Table 2, on the smaller-scale Oncology-Screen dataset, CASynergy also outperformed other comparison methods with an AUC of 0.8740 and an AUPR of 0.8883, particularly showing an approximate 0.3% improvement in AUC, AUPR, and ACC compared to the latest KGANSynergy. Although both methods share similar concepts in leveraging knowledge graphs and multi-source information integration, CASynergy further enhances prediction accuracy and stability by employing a causal attention mechanism to more finely decouple and intervene on cell line-specific networks and molecular features. CASynergy also demonstrated notable advantages in the F1 score (0.8286) and ACC (0.8072), which measure the model’s comprehensive discrimination ability and overall prediction accuracy.
2.2 Performance evaluation by cold-start task
In practical cancer drug combination prediction, many new drug combinations are not sufficiently recorded in the existing datasets. Moreover, the heterogeneity of cancer causes significant differences in the responses of different cell lines to drugs, which poses challenges to model training and prediction. To validate the role of the CASynergy algorithm in actual drug combination prediction, we designed and conducted three cold-start experimental scenarios:
Scenario 1: Drug Cold-Start Task In this scenario, we randomly selected 80% of the drugs in the dataset as known drugs and used drug combinations composed of these known drugs as the training set. The remaining 20% of the drug combinations served as the test set to evaluate the model’s prediction ability when facing new drug combinations that have not appeared in the training set. This experiment was conducted five times.
Scenario 2: Drug Combination Cold-Start Task. This evaluation randomly samples 10% of drug combinations. For each sampled combination, we remove both constituent drugs from the entire training set (i.e., no training instance contains either drug), retrain the model on the remaining data, and evaluate only on that held-out combination. We repeat this procedure for all sampled combinations and report mean ± standard deviation across runs. This protocol rigorously assesses the model’s ability to generalize to genuinely unseen drug pairs.
Scenario 3: Cell Line Cold-Start Task In this scenario, the cell lines in the test set did not appear in the training set. The experimental setup was as follows: we selected 80% of the cell lines in the dataset as the training set, and the remaining 20% of the cell lines as the validation set. This experiment was also conducted five times.
We used the Oncology-Screen dataset for the experiments and compared CASynergy with several of the latest algorithms. The experimental results under the two cold-start scenarios are shown in Table 3.
In the drug cold-start task experiment, CASynergy significantly outperforms other algorithms in both AUC (0.7512) and AUPR (0.7154), especially compared to DeepSynergy (AUC: 0.673, AUPR: 0.5926). The AUC of CASynergy is about 11. 6% higher and the AUPR is about 20.7% higher. CASynergy shows greater stability and reliability. In the experiment of the cell line cold start task, CASynergy also performs excellently, with an AUC value of 0.75, which is approximately 14.8% higher than DeepSynergy’s 0.6528. The AUPR value of CASynergy is 0.753, about 14.8% higher than DeepSynergy’s 0.6561. In terms of ACC, CASynergy’s performance leads with an ACC of 0.7559, which is about 22% higher than DeepSynergy, demonstrating its ability to provide more accurate drug response predictions when handling cell line cold starts. In the drug combination cold-start task experiment, CASynergy outperforms the baselines across all metrics. It attains AUC 0.6732 and AUPR 0.6745. More importantly for classification, CASynergy achieves F1 0.7512 and ACC 0.7034, substantially exceeding DeepSynergy (0.6345/0.5983) and DeepDDS (0.6112/0.5474). These results show that CASynergy maintains competitive ranking performance while delivering significantly decision quality under the most demanding cold-start condition, underscoring its robustness and practical utility for predicting synergy of entirely unseen drug combinations.
In summary, the effectiveness of CASynergy in drug synergy prediction tasks has been thoroughly validated. It outperforms other latest comparative algorithms, particularly in cold-start tasks, showing significant improvements in AUC and AUPR. CASynergy demonstrates excellent adaptability under different data scales and feature distribution conditions, and its strong generalization capability allows it to effectively handle unseen drug combinations and cell lines. These results indicate that CASynergy provides more accurate and reliable predictions for drug combinations in real-world applications, showcasing its greater potential for both application and clinical value.
2.3 Ablation study
To investigate the influence of different modules on the performance of the CASynergy algorithm, we conducted an ablation study focusing on key components. Specifically, we constructed the following four model variants for comparison against the full CASynergy model:
- w/o spp: Does not use the cell-line-specific PPI network (spp). Instead, it directly adopts the original PPI edges as the cell line graph structure.
- w/o CA fusion: Does not employ the Cross-Attention fusion (CA fusion) mechanism to fuse the drug combination and cell line features; rather, it simply concatenates the two feature sets and feeds the result into downstream modules.
- CASynergy-BLA: cross-modal interactions are modeled via a bilinear scoring function
- w/o cal: Does not use causal attention learning (cal) to decouple the cell line graph structure; instead, it directly utilizes GCNs to obtain cell line embedding vectors for prediction.
As shown in Table 4 and Fig 4, the cell-line-specific PPI network and the causal decoupling layer (cal) play crucial roles in the model. Removing the cell-line-specific PPI network leads to decreases in AUC, AUPR, and F1 (e.g., AUC drops from 0.8740 to 0.8616), suggesting that the cell-line-specific PPI network captures more pertinent protein and gene information for each cell line, thereby enhancing predictive accuracy. In contrast, employing only the raw PPI edges fails to adequately capture the unique gene interaction patterns of each cell line, resulting in diminished performance.
Eliminating the cross-attention module causes the AUC to drop from 0.8740 to 0.8689 and AUPR to drop from 0.8883 to 0.8826, demonstrating that cross-attention allows for deeper integration of drug and cell line features when modeling differences in cell line targets under distinct drugs. Although this decrease is less pronounced than with other modules, the results still reveal the critical role cross-attention plays in further boosting the model’s performance.
Finally, omitting causal attention learning lowers the AUC from 0.8740 to 0.8645, indicating that causal attention learning helps identify genes that exhibit a true causal relationship with the labels while suppressing purely correlational genes or noise, thereby enhancing model robustness and interpretability.
Overall, the ablation study highlights the pivotal contributions of the cell-line-specific network, cross-attention mechanism, and causal attention learning to CASynergy. Only by jointly employing these components can one maximize drug synergy prediction performance and achieve high model interpretability.
2.4 Interpretability analysis
In practical applications, understanding how synergistic drug combinations effectively treat specific diseases generally involves revealing a crucial question: through which critical genes or biological processes do drug combinations exert their therapeutic effects? Addressing this question is central to predicting drug synergy and is a critical step in studying drug therapeutic mechanisms. To better address this question and provide clear interpretability analysis, the CASynergy algorithm integrates causal inference mechanisms and attention mechanisms, effectively identifying the specific impact of drug combinations on gene expression in cell lines, thus providing intuitive and reliable biological explanations.
CASynergy utilizes causal inference methods to analyze the effect of drug combinations on gene expression in cell lines. One of the core interpretability tools provided by CASynergy is the causal subgraph soft mask matrix . By analyzing this matrix, we can clearly identify key genes whose expression levels are significantly altered by drug combinations in cell lines and subsequently discern potential biological mechanisms based on the analysis of these critical genes.
To validate the interpretability of the CASynergy algorithm, this study identifies key therapeutic genes by analyzing the attention score soft mask matrix generated by CASynergy. Firstly, the matrix
of a specific synergistic drug combination was extracted from a trained CASynergy model. Genes ranking in the top 25% based on attention scores were defined as key genes. Then, the cumulative number of key genes across all synergistic drug combinations for the given cell line was calculated. Subsequently, the top 20% of genes from this cumulative distribution were selected as critical genes associated with synergistic drug combinations for that cell line. A KEGG biological pathway enrichment analysis was then performed to identify biological pathways associated with these key genes.
In this study, we exemplify this approach using the KPL-1 (human breast cancer cell line), NCI-H520 (human lung squamous cell line), and ES2 (human ovarian clear cell carcinoma cells). Our experiments demonstrate that the cumulative number of key genes in the cell lines follows a power-law distribution (as shown in Fig S1), indicating that changes in gene expression induced by drug combinations predominantly concentrate on a few critical genes that decisively influence therapeutic outcomes. Subsequently, we performed interpretability analysis using Enrichr (https://maayanlab.cloud/Enrichr/) [19], selecting pathways with adjusted p-values less than 0.1 as critical pathways for enrichment analysis; results are depicted in Fig 5.
(a) Breast cancer cell line KPL-1. (b) Lung squamous cell carcinoma cell line NCI-H520. (c) Ovarian clear cell carcinoma cell line ES2.
For the breast cancer cell line KPL-1 (as shown in Fig 5(a)), apoptosis is a crucial process in cancer therapy. Drug combinations inducing apoptosis can effectively eliminate cancer cells by activating intrinsic and extrinsic apoptotic pathways. In breast cancer contexts, drugs that promote apoptosis may inhibit tumor growth by overcoming resistance to cell death. Research has shown that combining ruxolitinib and MK-2206 can regulate apoptotic pathways by dual inhibition of JAK2/STAT5 and PI3K/AKT signaling pathways, thus treating breast cancer [20]. Central carbon metabolism is essential for cancer cell metabolic reprogramming, involving key metabolic pathways such as glycolysis, TCA cycle, and pentose phosphate pathway. These pathways provide energy and biosynthetic precursors critical for rapid cancer cell proliferation and survival. Metabolic reprogramming is closely related to resistance to chemotherapy and radiotherapy.
Targeting these metabolic pathways can improve therapeutic effectiveness and overcome drug resistance [21]. In vitro studies demonstrated promising effects of metabolic inhibitors combined with chemotherapeutics. For example, combinations of CB-839 (a glutaminase inhibitor) and Oxamate (a lactate dehydrogenase inhibitor), as well as the triple combination of CB-839/Oxamate/D609 (phosphatidylcholine- specific phospholipase C inhibitor), induced significant cell death in breast cancer cell lines. These inhibitors also enhanced the efficacy of doxorubicin, potentially reducing patient exposure and cardiotoxicity while maintaining therapeutic effects [18]. The p53 tumor suppressor gene is pivotal in regulating the cell cycle and apoptosis in response to DNA damage. Loss of p53 function, common in cancers, contributes to tumorigenesis. Certain drug combinations can activate the p53 pathway, enhancing apoptotic responses and effectively inhibiting breast cancer cell growth [22]. For instance, combining Renieramycin M with doxorubicin upregulates p53 protein expression, promotes activation of downstream target genes, enhances cellular responses to DNA damage, and ultimately induces apoptosis. This synergy increases treatment sensitivity in breast cancer cells, offering a potential therapeutic strategy [23].
For lung squamous carcinoma (LUSC) cell line NCI- H520 (Fig 5(b)), numerous studies demonstrate enhanced therapeutic efficacy by drug combinations targeting the MAPK signaling pathway. Such combinations typically simultaneously target different nodes of the MAPK pathway, including upstream receptor tyrosine kinases (such as EGFR) and downstream kinases (such as MEK), achieving comprehensive tumor growth inhibition. For instance, in non-small cell lung cancer (NSCLC) models harboring MET exon 14 skipping mutations, coexisting KRAS activation or NF1 loss can hyperactivate RAS-MAPK signaling, limiting MET inhibitor efficacy. However, combining MET inhibitors (e.g., crizotinib) with MEK inhibitors (trametinib) can concurrently block reactivation of this pathway, thus overcoming resistance. LUSC patients often experience bone metastases, and osteoclast- mediated bone destruction can accelerate tumor progression. Studies indicate that targeting osteoclast-related pathways improves LUSC treatment outcomes. Denosumab, which blocks the RANK/RANKL pathway, inhibits osteoclast activity and bone destruction. Clinically, adding Denosumab to treatment significantly improves overall survival compared to traditional bisphosphonates [24]. Additionally, RANKL inhibition exhibits immune-regulatory effects. Preclinical studies by Ahern et al. demonstrated that combining anti-RANKL agents with anti- PD-1 immune checkpoint inhibitors increases tumor-infiltrating T cells and antitumor cytokines such as IFN-γ, significantly reducing tumor burden. In advanced NSCLC patients with bone metastases, combining Denosumab with PD-1/PD-L1 inhibitors improved response rates and progression-free survival without significant additional toxicity [25].
In ovarian cancer cell line ES2 (Fig 5(c)), cellular senescence and cell cycle pathways are critical targets of drug combinations. Classical chemotherapy drugs such as platinum agents and paclitaxel induce therapy-induced senescence (TIS). Studies in high-grade serous ovarian cancer mouse models with homologous recombination deficiency (HR-deficient, e.g., BRCA mutation) show that platinum chemotherapy efficiently induces cellular senescence [26]. These senescent cells activate the cGAS/STING pathway, inducing a limited senescence-associated secretory phenotype (SASP), recruiting immune cells, and enhancing sensitivity to immune checkpoint inhibitors. Tumors prone to senescence show better responses when chemotherapy is combined with immunotherapy (PD-1/PD-L1 antibodies). Additionally, topoisomerase inhibitors (such as doxorubicin, etoposide) induce senescence; subsequently adding the BCL-2/BCL- XL inhibitor ABT-263 (Navitoclax) selectively eliminates these cells, substantially reducing tumor volume [27]. This mechanism presents a potentially effective drug combination strategy for treating ovarian cancer.
Through the above experimental analyses, we demonstrated the interpretability of the CASynergy algorithm in predicting drug combination synergistic effects. Firstly, causal inference and attention mechanisms effectively identified the critical biological processes impacted by drug combinations in cell lines, elucidating molecular mechanisms underlying synergistic actions. Secondly, by utilizing the soft mask matrix , we provided a deep understanding of how drug combinations achieve synergistic therapeutic effects through specific biological processes. This enhances model transparency, provides a theoretical foundation for subsequent experimental validation, and facilitates practical applications of the model.
2.5 Case study
To verify the adaptability and practical potential of the CASynergy model, we conducted predictions for novel drug combinations on the Oncology-Screen dataset. Specifically, we first constructed candidate drug combinations using the drug and cell line data from Oncology-Screen; then, the CASynergy model evaluated the synergy scores of these combinations in designated cell lines. We selected the top 100 drug combinations with the highest predicted synergy scores for in-depth analysis and found 14 synergy combinations supported by published literature (see Table 5).
For instance, gemcitabine suppresses NF-κB activation induced by vorinostat and concurrently inhibits chromatin remodeling mediated by histone deacetylases, prompting apoptosis in NSCLC [28]. Paclitaxel, administered alone or in combination with other anticancer agents—particularly in HER2-negative breast cancer cells—exerts enhanced cytotoxicity through lapatinib-induced inhibition of ABC transporters, effectively treating both early and metastatic breast cancer [29,30]. Sorafenib, a multi-target kinase inhibitor, mainly acts on the RAF kinase pathway and vascular endothelial growth factor receptors; tanespimycin, as an Hsp90 inhibitor, affects key signaling proteins such as RAF-1. Their synergy arises from acting on distinct but interrelated signaling pathways, inhibiting tumor cell proliferation and angiogenesis, thereby providing a potentially effective therapeutic strategy [31]. Vorinostat(Zolinza) exerts its biological effects by inhibiting histone deacetylases, whereas bortezomib inhibits proteasome formation, affecting the cell cycle and apoptosis. Their combined application in colon cancer cells demonstrates a synergistic effect, significantly enhancing the inhibition of cell proliferation, underscoring the superior efficacy of their joint use [32].
These cases not only underscore the high predictive accuracy of CASynergy in determining synergistic drug effects but also highlight its capability in identifying biologically plausible drug combinations.
3 Discussion
In this study, we developed CASynergy, a computational model that significantly improves the prediction of synergistic cancer drug combinations. The model’s key contributions include the integration of causal inference and attention mechanisms to focus on truly influential molecular features, and the use of cell line-specific gene network topology to account for biological differences between cancer types. By combining causal attention (to filter out confounding signals) and cross- attention (to capture detailed interactions between drug properties and gene expression), CASynergy addresses major challenges in the field. These innovations enable CASynergy to deliver more stable and accurate synergy predictions across different datasets and conditions. Our experimental results demonstrate that CASynergy achieves superior performance on large-scale heterogeneous data, maintaining high predictive accuracy and generalization where previous models struggled. Notably, CASynergy attained the highest AUC and AUPR scores on both the DrugCombDB and Oncology-Screen benchmarks, reflecting its strong overall discrimination ability and balanced precision–recall performance. These findings confirm that the proposed model not only advances prediction accuracy but also offers greater interpretability and robustness, thereby contributing a valuable tool for personalized cancer therapy research. Compared to existing drug synergy prediction approaches, CASynergy offers marked improvements in accuracy, robustness, and interpretability.
While effective to an extent, such feature-concatenation models (including DeepSynergy and DeepDDS) often treat drug–gene interactions as a “black box,” making it difficult to discern which biological factors drive synergy. In contrast, CASynergy’s causal attention mechanism allows it to identify and weight only the truly causal genomic features for synergy, improving interpretability and out-of-distribution generalization.
Graph-based methods (including GraphSynergy and KGANSynergy) represent another advance, using biological interaction networks to inform predictions. These approaches improve prediction accuracy and provide some biological insights. Nonetheless, they do not fully incorporate disease- specific molecular contexts; for example, they treat all cell lines in a unified network without tailoring to individual genetic backgrounds. CASynergy distinguishes itself by constructing cell line-specific gene networks, effectively personalizing the interaction topology to each cancer type or cell line. This design captures critical differences in molecular pathways between cell lines, which prior models might overlook. As a result, CASynergy demonstrated stronger performance and stability across diverse contexts. It outperformed GraphSynergy and KGANSynergy on both benchmarks, with particularly notable gains on the Oncology-Screen dataset (improving AUC, AUPR, and accuracy by 0.3% over the best prior model, KGANSynergy).
Furthermore, in challenging cold-start scenarios where the model must predict synergy for new drugs or new cell lines not seen in training, CASynergy maintained high accuracy. CASynergy integrates advantages of previous deep learning and graph-based methods. It overcomes their limitations, providing a model with improved accuracy, better reliability on new data, and clearer interpretability.
In real-world settings, missing values in cell-line transcriptomic profiles commonly arise from assay dropouts and differences between platforms. Specifically, we randomly masked 10%, 20%, or 30% of genes in each cell line and used imputers the MICE package in R to perform mean imputation on the gene expression matrices. The imputed matrices were then evaluated with five-times five-fold cross-validation using the same training protocol as the main experiments (as shown in Table S4). CASynergy shows minimal performance degradation as the masking ratio increases from 0% to 30%. To further and more systematically examine how performance scales with data volume, we added a controlled data-efficiency study on Oncology-Screen. We created stratified training subsets containing 30%, 50%, and 70% of the data and kept the evaluation protocol identical to the main experiments (Table S5). This high robustness is likely due to the model’s causal inference framework. The model’s causal attention focuses on stable drug-gene interactions that exist across all cell lines, ignoring random associations that can be easily lost. Guided by the structure of cell-line-specific networks, its cross-modal attention mechanism aligns and combines strong signals within the same biological pathway, which helps to remove noise and fill in missing information. Furthermore, the causal design makes the model focus on key causal factors and uses redundant information from multiple sources (drug structure and gene expression), making it less sensitive to errors.
Despite its advancements,CASynergy has certain limitations that should be acknowledged. First, the model’s effectiveness relies on the availability of high-quality structured omics data and prior biological knowledge for each cell line. CASynergy uses predefined cell line-specific gene networks (e.g., protein–protein interaction or gene co-expression networks) to inform its feature topology. This reliance means that for cancer types or cell lines where such network data are incomplete or unavailable, the model’s performance may be impacted. Moreover, CASynergy currently focuses on transcriptomic gene expression and known gene interactions; other molecular factors (such as genetic mutations, epigenetic modifications, or proteomic changes) are not explicitly integrated, which may limit the model’s ability to capture all determinants of drug response. These limitations suggest that, although CASynergy represents a meaningful step forward, there is room for further refinement to enhance its general applicability and ease of use.
In the future, the CASynergy algorithm can be improved and expanded through the following aspects. An important future direction is the integration of multi-omics data to provide a more comprehensive view of tumor biology. Incorporating genomic mutations, copy number variations, proteomic and metabolomic profiles, or epigenetic markers alongside gene expression could allow the model to learn a richer representation of each cell line’s state. To assess the practical feasibility of this direction in our current setting, we performed a controlled multi-omics augmentation on the Oncology-Screen cohort. We matched CCLE mutation and copy-number (CNV) profiles to the same cell lines and compared four feature configurations for CASynergy: (i) gene expression only; (ii) expression + mutation; (iii) expression + CNV; and (iv) expression + mutation + CNV. All variants were capacity-matched and trained under identical protocols with five-times five-fold cross-validation. Expression-only already matched or slightly outperformed the multi-omics variants on AUC/AUPR/F1 (AUC 0.8740 ± 0.004, AUPR 0.8883 ± 0.006), while the triple-omics setting produced only a small ACC uptick (0.8098 ± 0.006) without consistent gains elsewhere (see S6 Table). These findings suggest that, at the current dataset scale and without pathway-level integration, most synergy-relevant variation is already encoded in transcriptomics, whereas naïvely concatenating mutation/CNV features may add redundancy and noise. Going forward, we will investigate principled integration strategies—e.g., pathway/module aggregation, shared–private factorization, and causal multi-view training on harmonized profiles—to more effectively exploit complementary omics in larger cohorts. Another important future direction is to expand the scope of CASynergy to handle a broader range of drug types and combination strategies. This could involve predicting synergies that include not only small-molecule drugs but also biologics (such as therapeutic antibodies), immunotherapies, or even combinations of more than two agents. Adapting the model to these more complex or diverse therapy regimens would increase its relevance to real-world cancer treatment scenarios. Moreover, developing simplified versions of CASynergy or user- friendly tools that highlight the key pathways and gene targets driving a predicted synergy can help oncologists and researchers trust and effectively use the model’s recommendations. In summary, the CASynergy approach can be further evolved into a powerful decision-support system for combination cancer therapy discovery and optimization.
4 Materials and methods
4.1 Datasets
This study systematically prepared datasets by constructing drug synergy datasets and cell line-related datasets to comprehensively evaluate the performance of CASynergy in predicting drug synergy under various cell line conditions.
Drug synergy datasets To evaluate the performance of CASynergy in the task of drug synergy prediction, we selected two publicly available datasets of varying scales and sources: DrugCombDB and Oncology- Screen (see Table 6). These datasets include experimental results such as drug combinations and their synergy scores, effectively demonstrating CASynergy’s adaptability under different scales and conditions.
DrugCombDB [33]: DrugCombDB is a comprehensive drug combination dataset sourced from high-throughput screening experiments, multiple external databases, and relevant literature on PubMed. The DrugCombDB dataset used in this study comprises 69,436 drug combinations and their synergy scores, covering 764 drugs and 76 cancer cell lines.
Oncology-Screen [34]: The Oncology-Screen dataset originates from a large-scale tumor screening conducted by O’Neil et al., encompassing 4,176 drug combinations and their synergy scores, involving 21 drugs and 29 cancer cell lines.
In the experiments, drug combinations with synergy scores greater than 0 are considered positive samples with synergistic effects, while those with scores less than 0 are regarded as negative samples without synergistic effects. This classification facilitates the construction of a binary classification task for synergy prediction.
Cell line-related datasets To further explore the cell line-specific network information and gene expression differences in cancer cell lines, this study incorporates additional cell line-related data based on the drug synergy data. This integration aids CASynergy in better understanding and modeling the molecular-level differences among cell lines.
Protein-Protein Interaction (PPI) Networks: We utilized the ‘Human Protein-Protein Interaction (PPI) Network’ integrated by Cheng et al. [35], which consolidates interactions from 15 experimentally validated common databases. This network comprises 217,160 interactions among 15,790 protein nodes, providing the topological foundation at the gene-protein interaction level for this study.
Cell Line-Protein Relationships: Leveraging the cell line-protein relationship data collected by Yang et al. [12] from the CCLE [36] database, we mapped 18,022 genes to their corresponding proteins and associated them with 1,035 cancer cell lines. This mapping process facilitates the precise identification of active or key gene-protein interactions within each cell line, thereby establishing a data foundation for constructing cell line-specific gene networks.
Cell Line Gene Expression Data: Based on the cell line-protein relationships, we selected protein-coding genes relevant to the target cell lines and removed isolated gene nodes in the constructed specific PPI networks. The DrugCombDB dataset utilizes 7,217 genes, while the Oncology-Screen dataset employs 2,865 genes, with a certain degree of overlap between these gene sets. The gene expression data primarily originate from the CCLE and GEO databases (see S1 Table), and were standardized prior to use.
By integrating the aforementioned drug synergy data with cell line-related data, this study accurately characterizes the potential synergistic responses of different cell lines to drug combinations from multiple perspectives, including gene expression and cell line-specific network topology. This comprehensive data support underpins the construction of cell line-specific models within CASynergy.
4.2 Cell line-specific gene network extraction module
To better reveal the underlying molecular mechanisms of disease onset and progression, this study constructs cell line specific feature topological networks based on protein-protein interaction (PPI) relationships specific to each cell line (Fig 1(a)). Research has shown that although cancer cell lines of the same type may exhibit similarities in certain aspects, their protein interaction networks differ significantly. Therefore, mining specific gene network modules can provide a more effective basis for drug combination prediction.
Given a protein-protein interaction network (PPI) , where
represents the network nodes (proteins), and
represents the edges in the network, with an edge value of
if there is an interaction between two proteins, and
otherwise. For cell line
, the key target proteins
of the cell line are first identified based on the protein-cell line interaction relationship. These key proteins are used as seed nodes in the PPI network. Next, a connected subgraph with minimal total edge count is constructed to generate the cell line-specific PPI network structure by introducing additional protein nodes (i.e., intermediate nodes). Specifically, a heuristic method based on the shortest path is used to approximate the construction of the connected specific PPI network structure
. The Dijkstra algorithm is employed to calculate the shortest paths between seed nodes. Then, intermediate nodes (non-seed nodes) from these shortest paths are gradually introduced to connect unconnected sub-networks. In this process, a greedy strategy is adopted, prioritizing the connection with the shortest total path length to minimize the total number of edges in the network. Through iterative optimization, the network connectivity is progressively improved, ultimately generating a connected, specific PPI network
for the cell line
. This method is highly efficient when handling large-scale PPI networks and effectively preserves the key topological features of the network.
4.3 Drug and cell line feature extraction and fusion module
In order to investigate how drugs affect the expression of key protein-coding genes in cell lines, this study analyzes relevant features within the drug and cell line feature extraction and fusion module (Fig 1(b)). By examining the impact of drugs on gene expression in cell lines, it is possible to more accurately identify key genes jointly influenced by drug combinations at the molecular level, thereby improving the accuracy of synergy prediction and laying a foundation for interpretable analyses of drug mechanisms.
Specifically, based on the previously described cell-line-specific gene network extraction module, we first consider the key topological features of cell line , denoted by
, and their corresponding gene expression features
. We then construct a cell-line-specific key gene expression network
, where
represents the expression data of the key genes in the network,
denotes the number of genes in the cell line, and
refers to the key topological features of cell line
.
Next, for the extraction of drug features, we use the molecular structural information of drugs. The molecular structure of a drug is transformed from its SMILES string into a molecular graph. Here,
denotes the feature matrix of all atoms within the molecular graph, where
is the number of atoms, and
represents the feature dimension of each atom. The adjacency matrix
encodes the chemical bond relationships between atoms. We employ a Graph Convolutional Network (GCN) to process this molecular graph. The operation of the
-th GCN layer can be defined as:
where represents the hidden feature matrix at layer
,
is the normalized adjacency matrix, and
is the trainable weight matrix of the
-th GCN layer.
Among them, , and
denotes the learnable weight matrix of the
-th layer. The matrix
is the normalized adjacency matrix, where
is the diagonal degree matrix of
,
is the identity matrix, and
represents the ReLU activation function. After
layers of GCN processing, the feature matrix of the drug molecular graph is
, where
is the hidden layer dimension. Because different drug molecular graphs may have different numbers of atoms, we apply a global max pooling operation to
so as to obtain drug feature vectors of uniform dimension. Specifically, we derive the global features
for drugs
and
, respectively. For a drug combination
, its feature representation is given by the sum of the two global drug features:
where and
are the feature vectors of drugs
and
, respectively, and
denotes the feature representation of the drug combination.
The feature representation of cell line is derived from the constructed cell-line-specific gene expression graph
. Similar to processing a drug molecular graph, the
-th GCN layer for the cell line graph is defined as:
where , and
is the normalized adjacency matrix. Here,
represents the topological structure of the cell line’s network, and
is the diagonal degree matrix of
. After
GCN layers, the resulting feature matrix of cell line
is
.
When a drug acts on a cell line, it usually affects target proteins, signaling pathways, and other biological processes, thereby inducing therapeutic or other biological effects. Consequently, when two drugs act jointly on a cell line, they necessarily influence gene expression. To capture this effect, we introduce a cross-attention mechanism to fuse the features of the drug combination with those of the cell line, thereby modeling the influence of the drug combination on cell-line gene expression.
In this study, the transposed drug combination feature serves as the key vector
, while the cell-line gene expression features
act as the query vector
and value vector
. Here,
denotes the number of genes in the cell line, and h is the dimension of the gene feature representation. By applying trainable weight matrices
,
, and
, we obtain the following feature transformations:
with and
, where
is the hidden-layer dimension. The attention weight matrix
is computed by
where denotes the dot product. To maintain a consistent hidden-layer dimension, we use a linear layer to reduce the dimension of
to
. According to Eq. (5), we then obtain the cell-line gene expression feature
, which incorporates drug-combination effects:
where denotes the Hadamard product. Additionally, to mitigate the vanishing-gradient problem, we employ a residual strategy by concatenating
and
. This yields a cell-line feature representation
that accounts for drug-combination effects:
where . Hence,
. Finally, we apply global average pooling followed by a transpose operation to obtain the cell-line feature representation
. Thus, the fused network capturing the influence of the drug combination on gene expression in cell line
is given by
4.4 Causal attention-based module for drug combination synergy prediction
In order to accurately identify cell-line-specific synergistic drug combinations and provide interpretable analyses of the underlying molecular mechanisms, this study proposes a graph decoupling method based on a causal attention mechanism (Fig 1(c)). Through this method, the fused network , which encapsulates the effects of drugs on cell-line gene expression, is decoupled into a causal subgraph and a shortcut subgraph. The causal subgraph represents the true synergistic interaction of the drug combination on the cell line, while the shortcut subgraph includes confounding factors (i.e., “shortcut features”) associated with the prediction. The benefit of decoupling for synergy prediction is that, by separating the true synergistic signals (causal features) from potential confounding factors (shortcut features), the model can focus on causal relationships that are crucial to synergy, thereby performing more stable and accurate synergy predictions.
Soft mask estimation. To effectively decouple the causal subgraph and the shortcut subgraph, we employ an attention mechanism and introduce soft masks for both nodes and edges. For a given fused network of drug combination and cell-line gene expression features, we construct two multilayer perceptrons (MLPs) to assess attention scores separately from the perspectives of nodes and edges. Equations (9) and (10) respectively provide the causal and shortcut attention scores for node
and for the edge
:
where is the softmax function,
is the feature vector of node
, and
denotes the concatenation operator; hence,
is the feature vector corresponding to the edge
. The causal and shortcut attention scores satisfy
and
, indicating that the attention scores for the causal subgraph and the shortcut subgraph are complementary in this study.
Based on the calculated node and edge attention scores ,
, we construct matrices
and
as soft masks for the causal subgraph. Meanwhile,
and
are used to construct
and
as soft masks for the shortcut subgraph. We then apply these soft masks to the graph structure via the Hadamard product to obtain the causal subgraph
and the shortcut subgraph
:
where is the adjacency matrix of the cell-line-
-specific PPI network,
is the node feature matrix that integrates both the drug-combination and cell-line gene expression features, and
denotes the Hadamard product.
Graph decoupling. Having obtained the causal subgraph and the shortcut subgraph
at both the node and edge levels, we now decouple the cell-line-specific PPI network subgraphs to extract the corresponding causal and shortcut features. Specifically, we apply a graph neural network (GCN) and a readout model to
and
, respectively, to obtain their global vector representations:
where is the causal feature extracted from
, and
is the shortcut feature extracted from
. We then concatenate the drug combination feature
with the causal feature
, and use a classifier
to predict the probability of synergy:
where is the predicted synergy probability. We employ the cross-entropy loss for training:
where is the training dataset and
is the ground truth label for each drug combination.
Regarding the shortcut features from the shortcut subgraph, we desire them not to affect the true classification outcome. Hence, we randomly generate a label
as the target distribution, use a classifier
to predict the corresponding synergy score
, and then minimize the KL-divergence between
and
:
By optimizing these two objectives, we can effectively disentangle causal features from shortcut features. Because of the presence of noise and shortcut features, the mutual information between the causal features and the labels is higher than that between the full graph and the labels, preventing the causal subgraph from degenerating into the complete, undecoupled graph.
Causal attention-based module for drug combination synergy prediction
Because confounding effects may influence the final results, we employ the backdoor adjustment strategy from causal inference theory to remove biases introduced by confounders. The backdoor adjustment is an effective solution to mitigate confounding effects by stratifying the shortcut features and pairing the causal features with each stratification of shortcut features to form an ``intervened graph.“ However, due to the irregular structure of graph data, it is challenging to perform such interventions directly at the data level. To address this, we adopt a causal attention strategy that avoids this issue by performing implicit interventions at the representation level, guided by a loss function derived from backdoor adjustment. Concretely, as shown in Eqs. (18) and (19):
where is the causal feature obtained from Eq. (13), and
is the feature representation of the shortcut subgraph
from a different stratification, obtained via Eq. (14). The vector
is the feature representation of the drug combination
,
. By combining these representations, we simulate the model’s predictions under varying shortcut features. The output
is the prediction result of the classifier
on the “implicit intervened graph,” and
is the set of strata for the estimated shortcut subgraphs, encompassing various shortcut features in the training data. In practice, we use random addition to implement the intervention in Eq. (18). Random addition simulates different “scenarios” by adding the causal feature representation to randomly selected shortcut feature representations, thereby creating a new, combined graph representation. This approach helps the model maintain stable predictions when confronted with different shortcut features. We refer to Eq. (19) as the causal intervention loss
. It enforces consistent and stable predictions across different shortcut stratifications because they share the same causal features.
Finally, the objective function of our CASynergy algorithm is defined as the weighted sum of these losses:
where and
are hyperparameters that determine the strengths of the decoupling and causal intervention, respectively. The total loss
consists of three parts:
is the supervised classification loss based on causal features, ensuring that the model correctly predicts the labels;
encourages the shortcut features to produce uniformly distributed predictions without bias toward any particular category; and
ensures consistent predictions under different shortcut features, reinforcing the underlying causal relationships.
By leveraging the causal attention mechanism, the CASynergy algorithm is capable of making robust predictions without relying on specific shortcut features. This also endows the model with strong generalization capabilities when handling test data drawn from distributions different from those of the training data.
4.5 Performance measures
To evaluate the effectiveness of CASynergy, we divided the datasets into training, validation, and testing sets with an 8:1:1 ratio. The training and validation sets underwent five- fold cross-validation, and the results on the independent test set were averaged to obtain the final outcomes. To comprehensively describe the performance of the prediction model, we employed common classification metrics, including Area Under the Receiver Operating Characteristic Curve (AUC-ROC), Area Under the Precision-Recall Curve (AUC-PR), Accuracy (ACC), and F1 Score. These metrics respectively reflect the model’s ability to distinguish between positive and negative samples, balance precision and recall, overall prediction accuracy, and comprehensive performance.
In the specific experiments, to identify the optimal hyperparameter configuration, we conducted hyperparameter optimization using grid search. This optimization included searching for learning rates within the set {10−2, 10−3, 10−4, 10−5}, batch sizes within{16, 32, 64, 128, 256}, cross-attention layers within {1, 2, 3},and testing both Global Max Pooling and Global Average Pooling for global pooling. By comparing the experimental results of different hyperparameter combinations in the validation set, we ultimately selected a set of parameter configurations that achieved the best performance on the independent test set, as shown in Tables S2 and S3 in the appendix.
Supporting information
S1 Table.
summarizes the OncologyScreen dataset, comprising 28 cancer cell lines from CCLE and GEO, spanning skin, ovary, large intestine, lung, breast, and prostate tissues. Each entry lists the cell line name, tissue origin, and unique DepMap ID, enabling integration with multi-omics data and cross-study analyses.
https://doi.org/10.1371/journal.pcbi.1013567.s001
(XLSX)
S2 Table.
lists the hyperparameters optimized for the CASynergy model, including learning rates (10 ⁻ ² to 10 ⁻ ⁵), batch sizes (16–156), cross-attention layers (1–3), and global pooling methods (Max/Average). The final configuration was empirically determined through systematic validation.
https://doi.org/10.1371/journal.pcbi.1013567.s002
(XLSX)
S3 Table.
presents the hyperparameter tuning results.
https://doi.org/10.1371/journal.pcbi.1013567.s003
(XLSX)
S4 Table. CASynergy performance under random gene expression masking (0–30%): AUC, AUPR, F1, and ACC (mean ± SD) evaluated with five-times five-fold cross-validation.
https://doi.org/10.1371/journal.pcbi.1013567.s004
(XLSX)
S5 Table. Data-efficiency analysis on Oncology-Screen: CASynergy performance versus training-set size (30%, 50%, 70%) under five-times five-fold cross-validation (mean ± SD).
https://doi.org/10.1371/journal.pcbi.1013567.s005
(XLSX)
S6 Table. Multi-omics augmentation on Oncology-Screen: CASynergy performance across feature configurations (expression only; + mutation; + CNV; + mutation+CNV) under five-times five-fold cross-validation (mean ± SD).
https://doi.org/10.1371/journal.pcbi.1013567.s006
(XLSX)
S1 Fig. Gene distribution probability density graphs.
https://doi.org/10.1371/journal.pcbi.1013567.s007
(TIFF)
References
- 1. Shah MA, Schwartz GK. Cell cycle-mediated drug resistance: an emerging concept in cancer therapy. Clin Cancer Res. 2001;7(8):2168–81. pmid:11489790
- 2. Sliwoski G, Kothiwale S, Meiler J, Lowe EW Jr. Computational methods in drug discovery. Pharmacol Rev. 2013;66(1):334–95. pmid:24381236
- 3. Cortes C, Vapnik V. Support-Vector Networks. Machine Learning. 1995;20(3):273–97.
- 4. Breiman L. Random forests. Machine learning. 2001;45:5–32.
- 5. Caruana R, Niculescu-Mizil A. An empirical comparison of supervised learning algorithms. In: Proceedings of the 23rd international conference on Machine learning - ICML ’06, 2006. 161–8.
- 6. Thafar MA, Albaradie S, Olayan RS, Ashoor H, Essack M, Bajic VB. Computational Drug-target Interaction Prediction based on Graph Embedding and Graph Mining. In: Proceedings of the 2020 10th International Conference on Bioscience, Biochemistry and Bioinformatics, 2020. 14–21.
- 7. Güvenç Paltun B, Kaski S, Mamitsuka H. Machine learning approaches for drug combination therapies. Brief Bioinform. 2021;22(6):bbab293. pmid:34368832
- 8. Preuer K, Lewis RPI, Hochreiter S, Bender A, Bulusu KC, Klambauer G. DeepSynergy: predicting anti-cancer drug synergy with Deep Learning. Bioinformatics. 2018;34(9):1538–46. pmid:29253077
- 9. Wang Y, Wang J, Liu Y. Deep learning for predicting synergistic drug combinations: State‐of‐the‐arts and future directions. Clinical and Translational Dis. 2024;4(3).
- 10. Kipf TN, Welling M. Semi-supervised classification with graph convolutional networks. arXiv preprint. 2016.
- 11. Bonatti PA, Decker S, Polleres A. Knowledge graphs: New directions for knowledge representation on the semantic web (dagstuhl seminar 18371). Dagstuhl reports. 2019;8(9):29–111.
- 12. Yang J, Xu Z, Wu WKK, Chu Q, Zhang Q. GraphSynergy: a network-inspired deep learning model for anticancer drug combination prediction. J Am Med Inform Assoc. 2021;28(11):2336–45. pmid:34472609
- 13. Zhang G, Gao Z, Yan C, Wang J, Liang W, Luo J, et al. KGANSynergy: knowledge graph attention network for drug synergy prediction. Brief Bioinform. 2023;24(3):bbad167. pmid:37130580
- 14. Liu X, Song C, Liu S, Li M, Zhou X, Zhang W. Multi-way relation-enhanced hypergraph representation learning for anti-cancer drug synergy prediction. Bioinformatics. 2022;38(20):4782–9. pmid:36000898
- 15. Zhang P, Tu S, Zhang W, Xu L. Predicting cell line-specific synergistic drug combinations through a relational graph convolutional network with attention mechanism. Brief Bioinform. 2022;23(6):bbac403. pmid:36136353
- 16. Sui Y, Wang X, Wu J. Causal attention for interpretable and generalizable graph classification. In: Proceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining, 2022. 1696–705.
- 17. Wang J, Liu X, Shen S, Deng L, Liu H. DeepDDS: deep graph neural network with attention mechanism to predict synergistic drug combinations. Brief Bioinform. 2022;23(1):bbab390. pmid:34571537
- 18. Draguet A, Tagliatti V, Colet J-M. Targeting Metabolic Reprogramming to Improve Breast Cancer Treatment: An In Vitro Evaluation of Selected Metabolic Inhibitors Using a Metabolomic Approach. Metabolites. 2021;11(8):556. pmid:34436498
- 19. Xie Z, Bailey A, Kuleshov MV, Clarke DJB, Evangelista JE, Jenkins SL, et al. Gene Set Knowledge Discovery with Enrichr. Curr Protoc. 2021;1(3):e90. pmid:33780170
- 20. Guvenir Celik E, Eroglu O. Combined treatment with ruxolitinib and MK-2206 inhibits the JAK2/STAT5 and PI3K/AKT pathways via apoptosis in MDA-MB-231 breast cancer cell line. Mol Biol Rep. 2023;50(1):319–29. pmid:36331743
- 21. Lin J, Xia L, Liang J, Han Y, Wang H, Oyang L, et al. The roles of glucose metabolic reprogramming in chemo- and radio-resistance. J Exp Clin Cancer Res. 2019;38(1):218. pmid:31122265
- 22. Marvalim C, Datta A, Lee SC. Role of p53 in breast cancer progression: An insight into p53 targeted therapy. Theranostics. 2023;13(4):1421–42. pmid:36923534
- 23. Tun JO, Salvador-Reyes LA, Velarde MC, Saito N, Suwanborirux K, Concepcion GP. Synergistic Cytotoxicity of Renieramycin M and Doxorubicin in MCF-7 Breast Cancer Cells. Mar Drugs. 2019;17(9):536. pmid:31527453
- 24. Cao Y, Afzal MZ, Shirai K. Does denosumab offer survival benefits? -Our experience with denosumab in metastatic non-small cell lung cancer patients treated with immune-checkpoint inhibitors. J Thorac Dis. 2021;13(8):4668–77. pmid:34527308
- 25. Li H-S, Lei S-Y, Li J-L, Xing P-Y, Hao X-Z, Xu F, et al. Efficacy and safety of concomitant immunotherapy and denosumab in patients with advanced non-small cell lung cancer carrying bone metastases: A retrospective chart review. Front Immunol. 2022;13:908436. pmid:36105807
- 26. Paffenholz SV, Salvagno C, Ho Y-J, Limjoco M, Baslan T, Tian S, et al. Senescence induction dictates response to chemo- and immunotherapy in preclinical models of ovarian cancer. Proc Natl Acad Sci U S A. 2022;119(5):e2117754119. pmid:35082152
- 27. Saleh T, Carpenter VJ, Tyutyunyk-Massey L, Murray G, Leverson JD, Souers AJ, et al. Clearance of therapy-induced senescent tumor cells by the senolytic ABT-263 via interference with BCL-XL -BAX interaction. Mol Oncol. 2020;14(10):2504–19. pmid:32652830
- 28. Rundall BK, Denlinger CE, Jones DR. Suberoylanilide hydroxamic acid combined with gemcitabine enhances apoptosis in non-small cell lung cancer. Surgery. 2005;138(2):360–7. pmid:16153448
- 29. Abu Samaan TM, Samec M, Liskova A, Kubatka P, Büsselberg D. Paclitaxel’s Mechanistic and Clinical Effects on Breast Cancer. Biomolecules. 2019;9(12):789. pmid:31783552
- 30. Zajdel A, Nycz J, Wilczok A. Lapatinib enhances paclitaxel toxicity in MCF-7, T47D, and MDA-MB-321 breast cancer cells. Toxicol In Vitro. 2021;75:105200. pmid:34126209
- 31. Vaishampayan UN, Burger AM, Sausville EA, Heilbrun LK, Li J, Horiba MN, et al. Safety, efficacy, pharmacokinetics, and pharmacodynamics of the combination of sorafenib and tanespimycin. Clin Cancer Res. 2010;16(14):3795–804. pmid:20525756
- 32. Pitts TM, Morrow M, Kaufman SA, Tentler JJ, Eckhardt SG. Vorinostat and bortezomib exert synergistic antiproliferative and proapoptotic effects in colon cancer cell models. Mol Cancer Ther. 2009;8(2):342–9. pmid:19174560
- 33. Liu H, Zhang W, Zou B, Wang J, Deng Y, Deng L. DrugCombDB: a comprehensive database of drug combinations toward the discovery of combinatorial therapy. Nucleic Acids Res. 2020;48(D1):D871–81. pmid:31665429
- 34. O’Neil J, Benita Y, Feldman I, Chenard M, Roberts B, Liu Y, et al. An Unbiased Oncology Compound Screen to Identify Novel Combination Strategies. Mol Cancer Ther. 2016;15(6):1155–62. pmid:26983881
- 35. Cheng F, Kovács IA, Barabási A-L. Network-based prediction of drug combinations. Nat Commun. 2019;10(1):1197. pmid:30867426
- 36. Barretina J, Caponigro G, Stransky N, Venkatesan K, Margolin AA, Kim S, et al. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature. 2012;483(7391):603–7. pmid:22460905