Skip to main content
Advertisement
  • Loading metrics

Systematic modeling predicts synergistic and safe drug combinations for parasitic diseases

  • Yansen Su,

    Roles Conceptualization, Funding acquisition, Methodology, Supervision

    Affiliation Key Laboratory of Intelligent Computing and Signal Processing, Anhui University, Hefei, Anhui, China

  • Hongyu Zhang,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Key Laboratory of Intelligent Computing and Signal Processing, Anhui University, Hefei, Anhui, China

  • Yun Du,

    Roles Formal analysis, Validation, Visualization

    Affiliation College of Pharmacy, Xinjiang Medical University, Urumqi, Xinjiang, China

  • Lei Li,

    Roles Conceptualization, Data curation, Investigation, Methodology

    Affiliation Key Laboratory of Intelligent Computing and Signal Processing, Anhui University, Hefei, Anhui, China

  • Guodong Lv ,

    Roles Funding acquisition, Supervision

    lgd_xj@xjmu.edu.cn (GL); jianghanjing@ahu.edu.cn (HJ)

    Affiliation Joint International Research Laboratory of Prevention and Control of Major Diseases in Central Asia, Clinical Medical Research Institute, First Affiliated Hospital of Xinjiang Medical University, Urumqi, Xinjiang, China

  • Hanjing Jiang

    Roles Investigation, Supervision, Validation, Writing – original draft, Writing – review & editing

    lgd_xj@xjmu.edu.cn (GL); jianghanjing@ahu.edu.cn (HJ)

    Affiliation Key Laboratory of Intelligent Computing and Signal Processing, Anhui University, Hefei, Anhui, China

?

This is an uncorrected proof.

Abstract

Parasitic diseases impose a substantial global health burden due to the widespread transmission and diversity of protozoa and helminths, which cause numerous infections and regional outbreaks. Despite the availability of various antiparasitic drugs, their clinical utility is often constrained by high cost, toxicity, severe side effects, and the growing threat of drug resistance. Combination therapy, designed to enhance efficacy through synergistic effects while reducing toxicity, represents a promising strategy to improve treatment outcomes for parasitic diseases. In this work, we propose MetaSynMT, a novel multi-task learning framework designed to predict synergistic and safe drug combinations, with a specific focus on parasitic diseases. The model integrates a meta-path aggregation mechanism to capture both structural and high-order semantic features of drugs. Alongside the primary task of synergy prediction, we introduce a secondary task of side effect prediction, enabling the joint identification of combinations with high synergy and low toxicity. Experimental results demonstrate that MetaSynMT outperforms several state-of-the-art baselines on parasitic disease dataset and exhibits strong generalization capability across diverse real-world settings. Furthermore, based on MetaSynMT’s predictions, we identified allicin and sodium stibogluconate as a promising combination therapy for echinococcosis. In vitro protoscolex culture experiments showed that the combination achieved a 100% inhibition rate at concentrations of 850 μM allicin and 36.3 μM sodium stibogluconate, significantly surpassing monotherapies. Overall, this work provides a novel computational tool and theoretical foundation for optimizing antiparasitic drug combinations and discovering potential therapeutic strategies.

Author summary

Parasitic diseases affect millions of people worldwide and remain a major public health challenge, especially in regions with limited medical resources. Current treatments are often expensive, toxic, and increasingly less effective due to the rise of drug resistance. Combining two or more drugs is a promising way to improve treatment, but identifying safe and effective combinations is difficult and usually relies on time-consuming laboratory testing.

In this study, we developed MetaSynMT, a new artificial intelligence framework that can predict both the effectiveness and the potential side effects of drug combinations for parasitic diseases. Our model not only performed better than existing methods on large datasets, but also suggested a novel therapy: the natural compound allicin combined with the clinical drug sodium stibogluconate. Laboratory experiments confirmed that this combination was highly effective against the parasite that causes echinococcosis, completely blocking its growth at tested doses.

By integrating advanced machine learning with experimental validation, our work provides a powerful tool for accelerating the discovery of new and safer treatments for parasitic diseases, offering hope for more accessible and effective therapies worldwide.

Introduction

Parasites represent a major class of human pathogens capable of infecting a wide range of hosts, including humans and animals, and causing diseases that span from mild discomfort to life-threatening conditions [1]. Parasitic infections contribute to diverse health complications, such as gastrointestinal disorders, malnutrition, anemia, and allergic reactions [2]. However, most available antiparasitic drugs are expensive, frequently toxic, and often associated with significant side effects [3]. In addition, the extensive and prolonged use of these drugs has accelerated the emergence of resistance in many parasites to first-line therapies [4]. For example, Plasmodium falciparum has developed resistance to nearly all major antimalarial agents, and no highly effective commercial vaccine is currently available [5]. These challenges not only intensify the global health burden but also pose substantial obstacles to modern medicine and drug discovery.

Currently, the development of new drugs for parasitic diseases is hindered by long research cycles, high failure rates, and a limited number of approved therapeutics [6]. Existing antiparasitic drugs include artemisinin derivatives, quinine-related compounds, praziquantel, ivermectin, albendazole derivatives, nitazoxanide, and pyrimethamine, among others. These drugs employ diverse mechanisms of action, primarily targeting the parasite’s metabolism and physiological functions to achieve parasiticidal effects. However, monotherapies for parasitic diseases often face challenges such as limited efficacy, drug resistance, and adverse side effects [79]. These challenges have driven the search for innovative strategies, among which combination therapy has emerged as a vital and rapidly advancing approach in antiparasitic drug development [8]. Combination therapy, which targets multiple proteins simultaneously, not only enhances therapeutic effectiveness but also minimizes side effects and resistance, offering new insights into the pharmacological treatment of parasitic diseases [10,11].

With changing climatic conditions and shifts in ecological transmission patterns, the risk of parasitic disease outbreaks is increasing, placing greater demands on traditional diagnostic and therapeutic approaches. In recent years, researchers have conducted numerous experimental studies based on genomics, proteomics, and host-targeted therapies to investigate host metabolic mechanisms [12]. These efforts have led to more precise treatment strategies for parasite-infected hosts and accelerated the clinical translation of antiparasitic drugs [13]. However, such experimental approaches often suffer from limited candidate drug pools, which may result in the omission of potentially effective combinations. In addition, experimental validation typically requires significant human and material resources. To overcome these limitations, computational methods have been increasingly adopted. For example, Mason et al. [14] proposed a machine learning–based method called the Combination Synergy Estimator (CoSynE), which utilizes chemical structural features of drugs and a support vector machine classifier to predict synergy scores. Ferreira Junior [15] employed machine learning algorithms such as random forest, naive bayes, support vector machine, and probabilistic neural network to develop a computational model based on the phenotypic data of trypanosoma inhibition in vitro. Roche-Lima et al. [16] developed a tool named MLSyPred for predicting synergistic antimalarial drug combinations. It extracts molecular fingerprint features of drugs and employs five different classifiers to recommend potentially effective combinations. Compared to traditional biological screening, these computational approaches can evaluate large-scale drug combinations more efficiently in a shorter time, significantly reducing experimental cost and workload. However, these methods are currently limited to treating specific parasitic diseases and have not been extended to other types of parasitic diseases. In fact, many other parasitic infections also urgently require effective therapeutic strategies.

Compared to traditional machine learning methods, deep learning offers notable advantages in handling large-scale, high-dimensional data and holds great potential for more effectively identifying promising drug combinations for a wide range of parasitic diseases [17]. Deep learning, with its powerful capabilities in data processing and pattern recognition, has emerged as a valuable tool for improving the diagnosis and drug development efficiency of parasitic diseases. On one hand, deep learning models based on epidemiological data can support disease prediction and intervention strategies. On the other hand, deep learning has shown great potential in drug target identification and drug combination design, laying the foundation for innovative therapies and enhancing the resilience and adaptability of global public health systems. Currently, a number of deep learning models have been developed for predicting synergistic anticancer drug combinations, offering valuable insights for constructing models aimed at parasitic diseases. For example, DeepDDS [18] leverages drug molecular structures and gene expression profiles to construct features for drugs and cell lines, which are then fed into a multi-layer feedforward neural network to identify synergistic drug pairs. GAECDA [19] utilizes the same input information as DeepDDS but incorporates graph autoencoders and convolutional neural networks for prediction. These models rely solely on the chemical structures of drugs while overlooking the underlying mechanisms of drug–target interactions [20].

Previous studies have demonstrated that protein–protein interaction(PPI) networks play a crucial role in identifying interactions between drug pairs. Building on this insight, GraphSynergy [21] predicts synergistic drug combinations based on the PPI network, where each drug or cell line aggregates information from its related H-hop proteins via inner product operations. LGSyn [22] is a novel framework for drug synergy prediction that integrates local molecular features with global biological interaction networks. It builds a knowledge graph, using multi-head attention modules and Bi-interaction aggregator to represent global features. However, these graph convolution based models mechanically aggregate first-order and second-order neighbors and are unable to dynamically select or weight different semantic paths according to task requirements. MRHGNN [23] is a dual-channel multimodal relational hypergraph neural network used for synergy drug prediction. It integrates multimodal drug features to learn high-quality node features. Yet, hypergraph methods are more suitable for feature extraction in scenarios with explicit associations and high complexity in heterogeneous network. HGCLSynergy [24] integrates biological data on drugs and cell lines to construct a heterogeneous information network, and then employs neighbor-based and meta-path-based view encoders for feature learning. However, its meta-path design still has certain limitations, lacking a comprehensive perspective to balance drug and disease nodes. As the diversity of node types increases, the design of its meta-paths also faces growing challenges such as noise interference, semantic redundancy, and node imbalance. Therefore, we aim to explicitly design meta-paths that guide the model to focus on the most interpretable and predictive local structures in the parasitic drug-target graph network, thereby achieving semantic filtering and exploring more complex association patterns.

Many models have successfully leveraged deep learning techniques to explore the relationships between drug combinations and cancer. However, most existing models for drug combination prediction focus solely on synergistic efficacy, with little attention paid to potential adverse effects [25,26]. Although combination therapy can enhance therapeutic outcomes, it often comes with side effects [27,28]. Such adverse effects may worsen the condition of a patient or cause new health issues, which pose challenges to ensuring the safety of proposed drug combinations [29]. Therefore, it is essential not only to identify highly synergistic drug combinations but also to evaluate the potential side effects between drugs. Multi-task learning models offer an effective solution by simultaneously optimizing and updating parameters across different tasks through backpropagation. Moreover, multi-task learning enables the extraction of shared features that are important for all tasks, which can enhance the overall performance of each individual task.

In this study, we propose a multi-task learning model, MetaSynMT, designed to predict synergistic and safe drug combinations for parasitic diseases. MetaSynMT introduces a meta-path-guided parasitic drug-target graph network to construct drug representations and leverages a gene–disease association matrix and a disease similarity matrix to generate representations for parasitic diseases. These learned drug and disease features are jointly used to predict both the synergistic effect and potential side effects of drug combinations. In comparative experiments, MetaSynMT demonstrated superior performance compared to state-of-the-art prediction models. Ablation studies and meta-path effectiveness analyses confirmed the contributions of the side effect prediction module, attention mechanism, and information aggregation strategy. Furthermore, case studies showed that MetaSynMT can identify novel drug combinations with potential therapeutic value. Therefore, MetaSynMT can be regarded as an effective model for predicting synergistic and safe drug combinations targeting parasitic diseases.

Materials and methods

Ethics statement

The in vitro experiments in this study utilized protoscoleces of Echinococcus granulosus. These samples were obtained from the livers of naturally infected sheep. No animals were subjected to additional specialized breeding or in vivo procedures specifically for this research. Therefore, this study did not require specific approval for animal ethics or biosafety handling.

Overview of the MetaSynMT

MetaSynMT is developed for drug combination synergy prediction and consists of three core components: drug feature construction, parasitic disease feature construction, and synergy effect and side effect prediction module. The overall architecture of the MetaSynMT is illustrated in Fig 1. MetaSynMT first constructs a parasitic drug-target graph that integrates diverse types of interactions. To capture the semantic relationships within the drug–target network, multi-hop propagation paths are generated to model meta-paths, and a meta-path-guided aggregation mechanism is subsequently applied to hierarchically extract semantic drug features. Then leverages a gene–disease association matrix and a disease similarity matrix to generate parasitic diseases features. Finally, drug features and parasitic diseases features are used to predict drug combination synergy and potential side effects.

thumbnail
Fig 1. Flowchart of the MetaSynMT framework.

A. Drug feature construction: Drug representations are learned by aggregating information in the parasitic drug-target graph guided by meta-paths. B. Parasitic disease feature construction: The parasite disease–gene association matrix and the disease similarity matrix are independently processed by Multilayer Perceptron (MLP) to generate two feature matrices, which are subsequently concatenated. C. Prediction module: The drug–drug features and drug–drug–disease triplet features are input into the task-specific (side effect and synergy) prediction modules to produce corresponding prediction scores.

https://doi.org/10.1371/journal.pntd.0013991.g001

Data acquisition

Currently, no publicly available dataset is specifically designed for predicting drug combinations against parasitic diseases. To fill this gap, we constructed a benchmark dataset that integrates synergy information for drug combinations, drug–drug interaction data, and drug- and disease-associated targets collected from the MalaCards and Harmonizome databases [30,31]. Detailed statistics are summarized in Table 1.

Parasitic Disease Benchmark Dataset. This dataset contains drug combination information relevant to parasitic diseases, annotated with both synergy and side effects. We selected 25 parasitic diseases with rich annotations and high clinical relevance from the MalaCards database—such as malaria, ascariasis, schistosomiasis, and echinococcosis. Drug combination data for these diseases were collected through a comprehensive PubMed literature search [32]. In total, 336 synergistic drug combinations were extracted from more than 190 research articles and labeled as positive samples, while 337 antagonistic combinations were labeled as negative samples. Across all samples, 393 unique drug pairs were identified, of which 117 are reported to induce adverse interactions and were thus annotated as side-effect positive.

Drug-Related Target Data. The dataset involves 232 unique drugs. For these drugs, a total of 3781 direct and indirect targets were retrieved from the Harmonizome database [31].

Parasitic Disease–Related Gene Data. Disease-associated genes were obtained from the GeneCards database [33], which provides comprehensive annotations for both predicted and validated human gene–disease associations. A union of genes associated with various parasitic diseases was collected, resulting in 1741 unique genes.

Parasitic drug-target graph construction

MetaSynMT constructs a parasitic drug-target graph using drug and target information. From the perspective of synergy, each drug can interact with multiple targets, and therapeutic effects are generally achieved through binding to specific biological targets. From the perspective of safety, overlapping target modules may lead to shared exposures, potentially causing toxic side effects.

In this section, we integrate drug–drug, drug–target, and target–target interaction data to construct a parasitic drug-target graph, denoted as , where V is the set of nodes and E is the set of edges. The node set V consists of two disjoint subsets: the drug set and the target set , where N1 and N2 represent the total numbers of drugs and targets, respectively.

The specific data sources used to construct the parasitic drug-target graph G are summarized in Table 2. An example of the constructed parasitic drug-target graph is illustrated in Fig 2(a), which includes three types of undirected edges:

  • Drug–drug interaction (DDI),
  • Drug–target interaction (DTI),
  • Target–target interaction (TTI).
thumbnail
Fig 2. Schematic illustration of drug representation learning.

(a) An example of parasitic drug-target graph. (b) types of meta-paths. (c) Representative instance sequences corresponding to meta-paths M2 and M3. (d) An example of and based on (c). (e) The process of hierarchical information aggregation from neighboring nodes at each layer to obtain the node embedding based on the specific meta-path Mk. (f) Aggregate information from all different types of meta-paths to obtain the final feature representation of drug node di.

https://doi.org/10.1371/journal.pntd.0013991.g002

thumbnail
Table 2. Summary of data types and numbers in the parasitic drug-target graph.

https://doi.org/10.1371/journal.pntd.0013991.t002

Meta-path construction for parasitic drug-target graph

Since both tasks require learning drug representations, drugs are designated as the endpoints (i.e., start and end nodes) of meta-paths to capture their attribute features. By exploiting the topological relationships between drugs and targets, the model can learn high-order associations that strengthen semantic representation and reasoning. Specifically, we design -type meta-paths, where both ends are drug nodes and the intermediate nodes consist of an arbitrary number of targets, thereby approximating potential interaction routes and facilitating information propagation.

We begin with direct drug-drug interactions, represented by the meta-path M1: Drug-Drug (DD). To further capture indirect associations, we construct the two-hop meta-path M2: Drug-Target-Drug (DTD), which reflects relationships mediated by a single target. However, such two-hop structures only model direct drug-target links and fail to incorporate target-target interactions. To address this limitation, we incorporate TTI to construct higher-order meta-paths that encode more complex semantic dependencies.

To generalize this design, we introduce a parameter K (), denoting the number of intermediate target nodes in a -type meta-path. Accordingly, types of meta-paths are generated, as illustrated in Fig 2(b). A meta-path instance is defined as a specific node sequence that conforms to a meta-path pattern, where the start and end drugs are connected through edges following that pattern. Each meta-path type contains multiple such instances, with examples for M2 and M3 shown in Fig 2(c). Based on the meta-paths , we then construct the corresponding subgraphs , as shown in Fig 2(d).

Information aggregation of parasitic drug-target graph construction

After constructing the meta-paths, we adopt a meta-path-guided information aggregation mechanism to generate feature representations for drug nodes. This process consists of two stages: (1) drug embeddings are extracted for each type of meta-path, and (2) these embeddings are then aggregated across all meta-path types to obtain the final representations of the drug nodes.

Aggregation of information within each type of meta-path.

In the parasitic drug-target graph G, drug and target nodes are initialized as one-hot vectors unique to their node types, allowing the model to distinguish between them during feature extraction. We then project these one-hot vectors into a higher-dimensional space to capture more complex patterns and relationships. For a node i of type t, the transformation is:

(1)

where Wt is the type-specific transformation matrix, b is the bias vector, and is the activation function.

For the subgraph induced by meta-path Mk, we denote the drug set as and the target set as . As show in Fig 2(e), at the l-th layer, the embedding of drug node di is updated by aggregating drug- and target-type neighbors:

(2)

Since different neighbors contribute unequally, we apply an attention mechanism. The attention score of node i at layer l is:

(3)

where ak is the learnable attention vector and ba is the bias term.

The final embedding of drug node di at the last layer L is:

(4)

After L layers, the embeddings of all drugs are denoted as , and similarly for targets .

Aggregation of information between different types of meta-paths.

After aggregating information from each individual meta-path type, the drug representations obtained from all meta-paths are integrated to derive the final drug features. Since meta-paths capture diverse semantic information of varying importance, we employ an attention mechanism to adaptively assign their weights.

Fig 2(f) illustrates the process of aggregating representations across meta-paths. For a given subgraph constructed from meta-path Mk, we first compute a summary vector by averaging the transformed features of all nodes in the subgraph:

(5)

where n is the number of nodes in , is a learnable weight matrix, and bn is the bias term. The importance score for meta-path Mk is computed using a trainable attention vector q:

(6)

Next, the softmax function is used to normalize the importance weights to obtain the weight coefficients:

(7)

where K+1 is the number of meta-path types. After obtaining the normalized importance weights, we aggregate the drug representations across all meta-path types. Finally, we can obtain the final feature representation of drug node di based on all types of meta-paths:

(8)

Parasitic disease feature construction

The input features for the 25 parasitic diseases are composed of two parts. The first is the parasite–associated gene matrix R1, a binary matrix where rows correspond to diseases and columns to protein-coding genes associated with them. Entries are set to 1 if a disease is associated with a gene.

The second is the parasite disease similarity matrix R2, a matrix derived from R1 using the Jaccard similarity coefficient [34]. Both R1 and R2 are processed by a two-layer MLP with ReLU activations. Each input is projected into a 64-dimensional feature space.

The final representation of each parasitic disease is obtained by concatenating the outputs of the two MLPs:

(9)

where He denotes the feature vector for disease e.

Predict module

For the drug pair , we have constructed their respective meta-path-based features. The model is trained to predict both side effects and synergy effects. The side effect and synergy prediction modules both consist of multi-layer perceptron with ReLU activation function. For the side effect prediction task, the features of two drugs are concatenated to form drug-drug feature , which is fed into the side effect prediction module to predict the side effect score pside.

For the synergy prediction task, the features of two drugs and feature of parasitic disease are concatenated to form drug-drug-parasitic disease triplet feature. Then the triple feature is fed into the synergy prediction module to produces synergy probability score psyn. The detailed process is as follows:

The binary cross-entropy (BCE) loss is employed as the objective function. Let n be the total number of training samples, and let yi and pi denote the true label and the predicted probability for the i-th sample, respectively. The BCE loss is defined as:

(10)

The total loss of the model is computed as a weighted sum of the losses for the two tasks:

(11)

where and denote the BCE losses for the synergy prediction and side effect prediction tasks, respectively. The scalar is a learnable parameter that balances the two objectives.

Results

Experiment settings

We employ the hold-out method to split the parasitic disease benchmark dataset into training, validation, and test sets with a ratio of 6:2:2. To ensure robustness, the dataset is randomly partitioned ten times, and the average performance across these runs is reported.

To prevent information leakage, drug–drug–disease triplets in the test set are excluded from the training set. Since drug combinations are order-independent (i.e., drug_row-drug_col and drug_col-drug_row are equivalent), each training sample is duplicated in both orders. For validation and test sets, predictions from both orders are averaged to mitigate any bias introduced by drug ordering.

MetaSynMT is implemented in PyTorch based on a meta-path-guided information aggregation framework. The embedding dimension of node features is set to 64, and the MLP hidden layers contain [2048, 1024, 512] units. The learning rate is 0.0005, and training runs for up to 50 epochs with early stopping if validation metrics do not improve for 10 consecutive epochs. The model is optimized using the Adam optimizer.

Model performance is evaluated using multiple metrics: area under the receiver operating characteristic curve (AUC), area under the precision-recall curve (AUPR), accuracy (ACC), precision, recall, and F1 score. AUC measures the model’s ability to distinguish positive and negative samples, while AUPR reflects the trade-off between precision and recall. ACC indicates the proportion of correctly classified samples. Precision denotes the fraction of true positives among all predicted positives. The F1 score, as the harmonic mean of precision and recall, captures both accuracy and completeness. Higher values for these metrics indicate better predictive performance.

Baseline methods

To evaluate the performance of MetaSynMT, we compare it against several representative baseline methods. Among them, the recently published MLSyPred relies primarily on traditional machine learning algorithms, such as Random Forest, Support Vector Machines, and Gradient Boosting, to generate predictions.

In addition, we include six state-of-the-art deep learning-based methods for drug combination prediction. GraphSynergy [21], Graph Convolutional Network (GCN) [35], Graph Attention Network(GAT) [36], LGSyn [37], and MRHGNN [23] are are graph-based models that utilize heterogeneous network structures, while GAECDS [19], MFSynDCP [38], and DeepDDS [18] rely solely on the molecular structures of drugs for feature extraction. The details of these baselines are as follows:

GCN is a neural network model specifically designed to operate on non-Euclidean graph-structured data using convolutional operations.

GAT introduces an attention mechanism to adaptively weigh the importance of neighboring nodes, thereby enabling more informative feature aggregation for each node.

GraphSynergy constructs a heterogeneous graph architecture and applies spatial graph convolution along with attention mechanisms to encode high-order structural information from drug–cell line and protein modules.

GAECDS integrates graph autoencoders and convolutional neural networks to model the drug synergy prediction task.

MFSynDCP designs a graph aggregation module with an adaptive attention mechanism that dynamically focuses on key information within drug molecular graphs. This allows the model to comprehensively capture crucial interaction patterns between drug pairs.

DeepDDS is a deep learning framework that combines graph neural networks and attention mechanisms. Baseline models are implemented using the hyperparameters reported in their respective original publications.

LGSyn is a novel framework for drug synergy prediction that integrates local molecular features with global biological interaction networks. It employs three fusion strategies to effectively combine these heterogeneous features before feeding them into a deep neural network for prediction.

MRHGNN is a novel dual-channel multimodal relational hypergraph neural network designed for synergistic drug combination prediction. It leverages attention mechanisms to integrate multimodal drug features and employs a unified framework combining primary and self-supervised learning tasks to enhance prediction robustness.

To ensure robustness and reproducibility, each experiment is repeated ten times with random splits, and average performance across these runs is reported.

Comparative performance experiment

We compare MetaSynMT with selected baseline models in terms of synergistic prediction performance on the parasitic disease benchmark dataset, using the synergy prediction score for binary classification analysis. The results, presented in Table 3, show that MetaSynMT consistently outperforms all competing methods across multiple evaluation metrics, including ACC, AUC, AUPR, precision, F1-score, and recall, demonstrating its strong capability in predicting synergistic drug combinations. For example, MetaSynMT achieved a higher AUC than the second method, MFSynDCP (mean difference = 0.041, 95% CI [0.025, 0.057], paired t-test, p < 0.05).

thumbnail
Table 3. Performance comparison of MetaSynMT and baseline methods on the parasitic disease benchmark dataset.

https://doi.org/10.1371/journal.pntd.0013991.t003

Among the baseline methods, the traditional machine learning model MLSyPred exhibits relatively poor predictive performance. In contrast, deep learning-based models perform better, likely due to their superior ability to capture complex, non-linear patterns in high-dimensional data. However, models such as GAECDS, DeepDDS, and MFSynDCP, which rely solely on drug chemical structure, show limited performance due to their inability to integrate relevant target-related information. Although GraphSynergy incorporates protein–protein interaction networks to model topological relationships between drugs, its performance remains inferior to that of MetaSynMT, suggesting that the inclusion of the auxiliary side effect prediction task helps improve overall prediction accuracy. Although GraphSynergy, LGSyn, and MRHGNN incorporate protein–protein interaction networks to model topological relationships between drugs, its performance remains inferior to that of MetaSynMT, suggesting that the inclusion of the auxiliary side effect prediction task helps improve overall prediction accuracy.

Additionally, attention-based models—such as GAT, DeepDDS, GAECDS, and MFSynDCP—demonstrate better performance than the standard GCN, underscoring the importance of attention mechanisms in capturing critical structural and neighborhood-level information.

To verify the robustness and generalization performance of our model, we used a widely applied cancer synergy dataset, Oncology-Screen (On-Screen) dataset [21] for the synergy prediction task. The On-Screen dataset encompasses 4176 drug combination samples involving 21 drugs and 29 cancer cell lines. It includes 2257 positive samples, 1919 negative samples, and 405 drug-related targets. Synergy scores in the On-Screen dataset are calculated using the ZIP metric. Gene expression data for cancer cell lines are obtained from Cancer Cell Line Encyclopedia [39], an independent project designed to characterize genomic profiles, mRNA expression, and anticancer drug responses across cancer cell lines.

The comparison results of MetaSynMT and baseline methods on On-Screen dataset are shown in Table 4. As can be seen from the table, the binary classification evaluation metric of MetaSynMT ranks among the top in all baseline methods. This demonstrates that MetaSynMT exhibits stable predictive performance across different types of datasets, strongly confirming that the MetaSynMT framework possesses excellent generalization capabilities.

thumbnail
Table 4. Performance comparison of MetaSynMT and baseline methods on the on-screen dataset.

https://doi.org/10.1371/journal.pntd.0013991.t004

Comparative experiment of four initial feature extraction methods

To determine the optimal strategy for initial feature extraction, we additionally employed three initial drug/target feature extraction methods, i.e., (1) the Word2Vec [40] chemical language model trained on SMILES representations and the Word2Vec model trained on protein sequences; (2) the Rdkit [41] cheminformatics toolkit and the BERT [42] deep protein sequence representation model; (3) the specially pre-trained ChemBERTa-2 [43] chemical large language model, and the ESM-2 [44] evolutionary scale protein language model. In addition, the SMILES string of the drug is obtained from DrugBank [45], and the protein sequence of the target is obtained from UniProt [46]. The evaluation results on the parasitic disease benchmark dataset are presented in Table 5. The comparison results of four methods revealed that one-hot encoding method achieved the best performance across all three core prediction metrics. Therefore, we selected one-hot encoding approach for its advantages of having few parameters, high computational efficiency, and the ability to quickly provide foundational features for downstream processing. This result also indicates that the initial representation is not the decisive factor for performance, the core strength of our model lies in the subsequent meta-path-guided feature extraction and aggregated process, which learns more discriminative representations from complex biological associations. Pre-trained large models like ChemBERTa-2 and ESM-2 still possess significant advantages, as the rich prior knowledge they encapsulate may play a prominent role in more complex deep learning models.

thumbnail
Table 5. Performance comparison of four different initial feature extraction methods on the parasite benchmark dataset.

https://doi.org/10.1371/journal.pntd.0013991.t005

Ablation experiment and meta-path effectiveness validation experiment

To evaluate the contribution of key components in MetaSynMT, we conducted ablation experiments on the parasitic disease benchmark dataset. The ablated variants include:

  1. (1) MetaSynMT-side, which removes the side effect prediction module and performs only the synergy prediction task;
  2. (2) MetaSynMT-atten, which removes the attention mechanism from the meta-path-based information aggregation module. In this variant, embeddings of neighbors at different hierarchical levels are aggregated using simple averaging, and drug embeddings across different meta-path types are directly summed without applying attention weights.

As shown in Table 6, MetaSynMT-side consistently underperforms the full MetaSynMT model across all evaluation metrics, indicating that the auxiliary task of side effect prediction enhances the accuracy of synergy prediction. Similarly, MetaSynMT-atten also exhibits inferior performance compared to MetaSynMT, demonstrating that the attention mechanism improves learning over graph structures by enabling the model to focus on more informative node features.

thumbnail
Table 6. Performance comparison of MetaSynMT and its variants for drug combination prediction.

https://doi.org/10.1371/journal.pntd.0013991.t006

To assess the effectiveness of the meta-path-based information aggregation module in MetaSynMT, we replace it with several widely used heterogeneous graph representation learning methods, including heterogeneous GCN, GAT, and DeepWalk [47]. Accordingly, we construct three model variants: MetaSynMT-GCN, MetaSynMT-GAT, and MetaSynMT-DeepWalk. Each variant replaces the original aggregation component with a different method.

As shown in Table 7, MetaSynMT consistently outperforms all variants across the parasitic disease benchmark dataset. The inferior performance of these alternative models may stem from several limitations. Specifically, heterogeneous GCN and GAT often struggle to capture long-range dependencies, as information propagation weakens with increasing hop distance. In contrast, DeepWalk generates node embeddings through random walks, but tends to introduce high levels of information redundancy, which can negatively impact prediction accuracy.

thumbnail
Table 7. Performance comparison of MetaSynMT and its variants with different meta-path aggregation strategies.

https://doi.org/10.1371/journal.pntd.0013991.t007

These results indicate that the meta-path-based aggregation mechanism is more effective for modeling parasitic drug-target graph structures in the context of synergistic drug combination prediction.

Parameter sensitivity analysis

We conduct a sensitivity analysis to evaluate the impact of three key hyperparameters in MetaSynMT: (1) the maximum number of intermediate targets K in the designed meta-paths, (2) the node embedding dimension d, and (3) the learning rate lr. The corresponding results are presented in Fig 3.

  • Effect of intermediate target number K. As shown in Fig 3(A), the AUC performance initially increases and then decreases as the number of intermediate targets K increases from 1 to 5, with the best performance observed at K = 3. This trend suggests that incorporating moderate levels of target-hopping enhances the expressiveness of meta-path-based feature representations. However, overly long meta-paths (i.e., larger K) introduce excessive noise, which hinders the effectiveness of information aggregation.
  • Effect of embedding dimension d. Fig 3(B) demonstrates a similar pattern for the node embedding dimension d. As d increases from 16 to 128, model performance improves and peaks at d = 64, after which it declines. A small embedding size (e.g., d = 16) may lead to underfitting due to insufficient model capacity, while an overly large dimension (e.g., d = 128) may cause the model to overfit the training data by capturing noise rather than meaningful patterns.
  • Effect of learning rate lr. In Fig 3(C), we evaluate four different learning rates: 0.01, 0.001, 0.0005, and 0.0001. The model achieves the best performance when lr = 0.0005, indicating that this setting provides a good balance between convergence stability and optimization efficiency.
thumbnail
Fig 3. (A) Performance metrics (AUC, AUPR, and ACC) under varying values of parameter K.

(B) Performance metrics (AUC, AUPR, and ACC) under varying embedding dimensions d. (C) Performance metrics (AUC, AUPR, and ACC) under different learning rates lr.

https://doi.org/10.1371/journal.pntd.0013991.g003

Key meta-paths analysis

In the parameter sensitivity analysis, we found that the model achieves optimal performance when the maximum number of intermediate targets in the -type meta-paths is set to 3. Accordingly, we design four meta-paths: Drug-Drug (DD), Drug-Target-Drug (DTD), Drug-Target-Target-Drug (DTTD), and Drug-Target-Target-Target-Drug (DTTTD).

To further investigate the contribution of these meta-paths to drug node embedding aggregation, we randomly selected 120 drug combination samples from the training set. The attention scores assigned to each meta-path for these drug pairs are visualized in a heatmap (Fig 4). The analysis reveals that the DTD and DD meta-paths receive the highest attention scores, indicating that the model regards drug-drug and drug-target-drug interactions as the most critical factors during the learning process.

thumbnail
Fig 4. Heat map of attention scores for four types of meta-paths.

https://doi.org/10.1371/journal.pntd.0013991.g004

Further, the necessity of the two key meta-paths is analyzed by biological instances analysis. Taking the drug combination of albendazole and tetrandrine as an example, the meta-path distribution analysis shows contribution scores of 0.31 (DTD), 0.21 (DTTD), 0.22 (DTTTD), and 0.26 (DD), respectively. The main interactions are shown in Fig 5. From a biological perspective, CYP3A4 is the key metabolic enzyme responsible for activating albendazole [48], while tetrandrine significantly inhibits CYP3A4 activity, thereby affecting the plasma concentrations of other drugs metabolized by this enzyme [49]. When used in combination, tetrandrine inhibits CYP3A4, slowing down the further metabolism of the active metabolite of albendazole, which may lead to increased systemic exposure and prolonged retention of the active metabolite. The “Drug-Target-Drug" meta-path may provide the intrinsic mechanisms of the synergistic anti-echinococcal effects of albendazole and tetrandrine. The usage of the “Drug-Target-Drug" meta-path may effectively capture the essential information between two drugs (albendazole and tetrandrine) and the target (CYP3A4).

thumbnail
Fig 5. Meta-path visualization of the albendazole and tetrandrine combination.

https://doi.org/10.1371/journal.pntd.0013991.g005

Case study on predicting synergistic and safe drug combinations

This section presents a case study to demonstrate the capability of MetaSynMT in predicting synergistic and safe drug combinations for parasitic diseases. In the parasitic disease benchmark dataset, the known drug combinations involve 232 drugs and 25 parasitic diseases, yielding a theoretical total of drug–drug–disease triplets. After removing the collected positive and negative samples, the remaining triplets are used to construct the test set. For each sample in the test set, the model generates a synergy score and a side effect score. A higher synergy score indicates stronger predicted synergistic efficacy, while a lower side effect score suggests a safer combination. We first rank the samples based on synergy scores and then sort them by side effect scores, ultimately selecting eight drug combinations with both strong synergy and low toxicity, as shown in Table 8.

thumbnail
Table 8. The eight candidate synergistic and safe drug combinations targeting parasitic diseases.

https://doi.org/10.1371/journal.pntd.0013991.t008

The combination of miltefosine and curcumin has been reported for the treatment of mucocutaneous leishmaniasis [50]. This drug pair exhibits synergistic activity against both promastigote and amastigote forms in vitro, and experimental results further showed increased production of reactive oxygen and nitrogen species, as well as enhanced lymphocyte proliferation. The combination of rifapentine and moxifloxacin has been investigated for the treatment of onchocerciasis. An in vivo screening model was established to evaluate the anti-Wolbachia efficacy of different antibiotic combinations [51]. The results showed that rifapentine (15 mg/kg) combined with moxifloxacin (2×200 mg/kg) produced the most pronounced treatment-shortening effect, regardless of administration route. Although no direct studies currently support its efficacy against ascariasis, both onchocerciasis and ascariasis are caused by nematodes with similar morphology, life cycles, and parasitic behaviors [52], suggesting potential therapeutic relevance. Clinical trials have demonstrated that the combination of moxidectin and albendazole is effective for treating elephantiasis, with no serious adverse events reported [53]. Oral terbinafine combined with biweekly cryotherapy (meglumine antimoniate) has been evaluated in clinical trials for the treatment of cutaneous and mucosal leishmaniasis [54]. Follow-up of the patients during these trials showed full recovery without any reported adverse events. Previous studies have indicated that the combination of antioxidants and anthelmintic drugs, such as artesunate, exhibits synergistic anthelmintic effects against Schistosoma mansoni [55]. As allicin is also a natural compound with significant antioxidant activity, the combination of allicin and artesunate may likewise possess potential anthelmintic efficacy.

These examples confirm the effectiveness of MetaSynMT in predicting synergistic and safe drug combinations. Although the model-generated high-scoring combinations show promising therapeutic potential in theory, their safety and synergy still require further validation through clinical trials.

In vitro experimental analysis of the combination of allicin and sodium stibogluconate for echinococcosis

After applying MetaSynMT to predict the synergy and side effect scores of drug combinations targeting parasitic diseases, we further selected several combinations that demonstrated both strong synergistic effects and low toxicity for the treatment of echinococcosis. The six drug combinations are shown in the Table 9 below. These six combinations collectively involve eight drugs. Cystic echinococcosis is a zoonotic neglected tropical parasitic disease caused by Echinococcus granulosus, while alveolar echinococcosis is caused by Echinococcus multilocularis. Since only the protoscoleces of Echinococcus granulosus are available, in our work we validated the efficacy of the detected drug combinations against cystic echinococcosis. The protoscoleces were harvested from the livers of sheep infected with echinococcus granulosus and subsequently cultured, process is detailed in S1 Text S1.1. Under aseptic conditions, protoscoleces were isolated and cultured in an incubator at 37°C with 5% CO2. Protoscoleces with a viability exceeding 90% were transferred to a 96-well plate, with approximately 200 larvae per well. Similarly, the experimental groups were set up with three replicate wells per group, and the culture medium and solutions were replaced every two days. Mortality of the protoscoleces was assessed under an optical microscope using 1% eosin staining, and the average of the three replicate results was taken as the final statistical data.

thumbnail
Table 9. The synergistic and safe drug combinations targeting echinococcosis.

https://doi.org/10.1371/journal.pntd.0013991.t009

We first conducted single-drug maximum concentration efficacy experiments against echinococcosis to select the combination where both individual drugs demonstrated the highest efficacy. The survival curve graph of protoscoleces on single drug is shown in Fig 6. Survival rate records of protoscoleces over 7 days showed varying degrees of anthelmintic efficacy for azithromycin, melarsoprol, moxidectin, moxifloxacin, N-acetylcysteine, and levofloxacin at their maximum single-drug concentrations. Detailed experimental procedures are provided in S1 Text S1.2. Furthermore, among these drugs, allicin, sodium stibogluconate, azithromycin and melarsoprol showed strong efficacy. Allicin was identified as a broad-spectrum antimicrobial agent with no significant hepatorenal toxicity in vivo [56]. Therefore, we focused our experimental validation on the combination of allicin and sodium stibogluconate, highlighting the potential of repurposing traditional drugs for new therapeutic indications.

thumbnail
Fig 6. Survival curves of protoscoleces over 7 days with individual drugs at maximum concentrations.

https://doi.org/10.1371/journal.pntd.0013991.g006

The LD50 values of each drug were determined through gradient concentration assays, where LD50 refers to the concentration required to induce 50% mortality in protoscoleces. These values serve as a critical reference for evaluating the inhibitory effects of each drug. Based on the LD50 values predicted by the SPSS software, the gradient concentrations used were set at [0 μM, 7.0 μM, 12.7 μM, 22.6 μM, 36.3 μM] for allicin and [0 μM, 425.0 μM, 850.0 μM, 893.0 μM, 936.2 μM] for sodium stibogluconate. A dose–response matrix of inhibition rate for the allicin and sodium stibogluconate combination was constructed (Fig 7). The value of LD50 and the design of dose-response matrix are detailed in S1 Text S1.3. This matrix serves as an important tool for evaluating combination efficacy and provides a visual representation of protoscolex survival under varying drug concentrations.

thumbnail
Fig 7. The dose-response matrix of the allicin and sodium stibogluconate combination.

https://doi.org/10.1371/journal.pntd.0013991.g007

From the dose-response matrix, it was determined that the combination of allicin (850 μM) with sodium stibogluconate (36.3 μM) achieved an inhibition rate of 100%. We statistically analyzed the effects on protoscoleces viability in the control group, two monotherapy groups and combination group. As shown in Fig 8, the experiments demonstrated that compared with monotherapy, the combination group (allicin and sodium stibogluconate) significantly increased the inhibition rate of protoscoleces (P < 0.001). Specifically, compared with the second most potent monotherapy, i.e., allicin (inhibition rate = 55.0% ± 3.0%; 95% CI: 49.6%-60.4%; n=3), the combination regimen demonstrated complete inhibition against protoscoleces with an inhibition rate of 100.0% (n=3). These results highlight the synergistic effect of allicin and sodium stibogluconate combination therapy against echinococcosis.

thumbnail
Fig 8. Compared with the negative control, DMSO solvent (1%), albendazole (15 μM), allicin (36.3 μM) or sodium stibogluconat (850 μM) alone, the combination of allicin (36.3 μM) and sodium stibogluconat (850 μM) has a significantly stronger inhibitory effect on echinococcosis protoscoleces.

https://doi.org/10.1371/journal.pntd.0013991.g008

We then employed SynergyFinder [57] to calculate the synergy scores at various concentration combinations. As shown in Fig 9, the heatmap and 3D synergy plots generated using the ZIP, Bliss, HSA, and Loewe models indicate that the ZIP, Bliss, and HSA scores all exceed 10. These results demonstrate that the allicin and sodium stibogluconate combination exhibits significant synergistic effects within a specific concentration range. Notably, the strongest synergy was observed at a concentration of 850 μM allicin and 36.3 μM sodium stibogluconate, as indicated in the heatmap.

thumbnail
Fig 9. Heatmap and 3D surface plot of four synergy scores (ZIP, Bliss, HSA, and Loewe) for allicin and sodium stibogluconate combination.

Synergistic: synergy score > 10; Additive: –10 < synergy score < 10; Antagonistic: synergy score <–10. A t-test was used to calculate the P-values of the four synergy scores (p < 0.09), suggesting a trend toward significance.

https://doi.org/10.1371/journal.pntd.0013991.g009

Overall, these findings confirm the therapeutic potential of the allicin and sodium stibogluconate combination for treating echinococcosis and further validate the practical applicability of MetaSynMT in predicting effective and safe drug combinations. However, further investigations are required to assess the long-term synergy and safety of this combination therapy.

Conclusion

Parasitic diseases, caused by a wide range of protozoa and helminths, remain a pressing global health problem. While combination therapy has long been recognized as a promising approach to enhance efficacy and mitigate toxicity, the search for optimal drug pairs has largely relied on costly and time-consuming experimental screening. This bottleneck is especially acute for neglected tropical diseases, where funding and experimental resources are limited. Against this backdrop, we developed MetaSynMT, a multi-task learning framework that integrates biomedical data and leverages meta-path-guided aggregation to jointly predict both the synergistic potential and safety profile of candidate drug combinations.

Compared with existing computational approaches for drug synergy prediction, MetaSynMT introduces several distinctive advantages. First, the incorporation of meta-path-based semantic aggregation allows the model to capture high-order relational patterns between drugs and targets—relationships often overlooked by standard network embedding or deep learning models that rely solely on direct interactions. Second, by coupling side effect prediction as an auxiliary task, the framework explicitly addresses the clinical need to balance efficacy with safety, a dimension that is rarely integrated into existing models. Third, the model’s design enables it to generalize effectively across different parasitic diseases, as evidenced by its robust performance on multiple datasets.

The experimental results substantiate these advantages: MetaSynMT not only achieved superior predictive accuracy compared to several state-of-the-art baselines, but also demonstrated strong translational relevance. In vitro validation of the allicin–sodium stibogluconate combination against echinococcosis showed complete inhibition of protoscoleces, confirming the practical utility of our computational predictions. These results reinforce the importance of multi-task learning and parasitic drug-target graph network modeling as effective strategies for accelerating the discovery of clinically viable drug combinations, particularly in resource-constrained research domains.

Despite these promising outcomes, several limitations of the current framework warrant discussion. First, the accuracy of MetaSynMT’s predictions is inherently dependent on the quality and completeness of the biomedical datasets used. Incomplete or biased drug–target associations, disease similarity measures, or side effect records may lead to suboptimal predictions. Second, while the meta-path-based aggregation effectively captures structural and semantic relationships, it currently relies on a predefined set of meta-paths, which may not encompass all biologically relevant interaction patterns. Automated or data-driven meta-path discovery could further enhance representational power. Third, the side effect prediction module treats toxicity risk as a single aggregated score, without considering dose-dependence or patient-specific factors, which limits its applicability to personalized treatment planning.

Future work could address these limitations in several ways. Expanding the framework to incorporate additional omics data, such as transcriptomics, metabolomics, and host immune response profiles, may help capture broader biological contexts. Incorporating temporal and dosage-dependent modeling could improve the realism and clinical applicability of synergy and safety predictions. Additionally, integrating patient-specific data could pave the way toward personalized antiparasitic combination therapies. Finally, extending the methodology beyond parasitic diseases to other infectious and neglected diseases could further demonstrate its generalizability and impact.

Taken together, these findings position MetaSynMT as a promising advancement in computational drug combination discovery, offering a framework that not only bridges efficacy and safety considerations but also demonstrates strong potential for translation into real-world therapeutic strategies.

Discussion

Despite proposing a drug combination discovery framework with preliminary validation in this study, we acknowledge several limitations. Firstly, the literature data used possess inherent noise and potential bias. Secondly, there remains a lack of extensive external experimental validation and the reliance on a binary safety proxy indicator. In conclusion, the MetaSynMT framework serves as an early-stage drug discovery and prioritization tool to help identify potentially high-potential, synergistic, and lower-risk candidate combinations. Its predictions are designed to inform subsequent steps in the drug development pipeline, including preclinical dose optimization and regulatory assessment for new combinations, guiding rather than replacing rigorous preclinical and clinical research.

In future work, we plan to integrate multi-omics data and conduct multidimensional biological experiments to further elucidate the synergistic mechanisms of drug combinations. Furthermore, we also plan to develop graph neural network methods capable of adaptively fusing multi-path semantics. Such methods should possess the following capabilities: first, to dynamically learn the importance weights of different meta-paths based on downstream tasks; second, to efficiently screening key subsets within complex path spaces to avoid noise interference; and third, to maintain computational efficiency even as meta-path combinations grow combinatorially. Achieving such data-driven meta-path discovery and aggregation would enable models to transcend manually designed patterns and uncover latent associations with enhanced biological interpretability.

Supporting information

S1 Text. Experimental design on the effect of drug combinations on protoscolex viability and supplementary graph of the PR curve.

https://doi.org/10.1371/journal.pntd.0013991.s001

(DOCX)

Acknowledgments

We gratefully acknowledge the technical support provided by the Key Laboratory of Intelligent Computing and Signal Processing of Anhui University.

References

  1. 1. Ward KE, Fidock DA, Bridgford JL. Plasmodium falciparum resistance to artemisinin-based combination therapies. Curr Opin Microbiol. 2022;69:102193. pmid:36007459
  2. 2. Sharma R, Sharma H, Jones S, Borghini-Fuhrer I, Domingo GJ, Gibson RA, et al. Optimal balance of benefit versus risk for tafenoquine in the treatment of Plasmodium vivax malaria. Malar J. 2024;23(1):145. pmid:38741094
  3. 3. Singh N, Vayer P, Tanwar S, Poyet J-L, Tsaioun K, Villoutreix BO. Drug discovery and development: introduction to the general public and patient groups. Front Drug Discov. 2023;3.
  4. 4. Ertabaklar H, Malatyal E, Ertu S. Drug resistance in parasitic diseases. 2020.
  5. 5. Wen T-H, Tsai K-W, Wu Y-J, Liao M-T, Lu K-C, Hu W-C. The framework for human host immune responses to four types of parasitic infections and relevant key JAK/STAT signaling. Int J Mol Sci. 2021;22(24):13310. pmid:34948112
  6. 6. Rao SPS, Manjunatha UH, Mikolajczak S, Ashigbie PG, Diagana TT. Drug discovery for parasitic diseases: powered by technology, enabled by pharmacology, informed by clinical science. Trends Parasitol. 2023;39(4):260–71. pmid:36803572
  7. 7. Fairhurst RM, Nayyar GML, Breman JG, Hallett R, Vennerstrom JL, Duong S, et al. Artemisinin-resistant malaria: research challenges, opportunities, and public health implications. Am J Trop Med Hyg. 2012;87(2):231–41. pmid:22855752
  8. 8. Hanboonkunupakarn B, White NJ. Advances and roadblocks in the treatment of malaria. Br J Clin Pharmacol. 2022;88(2):374–82. pmid:32656850
  9. 9. Ahmad SS, Rahi M, Ranjan V, Sharma A. Mefloquine as a prophylaxis for malaria needs to be revisited. Int J Parasitol Drugs Drug Resist. 2021;17:23–6. pmid:34339933
  10. 10. Borisy AA, Elliott PJ, Hurst NW, Lee MS, Lehar J, Price ER, et al. Systematic discovery of multicomponent therapeutics. Proc Natl Acad Sci U S A. 2003;100(13):7977–82. pmid:12799470
  11. 11. Jitobaom K, Boonarkart C, Manopwisedjaroen S, Punyadee N, Borwornpinyo S, Thitithanyanont A, et al. Synergistic anti-SARS-CoV-2 activity of repurposed anti-parasitic drug combinations. BMC Pharmacol Toxicol. 2022;23(1):41. pmid:35717393
  12. 12. Rao SPS, Manjunatha UH, Mikolajczak S, Ashigbie PG, Diagana TT. Drug discovery for parasitic diseases: powered by technology, enabled by pharmacology, informed by clinical science. Trends Parasitol. 2023;39(4):260–71. pmid:36803572
  13. 13. Singh B, Varikuti S, Halsey G, Volpedo G, Hamza OM, Satoskar AR. Host-directed therapies for parasitic diseases. Future Med Chem. 2019;11(15):1999–2018. pmid:31390889
  14. 14. Mason DJ, Eastman RT, Lewis RPI, Stott IP, Guha R, Bender A. Using machine learning to predict synergistic antimalarial compound combinations with novel structures. Front Pharmacol. 2018;9:1096. pmid:30333748
  15. 15. Ferreira Junior MA. Reposicionamento de fármacos para o tratamento da doença de Chagas: desenvolvimento de modelos computacionais baseados em ligantes ativos fenotipicamente contra o Trypanosoma cruzi e na inibição da diidroorotato desidrog.. Universidade de São Paulo; 2021.
  16. 16. Roche-Lima A, Rosado-Quinones AM, Feliu-Maldonado RA, Figueroa-Gispert MDM, D´ıaz-Rivera J, D´ıaz-Gonz´alez RG. Antimalarial drug combination predictions using the machine learning synergy predictor (MLSyPred) tool. Acta Parasitologica. 2024;69(1):415–25.
  17. 17. Sharifani K, Amini M. Machine learning and deep learning: a review of methods and applications. World Information Technology and Engineering Journal. 2023;10(07):3897–904.
  18. 18. Wang J, Liu X, Shen S, Deng L, Liu H. DeepDDS: deep graph neural network with attention mechanism to predict synergistic drug combinations. Brief Bioinform. 2022;23(1):bbab390. pmid:34571537
  19. 19. Li H, Zou L, Kowah JAH, He D, Wang L, Yuan M, et al. Predicting drug synergy and discovering new drug combinations based on a graph autoencoder and convolutional neural network. Interdiscip Sci. 2023;15(2):316–30. pmid:36943614
  20. 20. Ma J, Wang J, Ghoraie LS, Men X, Haibe-Kains B, Dai P. A comparative study of cluster detection algorithms in protein-protein interaction for drug target discovery and drug repurposing. Front Pharmacol. 2019;10:109. pmid:30837876
  21. 21. Yang J, Xu Z, Wu WKK, Chu Q, Zhang Q. GraphSynergy: a network-inspired deep learning model for anticancer drug combination prediction. J Am Med Inform Assoc. 2021;28(11):2336–45. pmid:34472609
  22. 22. Zuo Y, Zhang Y, Wang L, Yu J, Luo J, Xiao Q. Synergistic drug combination prediction via dual-level feature aggregation and knowledge graph-based deep neural network. IEEE J Biomed Health Inform. 2025;29(11):7945–54. pmid:40327469
  23. 23. Chen M, Zhang M, Yan G, Wang G, Qu C. MRHGNN: enhanced multimodal relational hypergraph neural network for synergistic drug combination forecasting. IEEE Trans Neural Netw Learn Syst. 2025;36(9):17086–98. pmid:40193258
  24. 24. Xue Q, Ni J, Ji C. HGCLSynergy: heterogeneous graph contrastive learning for drug synergy prediction. in: Proceedings of the 2025 5th International Conference on Bioinformatics and Intelligent Computing. 2025. p. 94–100. https://doi.org/10.1145/3724979.3724995
  25. 25. Fang Y, Lin W, Zheng VW, Wu M, Shi J, Chang KC-C, et al. Metagraph-based learning on heterogeneous graphs. IEEE Trans Knowl Data Eng. 2021;33(1):154–68.
  26. 26. Meng C, Cheng R, Maniu S, Senellart P, Zhang W. Discovering meta-paths in large heterogeneous information networks. In: Proceedings of the 24th International Conference on World Wide Web. 2015. p. 754–64. https://doi.org/10.1145/2736277.2741123
  27. 27. Nakamura A. Combination therapy with risperidone and high-dose melatonin is effective despite reversible side effects including breast budding. Cureus. 2024;16(11):e74607. pmid:39735055
  28. 28. Talu Erten P. Evaluation of the efficacy, side effects, and drug survival data with methotrexate-leflunomide combination therapy in patients with inflammatory arthritis. meandros. 2023;24(3):232–6.
  29. 29. Lin J, He Y, Ru C, Long W, Li M, Wen Z. Advancing adverse drug reaction prediction with deep chemical language model for drug safety evaluation. Int J Mol Sci. 2024;25(8):4516. pmid:38674100
  30. 30. Rappaport N, Twik M, Plaschkes I, Nudel R, Iny Stein T, Levitt J, et al. MalaCards: an amalgamated human disease compendium with diverse clinical and genetic annotation and structured search. Nucleic Acids Res. 2017;45(D1):D877–87. pmid:27899610
  31. 31. Rouillard AD, Gundersen GW, Fernandez NF, Wang Z, Monteiro CD, McDermott MG, et al. The harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins. Database (Oxford). 2016;2016:baw100. pmid:27374120
  32. 32. White J. PubMed 2.0. Medical Reference Services Quarterly. 2020;39(4):382–7.
  33. 33. Stelzer G, Rosen N, Plaschkes I, Zimmerman S, Twik M, Fishilevich S, et al. The GeneCards suite: from gene data mining to disease genome sequence analyses. Curr Protoc Bioinformatics. 2016;54:1.30.1-1.30.33. pmid:27322403
  34. 34. Niwattanakul S, Singthongchai J, Naenudorn E, Wanapu S. Using of Jaccard coefficient for keywords similarity. In: Proceedings of the international multiconference of engineers and computer scientists. vol. 1; 2013. p. 380–4.
  35. 35. Kipf TN, Welling M. Semi-supervised classification with graph convolutional networks. arXiv preprint 2016. https://arxiv.org/abs/1609.02907
  36. 36. Veličković P, Cucurull G, Casanova A, Romero A, Lio P, Bengio Y. Graph attention networks. arXiv preprint 2017. https://arxiv.org/abs/1710.10903
  37. 37. Fillinger L, Walter S, Ley M, Keska-Izworska K, Dehkordi LG, Kratochwill K, et al. Computational modeling approaches and regulatory pathways for drug combinations. Drug Discov Today. 2025;30(5):104345. pmid:40158836
  38. 38. Dong Y, Chang Y, Wang Y, Han Q, Wen X, Yang Z, et al. MFSynDCP: multi-source feature collaborative interactive learning for drug combination synergy prediction. BMC Bioinformatics. 2024;25(1):140. pmid:38561679
  39. 39. Barretina J, Caponigro G, Stransky N, Venkatesan K, Margolin AA, Kim S, et al. The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature. 2012;483(7391):603–7. pmid:22460905
  40. 40. Church KW. Word2Vec. Natural Language Engineering. 2017;23(1):155–62.
  41. 41. Bento AP, Hersey A, Félix E, Landrum G, Gaulton A, Atkinson F, et al. An open source chemical structure curation pipeline using RDKit. J Cheminform. 2020;12(1):51. pmid:33431044
  42. 42. Vig J, Madani A, Varshney LR, Xiong C, Socher R, Rajani NF. Bertology meets biology: interpreting attention in protein language models. arXiv preprint 2020. https://arxiv.org/abs/2006.15222
  43. 43. Ahmad W, Simon E, Chithrananda S, Grand G, Ramsundar B. Chemberta-2: towards chemical foundation models. arXiv preprint 2022. https://arxiv.org/abs/2209.01712
  44. 44. Lin Z, Akin H, Rao R, Hie B, Zhu Z, Lu W, et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science. 2023;379(6637):1123–30. pmid:36927031
  45. 45. Knox C, Wilson M, Klinger CM, Franklin M, Oler E, Wilson A, et al. DrugBank 6.0: the DrugBank Knowledgebase for 2024 . Nucleic Acids Res. 2024;52(D1):D1265–75. pmid:37953279
  46. 46. UniProt Consortium. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res. 2019;47(D1):D506–15. pmid:30395287
  47. 47. Perozzi B, Al-Rfou R, Skiena S. DeepWalk: online learning of social representations. ACM. 2014.
  48. 48. Li C, Zhang Y, Pang M, Zhang Y, Hu C, Fan H. Metabolic mechanism and pharmacological study of albendazole in secondary hepatic alveolar echinococcosis (HAE) model rats. Antimicrobial Agents and Chemotherapy. 2024;68(5):e01449-23.
  49. 49. Liu J, Wang F, Wang X, Fan S, Li Y, Xu M, et al. Antiviral effects, tissue exposure of tetrandrine against SARS-CoV-2 infection and COVID-19. MedComm 2020 . 2023;4(1):e206. pmid:36699286
  50. 50. Fernandes KS, de Souza PEN, Dorta ML, Alonso A. The cytotoxic activity of miltefosine against Leishmania and macrophages is associated with dynamic changes in plasma membrane proteins. Biochim Biophys Acta Biomembr. 2017;1859(1):1–9. pmid:27773565
  51. 51. Specht S, Pfarr KM, Arriens S, Hübner MP, Klarmann-Schulz U, Koschel M, et al. Combinations of registered drugs reduce treatment times required to deplete Wolbachia in the Litomosoides sigmodontis mouse model. PLoS Negl Trop Dis. 2018;12(1):e0006116. pmid:29300732
  52. 52. Jia TW, Wang W, Zhou YB, Zhou J, Mei ZQ, Li SZ. Taxonomic rank of human parasites. Zhongguo Xue Xi Chong Bing Fang Zhi Za Zhi. 2022;34(4):420–8. pmid:36116936
  53. 53. Bjerum CM, Koudou BG, Ouattara AF, Lew D, Goss CW, Gabo PT, et al. Safety and tolerability of moxidectin and ivermectin combination treatments for lymphatic filariasis in Côte d’Ivoire: a randomized controlled superiority study. PLoS Negl Trop Dis. 2023;17(9):e0011633. pmid:37721964
  54. 54. van Henten S, Pareyn M, Tadesse D, Kassa M, Techane M, Kinfe E, et al. Community-based treatment of cutaneous leishmaniasis using cryotherapy and miltefosine in Southwest Ethiopia: the way forward?. Front Med (Lausanne). 2023;10:1196063. pmid:37886367
  55. 55. Gouveia MJ, Brindley PJ, Gärtner F, Vale N. Activity of combinations of antioxidants and anthelmintic drugs against the adult stage of schistosoma mansoni. J Parasitol Res. 2020;2020:8843808. pmid:32832132
  56. 56. Puchkov VM, Lyfenko AD, Koval VS, Revtovich SV, Kulikova VV, Anufrieva NV. Synthesis and antimicrobial activity of thiosulfinates and allicin analogues. Molecular Biology. 2024;58(6):1074–81.
  57. 57. Ianevski A, Giri AK, Aittokallio T. SynergyFinder 3.0: an interactive analysis and consensus interpretation of multi-drug synergies across multiple samples. Nucleic Acids Res. 2022;50(W1):W739–43. pmid:35580060