ProtAttn-QuadNet: An attention-based deep learning framework for protein–protein interaction prediction using ProtBERT embeddings

Md. Shahidul Islam; Md. Muhtasim Rahman Mim; Md. Raihan Kabir

doi:10.1371/journal.pone.0349433

Abstract

Protein–protein interactions (PPIs) form the backbone of most cellular processes, governing signal transduction, gene regulation, and metabolic control. However, experimental approaches to identifying PPIs remain expensive, laborious, and often incomplete. Recent advances in protein language models (PLMs) have transformed sequence-based PPI prediction by enabling deep contextual encoding of biochemical and structural information directly from amino acid sequences. Building upon this progress, we present ProtAttn-QuadNet, an attention-based deep learning framework that leverages ProtBERT embeddings to model reciprocal dependencies between protein pairs. The proposed model employs a quad-stream attention mechanism that integrates individual protein features, synergistic interactions, and complementary differences through multi-level self- and cross-attention layers. This architecture enables the discovery of fine-grained relational patterns while ensuring balanced bidirectional modeling of interacting proteins. Evaluated on the independent test set of a large-scale dataset from UniProt, ProtAttn-QuadNet achieves 97.16% accuracy (AUC-ROC 99.00%) on balanced data and 99.19% accuracy (AUC-ROC 99.76%) on oversampled datasets, surpassing several recent state-of-the-art PPI prediction methods. Statistical validation using the Chi-square and Wilcoxon signed-rank tests confirms the model’s predictive significance and reliability. ProtAttn-QuadNet offers a powerful computational framework for large-scale PPI prediction.

Citation: Islam MS, Mim MMR, Kabir MR (2026) ProtAttn-QuadNet: An attention-based deep learning framework for protein–protein interaction prediction using ProtBERT embeddings. PLoS One 21(6): e0349433. https://doi.org/10.1371/journal.pone.0349433

Editor: Musa Aydin, Samsun University: Samsun Universitesi, TÜRKIYE

Received: November 26, 2025; Accepted: April 30, 2026; Published: June 2, 2026

Copyright: © 2026 Islam et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The primary data are available from UniProt (https://www.uniprot.org/uniprotkb?query=reviewed:true). All reviewed (Swiss-Prot) entries from UniProtKB were used in this study, comprising 573,661 protein sequences. The processed data and code supporting this study are publicly available on Figshare and can be accessed through the following link: https://doi.org/10.6084/m9.figshare.30637145.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Protein–protein interactions (PPIs) are fundamental to almost all cellular processes, including signal transduction, gene expression regulation, metabolic control, and immune responses [1–3]. Understanding the complex network of PPIs provides valuable insights into cellular functions and disease mechanisms [4,5]. Although numerous experimental techniques, such as yeast two-hybrid screening, co-immunoprecipitation, and affinity purification coupled with mass spectrometry, have been developed to detect PPIs, these methods remain time-consuming, costly, and often limited in coverage [6]. Consequently, computational prediction methods have become indispensable for large-scale PPI analysis.

Early computational approaches primarily relied on handcrafted sequence features, including amino acid composition, evolutionary profiles, and physicochemical descriptors. Classical machine learning algorithms such as Support Vector Machines (SVM), Random Forests (RF), and Bayesian classifiers were employed to classify interacting protein pairs based on these features [7–12]. While these models demonstrated moderate success, their dependence on manually engineered descriptors and incomplete structural data limited their generalization capabilities, particularly across species and diverse protein families [11,13–15].

The increasing availability of large-scale protein sequence databases has encouraged sequence-based prediction methods that rely less on structural information. Deep learning has substantially advanced this field by enabling hierarchical feature extraction and representation learning. DeepPPI [16] used a fully connected neural network to model complex non-linear relationships between protein features, whereas DPPI [17] applied a Siamese-like convolutional architecture to learn symmetric relationships between interacting proteins. Similarly, PIPR [18] introduced a residual recurrent convolutional neural network (RCNN) to capture both local motifs and long-range dependencies, while Wu et al. proposed DL-PPI [19], a graph neural network–based model that integrates multi-scale features and attention mechanisms to enhance relational reasoning among proteins. These architectures collectively improved predictive performance but often struggled with interpretability, data imbalance, and computational efficiency.

Recent advances in transformer architectures and PLMs have transformed sequence-based PPI prediction by learning contextualized residue representations through self-attention mechanisms. Pretrained models such as ProtTrans [5], ProtBERT [20], and ESM-2 [21] encode rich biochemical and evolutionary information from massive unlabeled protein corpora, effectively capturing secondary and tertiary structure tendencies directly from primary sequences. Several recent studies have leveraged these embeddings for PPI prediction using hybrid deep architectures. For example, xCAPT5 [22] integrated ProtTrans embeddings with a multi-kernel convolutional network to capture local and global dependencies, while TUnA [23] incorporated uncertainty modeling within a transformer framework to improve robustness. PPI-Graphomer [24] combined pretrained language models with graph transformers to integrate sequence and structural representations, achieving high performance across benchmark datasets.

Despite these advances, existing frameworks often treat protein pairs asymmetrically and fail to explicitly model the reciprocal dependencies inherent in protein–protein binding. Moreover, many attention-based methods focus on single-sequence encoding and overlook bidirectional relationships critical to interaction dynamics.

To address these limitations, this study proposes ProtAttn-QuadNet, an attention-based deep learning framework that leverages ProtBERT embeddings to model mutual interactions between protein pairs. By incorporating quadratic attention mechanisms and a reciprocal representation module, ProtAttn-QuadNet identifies key sequence regions contributing to binding affinity and ensures balanced interaction assessment. The proposed model was evaluated on dataset collected from UniProt [25] and demonstrated high predictive accuracy and robustness across species. Overall, ProtAttn-QuadNet offers a reliable computational framework for large-scale PPI prediction and contributes to a deeper understanding of cellular interaction networks.

Materials and methods

Dataset

A total of 573,661 protein entries were collected from the UniProtKB database [25]. Proteins were divided into interacting and non-interacting groups. Duplicate interaction pairs were removed; if both A = B and B = A were present, only one was kept. Each protein pair was labeled 1 for interaction (positive) or 0 for non-interaction (negative). To balance the dataset, two approaches were applied: (a) selecting an equal number of positive and negative samples, resulting in 249,814 protein pairs (124907 positive, 124907 negative) and 157,839 unique proteins, and (b) oversampling the positive class to include all proteins, resulting in 1,082,662 protein pairs (541331 positive, 541331 negative) and 573,661 unique proteins.

A separate dataset of 573,661 protein sequences was used for sequence embedding. Each protein sequence was converted into a 1024-dimensional numerical vector using ProtBERT embeddings [20], which were then used as model input.

Data preprocessing

In this study, robust scaling was used to normalize protein embedding vectors. The transformation is defined as:

(1)

where is the protein embedding matrix with N proteins and D = 1024 dimensions, and IQR is the interquartile range ().

Biological data such as protein embeddings often exhibit non-Gaussian distributions, contain outliers, and represent diverse protein families, making standard normalization approaches less effective. Protein embeddings derived from neural language models or structural encoders may show heavy-tailed distributions, with extreme values arising from rare proteins, unusual structural motifs, membrane domains, or intrinsically disordered regions. Furthermore, since these embeddings are pre-computed by external models and generated outside our training pipeline, their statistical properties are not guaranteed to follow a normal distribution. Under such conditions, StandardScaler, which relies on the mean and standard deviation, can be strongly influenced by extreme values. In contrast, MinMaxScaler, which depends on the minimum and maximum values, is also sensitive to outliers that may compress the majority of the data into a narrow range. In contrast, RobustScaler centers the data using the median and scales it by the interquartile range (IQR), both of which are less affected by extreme observations. This makes robust scaling more suitable for stabilizing embedding feature distributions and improving the reliability of downstream model training.

Dataset splitting

A stratified splitting strategy was employed to ensure balanced class distribution across training, validation, and test sets while maintaining statistical rigor for model evaluation. To achieve this, a two-stage stratified sampling approach was adopted instead of a single-stage three-way split. Single-stage splitting can introduce subtle class imbalances due to rounding effects when dividing samples into three partitions simultaneously, whereas two-stage splitting manages one binary decision at a time (train+validation vs. test, then train vs. validation), ensuring precise stratification.

In the first stage, a fully isolated test set (12%) was created and kept untouched during model development, hyperparameter tuning, and validation to prevent data leakage. The second stage applied a test size of 0.225 to the remaining 88% of data, yielding a validation set that accounted for exactly 20% of the total dataset (0.225 × 0.88 ≈ 0.20). This method preserved nested stratification, maintaining the original class distributions at both stages—something not guaranteed by a single three-way split.

Formally, given a dataset D with binary interaction labels , where , stratified splitting was performed as described above to preserve the proportions of positive and negative interactions across all partitions.

Stage 1: Training+Validation vs Test Split

Stage 2: Training vs Validation Split

Thus, the final balanced dataset contains 169,873 training samples, 49,963 validation samples, and 29,978 test samples, while the oversampled dataset contains 736,210 training samples, 216,532 validation samples, and 129,920 test samples.

Feature engineering

Our objective is to enhance interaction-aware representations that capture diverse functional dependencies in protein–protein interactions (PPIs). We introduce two key feature types, Element-wise Product and Absolute Difference, which collectively capture complementary interaction dynamics.

Element-wise Product focuses on synergistic relationships by emphasizing dimensions where both proteins exhibit strong activations simultaneously. When both embeddings express high values in the same dimension, their product yields a large value, reflecting cooperative or co-regulatory behavior.
For protein embeddings , we define the interaction feature as:(2)
where, x_int captures synergistic activation, representing cooperative dimensions where both proteins contribute strongly.
Absolute Difference on the other hand, models complementary relationships by quantifying divergence between embeddings. Large values indicate distinct functional behaviors or differing biological characteristics between the two proteins.
We define a difference feature as:(3)
where, x_diff captures complementary activation, representing the magnitude of contrast across feature dimensions.
Gaussian Noise Injection for Data Augmentation To enhance the robustness and generalization of the model, we introduce Gaussian noise to the protein embeddings generated by ProtBERT. Since these embeddings capture rich sequence-level features, small perturbations simulate natural variation in protein representations and prevent the model from overfitting, and encourage the model to learn generalizable patterns of interactions rather than memorizing specific embeddings. Formally, Gaussian noise is added independently to both protein embeddings in each interaction pair:(4)
where are the original scaled embeddings, are independent Gaussian noise vectors, , and is the identity matrix.

Model architecture

A multi-stream attention model is designed to predict protein–protein interactions while also estimating interaction uncertainty, binding strength, and interaction type. The model takes four different types of features as input and processes them through parallel attention streams and cross-attention layers to capture both individual protein properties and their relationships. The overall architecture of the proposed model is illustrated in Fig 1.

Download:

Fig 1. Architecture of the ProtAttn-QuadNet framework.

Four distinct protein feature representations are processed through parallel attention streams and cross-attention layers to predict protein–protein interactions and simultaneously estimate interaction uncertainty, binding strength, and interaction type.

https://doi.org/10.1371/journal.pone.0349433.g001

Advanced attention block

The Advanced Attention Block is designed as the core computational unit of the architecture, which processes each feature stream through a sophisticated attention mechanism. Given an input , it is first projected into a higher-dimensional hidden space as follows:

(5)

The projected features are normalized using layer normalization:

Subsequently, multi-head self-attention is applied:

A feed-forward network with GELU activation is then applied:

In this work, multi-head self-attention is employed to enable the model to focus on multiple aspects of each input dimension:

Four-stream processing

The proposed architecture processes four feature types through parallel attention streams:

Each attention stream is allowed to focus on the most relevant features for its specific input type. This design facilitates specialized processing of individual protein properties, interactions, and differences, while maintaining a consistent architecture across all streams.

Cross-attention fusion

To enable information exchange between streams, cross-attention mechanisms are implemented as follows:

Protein Cross-Attention:

Feature Cross-Attention:

Residual connections are applied to the final stream representations to preserve the original information:

Quad-stream fusion

After the final representations of each stream are computed, they are concatenated to form a combined feature matrix:

(6)

The concatenated representation is then passed through a two-layer fusion network with GELU activation, which reduces dimensionality while preserving critical information:

(7)

where the weight matrices and their dimensions are explicitly defined as:

and b₁, b₂ are the corresponding bias vectors.

This fusion network integrates information from all four streams Protein 1, Protein 2, Interaction, and Difference into a single, compact representation while retaining the most important features for downstream prediction tasks.

Multi-task learning framework

A multi-task learning framework is employed to simultaneously predict multiple aspects of protein–protein interactions from the fused representation h_fused. Predictions for four related tasks are generated from the fused representation: interaction probability, uncertainty, binding strength, and interaction type.

The interaction probability is predicted using a sigmoid activation:

(8)

and optimized via binary cross-entropy:

(9)

The uncertainty head predicts a scalar u_i per protein pair using a Softplus activation:

(10)

and the uncertainty loss is defined as:

(11)

This encourages the model to assign lower uncertainty to correct predictions and higher uncertainty to incorrect ones, improving calibration.

The binding strength target is computed directly from the predicted interaction probability and true interaction label, reflecting the confidence in interacting pairs:

(12)

while the predicted binding strength is

(13)

and optimized via mean squared error:

(14)

This ensures that binding strength is emphasized for pairs predicted with high interaction confidence, while non-interacting pairs are assigned zero, aligning with biologically meaningful interactions.

The interaction type is predicted using a Softmax activation:

(15)

If the true interaction type is known, standard cross-entropy is used:

(16)

If the type is unknown, the model is regularized toward a uniform distribution using KL-divergence:

(17)

Finally, the total multi-task loss combines all four components with tunable weights :

(18)

This formulation directly mirrors the implementation in the code, ensuring that each task-specific computation and target is explicitly defined and justified with respect to its role in learning biologically meaningful protein–protein interactions.

Optimization and hyperparameter search strategy

A systematic grid search integrated with advanced optimization techniques is employed to determine the optimal configuration of the ProtAttn-QuadNet architecture. The search space includes optimizer selection, learning rate, weight decay, batch size, learning rate scheduling, and multi-task loss weighting.

Four learning rates (0.0001, 0.0005, 0.001, 0.002), three weight decays (1 × 10⁻⁵, 1 × 10⁻⁴, 1 × 10⁻³), three optimizers (Adam, AdamW, RMSprop), three schedulers (ReduceLROnPlateau, CosineAnnealing, StepLR), and four batch sizes (32, 64, 128, 256) are explored. Additionally, three pre-configured settings are evaluated to assess stability and performance consistency.

Conservative: LR = 0.0005, WD = 1e-4, AdamW, ReduceLROnPlateau
Aggressive: LR = 0.001, WD = 1e-3, Adam, CosineAnnealing
Balanced: LR = 0.0001, WD = 1e-5, RMSprop, StepLR

The best performing configuration is presented in Table 1.

Download:

Table 1. Optimal configurations for balanced and oversampled datasets.

https://doi.org/10.1371/journal.pone.0349433.t001

Results and discussions

Evaluation metrics

Evaluating the performance of our protein–protein interaction (PPI) prediction model is essential for assessing its ability to identify true interactions, minimize false predictions, and generalize to unseen protein pairs. We employed five standard performance metrics: accuracy, precision, recall, F1-score, and AUC-ROC. These metrics collectively assess the model’s overall correctness, reliability of positive predictions, sensitivity to true interactions, and balance between precision and recall. Performance was monitored across all training epochs to ensure stable learning and robust generalization.

Performance evaluation of the proposed model

We evaluated the performance of the proposed protein–protein interaction (PPI) prediction model using two different datasets: a balanced dataset and an oversampled dataset. The balanced dataset allows us to assess how well the model performs when interacting and non-interacting protein pairs are equally represented, while the oversampled dataset increases the number of rare interactions, enabling us to evaluate how the model handles imbalanced data scenarios.

The comparative summary of model performance is presented in Table 2, highlighting the improvement gained through oversampling and confirming the proposed model’s reliability for large-scale PPI prediction tasks.

Download:

Table 2. Comparison of model performance on balanced and oversampled datasets.

https://doi.org/10.1371/journal.pone.0349433.t002

On the balanced dataset, the model performed consistently on both validation and test sets, achieving high accuracy, precision, recall, F1-score, and AUC-ROC. These results indicate that the model effectively identifies true interactions while maintaining a low false-positive rate. The high AUC-ROC further demonstrates strong discriminative power, confirming that the model accurately differentiates between interacting and non-interacting protein pairs. The validation curves (Fig 2) illustrate stable learning across epochs.

Download:

Fig 2. Validation accuracy of the model using the balanced dataset.

https://doi.org/10.1371/journal.pone.0349433.g002

When trained on the oversampled dataset, the model achieved even higher overall performance on both validation and test sets. The results indicate excellent accuracy, precision, recall, F1-score, and AUC-ROC, reflecting the model’s strong ability to identify true interactions with minimal false positives. The accuracy progression shown in Fig 3 illustrates stable improvement throughout training. These findings suggest that the oversampling strategy effectively enhanced the model’s capacity to learn from minority interaction samples without introducing overfitting.

Download:

Fig 3. Validation accuracy of the model using the oversampled dataset.

https://doi.org/10.1371/journal.pone.0349433.g003

The results demonstrate that the proposed model achieves highly competitive performance across both balanced and oversampled datasets. The oversampling strategy enhances recall and F1-score, improving detection of rare protein–protein interactions while maintaining strong precision. Consistently high AUC-ROC values further validate the model’s robust classification ability, and the confusion matrices in Figs 4 and 5 show that most predictions fall along the true positive and true negative axes, indicating minimal misclassification. These findings establish that the model is effective on well-balanced data and remains reliable under imbalanced conditions, making it a promising framework for large-scale protein–protein interaction discovery and downstream bioinformatics applications.

Download:

Fig 4. Confusion matrix on the balanced dataset.

https://doi.org/10.1371/journal.pone.0349433.g004

Download:

Fig 5. Confusion matrix on the oversampled dataset.

https://doi.org/10.1371/journal.pone.0349433.g005

The performance of the proposed ProtAttn-QuadNet model was compared against several state-of-the-art protein-protein interaction prediction methods across multiple species and datasets (Table 3).

Download:

Table 3. Comparison of ProtAttn-QuadNet model with existing state-of-the-art models.

https://doi.org/10.1371/journal.pone.0349433.t003

ProtAttn-QuadNet demonstrates strong predictive performance across multiple organisms. For the large-scale cross-species UniProt dataset, which contains over 541,000 positive and 541,000 negative interactions, ProtAttn-QuadNet achieved an accuracy of 99.19%, precision of 98.66%, recall of 99.70%, and an F1-score of 99.18%. While smaller organism-specific datasets reported in previous studies differ in size and composition, the organisms partially overlap with those included in our evaluation (e.g., S. cerevisiae and H. pylori). Our results indicate that ProtAttn-QuadNet achieves F1-scores that are comparable to or higher than the highest reported values in these organisms, demonstrating its robustness and generalization across diverse protein sequences.

Statistical analysis

Comprehensive statistical analyses were performed to rigorously evaluate the reliability and significance of the proposed PPI prediction model. Three primary statistical tests were applied: the Chi-square test to assess statistical associations, the Wilcoxon signed-rank test to compare model performances, and an effect size analysis to determine the magnitude of observed differences. All statistical tests were conducted at a significance level of 0.05.

The Chi-square () test was applied to examine whether a statistically significant association existed between the predicted and true PPI classes (i.e., interacting vs. non-interacting protein pairs). The hypotheses were defined as:

The Chi-square statistic was computed as:

where O_i and E_i represent the observed and expected frequencies, respectively.

The analysis produced a value of 26,671.80 with 1 degree of freedom, p < 0.000001, and a Cramér’s V of 0.944, indicating a very strong statistical association between the predicted and actual PPI outcomes.

Fig 6 illustrates the percentage distribution across confusion matrix categories, clearly showing the deviation from random classification behavior.

Download:

Fig 6. Distribution of predicted versus actual protein–protein interaction categories.

https://doi.org/10.1371/journal.pone.0349433.g006

To confirm that the proposed PPI model significantly outperformed the random-chance baseline of 50%, we applied a Wilcoxon signed-rank test on the distributions of accuracy, F1-score, and AUC-ROC. Each metric yielded a test statistic of W = 990.0 with , indicating that the model’s performance gains were highly significant. Building on this, we quantified the strength of association and the magnitude of the predictive effect using multiple effect size measures. Cramér’s V, Cohen’s w, and the Phi coefficient were all 0.944, reflecting a very large effect size. These results collectively demonstrate that the proposed PPI model not only significantly outperforms a random baseline but also produces predictions that are strongly consistent with true interaction labels, highlighting its practical and statistical relevance.

References

1. Stelzl U, Worm U, Lalowski M, Haenig C, Brembeck FH, Goehler H, et al. A human protein-protein interaction network: a resource for annotating the proteome. Cell. 2005;122(6):957–68. pmid:16169070
- View Article
- PubMed/NCBI
- Google Scholar
2. Uetz P, Giot L, Cagney G, Mansfield TA, Judson RS, Knight JR, et al. A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature. 2000;403(6770):623–7. pmid:10688190
- View Article
- PubMed/NCBI
- Google Scholar
3. Lee M. Recent Advances in Deep Learning for Protein-Protein Interaction Analysis: A Comprehensive Review. Molecules. 2023;28(13):5169. pmid:37446831
- View Article
- PubMed/NCBI
- Google Scholar
4. Safari-Alighiarloo N, Taghizadeh M, Rezaei-Tavirani M, Goliaei B, Peyvandi AA. Protein-protein interaction networks (PPI) and complex diseases. Gastroenterology and Hepatology from bed to bench. 2014;7(1):17.
- View Article
- Google Scholar
5. Elnaggar A, Heinzinger M, Dallago C, Rehawi G, Wang Y, Jones L, et al. ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning. IEEE Trans Pattern Anal Mach Intell. 2022;44(10):7112–27. pmid:34232869
- View Article
- PubMed/NCBI
- Google Scholar
6. Rao VS, Srinivas K, Sujini GN, Kumar GNS. Protein-protein interaction detection: methods and analysis. Int J Proteomics. 2014;2014:147648. pmid:24693427
- View Article
- PubMed/NCBI
- Google Scholar
7. Guo Y, Yu L, Wen Z, Li M. Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences. Nucleic Acids Res. 2008;36(9):3025–30. pmid:18390576
- View Article
- PubMed/NCBI
- Google Scholar
8. Zhou YZ, Gao Y, Zheng YY. Prediction of protein-protein interactions using local description of amino acid sequence. In: Advances in Computer Science and Education Applications: International Conference, CSE 2011, Qingdao, China, July 9-10, 2011. Proceedings, Part II, 2011. 254–62.
9. Hu L, Chan KCC. Extracting Coevolutionary Features from Protein Sequences for Predicting Protein-Protein Interactions. IEEE/ACM Trans Comput Biol Bioinform. 2017;14(1):155–66. pmid:26812730
- View Article
- PubMed/NCBI
- Google Scholar
10. Ding Y, Tang J, Guo F. Predicting protein-protein interactions via multivariate mutual information of protein sequences. BMC Bioinformatics. 2016;17(1):398. pmid:27677692
- View Article
- PubMed/NCBI
- Google Scholar
11. Ding Z, Kihara D. Computational Methods for Predicting Protein-Protein Interactions Using Various Protein Features. Curr Protoc Protein Sci. 2018;93(1):e62. pmid:29927082
- View Article
- PubMed/NCBI
- Google Scholar
12. Tsubaki M, Tomii K, Sese J. Compound-protein interaction prediction with end-to-end learning of neural networks for graphs and sequences. Bioinformatics. 2019;35(2):309–18. pmid:29982330
- View Article
- PubMed/NCBI
- Google Scholar
13. Aytuna AS, Gursoy A, Keskin O. Prediction of protein-protein interactions by combining structure and sequence conservation in protein interfaces. Bioinformatics. 2005;21(12):2850–5. pmid:15855251
- View Article
- PubMed/NCBI
- Google Scholar
14. Göktepe YE. Protein-protein interaction prediction using enhanced features with spaced conjoint triad and amino acid pairwise distance. PeerJ Comput Sci. 2025;11:e2748. pmid:40134873
- View Article
- PubMed/NCBI
- Google Scholar
15. Bonvin AMJJ. Flexible protein-protein docking. Curr Opin Struct Biol. 2006;16(2):194–200. pmid:16488145
- View Article
- PubMed/NCBI
- Google Scholar
16. Du X, Sun S, Hu C, Yao Y, Yan Y, Zhang Y. DeepPPI: Boosting Prediction of Protein-Protein Interactions with Deep Neural Networks. J Chem Inf Model. 2017;57(6):1499–510. pmid:28514151
- View Article
- PubMed/NCBI
- Google Scholar
17. Hashemifar S, Neyshabur B, Khan AA, Xu J. Predicting protein–protein interactions through sequence-based deep learning. Bioinformatics. 2018;34(17):i802–10.
- View Article
- Google Scholar
18. Chen M, Ju CJ-T, Zhou G, Chen X, Zhang T, Chang K-W, et al. Multifaceted protein-protein interaction prediction based on Siamese residual RCNN. Bioinformatics. 2019;35(14):i305–14. pmid:31510705
- View Article
- PubMed/NCBI
- Google Scholar
19. Wu J, Liu B, Zhang J, Wang Z, Li J. DL-PPI: a method on prediction of sequenced protein-protein interaction based on deep learning. BMC Bioinformatics. 2023;24(1):473. pmid:38097937
- View Article
- PubMed/NCBI
- Google Scholar
20. Brandes N, Ofer D, Peleg Y, Rappoport N, Linial M. ProteinBERT: a universal deep-learning model of protein sequence and function. Bioinformatics. 2022;38(8):2102–10. pmid:35020807
- View Article
- PubMed/NCBI
- Google Scholar
21. Lin Z, Akin H, Rao R, Hie B, Zhu Z, Lu W, et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science. 2023;379(6637):1123–30. pmid:36927031
- View Article
- PubMed/NCBI
- Google Scholar
22. Dang TH, Vu TA. xCAPT5: protein-protein interaction prediction using deep and wide multi-kernel pooling convolutional neural networks with protein language model. BMC Bioinformatics. 2024;25(1):106. pmid:38461247
- View Article
- PubMed/NCBI
- Google Scholar
23. Ko YS, Parkinson J, Liu C, Wang W. TUnA: an uncertainty-aware transformer model for sequence-based protein-protein interaction prediction. Brief Bioinform. 2024;25(5):bbae359. pmid:39051117
- View Article
- PubMed/NCBI
- Google Scholar
24. Xie J, Zhang Y, Wang Z, Jin X, Lu X, Ge S, et al. PPI-Graphomer: enhanced protein-protein affinity prediction using pretrained and graph transformer models. BMC Bioinformatics. 2025;26(1):116. pmid:40301762
- View Article
- PubMed/NCBI
- Google Scholar
25. UniProt Consortium. UniProt: a hub for protein information. Nucleic Acids Res. 2015;43(Database issue):D204-12. pmid:25348405
- View Article
- PubMed/NCBI
- Google Scholar
26. Asim MN, Ibrahim MA, Malik MI, Dengel A, Ahmed S. ADH-PPI: An attention-based deep hybrid model for protein-protein interaction prediction. iScience. 2022;25(10):105169. pmid:36267921
- View Article
- PubMed/NCBI
- Google Scholar
27. Zhao Z, Qian P, Yang X, Zeng Z, Guan C, Tam WL. Semignn-ppi: Self-ensembling multi-graph neural network for efficient and generalizable protein-protein interaction prediction. In: 2023. https://arxiv.org/abs/2305.08316
28. Zhang Z, Wang Z, Zhao L, Wang J, Wang C. Multimodal Contrastive Learning for Protein–Protein Interaction Inhibitor Prediction. In: 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024. 1327–33. https://doi.org/10.1109/bibm62325.2024.10822227
29. Tang T, Shen T, Jiang J, Li W, Wang P, Yuan S, et al. Prediction of protein-protein interaction based on interaction-specific learning and hierarchical information. BMC Biol. 2025;23(1):236. pmid:40754535
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Stelzl U, Worm U, Lalowski M, Haenig C, Brembeck FH, Goehler H, et al. A human protein-protein interaction network: a resource for annotating the proteome. Cell. 2005;122(6):957–68. pmid:16169070
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Uetz P, Giot L, Cagney G, Mansfield TA, Judson RS, Knight JR, et al. A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature. 2000;403(6770):623–7. pmid:10688190
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Lee M. Recent Advances in Deep Learning for Protein-Protein Interaction Analysis: A Comprehensive Review. Molecules. 2023;28(13):5169. pmid:37446831
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Safari-Alighiarloo N, Taghizadeh M, Rezaei-Tavirani M, Goliaei B, Peyvandi AA. Protein-protein interaction networks (PPI) and complex diseases. Gastroenterology and Hepatology from bed to bench. 2014;7(1):17.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref5] 5. Elnaggar A, Heinzinger M, Dallago C, Rehawi G, Wang Y, Jones L, et al. ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning. IEEE Trans Pattern Anal Mach Intell. 2022;44(10):7112–27. pmid:34232869
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref6] 6. Rao VS, Srinivas K, Sujini GN, Kumar GNS. Protein-protein interaction detection: methods and analysis. Int J Proteomics. 2014;2014:147648. pmid:24693427
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref7] 7. Guo Y, Yu L, Wen Z, Li M. Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences. Nucleic Acids Res. 2008;36(9):3025–30. pmid:18390576
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref8] 8. Zhou YZ, Gao Y, Zheng YY. Prediction of protein-protein interactions using local description of amino acid sequence. In: Advances in Computer Science and Education Applications: International Conference, CSE 2011, Qingdao, China, July 9-10, 2011. Proceedings, Part II, 2011. 254–62.

[ref9] 9. Hu L, Chan KCC. Extracting Coevolutionary Features from Protein Sequences for Predicting Protein-Protein Interactions. IEEE/ACM Trans Comput Biol Bioinform. 2017;14(1):155–66. pmid:26812730
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref10] 10. Ding Y, Tang J, Guo F. Predicting protein-protein interactions via multivariate mutual information of protein sequences. BMC Bioinformatics. 2016;17(1):398. pmid:27677692
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref11] 11. Ding Z, Kihara D. Computational Methods for Predicting Protein-Protein Interactions Using Various Protein Features. Curr Protoc Protein Sci. 2018;93(1):e62. pmid:29927082
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref12] 12. Tsubaki M, Tomii K, Sese J. Compound-protein interaction prediction with end-to-end learning of neural networks for graphs and sequences. Bioinformatics. 2019;35(2):309–18. pmid:29982330
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref13] 13. Aytuna AS, Gursoy A, Keskin O. Prediction of protein-protein interactions by combining structure and sequence conservation in protein interfaces. Bioinformatics. 2005;21(12):2850–5. pmid:15855251
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref14] 14. Göktepe YE. Protein-protein interaction prediction using enhanced features with spaced conjoint triad and amino acid pairwise distance. PeerJ Comput Sci. 2025;11:e2748. pmid:40134873
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref15] 15. Bonvin AMJJ. Flexible protein-protein docking. Curr Opin Struct Biol. 2006;16(2):194–200. pmid:16488145
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref16] 16. Du X, Sun S, Hu C, Yao Y, Yan Y, Zhang Y. DeepPPI: Boosting Prediction of Protein-Protein Interactions with Deep Neural Networks. J Chem Inf Model. 2017;57(6):1499–510. pmid:28514151
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref17] 17. Hashemifar S, Neyshabur B, Khan AA, Xu J. Predicting protein–protein interactions through sequence-based deep learning. Bioinformatics. 2018;34(17):i802–10.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref18] 18. Chen M, Ju CJ-T, Zhou G, Chen X, Zhang T, Chang K-W, et al. Multifaceted protein-protein interaction prediction based on Siamese residual RCNN. Bioinformatics. 2019;35(14):i305–14. pmid:31510705
View Article
PubMed/NCBI
Google Scholar

[65] View Article

[66] PubMed/NCBI

[67] Google Scholar

[ref19] 19. Wu J, Liu B, Zhang J, Wang Z, Li J. DL-PPI: a method on prediction of sequenced protein-protein interaction based on deep learning. BMC Bioinformatics. 2023;24(1):473. pmid:38097937
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref20] 20. Brandes N, Ofer D, Peleg Y, Rappoport N, Linial M. ProteinBERT: a universal deep-learning model of protein sequence and function. Bioinformatics. 2022;38(8):2102–10. pmid:35020807
View Article
PubMed/NCBI
Google Scholar

[73] View Article

[74] PubMed/NCBI

[75] Google Scholar

[ref21] 21. Lin Z, Akin H, Rao R, Hie B, Zhu Z, Lu W, et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science. 2023;379(6637):1123–30. pmid:36927031
View Article
PubMed/NCBI
Google Scholar

[77] View Article

[78] PubMed/NCBI

[79] Google Scholar

[ref22] 22. Dang TH, Vu TA. xCAPT5: protein-protein interaction prediction using deep and wide multi-kernel pooling convolutional neural networks with protein language model. BMC Bioinformatics. 2024;25(1):106. pmid:38461247
View Article
PubMed/NCBI
Google Scholar

[81] View Article

[82] PubMed/NCBI

[83] Google Scholar

[ref23] 23. Ko YS, Parkinson J, Liu C, Wang W. TUnA: an uncertainty-aware transformer model for sequence-based protein-protein interaction prediction. Brief Bioinform. 2024;25(5):bbae359. pmid:39051117
View Article
PubMed/NCBI
Google Scholar

[85] View Article

[86] PubMed/NCBI

[87] Google Scholar

[ref24] 24. Xie J, Zhang Y, Wang Z, Jin X, Lu X, Ge S, et al. PPI-Graphomer: enhanced protein-protein affinity prediction using pretrained and graph transformer models. BMC Bioinformatics. 2025;26(1):116. pmid:40301762
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref25] 25. UniProt Consortium. UniProt: a hub for protein information. Nucleic Acids Res. 2015;43(Database issue):D204-12. pmid:25348405
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref26] 26. Asim MN, Ibrahim MA, Malik MI, Dengel A, Ahmed S. ADH-PPI: An attention-based deep hybrid model for protein-protein interaction prediction. iScience. 2022;25(10):105169. pmid:36267921
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref27] 27. Zhao Z, Qian P, Yang X, Zeng Z, Guan C, Tam WL. Semignn-ppi: Self-ensembling multi-graph neural network for efficient and generalizable protein-protein interaction prediction. In: 2023. https://arxiv.org/abs/2305.08316

[ref28] 28. Zhang Z, Wang Z, Zhao L, Wang J, Wang C. Multimodal Contrastive Learning for Protein–Protein Interaction Inhibitor Prediction. In: 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024. 1327–33. https://doi.org/10.1109/bibm62325.2024.10822227

[ref29] 29. Tang T, Shen T, Jiang J, Li W, Wang P, Yuan S, et al. Prediction of protein-protein interaction based on interaction-specific learning and hierarchical information. BMC Biol. 2025;23(1):236. pmid:40754535
View Article
PubMed/NCBI
Google Scholar

[103] View Article

[104] PubMed/NCBI

[105] Google Scholar

Figures

Abstract

Introduction

Materials and methods

Dataset

Data preprocessing

Dataset splitting

Feature engineering

Model architecture

Advanced attention block

Four-stream processing

Cross-attention fusion

Quad-stream fusion

Multi-task learning framework

Optimization and hyperparameter search strategy

Results and discussions

Evaluation metrics

Performance evaluation of the proposed model

Statistical analysis

References