Peer Review History

Original SubmissionMarch 11, 2025
Decision Letter - Jeffrey Skolnick, Editor, Nir Ben-Tal, Editor

PCOMPBIOL-D-25-00468

AI-first structural identification of pathogenic protein targets

PLOS Computational Biology

Dear Dr. Bryant,

Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript within 30 days Jun 06 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

* A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below.

* A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

* An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

We look forward to receiving your revised manuscript.

Kind regards,

Jeffrey Skolnick

Academic Editor

PLOS Computational Biology

Nir Ben-Tal

Section Editor

PLOS Computational Biology

Additional Editor Comments:

This is an interesting and worthwhile study which could benefit from minor revision. With regards to the comments of reviewer 1, most are reasonable but it is not necessary to do MD simulations as they will not provide additional insights. Other than this caveat, please address the other concerns of both reviewers in your revised paper.

Journal Requirements:

1) Please ensure that the CRediT author contributions listed for every co-author are completed accurately and in full.

At this stage, the following Authors/Authors require contributions: Mihkel Saluri, Michael Landreh, and Patrick Bryant. Please ensure that the full contributions of each author are acknowledged in the "Add/Edit/Remove Authors" section of our submission form.

The list of CRediT author contributions may be found here: https://journals.plos.org/ploscompbiol/s/authorship#loc-author-contributions

2) We ask that a manuscript source file is provided at Revision. Please upload your manuscript file as a .doc, .docx, .rtf or .tex. If you are providing a .tex file, please upload it under the item type u2018LaTeX Source Fileu2019 and leave your .pdf version as the item type u2018Manuscriptu2019.

3) Please provide an Author Summary. This should appear in your manuscript between the Abstract (if applicable) and the Introduction, and should be 150-200 words long. The aim should be to make your findings accessible to a wide audience that includes both scientists and non-scientists. Sample summaries can be found on our website under Submission Guidelines:

https://journals.plos.org/ploscompbiol/s/submission-guidelines#loc-parts-of-a-submission

4) We do not publish any copyright or trademark symbols that usually accompany proprietary names, eg ©,  ®, or TM  (e.g. next to drug or reagent names). Therefore please remove all instances of trademark/copyright symbols throughout the text, including:

- ® on page: 14

- TM on page: 14.

5) Your manuscript is missing the following section: Discussion.  Please ensure all required sections are present and in the correct order. Make sure section heading levels are clearly indicated in the manuscript text, and limit sub-sections to 3 heading levels. An outline of the required sections can be consulted in our submission guidelines here:

https://journals.plos.org/ploscompbiol/s/submission-guidelines#loc-parts-of-a-submission 

6) Please upload all main figures as separate Figure files in .tif or .eps format. For more information about how to convert and format your figure files please see our guidelines: 

https://journals.plos.org/ploscompbiol/s/figures

7) We notice that your supplementary Figures, Tables, and information are included in the manuscript file. Please remove them and upload them with the file type 'Supporting Information'. Please ensure that each Supporting Information file has a legend listed in the manuscript after the references list.

8) Please amend your detailed Financial Disclosure statement. This is published with the article. It must therefore be completed in full sentences and contain the exact wording you wish to be published.

1) State what role the funders took in the study. If the funders had no role in your study, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

9) Your current Financial Disclosure states, "This study was supported by the SciLifeLab & Wallenberg Data Driven Life Science Program (grant: KAW 2020.0239, P.B). Computational resources were enabled by the supercomputing resource Berzelius provided by National Supercomputer Centre at Linköping University and the Knut and Alice Wallenberg foundation with project ids berzelius-2021-29 (Arne Elofsson), Berzelius-2023-267, Berzelius-2024-78 and Berzelius-2024-292 (P.B.). M.L. is supported by a KI faculty-funded Career Position, a Cancerfonden Project grant, a VR Research Environment Grant, a Consolidator Grant from the Swedish Society for Medical Research (SSMF), and the Knut and Alice Wallenberg foundation (2022.0032). "

However, your funding information on the submission form indicates one funder. Please ensure that the funders and grant numbers match between the Financial Disclosure field and the Funding Information tab in your submission form. Note that the funders must be provided in the same order in both places as well.

Reviewers' comments:

Reviewer's Responses to Questions

Reviewer #1: This manuscript utilizes AlphaFold-based protein structure prediction methods to predict the structures of host-pathogen protein-protein interactions (HP-PPIs). The authors applied AlphaFold-Multimer (AFM) and FoldDock protocols to predict the structures of 111 curated HP-PPI complexes, comparing TM-scores to determine that FoldDock is more suitable for HP-PPI prediction. Building upon this, the authors predicted the structures of 9,452 HP-PPIs from the HPIDB database using FoldDock, selecting 30 high-quality predictions based on plDDT ≥ 70 and pDockQ ≥ 0.3 for further analysis. In particular, the authors analyzed the complex between human IGKC and Francisella tularensis IPD and observed that IGKC binds at a distinct site from the IPD dimerization interface. Based on this observation, the authors hypothesized that IPD exists as a dimer and interacts with two IGKC molecules, forming a 1:2:1 heterotetramer structure. To validate this hypothesis, the authors conducted native mass spectrometry (nMS) and confirmed the formation of a 1:2:1 complex, demonstrating that AI-based structure prediction can contribute to identifying biologically relevant protein interactions. This manuscript highlights the potential of AI-driven structural modeling in elucidating host-pathogen protein interactions and predicting novel functional interactions. However, there are several issues to be addressed in the current form of the manuscript.

Major comments

1. The authors selected "high-quality predictions" based on structures with an average plDDT ≥ 70 and pTM-score ≥ 0.9. However, it is questionable whether the average plDDT value and pTM-score across the entire protein structure accurately reflect the reliability of the predicted HP-PPI structures. Since this study aims to predict HP-PPI structures, the plDDT of the binding sites and the PAE values between interacting residues are more relevant. However, this aspect seems to be insufficiently addressed.

2. Along the same line with point 1, the authors mention that they found 30 new structures (p. 4, last paragraph). Do they have any experimental evidence to support that those predictions are reliable?

3. The authors mention that pathogen proteins tend to be structurally unstable or contain a high proportion of intrinsically disordered regions (IDRs) (Structure prediction of novel host-pathogen interactions section, second paragraph). It is well known that AlphaFold2 exhibits hallucination effects when predicting secondary structures in IDR regions, which may artificially inflate plDDT values and affect PPI predictions. To address this issue, it would be beneficial to compare the predicted structures using more recent AI tools such as AlphaFold3 or Chai.

4. Did the authors consider HP-PPI interactions in a physiological environment? In vivo, these proteins do not exist in a static state. To better assess the stability of the predicted complexes, molecular dynamics (MD) simulations should be performed, demonstrating that the predicted binding site remains stable throughout the simulation. This would strengthen the credibility of the predictions.

5. The authors may consider comparing the predictions of monomeric and complex forms of the same molecule. This can provide additional insight on conformational flexibility, etc.

6. The interpretation of the predicted complexes in Figure 3 appears questionable. For example, in Figure 3i, the authors suggest that CSNK2B is involved in the Wnt signaling pathway and that its interaction with nifj may disrupt Wnt signaling or contribute to immune evasion. However, this claim is purely based on functional inference and lacks structural evidence. To support this hypothesis, it is necessary to confirm and provide evidence that the functional active site of CSNK2B overlaps with the binding site of nifj.

7. In Figure 4c, the authors report that approximately 25% of IGKC exists as a dimer, based on nMS experimental results. If so, was the IGKC dimer - IPD dimer - IGKC dimer complex observed? Since nMS data alone cannot distinguish between the IGKC dimer - IPD dimer (2:2) complex and the IGKC monomer - IPD dimer - IGKC monomer (1:2:1) complex, if such a structure is possible, the schematic representation should be revised accordingly.

8. More generally, the stoichiometry alone cannot prove that the predicted structure is correct. For example, the authors may suggest a mutation (based on the predicted structure) that can change the stoichiometry and show that it actually works by using MS experiment. If additional data like this are not provided, the current logic is too weak.

Minor comments

1. There are many grammatical errors. I strongly recommend the authors to review the manuscript thoroughly and carefully.

2. The authors provide the hyperlinks of PDB and UniProt databases, but it should be avoided. Instead, I would suggest using PDB and UniProt IDs only. The authors may want to organize the hyperlinks into a separate table.

3. The manuscript presents the IPD-IGKC heterotetramer structure, but does not specify confidence metrics. If this structure was also predicted using FoldDock, the corresponding confidence scores should be reported.

4. The format of reference 2 is inconsistent with the other references.

Reviewer #2: Saluri et al. in their manuscript “AI-first structural identification of pathogenic protein targets,” provide support for a subset of host-microbe interactions by performing an AlphaFold-based FoldDock method for structural prediction and then filtering high-confidence structures using author-defined criteria.

Overall claims:

The title and key claims of the paper are somewhat misleading, as the interactions that the authors investigated were already known prior to this analysis. The authors merely added structural prediction onto the previous evidential data.

The authors should highlight Dyer et al. 2010 in which the interaction between a bacterial dihydrolipoyl dehydrogenase and human IGKC was first identified.

The authors mention interface analysis, although this is missing from the results section, as the results are exclusively at the protein level.

It is unclear how the authors conclude that they have a 4-fold expansion in structures, as they provide 30 new structural predictions, but they claim earlier that there are 52 known complex structures.

Methods:

The authors justify using TM-score instead of DockQ by stating that DockQ does not perform well when native structures differ in length from predicted ones. However, they later used pDockQ in their analysis. Since pDockQ is derived from IF-plDDT and DockQ, wouldn’t it also inherit the same limitations?

The paper does not provide ROC analysis for TM-score thresholds below 0.9. TM-score ≥ 0.5 is already considered the same fold; I would like to see ROC curves for TM-score thresholds at 0.5 or 0.6.

Results:

The statement in Figure 2d is confusing: “Figure 2d shows the number of HP-PPIs with known structure (n=10) and the number of high-quality predictions with an expected TM-score of 0.9 (above 70 plDDT and pDockQ 0.3, n=30). None of the interactions with known structures were accurately predicted.” Could the authors clarify whether “known structures” refer to monomers or complexes?

The HPIDB 3.0 database reports 55,505 unique protein interactions, yet the manuscript states 69,787 interactions in the first sentence of the “Human-pathogen PPIs from the HPIDB” section. Where does this discrepancy come from?

The authors claim to investigate a structural protein-protein interaction network, yet there is no actual network analysis presented in the study.

Minor:

Typo in Figure 4: IDP → IPD

Reviewer #3: This paper reports the results of an effort to predict the 3D structure of host-pathogen protein-protein complexes. The predictions are performed using tools developed based on AlphaFold (AF) and AlphaFold-Multimer (AF-M), respectively trained on single protein chains and protein complexes from the PDB, previously developed by some of the authors. The predictions are performed on two datasets: a set of 111 (24) unique host-pathogen complexes with known 3D structure from the PDB and that of unique binary protein interactions of 9452 human proteins with ten different pathogens with unknown protein structures. The latter interactions, detected by various experimental techniques, were extracted from the HPIDB. Using various structure quality/reliability measures, widely used by the community (with preference given to the TM score) the authors managed to correctly predict (recall) a small fraction of the known complexes and propose highly reliable 3D structures for a small fraction of the tested PPI set from HPIDB. The predicted structure of one of the host-pathogen PPIs from HPIDB was experimentally verified using native MS, and based on this experimental data the stoichiometry of the native complex was defined.

General comments.

The idea of applying AF and AF-M methods to obtain reliable information on the 3D structures of host-pathogen protein interactions makes great sense as it provides valuable information on the 3D structure details of the interacting partners that can be used to design vaccines or drugs likely to counter pathogen infections. The work is solid and represents means of advancing drug and vaccine design developments against pathogen infections.

Minor comments: points demanding clarification.

1 Page 7: The fraction of reliably predicted structures (TM-score of 0.9) is rather low for the full set of 111 known structures and likely slightly lower for the subset of 24 structures that are more recent than those AF/AF-M were trained on (not explicitly shown but surmised from the average values). Why is this the case? What fraction of these two sets were determined by cryo-EM. What is the distribution of the resolution of these structures? Were cryo-EM structures les well predicted? Were lower resolution structures less well predicted?

2-Figure 2c: This Figure shows the distribution of the average plDDT for each chain in the predicted complexes from the HPIDB (n=8441) and the PDB (n=111), and the legend mentions that “93 complexes out of 111 where both chains have over 70 plDDT on average for the PDB set (84%) and 3472 out of 7948 for the HPIDB set (44%)”. These numbers are much higher than the fraction of PPI with TM-Scores above 0.9 for PPIs from these two sources. This seems a bit surprising. It would therefore be helpful to provide a plot illustrating the relationship between the TM-Score and plDDT.

3- Figure 2d plots the number of HP-PPIs with known structure (n=10) and with high-quality predictions (above 70 plDDT and pDockQ 0.3, n=30) for each pathogen. This information is provided here for the first time. What does ‘known structure’ mean in this context: The structure of the full complex or that of one of the partners (human or pathogen)?

4-The question raised above is part a wider imperfection of this otherwise nice manuscript, which is, a lack of a master Table, or 2 such Tables (in the Suppl ), which list the for each PPI dataset (PDB and HPIDB,) all the relevant details : resolution + experimental method (X-ray/cryo-EM; AF/MS/2-Hb/other);origin (host pathogen); # of total predictions; # of predictions with TM-scores≥0.9; pDockQ >0.3; and plDDT >70% for both chain and any other info (diff in sequence length and fraction of disorder when known). And for the PPI’s from HPIDB also provide information on availability experimental structure(s). This will make it much easier to build on this work for future studies.

5- The experimental validation of the predicted 3D structure of the IPD-IGKC, complex by native MS is very encouraging. Are there plans to carry out more extensive validations?

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: No:  See my review report concerning the need of providing more detailed information on the precise content of the data, and the prediction results. I don't remember having seen a link to the data used and that produced in this work. So please double check.

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy .

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

Figure resubmission:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. If there are other versions of figure files still present in your submission file inventory at resubmission, please replace them with the PACE-processed versions.

Reproducibility:

To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Attachments
Attachment
Submitted filename: Response to Reviewers.pdf
Revision 1
Decision Letter - Jeffrey Skolnick, Editor, Nir Ben-Tal, Editor

Dear Dr Bryant,

We are pleased to inform you that your manuscript 'AI-first structural identification of pathogenic protein target interfaces' has been provisionally accepted for publication in PLOS Computational Biology.

Please notice a few minor reviewer comments. These can be handled while preparing the final draft of the manuscript.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Jeffrey Skolnick

Academic Editor

PLOS Computational Biology

Nir Ben-Tal

Section Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The comments are uploaded as an attachment.

Reviewer #3: The authors have addressed all my comments and most of those raised by the 2 other reviewers.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #3: None

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy .

Reviewer #1: No

Reviewer #3: Yes:  Shoshana J. Wodak

Attachments
Attachment
Submitted filename: Comments.pdf
Formally Accepted
Acceptance Letter - Jeffrey Skolnick, Editor, Nir Ben-Tal, Editor

PCOMPBIOL-D-25-00468R1

AI-first structural identification of pathogenic protein target interfaces

Dear Dr Bryant,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Olena Szabo

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .