Peer Review History
| Original SubmissionOctober 19, 2025 |
|---|
|
PCOMPBIOL-D-25-02136 Paraplume: A fast and accurate paratope prediction method provides insights into repertoire-scale binding dynamics PLOS Computational Biology Dear Dr. Walczak, Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jan 14 2026 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: * A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below. * A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. * An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter We look forward to receiving your revised manuscript. Kind regards, Lun Hu Academic Editor PLOS Computational Biology Dominik Wodarz Section Editor PLOS Computational Biology Additional Editor Comments: All reviewers see value in this work, but they raised several critical concerns, such as in-depth analysis of experimental results, extension of ablation studies, and interpretability. Authors should carefully revise their manuscript by thoroughly addressing all these issues. Journal Requirements: 1) Please ensure that the CRediT author contributions listed for every co-author are completed accurately and in full. At this stage, the following Authors/Authors require contributions: Gabriel Athènes, Adam Woolfe, Thierry Mora, and Aleksandra M. Walczak. Please ensure that the full contributions of each author are acknowledged in the "Add/Edit/Remove Authors" section of our submission form. The list of CRediT author contributions may be found here: https://journals.plos.org/ploscompbiol/s/authorship#loc-author-contributions 2) We ask that a manuscript source file is provided at Revision. Please upload your manuscript file as a .doc, .docx, .rtf or .tex. If you are providing a .tex file, please upload it under the item type u2018LaTeX Source Fileu2019 and leave your .pdf version as the item type u2018Manuscriptu2019. 3) Please provide an Author Summary. This should appear in your manuscript between the Abstract (if applicable) and the Introduction, and should be 150-200 words long. The aim should be to make your findings accessible to a wide audience that includes both scientists and non-scientists. Sample summaries can be found on our website under Submission Guidelines: https://journals.plos.org/ploscompbiol/s/submission-guidelines#loc-parts-of-a-submission 4) Your manuscript is missing the following sections: Abstract. Please ensure all required sections are present and in the correct order. Make sure section heading levels are clearly indicated in the manuscript text, and limit sub-sections to 3 heading levels. An outline of the required sections can be consulted in our submission guidelines here: https://journals.plos.org/ploscompbiol/s/submission-guidelines#loc-parts-of-a-submission 5) Please upload all main figures as separate Figure files in .tif or .eps format. For more information about how to convert and format your figure files please see our guidelines: https://journals.plos.org/ploscompbiol/s/figures 6) We notice that your supplementary Figures, and Tables are included in the manuscript file. Please remove them and upload them with the file type 'Supporting Information'. Please ensure that each Supporting Information file has a legend listed in the manuscript after the references list. 7) Please amend your detailed Financial Disclosure statement. This is published with the article. It must therefore be completed in full sentences and contain the exact wording you wish to be published. State the initials, alongside each funding source, of each author to receive each grant. For example: "This work was supported by the National Institutes of Health (####### to AM; ###### to CJ) and the National Science Foundation (###### to AM).". If you did not receive any funding for this study, please simply state: u201cThe authors received no specific funding for this work.u201d Reviewers' comments: Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The review is uploaded as an attachment Reviewer #2: ## Summary The paper introduces Paraplume, a novel computational method for antibody paratope prediction. Unlike approaches that rely on 3D structures, Paraplume is purely sequence-based. Its architecture leverages embeddings from six different Protein Language Models (PLMs). These embeddings are concatenated and fed into a MLP to predict the paratope probability for each amino acid. The method was validated on three benchmark datasets (PECAN, Paragraph, MIPE), demonstrating performance that is competitive with or exceeds existing structure-based methods. ## Strength 1. **Scalability:** The method's primary strength is its computational efficiency. By avoiding 3D structure modeling, Paraplume is a highly scalable tool practical for analyzing antibody repertoires at a scale of millions of sequences. 2. **Strong Empirical Performance:** Despite its architectural simplicity, Paraplume demonstrates excellent performance, notably outperforming methods that rely on *modeled* 3D structures. This provides strong support for the utility of rich sequence embeddings. ## Major Points While the work has clear merits, several key issues regarding the methodology and experimental validation must be addressed. 1. **Lack of Cost-Benefit Analysis for PLM Embeddings:** The paper claims "complementarity" among the six PLMs but only provides a cost analysis for the final 6-PLM (Paraplume) vs. 1-PLM (Paraplume-S) models. The marginal performance gain of each individual PLM is not weighed against its specific computational cost (e.g., inference time, VRAM usage). 2. **Confounding Variable: Model Capacity vs. Informational Synergy:** The core methodological claim is confounded. Concatenating six embeddings does not just add "complementary information"; it massively increases the input dimensionality and, consequently, the total parameter count of the MLP (especially in the first hidden layer). The observed performance gain may simply be an artifact of this increased model capacity, rather than true informational synergy. 3. **Inadequate Ablation Studies:** Following Point 2, the ablation studies (Table S1) are insufficient. They compare a 6-PLM model to 1-PLM models *with the same MLP architecture*. A proper control is missing: a 1-PLM model (e.g., Paraplume-S) fed into a "wider" or "deeper" MLP with a parameter count equivalent to the full Paraplume model. Without this control, the central claim of "complementarity" is unsubstantiated. 4. **Omission of Key Model Comparisons:** The paper omits comparisons to a critical third class of models: structure-aware sequence models (e.g., ProstT5, TM-vec). These models explicitly integrate 3D knowledge during pre-training but require only sequence input at inference. To validate the claim that Paraplume's *implicit* structural information is superior, it must be benchmarked against these *explicit* structure-aware SOTA methods. 5. **Insufficient Case Study:** The case study in Figure 2 demonstrates Paraplume's success but not its *superiority*. A more compelling case would compare Paraplume against a structure-based method (e.g., Paragraph-ABB) on an antibody where the underlying 3D structure model (e.g., ABodyBuilder) is known to fail (e.g., a complex CDR loop), showing that Paraplume succeeds where the structure-based method fails. 6. **Lack of Model Interpretability:** The model is treated as a "black box." The claim of "complementarity" is an untested hypothesis. The authors should apply feature attribution methods (e.g., SHAP, Integrated Gradients) to the trained MLP. This would reveal which PLMs the model relies on for different predictions (e.g., CDR3 vs. framework regions) and provide mechanistic evidence for the complementarity claim. 7. **Omission of Interpretable Architectures:** The choice of an MLP, while simple, sacrifices model interpretability. Other architectures, such as those using attention mechanisms (e.g., VenusVaccine), provide clear, residue-level explanations for their predictions. The authors should justify why an opaque MLP was chosen over more transparent alternatives and demonstrate some aspect of their model's interpretability. ## Minor Points 1. **Incorrect Terminology in Figure:** In Figure 1B, the label "Pretrained LLMs" is used, maybe "Pretrained PLMs"? 2. **Figure Quality:** Figures 1 and 2 require improvement. Figure 1B is overly simplistic; a more informative schematic might briefly illustrate the different types of PLMs used. Figure 2 is a basic render; a more aesthetic visualization would be more impactful. 3. **Accessibility of Citations:** The submitted PDF lacks active hyperlinks for citations. 4. **Critical Link Error:** The Zenodo link provided on Page 4 (.../records/17277448) contradicts the link provided on Page 18 (.../records/17021232). Reviewer #3: This manuscript presents Paraplume, a fast and accurate paratope prediction framework that enables repertoire-scale analyses of antibody binding, and it demonstrates solid innovation, good readability, and a rich, convincing set of experiments. The following points require clarification: 1.The Introduction and Discussion could more clearly articulate what is conceptually new relative to existing paratope predictors such as Paragraph, PECAN, MIPE and Parapred. In particular, please spell out which aspects are the true contributions (multi-PLM sequence-only paratope prediction, paratope-weighted embeddings, repertoire-scale analyses, asymmetry-based upper bound) and which are incremental improvements in benchmarking. 2.The manuscript would benefit from a more detailed description of how sequence redundancy and potential train–test leakage are controlled across the PECAN, Paragraph, MIPE, and expanded SAbDab datasets (e.g., sequence-identity thresholds, overlap between ABB3 / AlphaFold training data and the benchmark splits, and between Paraplume’s training set and the repertoires analyzed downstream). Given that you already note ABB3 training/test overlap for Paragraph on MIPE, a systematic treatment of these issues is important for a fair comparison. 3.While Section IV. A briefly justifies the use of six PLMs and mentions an ablation in Table S1, the main text does not fully convey how much each embedding source contributes versus the added computational cost. I suggest moving at least a subset of these ablation results into the main Results, and clarifying whether a learned linear combination or dimensionality reduction of concatenated embeddings was tested as an alternative to the very high-dimensional concatenation. 4.The analysis of paratope/epitope asymmetry in antibodies binding identical antigens is interesting, but the resulting ~0.95 F1 “upper bound” is presented somewhat strongly as a performance ceiling. Please clarify that this bound depends on the chosen distance cutoff, labeling protocol, and dataset composition, and consider showing the full distribution of arm-to-arm F1/PR AUC across complexes to emphasize variability rather than a single number. 5.The conclusion that somatic hypermutation leads to larger paratopes in mouse and human repertoires is compelling but may be influenced by confounders such as total variable-region length, CDR3 length, or V-gene usage. Since “paratope size” is defined as a sum over per-residue probabilities, it should be made clearer how you control for length and composition (e.g., via normalized paratope size, length-stratified analyses, or multivariable models including SHM count and CDR length). 6.The Methods state that hyperparameters were selected on the Paragraph dataset and then reused for all benchmarks, and that multiple seeds with early stopping were employed, but these details are somewhat scattered. For reproducibility, please explicitly summarize in one place the training protocol (number of seeds, early-stopping criterion, epochs, optimizer settings, batch sizes, and hardware) and ensure that the GitHub repository mirrors this exact configuration so that readers can reproduce the main tables. 7.The authors propose an effective strategy. In future work, widely developed deep learning techniques and their application to more complex scenarios (DOI: 10.1145/3664647.3681673, DOI: 10.1109/JBHI.2024.3357979) could be further explored. ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] Figure resubmission: While revising your submission, we strongly recommend that you use PLOS’s NAAS tool (https://ngplosjournals.pagemajik.ai/artanalysis) to test your figure files. NAAS can convert your figure files to the TIFF file type and meet basic requirements (such as print size, resolution), or provide you with a report on issues that do not meet our requirements and that NAAS cannot fix. After uploading your figures to PLOS’s NAAS tool - https://ngplosjournals.pagemajik.ai/artanalysis, NAAS will process the files provided and display the results in the "Uploaded Files" section of the page as the processing is complete. If the uploaded figures meet our requirements (or NAAS is able to fix the files to meet our requirements), the figure will be marked as "fixed" above. If NAAS is unable to fix the files, a red "failed" label will appear above. When NAAS has confirmed that the figure files meet our requirements, please download the file via the download option, and include these NAAS processed figure files when submitting your revised manuscript. Reproducibility: To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols
|
| Revision 1 |
|
PCOMPBIOL-D-25-02136R1 Paraplume: A fast and accurate antibody paratope prediction method provides insights into repertoire-scale binding dynamics PLOS Computational Biology Dear Dr. Walczak, Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Mar 15 2026 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: * A letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below. * A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. * An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. We look forward to receiving your revised manuscript. Kind regards, Lun Hu Academic Editor PLOS Computational Biology Dominik Wodarz Section Editor PLOS Computational Biology Additional Editor Comments: One of our reviewers still has some concerns regarding the evaluation. Particularly, authors should seperately report the MCC and F1 scores in different cases, rather than simply averaging across all of the sequence distances. Reviewers' comments: Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The revised version is substantially better. Most of my concerns were addressed, but there are some lingering issues, one of which is a major concern on upper-bounding the performance on this dataset: Major comments: 1. Regarding the identical arms asymmetry, the sensitivity bins of 1 to 10 and 10 to 20 are very broad. They can mask a near-linear trend in the 1 to 5 range. Also, Fig S10 suggests that a considerable number of arm pairs have perfect MCC and F1. In this case, you should report the upper bound on distance zero cases (maybe perfect MCCs and F1 scores come from there) and separately report MCC and F1 as a function of narrowly binned distances (not 1 to 10 but much smaller increments) between the two arms of the antibody. The sequence differences may be confounding MCC and F1s since you’re averaging across all of the sequence distances. This should be a straightforward reanalysis. Minor comments: 1. Regarding the use of embeddings and the downstream tasks, I think you shouldn’t remove the negative result but instead change the language. It is important for the reader to know that your proposed weighted embedding is not a one-size-fits-all method but can yield performance improvements in some tasks. I see scientific value in all the results and in transparently reporting them. You can cite that as a limitation of the method. If you discard the negative results, the reader will mistakenly think that the paratope-weighted embeddings will work for whatever relevant downstream task they have. 2. Please specify which ESM-2 model size was used. I don't see this clearly mentioned, but there are a few different ESM-2 options with different sizes and embedding dimensions. 3. When you make scatter plots, it would be helpful not to set alpha (or opacity) to 1 so that the reader can understand the density of the dots in a given cluttered region. 4. In the beginning of page 10/29, you refer to Fig S8C, but it should be Fig S9C based on the supplementary figure tiff files attached. Overall, the paper has made progress in terms of scoping the claims. As stated above, I have only one reservation for acceptance, which doesn't affect the presented performance results but rather the proposed upper bound on the performance. Reviewer #2: The author made improvements to the paper, and my questions had been addressed. Reviewer #3: The draft submitted by the author gave a satisfactory answer. I think this article is acceptable. ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: None ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] Figure resubmission: While revising your submission, we strongly recommend that you use PLOS’s NAAS tool (https://ngplosjournals.pagemajik.ai/artanalysis) to test your figure files. NAAS can convert your figure files to the TIFF file type and meet basic requirements (such as print size, resolution), or provide you with a report on issues that do not meet our requirements and that NAAS cannot fix. After uploading your figures to PLOS’s NAAS tool - https://ngplosjournals.pagemajik.ai/artanalysis, NAAS will process the files provided and display the results in the "Uploaded Files" section of the page as the processing is complete. If the uploaded figures meet our requirements (or NAAS is able to fix the files to meet our requirements), the figure will be marked as "fixed" above. If NAAS is unable to fix the files, a red "failed" label will appear above. When NAAS has confirmed that the figure files meet our requirements, please download the file via the download option, and include these NAAS processed figure files when submitting your revised manuscript. Reproducibility: To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols |
| Revision 2 |
|
Dear Dr Walczak, We are pleased to inform you that your manuscript 'Paraplume: A fast and accurate antibody paratope prediction method provides insights into repertoire-scale binding dynamics' has been provisionally accepted for publication in PLOS Computational Biology. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests. Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated. IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS. Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. Best regards, Lun Hu Academic Editor PLOS Computational Biology Dominik Wodarz Section Editor PLOS Computational Biology *********************************************************** All reviewers were satisified with the changes made in this revision. Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: I appreciate the authors' work on addressing all the issues I've raised. The report looks stronger. ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No |
| Formally Accepted |
|
PCOMPBIOL-D-25-02136R2 Paraplume: A fast and accurate antibody paratope prediction method provides insights into repertoire-scale binding dynamics Dear Dr Walczak, I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. For Research, Software, and Methods articles, you will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing. Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work! With kind regards, Anita Estes PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .