Peer Review History
| Original SubmissionJune 27, 2025 |
|---|
|
Dear Dr. Li, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Sep 11 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols . We look forward to receiving your revised manuscript. Kind regards, Aiqing Fang Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 2. Thank you for stating the following financial disclosure: “This work was supported by the National Natural Science Foundation of China [grant numbers 62362034, 62372279]; the Natural Science Foundation of Jiangxi Province of China [grant number 20232ACB202010]; the Natural Science Foundation of Shandong Province [grant number ZR2023MF119]; and the Jiangxi Province Key Laboratory of Advanced Network Computing [grant number 2024SSY03071].” Please state what role the funders took in the study. If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript." If this statement is not correct you must amend it as needed. Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf. 3. If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise. Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? Reviewer #1: Yes Reviewer #2: Partly ********** 2. Has the statistical analysis been performed appropriately and rigorously? -->?> Reviewer #1: Yes Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available??> The PLOS Data policy Reviewer #1: Yes Reviewer #2: No ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English??> Reviewer #1: Yes Reviewer #2: Yes ********** Reviewer #1: This manuscript presents PLM-GNN, a novel dual-channel deep learning model for classifying bacterial virulence factors (VFs). The model uniquely integrates a sequence channel, utilizing pre-trained language models and Transformer architectures, with a structure channel that employs a geometric graph neural network on ESMFold-predicted 3D structures. The authors focus on classifying the seven most abundant VF types and report superior performance compared to existing sequence- or structure-only methods, demonstrating the synergistic benefit of combining both data modalities. While the work is promising and the methodology is sophisticated, several points should be addressed to elevate the manuscript to the standard of a top-tier journal. 1. A critical missing element is the evaluation of the quality of the predicted 3D structures from ESMFold. The entire structure-based channel relies on these predictions. The authors should report the predicted Local Distance Difference Test (pLDDT) scores for the dataset. It is crucial to analyze if there is a correlation between low-confidence structure predictions and model misclassifications. A model trained on inaccurate or low-confidence structures may learn artifacts rather than true biological signals. 2. The manuscript states that VFs longer than 1240 amino acids were excluded due to GPU memory limitations. This is a significant constraint that could introduce bias. The authors should provide a more precise quantification of what percentage of the total VFs in the initial dataset this exclusion represents. Furthermore, a discussion is needed on the potential biological implications, as longer proteins may possess complex domain architectures or functions that the model is now unable to learn. 3. The model's dataset is limited to the seven most numerous VF categories, excluding seven rarer ones. While this is a practical decision for model training, it limits the claimed generalizability of the model. The authors should be more explicit about this limitation and frame their conclusions accordingly, acknowledging that the model's utility for less common VF types is currently unknown. 4. The method for fusing the sequence and structure channel embeddings is a simple summation. This architectural choice requires justification. The authors should explore and compare this with other fusion strategies, such as concatenation followed by a linear layer or an attention-based fusion mechanism, to demonstrate that summation is indeed an optimal or sufficient choice. 5. The analysis of remote homology detection is compelling but appears anecdotal. To strengthen this claim, the authors should perform a systematic analysis. They could identify all pairs in the test set that qualify as remote homologs (e.g., sequence identity < 30%, TM-score > 0.5) and report the model's performance specifically on this subset, rather than presenting only two cherry-picked examples. 6. When comparing PLM-GNN with other models (Table 3), it is not specified whether the hyperparameters for these baseline models were also rigorously optimized. For a fair comparison, all competing models should be tuned to their best performance on the validation set, just as the proposed model was. The authors should clarify their procedure for this. 7. While the performance improvements of PLM-GNN are shown, the authors have not reported whether these improvements are statistically significant. Statistical tests (e.g., McNemar's test or bootstrapping to generate confidence intervals for the metrics) should be performed on the performance differences between PLM-GNN and the next-best-performing models to validate the superiority of the proposed method. 8. The paper would benefit from a more in-depth error analysis beyond the confusion matrix. The authors should investigate the characteristics of the misclassified VFs. For example, are they more likely to have low pLDDT scores, belong to specific organisms, have lengths close to the 1240 residue cutoff, or share features with another class that explains the confusion? 9. The description of the geometric features for the graph representation (Table 2) is very dense. While comprehensive, its complexity could hinder reproducibility. A more detailed, step-by-step explanation in the supplementary materials, perhaps with a visual diagram illustrating how these features are derived from a protein structure, would greatly improve clarity. 10. The conclusion that the model "can provide valuable theoretical support for the development of antiviral strategies" is an overstatement. The model is a classification tool, which is an important step, but it does not directly provide mechanistic insights or drug targets. A more measured and precise conclusion regarding the model's immediate application—improving the functional annotation of proteins—would be more appropriate. 11. In the "Applied to effector delivery system classification" section, the model's performance on the T7SS category is noted to be weaker due to the small sample size. This highlights a potential weakness in handling highly imbalanced data. The authors could discuss or experiment with advanced data augmentation techniques or few-shot learning approaches as potential future work to address this. 12. The interpretability of the model is primarily explored through t-SNE visualizations. To further this, the authors could consider using model interpretability techniques like integrated gradients or attention map analysis, especially on the Transformer components, to identify which specific amino acid residues or structural motifs are most influential for classifying a given VF type. This would provide more direct biological insights. 13. For a more comprehensive comparison, further studies involving standalone Convolutional Neural Network (CNN) and Vision Transformer (ViT) models could be included. Comparing the proposed method against other sophisticated, single-modality architectures would better highlight the specific advantages of the dual-channel approach. "Hybrid deep learning model for automated colorectal cancer detection using local and global feature extraction","InceptionNeXt-Transformer: A novel multi-scale deep feature learning architecture for multimodal breast cancer diagnosis" Reviewer #2: Reviewer Report This paper introduces PLM-GNN, a dual-channel model for accurate classification of seven major virulence factors. By integrating structural and sequence-based features, the model achieves high performance, aiding in the study of pathogen virulence mechanisms. While the core contributions appear sound, the presentation lacks critical clarifications in methodology and notation that impact reproducibility and readability. I recommend a major revision, as the following methodological clarifications are essential for clarity and rigor. Major Comments 1. The author should add the justification for the 8:1:1 stratified split The paper states: “We then randomly divided the data into three subsets using an 8:1:1 ratio (with stratified sampling).” 2. The manuscript relies heavily on undefined acronyms (e.g., “CNN”, “SVM”, “Cα”). Please define each acronym at its first occurrence. 3. Several symbols appear without explanation, like (X(0)∈RL×1280X^{(0)} ) For each symbol, please include a brief textual definition at the point of first use. ********** what does this mean? ). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy Reviewer #1: No Reviewer #2: No ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/ . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org . Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
Classification of virulence factors based on dual-channel neural networks with pre-trained language models PONE-D-25-34906R1 Dear Dr. Li, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact billing support . If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author Reviewer #1: All comments have been addressed Reviewer #3: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions??> Reviewer #1: Yes Reviewer #3: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? -->?> Reviewer #1: No Reviewer #3: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available??> The PLOS Data policy Reviewer #1: No Reviewer #3: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English??> Reviewer #1: Yes Reviewer #3: Yes ********** Reviewer #1: Accept , comments have been adressed. Paper can be accepted as its current form Accept , comments have been adressed. Paper can be accepted as its current form. Reviewer #3: In this paper, the authors propose a deep network model to classify virulence factors. The model is called PLM-GNN that includes CNN and transformer structures. The proposed approach can be helpful for other researchers. However, the following revisions are required to enrich and improve the quality of the paper; 1) It is commonly known that traditional machine learning-based methods are not efficient, and deep learning-based methods produce better results. Therefore, the statements about thraditional machine learning (e.g., SVM) in the introduction should be removed. For instance, the sentences "Garg et al. [12] introduced VirulentPred, a prediction .....with machine learning models, which yielded promising results." They are redundant/unnecessary. 2) The meaning of the statement ".....cross-entropy loss is widely adopted as the standard loss function" should be supported by the following works: https://doi.org/10.1016/j.bspc.2025.108083, https://doi.org/10.1016/j.bspc.2025.108138, https://doi.org/10.1016/j.eswa.2024.126290 3) In this work, RELU has been used for activations. However, it may cause dying neuron issues. Therefore, leaky RELU has been used in different network models to obtain higher performance. This can be considered a limitation of this work. In future work, the proposed PLM-GNN structure can be implemented using leaky ReLU to improve its performance. To inform the readers (who may want to apply the same model) about this, the following statements should be in the last paragraph: "A potential future work can be applying the proposed PLM-GNN structure using leaky ReLU rather than ReLU and investigating its performance because leaky ReLU has been used in various recent models [https://doi.org/10.1016/j.neucom.2024.127445, https://doi.org/10.1007/s11042-025-20760-y, https://doi.org/10.1007/s10462-024-10897-x, https://doi.org/10.1016/j.bspc.2025.108370]." ********** what does this mean? ). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy Reviewer #1: No Reviewer #3: No ********** |
| Formally Accepted |
|
PONE-D-25-34906R1 PLOS One Dear Dr. Li, I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS One. Congratulations! Your manuscript is now being handed over to our production team. At this stage, our production department will prepare your paper for publication. This includes ensuring the following: * All references, tables, and figures are properly cited * All relevant supporting information is included in the manuscript submission, * There are no issues that prevent the paper from being properly typeset You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps. Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. You will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing. If we can help with anything else, please email us at customercare@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Aiqing Fang Academic Editor PLOS One |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .