Peer Review History
| Original SubmissionApril 9, 2024 |
|---|
|
PONE-D-24-13520Leveraging Machine Learning for Enhanced and Interpretable Risk Prediction of Venous Thromboembolism in Acute Ischemic Stroke CarePLOS ONE Dear Dr. Jiang, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. ============================== ACADEMIC EDITOR: We have now received reports from two independent reviewers and have carefully examined your submission titled "Leveraging Machine Learning for Enhanced and Interpretable Risk Prediction of Venous Thromboembolism in Acute Ischemic Stroke Care." After a thorough review, we have identified several areas that require significant revision before the manuscript can be considered for publication. We request that you carefully address each of these comments in your revised manuscript, including those from reviewers (see below). Please submit a detailed rebuttal letter that explains how each concern was addressed or provide a rationale if certain suggestions were not followed. 1. The manuscript lacks a clear articulation of why machine learning is particularly suited for improving VTE risk prediction in AIS patients compared to traditional statistical models. Please enhance the discussion on the specific limitations of existing models and how your approach addresses these gaps. 2. The current literature review does not sufficiently engage with previous studies on VTE risk prediction in stroke patients. A more comprehensive and critical review of the literature is needed, highlighting how your study builds on or differs from existing work. 3. The manuscript would benefit from a more detailed description of the cohort, including the inclusion and exclusion criteria, demographic details, and the data collection period. This information is crucial for understanding the applicability and limitations of your findings. 4. The manuscript mentions the use of advanced techniques such as K-nearest neighbor and SMOTE but lacks sufficient detail on data preprocessing and the handling of potential biases introduced by these techniques. Further elaboration on your model development process, including hyperparameter tuning and the selection criteria for the final model, is necessary. 5. While SHAP values are used for feature importance, the process of feature selection and engineering should be more thoroughly explained, including how domain knowledge (how these features contributed to the model's predictions) was incorporated. 6. The manuscript currently reports AUC as the primary performance metric. To provide a more comprehensive evaluation, please include additional metrics such as sensitivity, specificity, PPV, NPV, and calibration plots. Furthermore, discussion on the potential for overfitting and steps taken to mitigate this should be included. The study's sample size (1,632 participants) is relatively small for machine learning studies aimed at clinical predictions. Please explain (?). 7. The manuscript mentions that multiple machine learning models were developed, with the Gradient Boosting Machine (GBM) showing the highest AUC. However, it does not provide information about external validation or the use of an independent test set. Without external validation, the generalizability of the model to other populations remains uncertain. If possible, please conduct validation on an independent dataset or provide a discussion on the generalizability of your model. 8. The manuscript mentions IRB approval but lacks detailed information on patient consent and data privacy. Please address. 9. Authors should better articulate how the proposed model would be implemented in or improve clinical practice. A detailed comparison with existing risk scores like the Wells or Caprini scores is needed to contextualize the added value of your model. 10. More information is needed about the patient population, including demographics, stroke subtypes, and comorbidities. This will help assess the model's generalizability across different clinical settings. 11. While your model's AUC is highlighted, the clinical implications of these results are not adequately discussed. We recommend providing a clearer interpretation of the model's predictions, including potential benefits and risks. 12. Please address how the model would be integrated into existing clinical workflows. This includes discussing the protocols for managing patients identified as high risk and the potential impact on treatment decisions. 13. Consider discussing the ethical implications of implementing the model, including the risk of harm from false positives or negatives. 14. The manuscript's abstract is somewhat dense, and the terminology used (e.g., "utilizing advanced technologies," "innovative approach") is vague. This could make it difficult for readers to quickly grasp the key points of the study. 15. The manuscript's terminology is not consistently precise (e.g., the use of "key predictors" without defining them clearly), which could lead to confusion. The writing should be more concise and the terms better defined to ensure clear communication of the study's findings and implications. ============================== Please submit your revised manuscript by Sep 30 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols . We look forward to receiving your revised manuscript. Kind regards, Sonu Bhaskar, MD PhD Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse. 3. Thank you for stating the following financial disclosure: "This study was funded by the High Level Project of Medicine in Longhua, ShenZhen under grant number HLPM201907020102 and construction funds of key medical disciplines in Longhua District, Shenzhen under grant number MKD202007090208." Please state what role the funders took in the study. If the funders had no role, please state: ""The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."" If this statement is not correct you must amend it as needed. Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf. 4. In the online submission form, you indicated that "All data files related to this study can be obtained from the inquiry email 66327285@qq.com(Qingshi Zhao)." All PLOS journals now require all data underlying the findings described in their manuscript to be freely available to other researchers, either 1. In a public repository, 2. Within the manuscript itself, or 3. Uploaded as supplementary information. This policy applies to all data except where public deposition would breach compliance with the protocol approved by your research ethics board. If your data cannot be made publicly available for ethical or legal reasons (e.g., public availability would compromise patient privacy), please explain your reasons on resubmission and your exemption request will be escalated for approval. 5. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information. Additional Editor Comments: Please see above. Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: No ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: No Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: No ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: 1.The existing risk prediction models for deep vein thrombosis (DVT) in patients with acute ischemic stroke are numerous, yet the manuscript fails to emphasize the distinctive features of the study, thus lacking novelty. 2.The Introduction section fails to introduce the existing risk prediction models for venous thromboembolism (VTE) in acute ischemic stroke. 3.The article mentions the use of Synthetic Minority Over-sampling Technique (SMOTE) to augment the dataset. It remains unclear whether SMOTE was applied directly to the entire dataset or solely to the training set. Employing SMOTE on the entire dataset could lead to data leakage, with the presence of synthetic data in the test set undermining the model's genuine effectiveness. 4.Although a series of machine learning algorithms were employed in the study, there is no mention of the principles governing parameter selection. 5.The model construction incorporates stepwise forward logistic regression and LASSO variable selection. However, it is not specified whether these procedures were executed solely on the training set or on the entire dataset. 6.In the development and validation of the predictive model, the performance of seven different machine learning models was evaluated, yet confidence intervals were not reported. It is advisable to supplement this information and conduct Delong's test to compare the AUC values of each model for significant differences. 7.A comparison between this study and existing literature utilizing logistic regression models is provided; however, a deeper discussion on the disparities between these two methodologies is warranted, elucidating why machine learning approaches are more suitable for the study's objectives than traditional logistic regression methods. 8.The study introduces a novel machine learning model exhibiting excellent predictive performance. However, the manuscript fails to contrast it with similar previous studies. 9.The Discussion section lacks a discussion of the study's limitations and should propose future research directions. Reviewer #2: The researchers employed machine learning algorithms to enhance the prediction of venous thromboembolism (VTE) risk in patients with acute ischemic stroke. VTE significantly impacts the survival prognosis of stroke patients, and accurate, individualized prediction could further improve patient outcomes. However, I believe there are several areas that need improvement in this study: 1.The references in the manuscript are overly concentrated, with multiple studies frequently cited in a single instance. Meanwhile, certain sections of the manuscript that require citations lack the corresponding references. 2.In the “Data processing and feature selection” section, the researchers stated that they used the SMOTE oversampling technique, which is based on the KNN algorithm. References should be added to this section. 3.Since SMOTE generates synthetic data, this could significantly affect the model's results. The correct approach is to first split the data into training and test sets, apply SMOTE only to the training set, and leave the test set unprocessed to reduce the risk of overfitting. 4.The authors did not provide a detailed explanation of the hyperparameter tuning process for the machine learning algorithms. In machine learning, model evaluation is typically conducted using a validation set to find the optimal parameters. The researchers did not indicate whether a validation set was used. 5.In the feature selection process, the study involved a large number of feature variables. I am concerned about the potential for significant multicollinearity among these variables, which the researchers did not assess. The authors only used LASSO and PCA for dimensionality reduction. 6.Given that the researchers identified data imbalance as an issue, experts in this field should recognize that the precision-recall curve can be more valuable than the ROC curve for evaluation. 7.The methods used in this study are quite simplistic. With the availability of libraries like Sklearn and SHAP, the required code to complete this study likely amounts to less than 100 lines, indicating a lack of rigorous methodology in constructing the machine learning models. 8.The web-based calculator is aesthetically unappealing. The data entered by each user should be visualized, for example, to highlight which specific indicators are most significant in determining high VTE risk. The authors used SHAP to improve model interpretability, so this should be reflected in the web-based calculator. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy . Reviewer #1: No Reviewer #2: Yes: Shi-Nan Wu ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/ . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org . Please note that Supporting Information files do not need this step.
|
| Revision 1 |
|
PONE-D-24-13520R1Leveraging Machine Learning for Enhanced and Interpretable Risk Prediction of Venous Thromboembolism in Acute Ischemic Stroke CarePLOS ONE Dear Dr. Jiang, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Dec 08 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols . We look forward to receiving your revised manuscript. Kind regards, Sonu Bhaskar, MD PhD Academic Editor PLOS ONE Additional Editor Comments: Thank you for submitting the revised version of your manuscript. The manuscript has undergone another round of review by independent reviewers. However, several concerns remain, particularly regarding methodological rigor. We invite you to submit a rebuttal addressing the reviewers' comments and concerns. Once we receive your responses, we will proceed with further consideration of your manuscript. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #2: (No Response) ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: No ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: No ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: No ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: No ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The reviewers of the article provided several suggestions for improvement, including the distribution of citations, discussion of the study's limitations, and proposals for future research directions. The authors have responded to these feedback and made the necessary revisions.I commend the authors for their thorough work and look forward to seeing the article published. Reviewer #2: Thank you to the author for their response. --I believe a critical point in the field of machine learning is the optimization of hyperparameters for each model. However, the author mentioned that they conducted the optimization on the training set. In fact, during each process of hyperparameter tuning, adjustments should be made based on results from the validation set, rather than solely tuning parameters in a black-box manner on the training set. It is unreasonable to adjust parameters using only the training set without data to evaluate the status of the model training. Since the author only explained their process on the training set, it shows a significant lack of understanding regarding the methodology or model-building process in the field of machine learning. Additionally, the author still lacks a systematic understanding of the field of machine learning. Therefore, I believe that under such circumstances, the methodology of this study is not rigorous and should not be accepted. --Furthermore, there is a clear issue of data imbalance in this study. Introducing only the AUC value of the ROC curve to evaluate the performance of the models does not reflect the real situation. In my review comments, I mentioned using the PR curve for evaluation, but the author has not adequately addressed this. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy . Reviewer #1: No Reviewer #2: Yes: Shi-Nan Wu ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/ . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org . Please note that Supporting Information files do not need this step. |
| Revision 2 |
|
Leveraging Machine Learning for Enhanced and Interpretable Risk Prediction of Venous Thromboembolism in Acute Ischemic Stroke Care PONE-D-24-13520R2 Dear Dr. Jiang, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Sonu Bhaskar, MD PhD FANA Academic Editor PLOS ONE Additional Editor Comments (optional): I am pleased to accept the manuscript in its current form. Thank you for considering PLOS One for your work. Reviewers' comments: |
| Formally Accepted |
|
PONE-D-24-13520R2 PLOS ONE Dear Dr. Jiang, I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team. At this stage, our production department will prepare your paper for publication. This includes ensuring the following: * All references, tables, and figures are properly cited * All relevant supporting information is included in the manuscript submission, * There are no issues that prevent the paper from being properly typeset If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps. Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. If we can help with anything else, please email us at customercare@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Sonu Bhaskar Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .