Peer Review History
| Original SubmissionAugust 23, 2022 |
|---|
|
PONE-D-22-23382A systematic review of clinical health conditions predicted by machine learning diagnostic and prognostic models trained or validated using real-world primary health care dataPLOS ONE Dear Dr. Abdulazeem, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by May 20 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Ágnes Vathy-Fogarassy, Ph.D. Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: N/A Reviewer #2: N/A ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: No Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Summary This is a useful review of machine learning methods for risk prediction of diseases or their complications using primary care databases. I’ve identified a number of issues with it, but, if these can be resolved, I would recommend publication. Main comments Many people will object (some strongly) to the inclusion of logistic regression here as an ML method. Some would also argue, for example, that ANNs can be just a set of logistic regressions and would therefore also be included in statistical learning rather than ML. “Classical log reg” is mentioned once, so it’s important to explain what is meant by this and by simply “log reg”. On the same theme, I would not include the studies that used clustering or other unsupervised methods, as these are not really about risk prediction for an individual patient. If reinforcement and unsupervised ML methods, e.g., NLP have been used in pre-processing, they should be separated from the main method used to fit the model. The tables are good. However, the accompanying text often repeats what’s in the tables in terms of how many studies were of which diagnosis or came from which country. I think this would be easier to digest if placed in an extra table instead. Also, the text brings out some important points from a particular study but without developing it or framing it for the reader. For example, “a study reported that variations in the importance of different risk factors depend on the modelling technique [35]”. Was this the only study to do this? Is one of the goals of this paper under review to highlight such modelling issues or just to summarise the frequency of methods used? The two sentences that followed that one also illustrate the point: “Another study reported that ignoring censoring substantially underestimated risk of cardiovascular disease [43]. Also, systolic blood pressure could be used as a predictor of hospitalization or mortality [59].” It’s well known that censoring affects the estimated risk, so this sentence should be deleted. If SBP is worth mentioning, what about other predictors? There are many other such examples in other sections of the Results, which needs to be rethought. Some reflection on WHY the reporting of methods in the studies in this review is often so limited would be useful. There have been many studies highlighting such limitations in non-ML fields that are relevant here. Discussion: “Other drawbacks reported, similar to our findings, were underrepresentation of healthy persons and retrospective temporal dimension of the extracted predictors [142].” This is worth explaining better and unpacking with further discussion. For example, is the underrepresentation due to healthy people not attending PHC and so not being captured in the EHR? Or is something else meant here? Discussion: “Furthermore, ML engineers must be aware of the unintended implications of their algorithms and ensure that they are created with the global and local communities in mind [145].” Again, this needs explaining and unpicking. What are the unintended implications? These could include healthcare rationing and widening of inequalities, but the authors may be thinking of something else (too). Do algorithms need to consider “global communities”? This brings me to a related point about the need for external validation of models. I would argue that external validation of a model in another country is not needed unless the model’s creators intend the model to be used overseas. Healthcare systems – and hence the populations using a particular sector and the available data – vary so much that performance would be either too poor to use or the model simply not relevant. The authors have penalised the lack of external validation perhaps unfairly. However, if what is meant by “external validation” covers another similar data set in the same country, then I’d agree that that validation would be important to do. Minor comments The Abstract should give the name of the risk of bias tool. “Extracted date” should be “Extracted data”. %s should be given for Alzheimer’s disease and diabetes. The Conclusions should summarise the importance / relevance of the findings rather than say it’s the first of its kind. Introduction: “To achieve these PHC care aims, common health disorders require risk prediction for primary prevention, early diagnosis, follow-up, and timely interventions to avoid diseases exacerbations and complications.” I disagree that risk prediction is needed for all these things if it refers to risk-prediction models. Clinician training and experience is used far more than any model for diagnosis, for instance. The rationale for risk-prediction models needs to be clearly and accurately set out to help demonstrate the importance of this study. Introduction: most EHR-type databases available to researchers are not “big data”, simply large. This is one reason why statistical learning is so common in risk-prediction models. Methods: “Health conditions extracted were categorized according to international classification of diseases (ICD)-10 version 2019”. How was this done for UK studies, where primary care EHRs don’t use ICD10? Results: “Sample sizes used for training and/or validating the models across the included studies ranged from 26 to around 4 million participants”. Is 26 right?!? Results: “A study revealed that models can be created using only data from medical records and had prediction values of 70-80% for identifying persons who are at risk of acquiring ankylosing spondylitis (M45) [100].” Do you mean a sensitivity of 70-80%? Or PPV? Or something else? Weng (ref 35) used CPRD, and therefore any models from it are by definition – in contrast to what Weng et al say in their paper – retrospective cohort studies. Table 1 should be amended accordingly and the other CPRD studies checked. Discussion: “it would be advisable that models’ developers propose solutions for the digital documentation systems…” Modellers are not IT or IG experts! There are numerous minor issues with the English. Just one example is: “A few studies (n=10) compared the performance of the developed ML models to other standard reference techniques”, where “to” should be “with” (they mean different things). Another is: “Children obesity” should be “childhood obesity”. “Evitable” should be “avoidable”. There are many others. Reviewer #2: Recommendation: minor revision This work presents a systematic review of the application of ML in the context of primary health care. Literature published between January 1990 to January 2022 is considered. 1. Is the manuscript technically sound, and do the data support the conclusions? Yes 2. Has the statistical analysis been performed appropriately and rigorously? NA 3. Have the authors made all data underlying the findings in their manuscript fully available? NA 4. Is the manuscript presented in an intelligible fashion and written in standard English? Yes 1. No insights related to algorithms applied. Any observation related to the most commonly used algorithms and the best performing ones? 2. What kind of data is usually used? Textual? Images? What kind of features? 3. What ate the current challenges in applying ML to primary health care data? 4. What are the most challenging diseases for ML prediction? 5. What kind of evaluation approaches are used? 6. Has any of the reviewed work been deployed in a real world application? 7. Quality of figures and tables need to be improved. 8. Need to discuss implications of the results. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Alex Bottle Reviewer #2: No ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.
|
| Revision 1 |
|
PONE-D-22-23382R1A systematic review of clinical health conditions predicted by machine learning diagnostic and prognostic models trained or validated using real-world primary health care dataPLOS ONE Dear Dr. Abdulazeem, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Aug 03 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Ágnes Vathy-Fogarassy, Ph.D. Academic Editor PLOS ONE Additional Editor Comments: Dear Authors, For the corrected manuscript, Reviewer #2 gave the following feedback: "Some of my concerns were not addressed in your response." Please take into account all the reviewer's suggestions and requests, otherwise we cannot accept the article. Best regards! [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #2: (No Response) ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Partly ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: N/A ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The authors have responded well to my comments, and in my opinion the ms is much improved. I am happy to recommend it for publication. Reviewer #2: (No Response) ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 2 |
|
A systematic review of clinical health conditions predicted by machine learning diagnostic and prognostic models trained or validated using real-world primary health care data PONE-D-22-23382R2 Dear Dr. Abdulazeem, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Ágnes Vathy-Fogarassy, Ph.D. Academic Editor PLOS ONE |
| Formally Accepted |
|
PONE-D-22-23382R2 A systematic review of clinical health conditions predicted by machine learning diagnostic and prognostic models trained or validated using real-world primary health care data Dear Dr. Abdulazeem: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Ágnes Vathy-Fogarassy Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .