Peer Review History
| Original SubmissionJanuary 8, 2020 |
|---|
|
Dear Dr. Liu, Thank you very much for submitting your manuscript "Comparing Machine Learning- with Case-Control-Models to Identify Confirmed Dengue Cases" for consideration at PLOS Neglected Tropical Diseases. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments. We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation. When you are ready to resubmit, please upload the following: [1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. [2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file). Important additional instructions are given below your reviewer comments. Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts. Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Alan L Rothman, MD Associate Editor PLOS Neglected Tropical Diseases Samuel Scarpino Deputy Editor PLOS Neglected Tropical Diseases *********************** Associate Editor comments: 1) The Discussion should be substantially shorter and more focused. As a particular example, the discussion about pathogenesis of leukopenia and thrombocytopenia is unnecessary since these findings are well described and the current study provides no new insights into pathogenesis. 2) The analysis of previous literature is incomplete. The authors need to cite previous papers on prediction models and compare them to the current results. Specific papers not addressed include: Tanner et al. PLoS Negl Trop Dis 2008, Potts et al. PLoS Negl Trop Dis 2010, Park et al. PLoS Negl Trop Dis 2018, and Phakhounthong et al. BMC Pediatr 2018. 3) The authors’ analysis was limited for obvious reasons to DLI, which, as defined, required the clinician’s suspicion of arboviral disease. This represents a potentially significant source of bias, of particular relevance to use of the algorithm for real-time syndromic surveillance as proposed by the authors. The authors should at a minimum acknowledge this limitation and discuss the potential effects of this bias, especially for the comparison of sensitivity and specificity during different time periods. Can the authors address how many cases in the full dataset (>100K visits) during each month would potentially have been identified as suspected dengue by these algorithms? 4) Table S1 shows statistically significant differences between included and excluded groups. The authors should address the potential implications of these differences for the results. Did the authors review whether these subjects could be included in validation of the final models? 5) Line 188- Why were these cutoffs selected? Would other cutoff values have had greater discriminatory value? 6) The WHO definitions (noted in Figure 3) should be described in Methods. 7) Table 3- The clinical relevance of showing results for subgroups based on laboratory data included in the models (e.g., WBC, platelet) is unclear. These subgroup analyses should be deleted. 8) Do the authors have data on day of illness at presentation? If so, these data should be included in Table 1. Did they consider including this variable in the analysis? 9) The timing of the epidemic in Taiwan in 2015 should be noted under Study Population. 10) Line 147- The authors should note the number of visits excluded for incomplete records, cancellation, or re-admission. 11) Lines 298-302- It is unnecessary to repeat data presented in Table 1 in the text. 12) Figure S1 and S2- The labels on each branch should be reformatted to avoid overwriting the graphical representation of the decision tree. 13) References 54 and 57 are identical. Reviewer's Responses to Questions Key Review Criteria Required for Acceptance? As you describe the new analyses required for acceptance, please consider the following: Methods -Are the objectives of the study clearly articulated with a clear testable hypothesis stated? -Is the study design appropriate to address the stated objectives? -Is the population clearly described and appropriate for the hypothesis being tested? -Is the sample size sufficient to ensure adequate power to address the hypothesis being tested? -Were correct statistical analysis used to support conclusions? -Are there concerns about ethical or regulatory requirements being met? Reviewer #1: Dengue is a major health problem worldwide over the past 50 years. The challenge of identifying dengue cases is increased during large epidemics, especially in non-endemic areas with limited experienced staff. The study group used machine learning (ML)-based prediction models to identify dengue confirmed cases with the rationale of applying these models in health facilities where dengue confirmed tests are not available. It is obvious that the objectives of the study clearly articulated with a clear testable hypothesis stated, and the study design is appropriate to address the stated objectives. Study population consisted of dengue-like illness (DLI) cases (based on ICD9) admitted to the emergency department (ED) from January 1 to December 31, 2015 at National Cheng Kung University Hospital (NCKUH) in Tainan City. The final dataset included 2,942(60.12%) laboratory-confirmed dengue cases and 1,952 non-dengue control cases. The population is clearly described and appropriate for the hypothesis being tested. However, if possible, the authors should present the definition of DLI rather than ICD code. The sample size (totally, 4,894 DLI cases) is sufficient to ensure adequate power to address the hypothesis being tested. However, there is an imbalance of cases between the two groups (confirmed dengue cases and non-dengue control cases) which need to be taken into account when interpreting the results from deep neural network (DNN) models, but it was not mentioned in the manuscript. Moreover, some other questions should be concerned: Is there any repeated variable in data set? And what is the total number of samples (rows) in the data set? Correct statistical analysis was used to support conclusion, and there are no concerns about ethical or regulatory requirements being met. About data validation, internal validation with 2 time-repeats was used. As a suggestion, whether it is better if the models are validated with new data set (external validation)? Reviewer #2: Objectives were clearly formulated, teh study design is appropriate. -------------------- Results -Does the analysis presented match the analysis plan? -Are the results clearly and completely presented? -Are the figures (Tables, Images) of sufficient quality for clarity? Reviewer #1: For the results, the analysis presented matches the analysis plan, and the results are clearly and completely presented. The key findings of the study are that using just four input variables [age, body temperature, and counts of white blood cells (WBCs) and platelets], areas under curves (AUCs) of the receiver operating characteristic (ROC) curves were reported at high level (above 80%) for all models, and DNN had higher performance than others. Subgroup analyses were informative, all the models were very sensitive particularly in pre-epidemic period. Pre-peak sensitivity (<35 weeks) was 92.6%, 92.9%, and 93.1% in DT, DNN, and LR respectively. The figures (tables, images) in the manuscript are of sufficient quality for clarity. It is easier for readers to follow up the paper, S1 Fig and S2 Figure should additionally be explained in text. One point should be considered, the algorithms of the highest performance models were not reported. Could the authors provide them in their manuscript? Reviewer #2: Results are well presented. -------------------- Conclusions -Are the conclusions supported by the data presented? -Are the limitations of analysis clearly described? -Do the authors discuss how these data can be helpful to advance our understanding of the topic under study? -Is public health relevance addressed? Reviewer #1: The authors’ conclusions are supported by the data presented, and all limitations of the study were clearly reported. Discussion of the manuscript is interesting, and the authors discussed how these data can be helpful to advance our understanding of the topic under study. Results of the study reveal that machine-learning based models can be developed to identify dengue cases with four commonly available key features. This implies that the prediction models can be widely deployed to all levels of medical facilities, including hospitals and local clinics. It would be clearer if the authors emphasize more on how these models will be applied in practice. Reviewer #2: The authors extract a small number of parameters, the combination of which they conclude are predictive of dengue during an outbreak in Taiwan. The identified parameters would also hold true for COVID-19: lymphopenia, fever, etc. In fact, during SARS in Singapore in 2003, overlapping parameters were found for dengue and SARS: Clin Infect Dis. 2004 Dec 15;39(12):1818-23. Epub 2004 Nov 19. Use of simple laboratory features to distinguish the early stage of severe acute respiratory syndrome from dengue fever. The authors should highlight that laboratory confirmation remains the primary goal of surveillance and outbreak investigation. Artificial intelligence and other parameters may aid when laboratories are overwhelmed, but should never replace laboratory confirmation. in fact, the call is for more enhanced laboratory dengue surveillance in all countries, including low to middle income countries. -------------------- Editorial and Data Presentation Modifications? Use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity. If the only modifications needed are minor and/or editorial, you may wish to recommend “Minor Revision” or “Accept”. Reviewer #1: The discussion is sometimes rather long for the readers to focus, and should be more concise. Reviewer #2: none -------------------- Summary and General Comments Use this section to provide overall comments, discuss strengths/weaknesses of the study, novelty, significance, general execution and scholarship. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. If requesting major revision, please articulate the new experiments that are needed. Reviewer #1: The study group used novel machine learning -based prediction models to identify dengue confirmed cases with the rationale of applying these models in health facilities where dengue confirmed tests are not available. As mentioned above, the prediction models to identify dengue cases with four commonly available key features, and can be widely implemented in all levels of medical facilities, and serve as a key component in an integrated dengue surveillance system. Overall, the manuscript is well prepared and organized. I recommend it is considered to be published with minor revision. Reviewer #2: The authors extract a small number of parameters, the combination of which they conclude are predictive of dengue during an outbreak in Taiwan. The identified parameters would also hold true for COVID-19: lymphopenia, fever, etc. In fact, during SARS in Singapore in 2003, overlapping parameters were found for dengue and SARS: Clin Infect Dis. 2004 Dec 15;39(12):1818-23. Epub 2004 Nov 19. Use of simple laboratory features to distinguish the early stage of severe acute respiratory syndrome from dengue fever. The authors should highlight that laboratory confirmation remains the primary goal of surveillance and outbreak investigation. Artificial intelligence and other parameters may aid when laboratories are overwhelmed, but should never replace laboratory confirmation. in fact, the call is for more enhanced laboratory dengue surveillance in all countries, including low to middle income countries. -------------------- PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Figure Files: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Data Requirements: Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5. Reproducibility: To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see https://journals.plos.org/plosntds/s/submission-guidelines#loc-methods |
| Revision 1 |
|
Dear Dr. Liu, Thank you very much for submitting your manuscript "Comparing machine learning- with case-control-models to identify confirmed dengue cases" for consideration at PLOS Neglected Tropical Diseases. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board. The editors appreciated the sincere and significant efforts the authors have made to address the prior review. Based on the response to the prior reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the additional recommendations below. Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. When you are ready to resubmit, please upload the following: [1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out [2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file). Important additional instructions are given below your reviewer comments. Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Alan L Rothman, MD Associate Editor PLOS Neglected Tropical Diseases Samuel Scarpino Deputy Editor PLOS Neglected Tropical Diseases *********************** We appreciate the significant efforts the authors have made to address the prior review. However, we believe that there are significant issues with this revised manuscript that need to be addressed before it can be considered suitable for publication in the journal: 1) In the previous review, the authors were asked to address potential bias due to the criteria for selection of cases (DLI as defined by ICD-9 codes or request for diagnostic laboratory testing). The authors have provided a detailed tabulation of the number of cases per month, as requested. The authors were also requested to "acknowledge this limitation and discuss the potential effects of this bias"; this point was not addressed. The authors need to modify the Discussion to acknowledge the reliance of their study design on clinicians making a diagnosis of DLI and requesting diagnostic testing. As part of their response, the authors have provided new analyses of excluded cases in the revised manuscript. However, the presentation of these analyses introduces a number of new issues. First, under Results, the section describing their analysis of excluded cases has been placed before the description of the results of the modeling; these sections needs to be re-ordered. Second, the analyses presented in figure S4 and described in the Discussion (lines 592-597) are problematic because they make an assumption that these are true dengue cases and limit consideration to cases with thrombocytopenia who visited the ED after July 2015. It is unsurprising that these steps improve the "accuracy" of the models, but they introduce additional bias into the results; these analyses should be deleted or at least substantially revised. Third, given the distribution of these cases in time relative to the dengue epidemic, it is not surprising that the authors found a high percentage of cases predicted to be dengue. In actuality, it is somewhat surprising that so few non-DLI subjects had complete data for the 4 key parameters. The ready availability of this information in the clinical setting is in fact proposed as a strength of the models. Can the authors comment on what data elements were missing from the other cases? 2) In the previous review, the authors were asked to "highlight that laboratory confirmation remains the primary goal of surveillance and outbreak investigation". The authors have added a very brief acknowledgement of this issue to the Discussion. However, an important limitation of their study is that the sensitivity, specificity, and predictive value of their algorithms depend upon the distribution of other diagnoses in the study population; this should be specifically noted in the Discussion. In addition, a cautionary interpretation note about the need for laboratory confirmation needs to be included in the Abstract and Summary. 3) In the previous review, the authors were asked to cite several additional published papers on prediction models for dengue and "compare them to the current results". The authors have cited these papers in the Introduction but have not addressed the need to compare their results to these other papers in the Discussion. 4) In the previous review, the authors were asked not to repeat data presented in Table 1 in the text (lines 298-302). The authors indicate that this issue was addressed, but their response does not reflect the revised manuscript (lines 334-337). 5) In the previous review, the authors were asked to substantially shorten the Discussion, with specific mention of text discussing mechanisms of leukopenia and thrombocytopenia that are not addressed by the present study. While the Discussion has been reduced in length in this revision, it remains overly long and unfocused; for example, the specific sections noted previously were not deleted. 6) The revised Figures S2 and S3 have misspelled "Negative"; this should be corrected. Figure Files: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Data Requirements: Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5. Reproducibility: To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see http://journals.plos.org/plosntds/s/submission-guidelines#loc-materials-and-methods |
| Revision 2 |
|
Dear Dr. Liu, We are pleased to inform you that your manuscript 'Comparing machine learning- with case-control-models to identify confirmed dengue cases' has been provisionally accepted for publication in PLOS Neglected Tropical Diseases. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests. Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated. IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS. Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Neglected Tropical Diseases. Best regards, Alan L Rothman, MD Associate Editor PLOS Neglected Tropical Diseases Samuel Scarpino Deputy Editor PLOS Neglected Tropical Diseases *********************************************************** |
| Formally Accepted |
|
Dear Dr. Liu, We are delighted to inform you that your manuscript, "Comparing machine learning with case-control-models to identify confirmed dengue cases," has been formally accepted for publication in PLOS Neglected Tropical Diseases. We have now passed your article onto the PLOS Production Department who will complete the rest of the publication process. All authors will receive a confirmation email upon publication. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any scientific or type-setting errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Note: Proofs for Front Matter articles (Editorial, Viewpoint, Symposium, Review, etc...) are generated on a different schedule and may not be made available as quickly. Soon after your final files are uploaded, the early version of your manuscript will be published online unless you opted out of this process. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Neglected Tropical Diseases. Best regards, Shaden Kamhawi co-Editor-in-Chief PLOS Neglected Tropical Diseases Paul Brindley co-Editor-in-Chief PLOS Neglected Tropical Diseases |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .