Peer Review History
Original SubmissionJuly 16, 2019 |
---|
PONE-D-19-20081 Extracting Lung Function Measurements to Enhance Phenotyping of Chronic Obstructive Pulmonary Disease (COPD) in an Electronic Health Record using Automated Tools PLOS ONE Dear Dr. Kathleen Akgun, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please resolve statistical analysis queries and make all data available. References about approaches for extracting FEV1 values from electronic health records, different or similar to that reported in the present study, should be reported and discussed. Limitations of the study need to be implemented and conclusions more clearly described in the abstract for clincians. We would appreciate receiving your revised manuscript by Nov 14 2019 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols Please include the following items when submitting your revised manuscript:
Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. We look forward to receiving your revised manuscript. Kind regards, Manlio Milanese Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. Please provide the full name of the IRB that approved the VACS study. Additional Editor Comments (if provided): Please resolve statistical analysis queries and make all data available. References about approaches for extracting FEV1 values from electronic health records, different or similar to that reported in the present study, should be reported and discussed. Limitations of the study need to be implemented and conclusions more clearly described in the abstract for clincians. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Partly Reviewer #3: Partly ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: No Reviewer #3: No ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: No ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: I will focus on methods and reporting. This is an excellent paper, with clear aims and implementation. The code has been made freely available through github, bravo! I only have some minor comments which shouldn't take too long to deliver, to make the evaluation clearer. 1) Why don't you display the distributions of the available values and the extracted values in overlapping histograms. 2) a second histogram of the values extracted by the clinician against the ones extracted by the tool in the validation subsample (or of their differences). 3) you can report the mean bias (is there overall bias in the tool, say reporting values higher) and the mean square error as well, they can be more informative than the kappa. Reviewer #2: The authors present an efficient method for extracting FEV1 from medical notes to improve phenotyping of Veteran patients with COPD. This work commendable because it leverages existing database tools to retrieve and extract concept-values pairs for FEV1 from clinical notes. The intent of the author is unclear in some sections but can be resolved with editing. I made several comments in the attached document but my primary concerns, which center around how the authors chose to analyze the data and communicate their finding are listed below. 1. Lack of justification for using agreement statistics vs. accuracy statistics with chart-review clearly defined as the reference standard. There is a lack of consistency and precision with the primary performance measure (agreement vs. accuracy). The reviewer believes that agreement statistics, such as Kappa, are justifiable if the goal is to determine whether or not the NLP tool is as reliable as two or more human reviewers. Performance measures, such as SE, SP, PPV, accuracy or F-measure, precision, and recall, should be considered since the tool is compared against one human reviewer described as the reference standard. 2. This approach will make it easier to discover additional problems with the analysis and compare to existing literature. For example, conditioning on the presence of FEV quantifiable values and reporting agreement when the values are known from chart-review is problematic, since it will bias the overall performance measures (e.g., SE and SP). Furthermore, it does not support a comprehensive evaluation that would include false positives, accuracy, etc. The distribution of extracted values should also be compared to chart-review findings 3. It is not clear how the tool addressed situations where references to historical FEV results are handled when presented with current tests. Is the goal to identify current FEV test results on specific visit dates? The overall goal is not clearly detailed. 4. The results that reference the CPT population vs. NLP extracted population are not clear. 5. There appears to be a misunderstanding about the CDW and VistA systems that needs to be corrected. 6. The discussion needs more detail regarding the possibility that many FEV1 (PFT) reports are scanned in as image files and not accessible in the TIU notes. 7. Finally, the findings from this study should be compared to other NLP studies of PFTs from within the VA if possible. It is not clear if performance is a trade-off for efficiency or if both the performance and efficiency of this method are superior to other NLP efforts to extract FEV from the medical notes. Reviewer #3: General comments This is an original study which reports on the implementation of an automated tool, based on the Microsoft SQL, for extracting FEV1 values from the data repository of the Veterans Aging Cohort Study implemented using the MS SQLServer. This reviewer has some formal and methodological concerns. Specific comments Major - References about approaches for extracting FEV1 values from electronic health records, different or similar to that reported in the present study, should be reported and discussed. - More detailed methods to replicate the automated tool presented in the present study should be provided as online information. - The spirometric reports usually include the parameter (i.e. the "string term") "FEV1" expressed as: 1. measured value, in liters; 2. calculated percent predicted value; 3. predicted value for the examined subject, based on sex, age and height. It should be clarified what "string term" for "FEV1" was selected/extracted by using the described automated tool. Indeed, the spirometric report of a single subject usually includes all these three "string term" of "FEV1" and the automated tool might have found three different quantifiable "FEV1" values for the same subject. - FEV6 is a spirometric index used instead of "FEV1", in particular when performing office spirometry, and possibly stored as parameter of pulmonary function test. Were "FEV6" "FEV-6", "FEV_6" excluded as string patterns from the string processing procedure? - The results of the validity of the automated tool performance should be better presented. Agreement/disagreement between the automated tool and chart review (performed by a single Pulmonologist) are based on only n=128 documents (51% of those available) (see the section "Assessing SQL tool performance"). - Results from the present study might support the usefulness of the presented automated tool for increasing the detection of quantifiable FEV1 values in those electronic health records which are implemented using MS SQLServer. This does not mean that the automated tool enhance the detection of COPD patients in electronic health records. The discussion section, in particular the paragraph dedicated to the limitations of the study and the conclusions, should be reviewed accordingly. - The abstract lacks of conclusions. Minor - Figure 1. Provide a legend for the abbreviations and unit of measurement for the y axis. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: Yes: Brian C. Sauer Reviewer #3: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step. |
Revision 1 |
Extracting Lung Function Measurements to Enhance Phenotyping of Chronic Obstructive Pulmonary Disease (COPD) in an Electronic Health Record using Automated Tools PONE-D-19-20081R1 Dear Dr. Kathleen Akgun, We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements. Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication. Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. With kind regards, Manlio Milanese Academic Editor PLOS ONE Additional Editor Comments (optional): It is a pleasure to inform you that the article is now acceptable for publication on PlosOne. Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #2: All comments have been addressed Reviewer #3: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #2: Yes Reviewer #3: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #2: Yes Reviewer #3: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #2: Yes Reviewer #3: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #2: Yes Reviewer #3: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #2: I believe the reviewers addressed the comments adequately. The one issue that continues to bother me is around the goal and framing of evaluation. It appears the goal of the SQL FEV extraction tool is to identify the "current" test result as of the date the note was written- meaning the most recent PFT on the date of the note vs. a co-referenced historical PFT result that may be typed or pasted into the document for historical reference with the "current" PFT. The authors target appears to be the current result up to the note date, but the tool appears to extract the FEV that appears first in the document. That is fine as a simplifying extraction assumption. The chart-reviewer/annotator was asked to abstract the current PFT results. In this case, accuracy statistics would say FEV was correctly identified but the mean values would not be the same. This is why including mean values or distribution of FEV values is important because you may correctly classify the document as having extractable FEV but extract the historical vs. the current. This can be revealed by the comparison of accuracy statistics with the distribution of FEV results. I would have expected the authors to explain why there was some inconsistencies in the extracted values to the chart-review values given the extremely high accuracy statistics. It is plausible that the differences in the distribution of values extracted from the tool and chart-review may be a result of grabbing the co-referenced "historical" FEV rather than the "current" FEV that the reviewer was asked to abstract. It was also nice to see the distribution of FEV extracted from the tool overlay with PFT file data but I was initially thinking you would use the subset where both the FEV extraction tool and PFT file overlapped as additional validation or to identify records that may not have been reviewed but the tool picked them up - to determine if they were missed by chart-review due to the eligibility requirement of CPT coding that is problematic in this population. I the tool was run on the same eligibility criteria as chart-review then this is not an issue, but it was not previously clear what the note selection criteria was. The thought of a sub-analysis may be flawed anyways as there may not be a clear system ID linking a note to a PFT result and rules connecting the two would introduce additional error. Overall this is excellent work and a more elegant solution than previous work in the VA. It is reassuring to learn that you are finding similar proportion of additional information from the notes that other studies found. It is fair to argue that other NLP approached in the VA m attempted different things and there may not be a direct comparison between your work and theirs. I do not believe the other approaches are more or less sophisticated, but what is in common is an attempt to extract information from notes that was not available in the PFT file since the PFT file is not complete. This approach appears more elegant and transportable than other approaches in the VA and is likely more valuable to the research community as a result. Reviewer #3: (No Response) ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #2: Yes: Brian C. Sauer Reviewer #3: Yes: Francesco Pistelli |
Formally Accepted |
PONE-D-19-20081R1 Extracting Lung Function Measurements to Enhance Phenotyping of Chronic Obstructive Pulmonary Disease (COPD) in an Electronic Health Record using Automated Tools Dear Dr. Akgün: I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. For any other questions or concerns, please email plosone@plos.org. Thank you for submitting your work to PLOS ONE. With kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Manlio Milanese Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .