Peer Review History
| Original SubmissionJune 30, 2021 |
|---|
|
PONE-D-21-21436Identifying Adverse Health Correlates of Intimate Partner Violence against Older Women: Mining Electronic Health RecordsPLOS ONE Dear Dr. Karakurt, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Dec 24 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Astrid M. Kamperman Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 2. Please update your submission to use the PLOS LaTeX template. The template and more information on our requirements for LaTeX submissions can be found at http://journals.plos.org/plosone/s/latex. 3. Thank you for stating the following financial disclosure: "Gunnur Karakurt (PI) This publication was made possible by R01-LM012518 from the National Library of Medicine. Its contents are solely the responsibility of the authors and do not necessarily represent the official views of the NIH" Please state what role the funders took in the study. If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript." If this statement is not correct you must amend it as needed. Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf. 4. Please amend either the abstract on the online submission form (via Edit Submission) or the abstract in the manuscript so that they are identical. 5. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: No ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: I Don't Know Reviewer #2: No ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Thanks to the authors for taking the time and effort to prepare this article. Here are some comments that the authors can consider for revising their paper. -The mathematical notations used in the article seemed unnecessary in my opinion, an example: the explanation how contingency tables were formed. I suggest authors to find simple analytic language to express their approach concisely and avoid notations when not needed. I am confused why a log odds ratio was used to present prevalence. -This does not seem to be a statistical methods paper; I wonder why readily available statistical software were not used to do the analysis, but rather manual calculation was preferred. For example, do the authors think using a software-based analysis would have saved them the space and effort for the section “Accounting for measurement error due to rounding” and made it easier for the readers to focus on the findings more? -The result section is not elaborate enough. The section mostly refers to tables and does not point out to key and important findings from the work at all. -The limitation section in discussion should list limitations of the study and not limitations in the topic (those can go in the introduction) -The conclusion should pinpoint which symptomatology or diagnoses the authors think could be flags for IPV in older women, which is missing. Rather general statements were made. Reviewer #2: PLOS One referee report PONE-D-21-21436: "Identifying Adverse Health Correlates of Intimate Partner Violence against Older Women: Mining Electronic Health Records" 2021 11 07 First, note that this reviewer is a biostatistician and not a subject matter reviewer. Therefore, most of my review is concerned with the statistical methods used in the analysis. Overview: This paper looks at the relationship between a large number of possible factors and intimate partner violence (IPV) among seniors. The data come from a large data source and the investigators identify a large number of potential factors that are looked at as possibly being related to IPV. (Initially, there are >18K factors considered.) For each examined factor, the authors basically construct a 2x2 table that is standard in epidemiological research. That is, this table looks at the cases versus the controls which are those who are and are not identified as victims of IPV. For each group (case and control), one counts the number who are exposed and not exposed to the individual factor. After creating a 2x2 table for each factor, they compute the log of the odds ratio and the associated confidence interval. If the confidence interval is greater than 0 (so the odds ratios are bigger than 1), the investigators then basically declare the factor important. Also, the reviewers compare the odds ratio for the senior population to the odds ratio of the non-senior population to see if the effect of each of the factors are discernible higher for the senior population. In the main body of the paper, the authors list some of these top factors. There is mention of a supplementary table with a larger list of the factors which are identifies as important. Note: This reviewer does note that there were some other subtle issues that are addressed in the paper. The above overview is this reviewer’s understanding of the main concepts in the paper. Overall, I like the general idea of the paper. I think that electronic health records are a largely untapped source of valuable information. However, I have important concerns that this manuscript does not do enough to account for the high number of multiple tests performed for this analysis. In the statistical literature, this is known as the multiple testing problem. Although the authors don’t mention that they are doing formal hypothesis testing, the essence of this problem still applies to this study. It might seem that I am suggesting substantial changes in the analysis; however, they would probably not take long to do. I would further note, that I did look up some papers that were produced from this general research group. Many of my below comments would have been answered in a satisfactory manner if they used the methods that they used in some other papers. I would specifically point out the paper: Karakurt, Patel, Whiting, Koyuturk, 2017, J Fam Viol 32:79-87, DOI 10.1007/s10896-016-9872-5. I will be referring to this paper as Karakurt et al in the below. I will first list some of the statistical notes/comments/issues and then list more general matters. 1) Multiple testing. Using the criteria that a factor is acceptable in some sense (high or medium confidence) if the confidence interval is greater than 0 is basically mathematically equivalent to a null hypothesis test. (Technically, this would be a 1-way test at the .025 level.) So, the authors are doing 18k null hypothesis test (or, considering only the "valid" test, there are 2056 different test). This is the classical multiple testing problem. You are saying that something is important if the test statistic is would be rare if it was random. That is, rare means that the outcome only occurs with about a 5% probability. Then you run this 2000 or so times and you see lots of “rare” things. This is a classic problem and there are many different ways to approach this. a) In the other paper (Karakurt et al), a Bonferroni correction is made. So, the authors are aware of the problem. When one is doing many test, this method might be too conservative. b) As an alternative to the Bonferroni, I prefer using the False Discovery Rate procedure (reference: Benjamini and Hochberg 1995 and Benjamini and Yekutieli 1999.) This reviewer is a fan of this method. At the end of the analysis, one gets a set of factors. In this set, it is acknowledge that some might still be by chance but overall the set contains factors which are highly likely to be important. 2) The issue that the observations are only know to the nearest "tenth". Here, the authors use a very conservative correction. When creating the log odds ratio (LOR) and the standard error estimate, they assume that the actual value is the worse possible. That is, the counts are rounded off to the nearest 10. a) In the other paper (Karakurt et al), the authors use a Monte Carlo simulation method to sample the possible values which would be the observed value plus or minus 5. That seems very reasonable to me. Basically, this approach is a type of multiple imputation used for missing data. (Also note, the authors in that paper then compute an adjusted confidence interval. This adjusted confidence interval can be used to approximate an adjusted standard error for the LOR.) b) In the other paper (Karakurt et al), take the sampled values are used to create an estimated confidence interval. Instead of that, one option is that one can use multiple imputation methods. That is, take the sampled values and then use multiple imputation methods to get an estimate of the standard error of the LOR which accounts for the interval censoring of the data. (see for example equations 14.7-14.10 in this article: https://www.sciencedirect.com/topics/mathematics/multiple-imputation for one reference for these formulas. These formulas are available many other places.) c) Instead of random sampling, one can do some calculations based on the fact that one is sampling from finite discrete distribution. That is, if one is planning on sampling an integer between -5 and 5, then just calculate the LOR and standard error for each of these 11 points and then use the formulas in (b) above. 3) Prevalence: The term prevalence has a well-defined definition in the epidemiology and population health sciences. It is the number of events in a population at a designated time. There is a similar definition of the prevalence rate. The use of the term "prevalence score" to mean the log of the odds ratio is very confusing and is a misuse of well-established terms. 4) Difference between the log odds ratios between the senior population and the younger population. On page 4, the authors look at “Assessment of Differential Prevalence”. They define this term in equation 5 and then give their formula for the confidence intervals in equation 6. This is a very reasonable thing to do. It is the difference between the two LOR. It is the increase in the odds between the senior population versus the non-senior population. This is a very common statistics to look at therefore the statistical properties are well known. The authors need to change the way that they compute these confidence intervals. Note the following statistical properties: a) the LOR statistics are approximately normally distributed with the standard errors as given in the equation 2. b) Since the senior population and the younger population are completely different populations, then these samples are statistically independence. c) Therefore, the difference of the two statistic defined in equation 5 has an approximate a normal distribution. d) standard error of difference of between the sampled log odd ratio's between these two populations is just "square root of the sum of the squared standard errors". That is, if the standard errors are S1 and S2, then the standard error of the differences is sqrt( S1^2 + S2^2). One can then get the confidence interval of the difference in the log odds ratio by the methods already discussed in the paper. (That is, it is the difference +/- 1.96* standard error of the difference as described in equation 3). 5) I don't see what the value of equation 4. That is, the value of using a mean adjusted prevalence score which is obtained by subtracting the LOR by the average of the LOR and then using that to declare "high" effect. That seems like an attempt to control for the multiple testing, but does not appear to have any validity in controlling for the multiple testing problem. If one took completely random data and then applied this procedure, it would falsely show that some of these random statistics showed a high effect. In fact, since among a random sample of values, by construction, there will always be some that would be "above the average". (Aside: you did not describe how to construct the confidence interval associated to this mean adjusted prevalence score.) Maybe I’m missing something here and there is validity to compute the mean adjusted prevalence score. If so, then please provide some literature to support this method. Some more general comments about the paper: 6) I don't know what the source of the data is. I am not researcher working in USA, so I am not familiar with this data source. a) this data source should be reference and either described or a paper which adequately describes this data source should be reference. b) I don't know where these terms/factors come from. It looks there is a query to the data set program, but I don't know what kinds of queries are being made. I am familiar with ICD codes and know where they come from, but I don't know why or how the authors came up with 18K different factors that were then tested. 7) The type of results reported. I think that if the purpose of this paper is to give information on the factors which contribute to IPV in seniors, then I think more analysis should be done. In the results section of the paper, the authors report that they identified 250 and 1240 terms with high or medium confidence. Then, in tables 1 and 2, they list the top 20 terms. (I did not receive the supplemental file, but I presume the other 1000+ terms are listed there.) My main concern is, “how is a scientist suppose to synthesis this information?” These are a lot of terms here. I would suggest that this information would be more useful if this paper was able to organize or structure this information better. For example, they could cluster the subjects or group together the main terms in some way. (A first pass cluster analysis or factor analysis for example.) I would assume that there is high correlation between these terms. Note that there is a total of only 420 senior IPV subjects. (See Figure 1b) Also, I see that members of this group have done similar work before. (See for example, Hacialiefeniouglu et al, 2021, Scientific Report.) 8) The statistics that are reported in tables 1 and 2. In these tables, the authors report log (base 2) odds ratios and counts. These are not the easiest values to interpret. Usually, one would transform the log odds ratio and confidence interval to the odd ratio scale. The usual population scientist thinks in terms of odds ratios. Most cannot convert from log odds ratios in their heads. (And even the ones that do have a feel for log odds ratios usually works in the natural log scale not in base 2. So, they would have to multiple their usual calibrated LOR by 0.7 to make the conversion.) The raw counts are okay, but the proportions are more convenient for those who want to interpret the results. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: Yes: Michael Escobar [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.
|
| Revision 1 |
|
PONE-D-21-21436R1Adverse health correlates of intimate partner violence against older women: Mining electronic health records PLOS ONE Dear Dr. Karakurt, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. The authors have responded to the concerns expressed in the previous round of review comments in the revised manuscript. The revised manuscript is somewhat improved—thank you. However, the authors should address the following issues further before this manuscript is considered for publication.
Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Kenta Matsumura Academic Editor PLOS ONE [Note: HTML markup is below. Please do not edit.] Reviewers' comments: [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 2 |
|
Adverse health correlates of intimate partner violence against older women: Mining electronic health records PONE-D-21-21436R2 Dear Dr. Karakurt, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Habil Otanga, Ph.D Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #2: (No Response) ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: I Don't Know Reviewer #2: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: No Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The authors seem to have addressed all reviewer comments. No major comments from my part. However, I would have specified "health conditions" instead of just "conditions" in lines 54 and 56. In line 185, authors mention of "detection bias". I would have termed it as "misclassification bias": a term commonly used in epidemiology for this bias category. I agree with the previous reviewer that the use of statistical formulas seem unnecessary at some parts, but authors have made valid arguments to justify their inclusion of formulas. Reviewer #2: I was the 2nd referee for the first version of this paper. I think this paper covers an important issues and collected some important data. As for the analysis, they looped through many important possibly important health correlates of IPV. They controlled for the multiple testing problem and the fact that the data was rounded to the nearest tens. In the body of the main paper they listed the strongest correlates and included the data for the other correlates/ variables that they collected. There are some parts of the analysis that I still find odd. This is the third revision of the paper and some of these issues have been in the first draft and other referees besides me have pointed these outs. There seems to be a stand still on these points. I was tempted to vote that the manuscript be rejected. However, as I pointed out above, this is an important paper and I think the overall message is important. Here are some of the details that I have issues with: 1) The authors insist on including the definition of the log odds ratio and is variance in the main body of the paper. They said that it is important since some might not know about this. Please note that this paper is clearly in the area of trauma epidemiology. I have been in the field of epidemiology and biostatistics for several decades, and this material was standard when I started graduate school. With all due respect, this seems rather odd in a major journal like this one. 2) Detection bias. I am not sure what to make of this. Large administrative data has variables which are under reported. It does seem that detection bias is another variant of this. However, it is not clear how to adjust for it. I think that one would simple note this as a limitation of using administrative/ eHR data. (I see that they have a prescription, and that is my next point.) 3) It seems that they have decided to select only those correlates where the odds ratio is statistically significant to the average odds ratio of all the correlates. (Note: here the average is the geometric mean. Also, rejecting the null hypothesis at the 5% level is mathematically equivalent to having the average not contained in the 95% confidence interval.) I don’t understand what the rational is for doing this. I don’t think that the authors are suggesting that this should be a standard methodological practice. 4) In item 3 above, the criteria turns out to be to consider terms where the odds ratio is significantly bigger than 3. Please note that in general an odds ratio is a rather odds ratio. For example in some studies, the odds ratio between high blood pressure and stroke is around 2 or 3. So, in many studies, this would be a big effect. By only looking at very large effects, you risk only finding effects which are so big that most people in the field are already aware of these risk factors. Sometimes, the real value of these studies is finding small signals that people were not aware of. I would be more concerned about items 2-4, except the authors do include some excellent supplemental materials which give the values for these smaller signals. Also, it is the case that with such studies that contain a lot, it is the authors prerogative to concentrate on a part of the results in the body of the paper. I would feel better about this discuss if they simply said that they were going to discuss the biggest effects and interested readers could look at the supplemental material for other details. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Riffat Ara Shawon Reviewer #2: No ********** |
| Formally Accepted |
|
PONE-D-21-21436R2 Adverse health correlates of intimate partner violence against older women: Mining electronic health records Dear Dr. Karakurt: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Habil Otanga Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .