Peer Review History
| Original SubmissionOctober 15, 2024 |
|---|
|
PONE-D-24-46561Towards using Tweet Sentiment for Infectious Disease DetectionPLOS ONE Dear Dr. Anderson, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jan 18 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Li-Pang Chen Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 2. We note that you have indicated that there are restrictions to data sharing for this study. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Before we proceed with your manuscript, please address the following prompts: a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially identifying or sensitive patient information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., a Research Ethics Committee or Institutional Review Board, etc.). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent. b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of recommended repositories, please see https://journals.plos.org/plosone/s/recommended-repositories. You also have the option of uploading the data as Supporting Information files, but we would recommend depositing data directly to a data repository if possible. We will update your Data Availability statement on your behalf to reflect the information you provide. 3. When completing the data availability statement of the submission form, you indicated that you will make your data available on acceptance. We strongly recommend all authors decide on a data sharing plan before acceptance, as the process can be lengthy and hold up publication timelines. Please note that, though access restrictions are acceptable now, your entire data will need to be made freely accessible if your manuscript is accepted for publication. This policy applies to all data except where public deposition would breach compliance with the protocol approved by your research ethics board. If you are unable to adhere to our open data policy, please kindly revise your statement to explain your reasoning and we will seek the editor's input on an exemption. Please be assured that, once you have provided your new statement, the assessment of your exemption will not hold up the peer review process. 4. Please update your submission to use the PLOS LaTeX template. The template and more information on our requirements for LaTeX submissions can be found at http://journals.plos.org/plosone/s/latex. 5. Thank you for stating the following in the Acknowledgments Section of your manuscript: “This study was funded by the National Science Foundation (NSF) Awards #2109647, #2302968, and #2302970.” We note that you have provided additional information within the Acknowledgements Section that is not currently declared in your Funding Statement. Please note that funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: “This study was funded by the National Science Foundation (NSF) Awards #2109647, #2302968, and #2302970. Please include your amended statements within your cover letter; we will change the online submission form on your behalf. Additional Editor Comments (if provided): The manuscript has been reviewed by two referees. One of both provided a positive feedback while the other did not. Provided that your work is interesting, I decide to stand by the positive position and give the authors an opportunity to revise the manuscript. Hence, please carefully address all comments raised by two referees. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: No Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: # Overview The paper aims to build on existing studies that find weak or mixed correlation between COVID incidence and Twitter sentiment in the United States by investigating the spatial and temporal variation in this correlation. To this end, the authors combine data from the TBCOV Tweet dataset with data on COVID prevalence from the CDC to generate average sentiment and COVID prevalence at the county–week level. They then calculate Pearson's correlation statistic on different subsets of this data. Below, I give a high-level assessment of (a) the framing and literature review and (b) the methodology used. I then list some smaller issues that I encountered in the manuscript. # Framing and literature An initial issue with the manuscript is a mismatch between the framing and the analysis. The authors situate the importance of their analysis in terms of contributing to the rapid detection (or even prevention) of epidemic outbreaks. However the analysis itself does not provide any treatment of the the predictive value of sentiment for disease outbreaks. One of the main features of the relationship between COVID cases and Tweet sentiment illustrated in the paper is that there is a large amount of unexplained variability in this relationship across space and time. Because of this, the utility of the findings is overstated. The literature that the authors cite at the beginning of the article does a good job motivating the potential for social media in epidemiological studies in general. However the paper overstates the novelty of the spatial analysis ti presents. Notably, the original paper accompanying the TBCOV dataset the paper relies on includes a similar county-level analysis. I was also surprised that the authors did not reference the considerable body of research demonstrating systematic geographic variation in social media sentiment in the United States (e.g. Mitchell et al 2013). Given the emphasis on geographic variation in the analysis, such work seems particularly relevant. # Methodology Overall, the methodology used in the paper is well described, but it is overly simplistic for an investigation of the relationship between social media sentiment and disease prevalence. Among the main findings of the paper is that correlations between sentiment and COVID cases are often stronger when they are calculated on smaller geographically and temporally restricted slices of the data. This kind of finding should not be surprising, though: stronger correlations are exactly what one would expect if the data has any sort of underlying smoothness (i.e. temporal and spatial autocorrelation). That said, the authors note some intriguing patterns such as the strong negative correlation from July to September of 2020 or the East–West differences in direction and clustering of correlation. Unfortunately, the authors do little to try to explain these patterns. Are the temporal (or geographic) patterns associated with national (or local) policy? Do the spatial patterns correspond with population density, socioeconomic differences, or medical resources? If the stated goal is to contribute to early warnings for outbreaks, explaining the differences in magnitude and direction of correlation is vital. In my view, a regression analysis that takes into account spatial and geographic features would be a much more appropriate method to use. # Smaller issues - In Figure 1(b), the rolling 7-day average of COVID cases per 100K in Washington falls below zero at the beginning of 2021. This warrants explanation (a quick view of the cited COVID data source doesn't show any negative values) - On page 4, the authors state "Deriving the sentiment of each such set provides us with a function S(w, r) that returns the sentiment of all tweets observed in region r during week w." I couldn't find a description of how sentiment was aggregated across sets of tweets. - In figure 2(b) and in the text on pages 6 and 7 the authors use p-values in a loose way to assess significance. A statistic like a Z-score would be more appropriate. - On page 6, the authors state that "we observe that our hypothesis of temporal heterogeneity is supported," but relevant the hypotheses put forward on page 4 was that the correlation would be "stable over time." - I do not believe the authors ever justified their choices of dates separating the three 'waves' used in the final spatiotemporal analysis. References Mitchell L, Frank MR, Harris KD, Dodds PS, Danforth CM (2013) The Geography of Happiness: Connecting Twitter Sentiment and Expression, Demographics, and Objective Characteristics of Place. PLoS ONE 8(5): e64417. https://doi.org/10.1371/journal.pone.0064417 Reviewer #2: Towards Using Tweet Sentiment for Infectious Disease Detection The authors explore whether social media sentiment, specifically derived from tweets, can function as an early warning system for infectious disease outbreaks, focusing on COVID-19. Utilizing a dataset of 56,755,894 tweets from the platform X, they analyze spatial, temporal, and spatiotemporal correlations between tweet sentiment and COVID-19 case counts, aggregating data by county and week. Their temporal analysis identifies a significant negative correlation between sentiment and cases from June to September 2020, suggesting that declining sentiment could predict outbreaks. Spatially, they find contrasting trends: a negative correlation in the Eastern U.S. and a positive correlation in the Western U.S., highlighting regional variability. Spatiotemporal analysis uncovers even stronger correlations during specific periods and regions, emphasizing the potential of social media as a complementary tool to traditional public health surveillance systems if contextual dynamics are better understood. The paper is well-structured and employs a consistent methodology. However, there are several areas where the authors could improve their work before publication: 1. Literature on Complex Systems and Social Media Communication The manuscript would benefit from incorporating insights from complex systems research that explore the relationship between social media communication patterns and disaster prediction, prevention or even management after. For instance, studies like “Rapid Assessment of Disaster Damage Using Social Media Activity” could be relevant. 2. Critique of Traditional Data Collection Methods While the authors critique testing bias and the lack of standardized reporting metrics in traditional methods, additional points could strengthen this discussion. For example, official census data often relies on diverse methodologies and is influenced by budgetary constraints, while surveys and questionnaires can introduce subjectivity in evaluation criteria. In the specific context of COVID-19 mapping data, studies such as “Geospatial Analysis and Mapping Strategies for Fine-Grained and Detailed COVID-19 Data with GIS” highlight these issues and could provide a solid foundation to justify the approach taken in this study. 3. Reliability of Social Media Data During Pandemics The authors argue that microblogs like tweets are effective in detecting disease prevalence (e.g., influenza, dengue). However, why did such globally accessible and rapid communication channels fall short during the COVID-19 pandemic? This question warrants exploration to address the limitations of social media in large-scale health crises. 4. Contradictory Results in the Literature The manuscript references findings like those by “Valdez et al were surprised to observe a positive correlation between US wide COVID-19 related Tweet sentiment and cases and deaths, meaning that as cases and deaths increase in the US, sentiment towards COVID-19 trends positive. This contradicts what would intuitively be expected. In another example, Feng and Kirkley find a weak negative or absent correlation between state-level Tweet sentiment and COVID-19 and cases and deaths”. How can these contradictory and counterintuitive results across studies be interpreted? A discussion of these inconsistencies in the literature would enhance the paper’s depth. 5. Complexity of Spatial and Temporal Contexts The authors should consider the temporal and spatial heterogeneity of the pandemic’s impact. Policies and interventions varied significantly across counties and states, influencing social behavior and pandemic outcomes. For instance, Maleki (2022) examines social factors influencing compliance with COVID-19 interventions in the U.S., offering valuable insights into behavioral changes during the pandemic. 6. Matching Real-World and Social Media Populations A key limitation of using Twitter data is the demographic bias of its user base, which tends to be younger, more educated, and urban. How well does this population represent broader societal trends? While some areas may align more closely with Twitter sentiment, others may diverge significantly. The authors should consider a twofold discussion: (a) The replicability of real-world events in digital spaces, and (b) The complexity of social space, where factors like heterogeneity, scale, and context introduce variability in data interpretation (e.g., Morales et al., 2022). 7. Expanding Discussion and Results The discussion and results sections are currently underdeveloped. While the authors present key findings, they provide limited analysis or interpretation. Expanding these sections with a deeper examination of results, including comparisons with previous studies and a critical evaluation of implications, would significantly enhance the paper’s impact. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
Towards using Tweet Sentiment for Infectious Disease Detection PONE-D-24-46561R1 Dear Dr. Anderson, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Li-Pang Chen Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #2: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #2: Dear Authors, After the review conducted by the authors in this review round, the modifications have been satisfactorily addressed. For that, I can suggest that this manuscript can be accepted for publication. Congrats on your achievement, Best, the reviewer ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #2: No ********** |
| Formally Accepted |
|
PONE-D-24-46561R1 PLOS ONE Dear Dr. Anderson, I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team. At this stage, our production department will prepare your paper for publication. This includes ensuring the following: * All references, tables, and figures are properly cited * All relevant supporting information is included in the manuscript submission, * There are no issues that prevent the paper from being properly typeset You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps. Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. If we can help with anything else, please email us at customercare@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Li-Pang Chen Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .