Peer Review History
| Original SubmissionMay 19, 2025 |
|---|
|
PONE-D-25-27112The Shifting Landscape of Vaccine Discourse: A Massive Dataset Examining a Decade of Social Media Posts on Vaccines Before and During the COVID-19 PandemicPLOS ONE Dear Dr. Caragea, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. In particular, I have summarized the issues raised by the reviewers as follows. The most important concerns relate to the methodological robustness of your emotion and stance detection. Reviewer 1 noted that the use of the NRC lexicon requires stronger justification, and suggested either providing a clear rationale for this choice or considering the use of more advanced transformer-based methods. Reviewer 2 raised concerns about the stance detection results, observing that the agreement between the Llama 3.1 model and human annotators (around 66%) is too low to support precise quantitative claims. These outputs should therefore be framed as exploratory or approximate rather than definitive. In the same area, Reviewer 1 also asked for fuller reporting of model performance (precision, recall, and F1 scores per class) and clarification of the temperature settings used in your prompting strategy. Another point raised by Reviewer 2 concerns the scope and framing of your findings. At times, the paper suggests broader implications for public perceptions or societal views, but since your dataset is limited to English-language posts on X/Twitter, conclusions should be carefully framed as such rather than generalized to the population at large. Relatedly, Reviewer 2 encouraged a clearer articulation of the theoretical framework, or at least an acknowledgment of its speculative nature if it cannot be fully justified. The reviewers also highlighted data quality and limitations. Reviewer 2 asked for some manual validation of your filtering procedure to ensure that irrelevant posts (such as references to “The Vaccines” band) were not mistakenly included or excluded. They also pointed out that your decision to omit engagement metrics such as likes, shares, and retweets is reasonable, but should be more explicitly presented as a limitation since it constrains conclusions about diffusion, influence, and reach. Finally, Reviewer 1 recommended improvements to figures and measures. Specifically, greater clarity is needed in labeling (for example, using “number of characters” rather than “length” and adding missing y-axis labels). Figure 1 would be more useful if contextualized against overall Twitter activity, while Figure 3 and Table 3 would benefit from using the proportion of emotion words rather than average word count, or at least from visually marking the 2017 change in Twitter’s character limit to avoid misinterpretation. In sum, the main revisions needed are to (1) strengthen and clarify methodological choices (emotion detection, stance detection, dataset filtering), (2) temper the generalization of findings, (3) address theoretical framing, and (4) improve figure clarity and limitations discussion. Please submit your revised manuscript by Oct 19 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Carlos Carrasco-Farré Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 2. If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: I would like to thank the authors for their submission. I appreciate how comprehensive this paper is in its view of vaccine discourse on Twitter/X over the years, and I find the results about shifts in emotional language and the emotional content of anti-vax posts to be particularly interesting. Overall, I think the paper is nearly ready for publication -- I just have a few notes about how the paper could be revised/improved. My primary concern with the paper is the use of the NRC lexicon as a method of emotion detection. It's possible that this is the best method to use for your case, but I don't see a strong justification for it written in the paper. My sense is that transformer-based supervised learning methods would be better for the task of emotion recognition. Therefore, I would either like to see a more accurate method used for emotion detection, or a justification for why the lexicon should be used over deep learning methods. Another, less important, concern I have is that Figure 1 would be more useful if there were some baseline representing how much overall activity on Twitter has increased over the years (e.g., is the spike in 2015 due to a bunch more people using Twitter or were people really more likely to talk about vaccines at that time?) I understand that these data might be hard to come by now since the API is no longer accessible, but if you're able to find some dataset representing overall Twitter activity during your period of study, it would be useful to include it as a comparison point. Similarly, for Figure 3 and Table 3 -- is average number of words really the best measure here? I would think that measuring the average proportion of emotion words would be better, especially because of the fact that Twitter increased their character limit. The huge spike attributable to that increase in late 2017 makes it difficult to compare any time before that to any time after that. If you do choose to stick to number of words instead of relative proportion, I would encourage you to include a vertical line in Figure 3 at the point in time when Twitter increased their character limit (the figure could be circulated without the caption). In Figure 2, I would also change the word "length" to "number of characters," since the meaning of the word "length" is ambiguous (could also apply to number of words). I've also noticed that y-axis labels are missing from many of these plots, make sure to add those. Finally, I see that you report the accuracy of your stance detection model -- what are the precision, recall, and f1 scores for each class? I also think the temperature of the language model matters a lot for your evaluation setup, where you prompt the model 5 times for each post. If the temperature is too low, it will give you the same output every time. If it's too high, it might be too random. What temperature value did you set for the model, and how would your average/std. error of accuracy vary for different temperature values? Reviewer #2: The manuscript presents a large-scale analysis of vaccine-related discourse on X/Twitter between 2013 and 2022. The dataset is impressive in scope and will be useful for future research. The integration of lexical analyses and LLM-based stance classification is timely. However, in its current form, the manuscript overstates its contributions and requires important revisions before it can be considered acceptable. Major concerns 1/ Reliability of data. The reported agreement between Llama 3.1 and human annotators (66.2% ±4.6) is too low to support precise quantitative claims. It can be used for identifying broad trends, it cannot provide accurate proportions of pro/anti vaccine discourse. The manuscript must explicitly frame these outputs as exploratory and approximates. The claims suggesting exact distributions of results should be removed or softened. 2/ Overgeneralization of results. The paper often refers to "public perceptions" or "societal views." This is misleading: the dataset only covers English-language posts from a single platform, X, which is demographically skewed. Conclusions should consistently be limited to "English-language vaccine discourse on X" rather than generalized to the population at large. 3/ Theoretical framework. The manuscript should provide stronger theoretical justification or acknowledge the speculative nature of their application. 4/ Dataset filtering. The procedure for excluding irrelevant content (i.e., references to "The Vaccines" band) relies on embeddings and thresholds, but no validation is reported. Without a sample-based check, the risk of false positives/negatives remains high. The authors should report at least some manual validation to strengthen confidence in dataset integrity. 5/ Engagement metrics. The exclusion of likes, shares, and retweets is justified as ambiguous, but this decision limits interpretability. The limitations section should more explicitly state that the absence of engagement analysis prevents conclusions about diffusion, influence, or reach of vaccine discourse. [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
The Shifting Landscape of Vaccine Discourse: A Massive Dataset Examining a Decade of English-language Social Media Posts on Vaccines Before and During the COVID-19 Pandemic PONE-D-25-27112R1 Dear Dr. Caragea, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact billing support. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Carlos Carrasco-Farré Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: No ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: (No Response) Reviewer #2: The revised manuscript presents a clear, methodologically sound, and transparent investigation of the relationship between neuroticism and social media addiction, incorporating the mediating role of perceived algorithmic personalization. The authors have satisfactorily addressed all prior reviewer and editorial comments, improving structure, clarity, and theoretical framing. The text is concise, logically organized, and free of overstatement. References and variable naming have been clarified. No ethical, methodological, or data integrity concerns are detected. Nevertheless: Ensure that the data/code availability statement is complete and accessible. Verify that item examples or full wording of the new scale appear in the manuscript or as supplementary material. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: Yes: David Badajoz-Dávila ********** |
| Formally Accepted |
|
PONE-D-25-27112R1 PLOS One Dear Dr. Caragea, I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS One. Congratulations! Your manuscript is now being handed over to our production team. At this stage, our production department will prepare your paper for publication. This includes ensuring the following: * All references, tables, and figures are properly cited * All relevant supporting information is included in the manuscript submission, * There are no issues that prevent the paper from being properly typeset You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps. Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. You will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing. If we can help with anything else, please email us at customercare@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Carlos Carrasco-Farré Academic Editor PLOS One |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .