Peer Review History
| Original SubmissionAugust 28, 2021 |
|---|
|
PONE-D-21-27904Representation of professions in entertainment media: Insights into frequency and sentiment trends through computational text analysisPLOS ONE Dear Dr. Baruah Thank you for submitting your manuscript to PLOS ONE, and apologies once again for the length of time it took to send this work out for review. I am pleased to report that I have now managed to secure two expert reviews of your work. Both reviewers feel that it the work has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Both reviewers agree that this is a worthwhile and carefully conducted study, something that I also sensed when reading the manuscript. All of us appreciate the clear writing style. Reviewer 1 raises some "high-level" issues, mostly around the interpretation of your findings. They are particularly concerned about possible causal relationships between the correlates that you identify in your work, and I agree that this needs some finessing in a revision. As Reviewer 1 states, if one wishes to offer a particular causal interpretation, evidence needs to be provided to support this; otherwise, one should instead emphasise the nature of the correlations and at most tentatively suggest causal hypotheses that could be further explored in future work. At the same time, Reviewer 1 notes that the manuscript presents a body of facts, but does not offer possible explanations for them - so clearly some speculation is to be appreciated, as long as it is presented as such. Reviewer 2 makes a similar observation and would appreciate a more extensive discussion of where your results fit within current theoretical frameworks and implications for future work. I also felt this would be necessary to include in a revision, and despite the brevity of Reviewer 2's report, it gets to the heart of the main issue standing in the way of an acceptance decision. My impression is that all comments of the reviewers are reasonable, not in conflict, and can be accommodated with relatively minor additions to the manuscript. Certainly it does not look as though more technical work is required, just a little more effort in discussing and interpreting the results so that the reader can better appreciate the implications of your findings (rather than the nature of the findings themselves, which is well explained). My preference wherever possible is to avoid a second round of review, which I think is feasible if you are able to address all the points raised by the reviewers primarily through changes to the manuscript and only occasionally with a rebuttal that makes a compelling case without recourse to lengthy technical detail. Please submit your revised manuscript by Feb 20 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Richard A Blythe Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 2. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match. When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section. 3. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: I Don't Know ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: PLOS ONE PONE-D-21-27904 Representation of professions in entertainment media: Insights into frequency and sentiment trends through computational text analysis General Comments The paper presents a method for extracting information about trends in the frequency and sentiment words that describe various occupations in movies and TV shows. The frequency and sentiment information is extracted from the subtitles of 136,000 movies and TV shows, using Named Entity Recognition, WordNet, Word Sense Disambiguation, and Sentiment Analysis. In general, the paper is clearly written and the methods used for analysis of the data are appropriate. The following review first discusses some high-level issues and then discusses some low-level details. High-level Concepts Abstract: "Professions that employ more people have increased media frequency, supporting our hypothesis that media acts as a mirror to society." -- Let E be the levels of employment in various professions. Let F be the frequency of various words for professions. Let S be the sentiment of various words applied to professions. The sentence quoted here suggests that variation in E causes variation in F and S: that is, E is the real thing, whereas F and S are merely reflections (mirrors) of E. The paper makes it clear that E is correlated with F and S, but the paper does not demonstrate how the causal connection goes. There are (at least) three possibilities: (1) Variation in E causes variations in F and S. (2) Variations in F and S cause variations in E. (3) Variation in some other thing, X, causes variation in E, F, and S. All three of these causal connections seem possible to me, and it is not at all clear which is the actual connection. The authors need to either (A) be clear that they have only demonstrated correlation and not causation or (B) provide solid evidence that there is indeed a specific causal relation connecting E, F, and S. If the authors choose (A) (causation unknown), then they should remove all expressions, such as "acts as a mirror", that imply a specific causal relation. My guess is option (3): changes in technology (X) drive changes in the level of employment in various professions (E), they drive changes in sentiment towards various professions (S), and they drive changes in the frequency of various words for professions. But I have no rigorous evidence for this hypothesis. Lines 8-10: "Cultivation theory suggests that prolonged exposure to the content we see on TV shapes our outlook and makes us believe that to be our reality [1]." -- There should be some discussion of alternatives to Cultivation theory. For example: The Fruits of Cultivation Analysis: A Reexamination of Some Effects of Television Watching (pp. 287-302), Michael Hughes. -- https://www.jstor.org/stable/2748103 -- Wikipedia says that there are two alternatives to Cultivation theory that are more widely used: "In a 2004 study, surveying almost 2,000 articles published in the top three mass communication journals since 1956, Jennings Bryant and Dorina Miron found that cultivation theory was the third most frequently utilized cultural theory." -- https://en.wikipedia.org/wiki/Cultivation_theory Lines 553-557: "The frequency trend of the SOC group does not reflect the frequency trend of all its professions. For example, Figs 8 e) and f) show increasing trends for many sales-related professions, but the overall frequency of the Sales Occupations SOC group decreased. A large proportion of Sales Occupations mentions are comprised of bankers and cashiers, whose frequencies have decreased." -- Do the authors have an explanation for this? The paper should not be merely a collection of facts. Perhaps it is difficult to prove an explanation is correct, however it would be useful to suggest some hypotheses. Lines 569-570: "Therefore, the number of positive sentiment mentions is greater than the number of negative sentiment mentions for all the SOC groups." -- Again, some hypothetical explanations would be of interest to the reader. Lines 589-598: "Fig 12 shows the Spearman's rank correlation coefficient [...] also one of the most positive sentiment trends." -- Some hypothetical explanations would be of interest to the reader. Lines 676-677: "In this section, we test the hypothesis that media stories reflect real-world events." -- The word "reflect" here is somewhat metaphorical. There is an implication that real-world events cause media stories. But perhaps media stories cause real-world events. Or perhaps both are caused by some third factor. All that the authors can say with certainty is that there is correlation. I would very much welcome some explicit discussion of causality in this paper, but I don't think it's right to smuggle causality into the paper using terms such as "reflect". If the authors are unable to firmly state that there is a certain causal connection, they could still offer some tentative hypotheses about causation. Lines 726-720: "Lastly, we observed that media frequency correlates with the employment trend of most SOC groups. Professions that employed more people were also more frequently mentioned in media content. This supports our hypothesis that media mirrors society and plays a role in our professional choices." -- Again, the word "mirror" suggests that society is the cause and media is the effect. This paragraph starts with "correlates" (quite rightly) and ends with "mirrors" (somewhat unscientific). I encourage some discussion of causality, but it should be explicit and straightforward, acknowledging the difficulty of causal analysis. Low-level Details Abstract: "Societal ideas and trends dictate media narratives and cinematic depictions which in turn influences people's beliefs and perceptions of the real world." -- "ideas", "trends", and "depictions" are plurals; therefore the verb should be "influence", not "influences". Abstract: "study the effect of media attributes like genre, country of production, and title type" -- replace "like" with "such as" -- http://www.differencebetween.net/language/grammar-language/difference-between-such-as-and-like Abstract: "and investigate if the incidence of professions in media subtitles correlate with their real-world employment statistics" -- replace "if" with "whether" -- https://www.merriam-webster.com/words-at-play/if-vs-whether-difference-usage Lines 16-18: "Professions are paid, skilled work we perform to provide services to people and earn a livelihood. It defines our role in society and allows us to contribute to a nation's economy." -- replace "It defines" with "They define" Lines 21-22: "Therefore, it is essential to study and assess the changes in the occupational structure of the nation." -- Why "of the nation"? Which nation are you talking about? Should it be "of a nation" or "of nations"? Lines 40-41: "Several studies have examined the nature of the portrayal of different professions in popular media, like lawyers [13], accountants [14], physicians [15], and cops [16]." -- replace "like" with "such as" Lines 67-69: "2. We describe and share a new corpus of professional mentions, spanning 4,000 professions, 136,000 movies and TV shows, ranging over the years 1950 to 2017 68 (almost 7 decades) created by analyzing job title occurrences in media subtitles." -- This would be a good place to add a reference to the list of references at the end of the paper. The reference would give a URL for downloading the new corpus. I think it is best to insert a reference here, rather than inserting a URL here. Lines 72-75: "4. We analyze the frequency and contextual sentiment trend of professional mentions in media over time. We also investigate the presence of any correlation between incidence of professional mentions and the genre, title type, and country of production of the movie or TV show." -- This would be a good place to add references to the sentiment algorithm and the sentiment dataset. Lines 105-108: "Kalisch et al. analyzed 670 nurse and 466 physician characters in novels, movies and television series, and concluded that compared to physicians, media nurses were consistently less central to the plot, less intelligent, rational, and less likely to exercise clinical judgement." -- I think "rational" should be "less rational". Lines 108-111: "Smith et al. investigated gender representation of occupations in films, prime-time programs, and children TV shows, and found that females are grossly underrepresented compared to males in science, technology, engineering and math jobs (STEM) [24]." -- The politically correct use of "female" and "male" is that these terms should only be used as adjectives ("female doctor", "male nurse"). The appropriate nouns are "men" and "women". -- https://golin.com/2021/03/31/stop-using-female-when-you-mean-woman/ -- https://medium.com/fearless-she-wrote/woman-vs-female-67fd4c36fe59 -- https://www.buzzfeed.com/tracyclayton/stop-calling-women-females Lines 130-131: "The TAC KBP track of entity discovery and linking introduced job titles as entity types to model the person:title relationship [30]." -- What is TAC KBP? Add the long form in parentheses. Lines 139-140: "Liu et al. combined dictionary lookups with semi-Markov CRF architectures and ..." -- What is CRF? Add the long form in parentheses. Lines 147-149: "Aside from careful human coding, such gazetteers can be constructed using different automated methods, including those that leverage existing knowledge bases like Wikipedia and WordNet [35, 36]." -- replace "like" with "such as" Lines 160-161: "Synsets are tagged by semantic classes (the complete tagset can be found at wordnet.princeton.edu)." -- replace the URL with a proper reference at the end of the paper Lines 218-220: "4. You're going to be a lousy architect. (Neutral) Explanation: The person towards which the negative sentiment is expressed, is not an architect." -- "is not yet an architect" Lines 221-222: "Benchmark ABSA datasets exist for several domains like question answering forums, customer reviews and tweets [56{58]." -- replace "like" with "such as" Lines 222-224: "Dong et al. proposed an adaptive recursive neural network for target-dependent twitter sentiment classification, that propagated the sentiments of words to target depending upon the context and syntactic relations [58]." -- replace "words to target" with "words to their targets" Lines 267-268: "An IMDB title can have multiple genres." -- replace "IMDB" with "IMDb" Lines 349-351: "We create a word-document search index using the Whoosh Python package for quick retrieval of mentions." -- add a proper reference to Whoosh at the end of the paper, with a URL Lines 360-361: "We apply the Stanford CoreNLP NER model [31] to find the named entity tag of words." -- I think "tag" should be "tags" Lines 381-382: "The CDM architecture masks less-SRD tokens, whereas the CDW architecture weighs them dynamically." -- This needs more explanation. I don't know what it means. Lines 740-741: "The profession taxonomy and sentiment-annotated subtitle corpus is publicly available at -- Replace the URL with a proper reference at the end of the paper, including the URL in the reference, but not in the main body of the paper. Reviewer #2: This paper has interesting results and methods, and, in my opinion, would be worthy of publication, after minor revisions. Although the data and methodology are clearly presented, I would have appreciated the authors pointing out more explicitly their limitations. The discussion section need substantive rework. While it summarizes well the main results of the article, it fails to engage in a reflective discussion with the findings of other studies. The discussion part would benefit from citing more social sciences references supporting the idea that "media mirrors society and plays a role in our professional choices". Also, it would be relevant to underline the main limitations of the study and to provide recommendations for future research in computational social sciences especially in other national and linguistic contexts Overall, my recommendation is that the manuscript can be published with minor modifications ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
PONE-D-21-27904R1Representation of professions in entertainment media: Insights into frequency and sentiment trends through computational text analysisPLOS ONE Dear Dr. Baruah, Thank you for resubmitting your manuscript to PLOS ONE. I have read through your response and the revisions to the paper. As stated in my previous message, my preference in general is to avoid a second round of review and I think in this instance this would be achievable if you were able to make some further small revisions to the Discussion section.In response to me and Reviewer 1, you have included some suggestions as to factors that might be responsible for the changes that have been detected in the corpus. Clearly these are speculative, as we suggested they might be, but I think some further text is necessary to draw attention to the speculative nature of the proposed connections, and the need for further work to establish whether or not they actually hold.My suggestion would be to insert some text at the start of the Discussion section, nothing that you will tentatively suggest some possible causes of the changes. This might allow you to deviate a little from the "X might be because of Y" template you followed throughout the discussion, which I found a little repetitive. I think it would be more satisfactory to employ forms of language like "There is some evidence of Y [reference], which could lead to [X happening]" and variants that remove repetition. Where possible it could be useful to add comments as to the mechanism by which it might filter through to your dataset and/or what you might need to do in practice to demonstrate the connection. I think overall this would make for a more satisfying and professional discussion, and if you were able to achieve this I would expect to be able to accept the manuscript without the need to consult the reviewers again.I would hope that this would not take you more than a few hours. Please submit your revised manuscript by Apr 21 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-emailutm_source=authorlettersutm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Richard A Blythe Academic Editor PLOS ONE Journal Requirements: Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 2 |
|
Representation of professions in entertainment media: Insights into frequency and sentiment trends through computational text analysis PONE-D-21-27904R2 Dear Dr. Baruah, Thank you for reconsidering the discussion section of this manuscript, which is now much improved. I'm pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Richard A Blythe Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: |
| Formally Accepted |
|
PONE-D-21-27904R2 Representation of professions in entertainment media: Insights into frequency and sentiment trends through computational text analysis Dear Dr. Baruah: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Prof. Richard A Blythe Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .