Peer Review History
| Original SubmissionAugust 15, 2022 |
|---|
|
PONE-D-22-22873Algorithmic voice transformations reveal the phonological basis of language-familiarity effects in cross-cultural emotion judgmentsPLOS ONE Dear Dr. Nakai, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. ============================== Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. While the manuscript was overall found satisfactory, there are a series of requests for clarifications that need to be carefully addressed to clear out the remaining concerns of the reviewers. I think addressing these points will strengthen the manuscript. Reviewer 1 raised two points I think need addressing before publication. These are 1) how the parameters of each transformation were identified (see reviewer's 1 comment regarding "Auditory manipulation"), and 2) how the transformations applied to the recordings may have impacted the results (see the comment regarding "Stimuli transformations"). Reviewer 2 requested further clarification on 1) the sad response category for Japanese participants and the cultural norm of how sadness is expressed in Japanese, and 2) details on the task method, particularly the clicking error rate and data possibly excluded because of error in clicks. ============================== Please submit your revised manuscript by Feb 26 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Federica Cavicchio Academic Editor PLOS ONE Journal requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf 2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information. 3. Thank you for stating the following in the Acknowledgments Section of your manuscript: “All the data in this experiment was collected at the Centre Multidisciplinaire des Sciences Comportementales Sorbonne Université–Institut Européen d’Administration des Affaires (INSEAD), the University of Tokyo, and the Center for Information and Neural Networks (Japan). The authors thank Maël Garnotel, Gabriël Vogel, and Michiko Asano for their help with data collection. This work was supported by ERC Grant StG 335536 CREAM, Fondation pour l'Audition FPA RD-2018-2, ANR REFLETS and SEPIA, and JSPS KAKENHI Grant Number 20H05023 and 20K07718.” We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: “This work was supported by ERC Grant StG 335536 CREAM, Fondation pour l'Audition FPA RD-2018-2, ANR REFLETS and SEPIA, and JSPS KAKENHI Grant Number 20H05023 and 20K07718. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.” Please include your amended statements within your cover letter; we will change the online submission form on your behalf. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. Additional Editor Comments: Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. While the manuscript was overall found satisfactory, there are a series of requests for clarifications that need to be carefully addressed to clear out the remaining concerns of the reviewers. I think addressing these points will strengthen the MS. Reviewer 1 raised two points I think need addressing before publication. These are 1) how the parameters of each transformation were identified (see reviewer's 1 comment regarding "Auditory manipulation"), and 2) how the transformations applied to the recordings may have impacted the results (see the comment regarding "Stimuli transformations"). Reviewer 2 requested further clarification on 1) the sad response category for Japanese participants and the cultural norm of how sadness is expressed in Japanese, and 2) details on the task method, particularly the clicking error rate and data possibly excluded because of error in clicks. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Thank you for the opportunity to review your paper. This paper presents new research into cross cultural emotion recognition in voice using software to ensure consistency between stimuli. Consistency in vocal stimuli presents a significant challenge for emotion research involving the voice, and this paper is novel both in its findings and in demonstrating how the DAVID software can be applied allowing the experimenter more control over their stimuli. Overall, I think this paper will make an excellent contribution to the journal. However, there are several points that would benefit from revision/clarification before publication. I itemize these below using line numbers to hopefully make these points more easily identifiable in the manuscript. GENERAL POINTS REGARDING FORMATTING, EXPRESSION, CLARITY ETC. Line 42 – 43, additional brackets not required around citation “…(other-race effect; (Shapiro and Penrod 1986; Meissner and Brigham 2001)),…”. Same for citation in like 44. There are other places in the paper where additional brackets appear in in text citations. Recommend checking for these (seems to just be an artifact of the reference management software?). Lines 48 – 52, “These familiarity effects are thought to result primarily from perceptual learning: differential exposure warps an observer's perceptual space to better accommodate distinctions that are informative to discriminate common in-group items, with the result that comparatively less-common out-group items are encoded less efficiently …”. Recommend rephrasing this (perhaps splitting into two sentences?) to make the meaning clearer. Line 59 – 63, “…high or low, a given phoneme bright or dark, whether a specific pitch inflection is expressive or phonological…” recommend moving the citation to the end of the sentence. Line 61, “…all would appear…” recommend rephrasing to “…would all appear…”. Line 64-66, issues with the formatting of the in text citation, are Scherer etc all cited in Elfenbine and Ambady 2002, and Lukka and Elfenbine 2021? Recommend reformatting the citation to make this clear. Line 71, recommend changing “…but such evidence is mixed: Scherer and colleagues…” to “…but such evidence is mixed. Scherer and colleagues” so that Scherer begins a new sentence. Line 102, here the emotions examined are referred to as “happiness, sadness, and fear/anxiety” but in the rest of the paper “afraid” was sometimes used to describe the emotion of fear/anxiety. This terminology should be aligned. What terms were presented to the participants in the experiment? Recommend using these. For example, if afraid was the term used in the experiment (or its translation) then used afraid in the paper rather than fear/anxiety? Line 45, 73 and other places, recommend changing “e.g.” to “for example” throughout the paper. Line 87, would recommend moving “DAVID” out of the brackets to make it clear that this is the name of the software. For example, “…processing software called DAVID (Rachman et al. 2018) to ..” Line 132, remove bracket after “Table 1)…”? Line 203, “…and JP (N = 21, 1 excluded)…” is that 21 with the 1 excluded or 20, plus 1 excluded? Line 263, recommend rephrasing “…provide unequivocal evidence that…” to “…provide evidence that…” Line 297, missing bracket after citation. Line 300 – 301, check citation formatting with page numbers? Line 334, sometimes “suprasegmental” is used, and other times “supra-segmental” is used. Recommend standardizing throughout Line 381, recommend changing “6-12mo” to “6-12 months” Line 395, “More generally, we find it quite remarkable that something as basic as the ability to detect a +30 cents increase of pitch in normal running speech is influenced by one observer's familiarity with the sounds of a given language” Does this refer to a single participant’s ability to detect +30 cents? Or an average threshold across participants? Line 401 – 402, perhaps change “When already pitch changes are difficult …” to “When pitch changes are already difficult…”? Line 416, were the sentences taken from Russ Gur, and Bilker 2008? If so, consider stating this explicitly. Line 437, “In addition, we used four male and four female normal Swedish (SE) recordings from a previous study (Rachman et al. 2018).” Were these recordings of the same sentences recorded by the JP and FR speakers in this study? Line 443, not sure that the URL needs brackets here? Line 475, recommend replacing “complained” with “reported” Line 479, delete repeated word “recordings” Would recommend considering citing your dataset in the references too, following citation format such as that found at: https://www.nlm.nih.gov/bsd/uniform_requirements.html#item43. This may make it easier for your dataset citation counts to be captured. Lines 494 – 495, “Visual stimuli were displayed on a laptop-computer screen,…” was this the written out sentences? Recommend explaining what these visual stimuli were and, if the visual stimuli was the written out sentences, why they were used. SPECIFIC COMMENTS REGARDING METHODS, FINDINGS, INTERPRETATION Sentences: Was the neutrality of the sentences validated for JP and FR translations? For example, a pre test to check that the JP translation of “I forgot my jacket” was also considered neutral? If not, recommend stating this explicitly and discussing why a pre test was not conducted/if there is any reason to think the translations might be less neutral in JP/FR than in English. Auditory manipulation: Line 441 – 462, including Table 2: Would recommend moving the sentence beginning on line 456 “These values were based on previous studies…” to the end of the first paragraph (and rephrasing appropriately) to make it clear up front that the transformations have been validated already. If I understood correctly, I understand that the transformations described in were based on Aucouturier et al. 2016 and Rachman et al. 2018. I think it would be useful to state which transformations were drawn from which studies. For example, if I am understanding correctly, when I look at the transformations tested in Rachman et al. 2018 I see that the transformations used in the present paper don’t exactly match the low/medium/high levels tested in Rachman et al. 2018, Table 2. In this paper, pitch is shifted +50 cents for happy, which matches Rachman pitch shift for “high” happy, but in this paper sad is shifted -50 cents which does not exactly match any of the transformations from Rachman Table 2. Also, is there a reason that vibrato depth was not adjusted for male and female speakers in this study but it was in Rachman et al? It would be useful to explain why any deviations from the validated transformations were made. This could be addressed in the paragraph beginning line 446. Recommend also stating here that the categories of happy, sad, afraid were used due to the validation in Rachman et al (sorry if this is already in the paper and I missed it). Discussion: Lines 297 – 305, beginning “Second, irrespective of listener’s language,…” to what degree might this be related to the intensity of the transformations? Is it possible that the afraid and sad stimuli were not as intensely afraid and sad as happy stimuli? This relates to the point directly above and how the value of the transformations were selected. Stimuli transformations: It’s always possible that results might be impacted by the way stimuli were created. Given this study used computer software to transform voice for the stimuli, DAVID, it would be worth including a paragraph in the discussion exploring how the use of DAVID might have impacted results. For example, are there differences in perceived naturalness across the emotions which are an artifact of the transformations that may have interfered with the results? Do these results say anything about the role of loudness in emotion perception given loudness is not part of the DAVID transformation configurations? This would strengthen the paper and help the reader identify areas in which ecological validity could be improved. Instructions: Can you include in an appendix or in the OSF repository the actual study protocol and instructions? That is, the actual instructions given to participants in the respective languages? This might be useful in understanding the implications of the disclaimer beginning line 496-497 that “Response categories in the recognition task for the French group…”. It may help the reader understand the possible implications of this difference between groups. As noted in the present paper, the different translations of the emotion words might have impacted results (sorry if this is already included in the OSF repo and I have missed it). Reviewer #2: The paper investigates the effect of familiarity with a language phonology in emotion judgements, using audio processing software (DAVID) for testing speakers of different languages on identical acoustic cues. While most previous studies tested language familiarity effects using stimuli created by human actors, the present study uses stimuli that have the exact same modifications, and yet sound natural, to reduce biases which may be introduced by the variability of the human voice in the speech signal. The paper is clear, written in good English and well organized. The literature review is exhaustive. The study is scientifically sound, the experiments have been carried out in a rigorous manner, with an appropriate methodology, and an appropriate number of participants and stimuli. The data and the experimental results support the conclusions. I have a few comments/requests for clarification: @310-313: the authors talk about the lack of specificity of the sad response category and how the JP participants were comparatively further away from the cultural norm of how sadness is expressed in the JP language. This is very interesting, but somewhat obscure for people that are not familiar with this subject. Providing a more detailed explanation of this issue would benefit the reader and help them get a better understanding of lines 320-321. @489-491: "After hearing the two utterances, participants (all right-handed) were instructed to answer whether the second utterance sounded happy, sad, afraid, or neutral by pressing one of four keys (“S”, “D”, “F”, and “G”) with their left fourth, third, second, and first finger, respectively". This task may have been rather confusing and/or somewhat difficult for the participants: they had to memorize the association of the emotions happy, sad, afraid, or neutral, with 4 keyboard keys that had no meaning association with them, and had to click on the keys with 4 different fingers. Did the participants ever get confused? Did this impact the experiment in any way? Had some result to be excluded because of this? Please provide further details. @497-499: why was there a difference in the presentation of terms in the 2 language groups (i.e., the Fr were given English terms and the JP were given JP terms)? The reason for this difference in the procedure needs to be explained. Also, details need to be provided to let the reader know whether this difference affected the results in any way. Minor corrections in the text: @line 280: 'in' is missing in: accuracy seen Experiment 1 @line 402: in 'lost in translation' the initial 'l' is not in italic @line 479: The word recordings is repeated twice in: All sentence recordings recordings described... ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
Algorithmic voice transformations reveal the phonological basis of language-familiarity effects in cross-cultural emotion judgments PONE-D-22-22873R1 Dear Dr. Nakai, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Federica Cavicchio Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #2: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #2: My previous comments have been addressed. Thank you for the nice work and all best for your carreers ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #2: No ********** |
| Formally Accepted |
|
PONE-D-22-22873R1 Algorithmic voice transformations reveal the phonological basis of language-familiarity effects in cross-cultural emotion judgments Dear Dr. Nakai: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Federica Cavicchio Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .