Peer Review History
| Original SubmissionJune 27, 2022 |
|---|
|
PONE-D-22-18218Continuous speech with pauses inserted between words enhances cortical envelope entrainmentPLOS ONE Dear Dr. Deoisres, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jan 19 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Michael Döllinger, Ph.D. Academic Editor PLOS ONE Journal requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: No Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The manuscript reports an EEG experiment investigating the role of added pauses on the cortical tracking of speech. Specifically, as the neural measure for speech tracking, the authors focus on the reconstruction of the speech envelope from the EEG signal (forward/decoding multivariate Temporal Response Function model). The results show that added pauses yield higher envelope reconstruction values compared to natural speech, a finding that seems to be primarily driven by word onsets. - Terminology: entrainment vs tracking Considering recent debates in the literature (see refs below), I would suggest the authors to use the term “(cortical/neural) tracking of speech” instead of “entrainment”. Obleser, J., & Kayser, C. (2019). Neural Entrainment and Attentional Selection in the Listening Brain. Trends in Cognitive Sciences, 23(11), 913–926. https://doi.org/10.1016/j.tics.2019.08.004 Meyer, L., Sun, Y., & Martin, A. E. (2020). “Entraining” to speech, generating language? Language, Cognition and Neuroscience, 35(9), 1138–1148. https://doi.org/10.1080/23273798.2020.1827155 - Goal of the study: At the beginning of the Introduction, it is mentioned that the goal is to provide the mTRF counterpart of ALRs, while later (lines 137 – 139) the goal shifts to how added pauses modulate the cortical tracking of speech (in terms of forward mTRF). From my understanding, the latter seems to be the main focus throughout the whole manuscript, while the former is barely addressed later on. Could the authors revise this point and provide an explicit and consistent description of their goal(s) in the Introduction? By doing so, the authors could also elaborate more on the theoretical motivations of their goal beyond just its empirical justification. - Expected results: The authors hypothesize stronger cortical responses with added pauses (lines 139-140). However, they do not provide any explanation for this expected result, which might seem even contradictory considering the previous findings reported above (Kayser’s and Hambrook’s studies). Could the authors provide a clear motivation for the mentioned expected results in the Introduction? - Rationale of mTRF approach: Although the authors do specify in the methods section that they will follow the decoding/backward mTRF approach, the terminology used in certain parts of the manuscript is not unambiguous in this respect. In particular, the authors often refer to higher envelope reconstruction correlations as “stronger/enhanced cortical responses”, which might wrongly suggest that encoding/forward mTRF models were used (e.g., referring to higher TRF weights). Moreover, considering that envelope reconstruction correlations could be reflecting phase rather than (only) amplitude alignment between both signals, using terms such as “stronger” or “enhanced” might be misleading. Based on these issues, I suggest the authors (i) specify in the Introduction how mTRF models will be implemented and explain their main rationale (e.g., what does higher envelope reconstruction mean in terms of neural responses?) and (ii) revise how the (interpretation of the) results is conveyed throughout the manuscript. - Stimuli: The method section reports that stimuli were modified by adding short/long pauses between words. Which linguistic (e.g., orthographic/phonological/phonetic?) criteria were used to qualify a speech unit as a word? For example, were contracted forms (e.g., “we’re” instead of “we are”) considered as one or two words? This seems quite relevant considering the use of natural speech, which probably contained many connected speech parts. Providing detailed information about stimuli preparation would be very helpful to better understand the manipulation as well as considering future studies. Also, would it be possible to have access to some of the stimuli? Considering the temporal nature of the manipulation, it would be very informative to provide the power spectrum of the stimuli (e.g., Fast Fourier Transform of the speech envelope) as usually done in neural tracking studies. If there were power differences in the speech envelope across conditions, how would they relate to the reported neural tracking findings? - Behavioural responses: The results from the behavioural task (comprehension questions) are not included in the manuscript. Could the authors report them? Also, could the authors somehow relate the patterns from the behavioural responses and the envelope reconstruction correlations? - Language comprehension/intelligibility: Although the authors sometimes make some reference to intelligibility issues, it would be interesting to elaborate more on this when explaining the results of the current manuscript. How does the current results relate to the mentioned mixed findings in the literature regarding the relationship between neural tracking and intelligibility? Addressing this important issue would also increase the theoretical implications of the reported results. - Onset vs non-onset comparison: In the methods section “Extraction of speech envelope”, it is said that onsets were taken from the first 150 ms of each word. Because of this approach, relevant linguistic aspects could significantly differ across words, which could somehow impact the reported results. For example, uniqueness point (the specific time point at which that word can be unambiguously identified), informativeness (e.g., how surprising/entropic is that word at that time in terms of phonetic/lexical information?) or syllabic chunking (does that speech segment corresponds to a single syllable or covers partial syllable(s)?) might vary considerably across words. From my perspective, these potential confounds represent limitations for the onset vs non-onset comparison as well as the interpretation of such results. In addition, in lines 402-408, the authors mention a few control analyses/measures for the onset vs non-onset contrast. Could the authors report the results of these control analyses/measures? Reviewer #2: Deoisres et al. explored how inserted pauses between words in natural speech stories modulate neural encoding/decoding of speech signals using a mTRF framework. The results indicate that the encoding/decoding performance is increased by inserted pauses between words. I found this study interesting. It is interesting to know how the encoding/decoding performance is changed after one inserts pauses between words or slows down normal speech. For hearing-impaired or elderly people, they can process speech better if the speech is slower, which is probably their encoding of word onsets becomes better. I am positive about this manuscript and here are my comments: 1.The authors do not clearly explain why the study was done. It is difficult to understand the meaning of the study for readers who are not familiar with this field. The interpretation of the data results is not very comprehensive, only the data results are presented, but the interpretation is very limited. More details can be added so that uninitiated readers can find this manuscript more accessible? 2.It seems to me that: The authors seem to confuse, sometimes, between the mTRF technique and the continuous speech processing paradigm, and it is felt that the authors did not express the difference between the two clearly. Can the authors make this difference more clear? 3.Can the authors spell out ALRs when it first appeared in the manuscript? 4.Can the authors elaborate more on how they went from ALRs to mTRF? I understand this logic from my knowledge, but, again, the authors seem to assume that most of readers should understand this, which seems hard just by reading the manuscript. 5.The total duration for continuous speech stimulus for the no pause, short pause, and long pause conditions were therefore approximately 10 minutes and 20 seconds, 16 minutes, and 21 minutes and 42 seconds, respectively. Does the length difference here affect the encoding/decoding performance? The amount of data sometimes affects values of correlations. Can the authors provide some thoughts on this? 6.By the authors' own description in their article, the growth multiplicity of the story has exceeded 0.5 under long pause conditions, which significantly affects the intelligibility of the story, and why the response of mTRF is enhanced under such speech that affects intelligibility, the authors need to explain more on this point. 7.Line 139. “We hypothesize that the cortical responses to continuous speech would become stronger when pauses are added to the stimulus, especially the responses following onsets. “ I haven’t quite understood this hypothesis. Maybe it is my negligence, but if the authors can make this hypothesis easy to understand, that would help readers in my opinion. 8.I do not understand the application implications here, and perhaps the authors could provide more detail on the value of comparing the mTRF from different speech corpora. Minor comments 1 The authors lacked a detailed explanation of what mTRF detection time is and the relevant references. 2 ‘mTRF was designed to measure the strength of envelope entrainment. ‘ The mTRF does not seem to be specifically designed to measure the strength of envelope entrainment, but has a broader use and more general principles, and if the authors insist on this opinion, they should add relevant references. 3 ‘As it is known that the EEG can entrain to the envelope of speech [7], the mTRF was designed to measure the strength of envelope entrainment. ‘ [7] This article is still based on research under the traditional ERP or isolated stimulus paradigm, and there are many references to mTRF paradigm research. 4 ‘Randomization was repeated 100 times to construct the null distribution of Pearson's correlation coefficients. ‘ The 100 times of permutation test is a bit low, and it is recommended to try more permutations (500 or 1000) to obtain more stable results. 5 The results of all paired tests might be better presented together with the figures, as two separate presentations may make it difficult for the reader to read and integrate all statistical results. Just a thought. 6 There are several typos and some sentences do not sound right. It would be nice if the authors proofread the manuscript before resubmission. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
PONE-D-22-18218R1Continuous speech with pauses inserted between words increases cortical tracking of speech envelopePLOS ONE Dear Dr. Deoisres, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jul 15 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Michael Döllinger, Ph.D. Academic Editor PLOS ONE Journal Requirements: Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: No Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The authors have addressed my comments from the previous manuscript. However, I still have a few further comments. - Modulation spectrum of the stimuli (figure 2): The 3 conditions show similar modulations around the theta-band range (4-8 Hz), but it is also evident that the 2 manipulated conditions show an additional peak in the delta-band range (< 4 Hz) mirroring their respective pause durations (lower frequency peak for long compared to short pauses). Although this is most probably an inevitable consequence of the fixed-duration pause manipulation, currently it remains unclear to what extent these delta-band acoustic modulations might impact the reported results. For example, could this explain why delta-band effects seem generally higher than theta-band effects? I am not necessarily asking for additional control analyses, but I think it is a fair point to be addressed in the manuscript. - About the hypothesized effect of pauses (only based on Hamilton et al.’s findings), another recent article that could also support this hypothesis partially: Chalas, Nikos, et al. "Speech onsets and sustained speech contribute differentially to delta and theta speech tracking in auditory cortex." Cerebral Cortex (2023). https://doi.org/10.1093/cercor/bhac502 - Lines 462-464: “However, it may be difficult to clearly quantify whether the cortical tracking of speech envelope when pauses were inserted to speech is dominated by the encoding of acoustic information or not.” I think this affirmation could be rephrased as future work rather than as a problem difficult to solve, considering the possibilities offered by the mTRF approach. In particular, encoding models with different sets of acoustic and/or linguistic regressors are commonly used in the literature to dissociate lower-level (e.g., acoustic) from higher-level (e.g., lexical or semantic components related to comprehension) contributions. - Lines 561-562: “Cortical responses to natural speech in the delta band appear more strongly linked to non-onset segments, while theta band responses are more related to onset segments.”. This contradicts the lack of significant differences in the natural speech conditions reported in Figure 4. - Lines 564-567: “The results could be exploited in the clinic in detecting cortical tracking of speech envelope or to onset segments in speech more effectively, where the sensitivity to acoustic onsets could be an indication of how well a person can segment phrases and sentences during speech comprehension.”. I think this somehow contradicts the previous discussion about the uncertainty on the reported findings actually reflecting comprehension-related processes (like syntactic phrasing in this case). Also taking into account the reduced speech naturalness due to the implemented manipulation, what are the clinical implications of this study? - Minor comment: Figure numbers are missing in the manuscript. Also, figures 3 and 4 show the same results (with and without significance asterisks, respectively). Reviewer #2: I would like to thank the authors for their time and effort. My concerns have been addressed and I have no further comments. Congratulations! ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 2 |
|
Continuous speech with pauses inserted between words increases cortical tracking of speech envelope PONE-D-22-18218R2 Dear Dr. Deoisres, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Michael Döllinger, Ph.D. Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: |
| Formally Accepted |
|
PONE-D-22-18218R2 Continuous speech with pauses inserted between words increases cortical tracking of speech envelope Dear Dr. Deoisres: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Michael Döllinger Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .