Peer Review History

Original SubmissionAugust 22, 2023
Decision Letter - Elena Theodorou, Editor

PONE-D-23-20708Frequencies and functions of vocalizations and gestures in the second year of lifePLOS ONE

Dear Dr. Burkhardt-Reed,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Upon careful review of the text and the reviewers' comments, it is evident that it is a valuable addition to the field of research it pertains to and is potentially worthy of publication.You should focus on addressing the comments made by Rev. 1 regarding the introduction and taking into account the methodological issues raised by both reviewers. Additionally, the way the data is visualized needs to be improved. You are advised to refer to the Rev1 comments for more detailed contributions.

Please submit your revised manuscript by Dec 01 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

We look forward to receiving your revised manuscript.

Kind regards,

Elena Theodorou

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Note from Emily Chenette, Editor in Chief of PLOS ONE, and Iain Hrynaszkiewicz, Director of Open Research Solutions at PLOS: 

Did you know that depositing data in a repository is associated with up to a 25% citation advantage (https://doi.org/10.1371/journal.pone.0230416)? If you’ve not already done so, consider depositing your raw data in a repository to ensure your work is read, appreciated and cited by the largest possible audience. You’ll also earn an Accessible Data icon on your published paper if you deposit your data in any participating repository (https://plos.org/open-science/open-data/#accessible-data).

3. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

Once you have amended this/these statement(s) in the Methods section of the manuscript, please add the same text to the “Ethics Statement” field of the submission form (via “Edit Submission”).

For additional information about PLOS ONE ethical requirements for human subjects research, please refer to http://journals.plos.org/plosone/s/submission-guidelines#loc-human-subjects-research.

4. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter."""

5. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

6. Please upload a copy of Figure 7, to which you refer in your text on page 30 (in PDF format). If the figure is no longer to be included as part of the submission please remove all reference to it within the text.

7. Please include a copy of Tables 1 to 3 which you refer to in your text on pages 18 and 21 (in PDF format).

Additional Editor Comments: 

Reviewers' comments:

Reviewer 1

The manuscript consists of a timely study that challenges gesture-first theories of language. The paper begins with a review of the literature, suggesting that gesture-first theories are in the ascendency. The authors then argue that on the contrary, it is more plausible that humans had selection pressure to use vocalisations more (compared at least to other primates). The main hypothesis of the paper is that if earliest language was gestural – we should expect extensive gestural use early in development, whereas if vocal – we should expect extensive vocal use in development. Comparisons between gestures and vocalisation (on any metric) are difficult, so I appreciate the authors descriptive framework (universal, non-social and conventional vocalisations/gestures). The sample size is quite small, but the observation time is densely sampled (60 mins total per participant across three sessions), so this is still very informative. The results suggest that (a) non-social vocalisations are far more frequent than non-social gestures (which does not change across the second year of life), (b) neither universal vocalisations and gestures were more frequent across the second year of life, however the tendency was that vocalisations decreased across the year and gestures increased, (c) conventional vocalisations were more frequent and increased across the second year of life, while conventional gestures were infrequent and did not increase/decrease over time, and finally (d) all types of gestures were more likely to be directed to caregivers (as indexed by gaze) than vocalisations across the second year. The paper is well written and important. I agree with the authors that clearly the claim that early gesture use exceeds vocalisation use in human infancy is unfounded in light of this (and other data). I especially enjoyed the discussion of action gestures – and share the authors concern that “true gestures” are often confounded with actions (that are not gestures) simply because they could one day be ritualised into gestures, but that does not mean that they are true gestures when first used by infants! The same concern might also be applied to chimpanzee gesture coding schemes (which often include manual actions, e.g., splashing, as gestures…). I do however have a number of concerns about the manuscript in its current form, detailed below. I think that all the issues I raise are surmountable and very much look forward to reading a revised manuscript. Introduction and literature review There is a missing step in the introduction. I agree that there are proponents of gesture-first theories (and there is a sense at which many in the field simply uncritically appear to accept it as a premise), but there also vocal detractors (excuse the pun). It would be of benefit to the introduction to include an extra section detailing the body of opinion and evidence already amassed that disputes gesture-first theories (in the Vocalization and Gesture in Communicative Evolution section) before launching into the evolutionary argument (Natural Selection of Communicative Signals). For example, there was a review paper suggesting that great ape data simply doesn’t support a gesture-first or vocal-first origin story (Frohlich et al., 2019 https://doi.org/10.1111/brv.12535). On the developmental side of things, there are other papers that provide evidence that would already count against gesture-first theories (e.g., McGillion et al., https//doi.org/10.1111/cdev.12671 ; Donnellan et al., 2020 https://doi.org/10.1111/desc.12843; McCune et al., 2020 https://doi.org/10.1177/014272372096682; Werwach et al., 2021 https://doi.org/10.1016/j.infbeh.2021.10158). Inclusion of a review of the literature that already supports the authors main contention would strengthen the manuscript. Universal vocalisations I am unconvinced by the justification in the current manuscript that crying, whimpering and laughter should be included as universal vocalisations. Universal gestures are defined with respect to communicative intent, however it’s not clear that laughter, crying and whimpering necessarily are produced with communicative intent (though it’s possible – and of course whether deictic gestures are always produced with communicative intent is debatable too [examples of children pointing for themselves, see Delgado, Gómez, & Sarriá, 2011: https://doi.org/10.1016/j.jecp.2011.04.010]). I am much more convinced that non-word conversational protophones should be included as universal vocalisations – both these and deictic gestures promote engagement with caregivers (and more richly, could be intended for this purpose on the part of the infant – though this is speculative [but often a speculation accorded to gestures only in the literature…]). I don’t quite see the parallels between deictic gestures with affective vocalisations – I do take the authors point that they have a modality-specific function (expressing affect -which gestures can’t do without a lexicon), but that doesn’t seem a good enough reason to group them together, especially because they are also grouped with vocalisations that might not have a modality-specific function (e.g., the conversational protophones – whose function [promoting engagement] could also be achieved gesturally [e.g., through showing: holding up objects to caregivers face, or even pointing at caregivers]). I think as it stands, universal vocalisations should consist either of crying, whimpering and laughter (to maintain the modality-specific function element) or conversational protophones (to capture the engagement element). As is stands, the justification is clearer to me that it should just be the protophones. Measures Sessions are approximately 20 mins (not exactly 20 mins). This makes it hard to interpret individual differences across participants. How close to 20 minutes are the sessions? If within a few seconds, this is fine. However, if discrepancies are significant, behaviours should be standardized as rates per minute (or rate per 20 mins if preferred) instead of as a raw frequency. The analysis should then be re-run with these standardized measures. Reliability Coding The first paragraph of Coder Agreement and Outcomes section is unnecessary – this is simply reporting reliability statistics from a previous study – not the current study – and it’s not clear why this is relevant. I suggest simply omitting this. The final paragraph concerning categorisation of vocalization and gesture into the three functional categories requires more quantitative reliability metrics. I agree (by eyeballing the plot) that this probably shows pretty decent agreement, however, given the way in which the data was coded, it should be possible to calculate kappa reliability (if I have understood the coding procedure correctly). I.e., for the instances in which both coders agreed there was a vocalisation or a gesture – did they agree on the function (i.e., did they both select an illocution code that would be categorized as the same function?). Overall though, it should be noted that reliability correlations currently reported are solid (and my impression is that the kappas will equally demonstrate this) – I have no doubt that this coding has been conducted meticulously, but there do need to be missing metrics included to confirm this. Analysis GEE does seem an appropriate method for the data, and likewise Mann-Whitney tests are appropriate in my understanding. The GEE models obtained need to be included in the paper in full (if even simply in tables in an Appendix). Data visualisation Given that the authors rely heavily on the visualisation of the data to do much of the work of inference (and correctly so I think!), the visualisations need work. I have a few suggestions: Distributions of data: Ideally the distribution of the datapoints themselves needs to be included (not just 95% confidence intervals and a mean). Numerous ways exist to do this, for example violin plots, half violin plots, and/or plotting the individual data points. E.g., in figures 3, 4 and 5, individual points (representing a participant) could be plotted in red at each time point for vocalisations and in blue for gestures, with summary statistics (mean/CIs) overlaid, perhaps in tandem with violin plots or similar. Model predictions. Equally, if GEE was implemented using an R package, it may be possible to plot model predictions (perhaps overlaid) on the figures automatically – which would be very informative. Of course it is possible to do manually, but this does require expertise. This is becoming much more of a standard approach to visualising data (i.e., raw scores – as you have here, overlaid with model estimates). This would really support the authors inferences from the data and I would strongly encourage that they attempt this. This would also remove the need for caveats to be included in the figure captions about what the model shows compared to what the figure seems to show (i.e., the confidence intervals relating to the the raw data not the SE/confidence intervals of the model predictions). Figures 2 and 5. For hypothesis 2 a pie chart is not informative – as the data comes from individual participants. Please plot the data so that the range of proportions of gestures/vocalisations combined with eye gaze across participants is clear (i.e., in the same way as figures relating to hypothesis 1 would be fine, y axis being proportion). As currently plotted, the discrepancy between the data and the model results (especially with regard to 20m timepoints) is hard to interpret (i.e., why there is a main effect of modality, not an interaction with age). Likewise for figure 2, though there are error bars, perhaps consider plotting the individual data points with summary statistics overlaid as suggested above (i.e., not as a bar chart), so the distribution of individual participant data can be observed. Data availability Currently the data for the study is not available. I agree fully that videos cannot be shared. It is perfectly possible however to have the data worksheet mentioned in the availability statement already uploaded into the OSF for example (it can be a private submission), with an anonymized link provided only for the purpose of review. Minor comments Fig 3. - Typo on y axis label (“Frequancy”). Also figure 3 error bars do not look correct and it’s a screenshot containing “plot area” in a box over the middle. I have suggested more substantial changes to figures above. Typo – in the Results paragraph headed “Hypothesis 1a”, the text refers to Figure 5 – this should be figure 3? Please include in the limitations section that this is still a relatively small sample size (in terms of unique participants). Please include line numbers in the next draft to facilitate navigating the text and providing feedback. When discussing possibility of vocalization to serve function of communication very early in development (e.g., page 2) consider citing Papaeliou et al., 2002 (10.1044/1092-4388(2002/024)) and/or Esteve-Gibert & Prieto 2012 (https://doi.org/10.1017/S0305000912000359 ) as these seem relevant too. Note, I could not see tables in my copy of the manuscript. Whether this is an editorial mistake or submission mistake, I’m unsure (I contacted the journal to ask, but didn’t get a response before submitting my review to the deadline). I look forward to reviewing the content of the tables in the next round of review.

Reviewer 2

INTRODUCTION

The literature review is relevant. The paper is clearly written with bibliographic references on par with latest published research. On page 14 the authors provide a clear description of the different types of gestures. I was glad to this this section because it provided a clear pathway to the information following the introduction.

I suggest that the authors provide the term typically developing (experimental group) earlier in the paper.  

METHODOLOGY

I am not clear on how toddlers were examined to exclude developmental/language challenges.  Dis children have any episodes of otitis media? Where they excluded? I mention this because under such circumstances even children and toddlers with neurotypical development use more communicative gestures to compensate for reduced vocal output and compromised hearing sensitivity.

Is there a rater-to-rater reliability score?  

RESULTS

No comments. This is a qualitative examination and data are clearly presented in a via descriptive non-param stats.

DISCUSSION

The authors describe the important performance and provide adequate explanation of the findings.  The writing is clear and the manuscript is well-organized.

The authors provide adequate description of findings which are combined with plausible interpretations.

The novelty of the study rests on the fact that the findings contribute and enrich the  database related to vocalizations versus gestures.

[Note: HTML markup is below. Please do not edit.]

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Please see attached comments (in pdf form). Overall, my review is positive, but there are revisions that need to be made in order to get the manuscript into publishable form. I look forward to the next round of review.

Reviewer #2: I would like to thank you for giving me the opportunity to review this interesting and well-written piece of work. As per your request, here is the review of PONE-D-23-20708 entitled Frequencies and functions of vocalizations and gestures in the second year of life”. My recommendation is “Acceptance with minor changes”. My comments are mostly related to methodological and stylistic issues. The paper focuses on the critical theme related to the importance of gestures vis a vis vocalization as early indications of the origins of language. It follows a previously conducted study, with the current paper adding and supporting evidence of their theoretical EVO-DEVO framework. The theoretical framework of observation highlights the fact that more than ¾ of gestures across these second-year data were deictics (e.g., pointing and reaching), acts that while significant in supporting the establishment of referential vocabulary in both spoken and signed languages, are not signs, but have single universal deictic functions in the here and now. In contrast, words and signs are functionally flexible, making possible reference to abstractions that are not bound to any “particular illocutionary force nor to the here and now”. This topic is long overdue given the scarcity of data available from such topics related to the origins of language.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachments
Attachment
Submitted filename: PLoS One Sep 2023.pdf
Attachment
Submitted filename: PLOS VOCALIZATIONS AND GESTURES.doc
Revision 1

All responses to specific reviewers and editor comments is provided in the attached "Response to Reviewers" document provided with our re-submission materials.

Attachments
Attachment
Submitted filename: response to reviewers.docx
Decision Letter - Elena Theodorou, Editor

PONE-D-23-20708R1Frequencies and functions of vocalizations and gestures in the second year of lifePLOS ONE

Dear Dr. Burkhardt-Reed,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jul 12 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Elena Theodorou

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #3: Partly

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Please see my attached comments for one final minor revision. Overall, again my review is very positive.

Reviewer #3: The paper in general is very well written. I do have a few considerations though, regarding methodology and the conclusions. My comments are shown below:

• Line 96: it has been reasoned that in the massive amounts of vocal explorations …

METHODOLOGY/RESULTS

• I have an issue with the inclusion of the category “social plauful” in Universal vocalizations. Although animal sounds or object sounds are echopoietic, they are still conventional across languages, so they are the product of learning/associations. I would have liked to know what the percentage of this category is in Universal Vocalizations across ages, and also I suspect that the fact that there is no significant difference between gestures and vocalizations in the Universal category is due to the fact that this subcategory was wrongly included in Universal instead of conventional. When this happens the difference should become evident.

• The fact that the two coders agree only at .8 and .65 regarding social/non-social interactions creates a problem. The percentage is very low. (line 523/524). This suggests highly ambiguous data and problematic categories. Can you please explain how you handled this problem?

• I cannot see any figures in the Revision 1 attachment. Seeing them in the initial submission, please add descriptions of the axes in all figures

DISCUSSION

• I also have an issue with the discussion about what constitutes a gesture and a general utilitarian movement. Also I would like to bring into this discussion the issue of development of fine motor skills which takes time for infants, even in the first year of life. In other words, gestures involve fine motor movements, which take longer to develop than gross motor movements and sounds (a bit like having protophones and language sounds). So although there are movements with the hands, they are not considered gestures, as the latter take long to be produced in a controlled manner.

• A final issue concerns a point acknowledged in the limitations section – there are no data from deaf children and this suggests that we cannot know whether the patterns observed are due to an evolutionary reason or whether a special emphasis on vocalizations is the result of communicative reinforcement from parent interactions throughout the first year. So if we are going to talk about an evolutionary perspective emerging from the current data, we need to account for the fact that parents respond to vocalizations from day 1 but not to gestures. So how do we know that the current data, and preference for vocalizations over gestures, are the result of a genetic/evolutionary basis of language and not just conditioning/reinforcement for sounds instead of gestures?

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #3: Yes: Loukia Taxitari

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachments
Attachment
Submitted filename: PLoS One Re-review 21_03_2024.docx
Revision 2

Response to reviewers is addressed in the document included in our resubmissions materials. We have also included the Author responses to the reviewer comments included below:

Reviewer #1: Please see my attached comments for one final minor revision. Overall, again my review is very positive.

Thanks for your comments and noticing the oddity in the figures. They have been corrected. We have provided an explanation for why the minimum whiskers cannot be displayed when the minimum is at zero.

Reviewer #3: The paper in general is very well written. I do have a few considerations though, regarding methodology and the conclusions. My comments are shown below:

• Line 96: it has been reasoned that in the massive amounts of vocal explorations …

Thanks for noticing the ungrammaticality of the sentence. We found and corrected it. The sentence now reads: “It has been reasoned that the massive amount of vocal exploration in human infants forms a foundation for language.”

METHODOLOGY/RESULTS

• I have an issue with the inclusion of the category “social plauful” in Universal vocalizations. Although animal sounds or object sounds are echopoietic, they are still conventional across languages, so they are the product of learning/associations. I would have liked to know what the percentage of this category is in Universal Vocalizations across ages, and also I suspect that the fact that there is no significant difference between gestures and vocalizations in the Universal category is due to the fact that this subcategory was wrongly included in Universal instead of conventional. When this happens the difference should become evident.

The reviewer deserves a big thank you for this comment. Table 2 had an error that misled the reviewer and would surely have misled other readers. In fact the “social playful” category for vocalization was eliminated from the Universal domain prior to our final sorting of data into categories, but we forgot to eliminate the mistaken line in Table 2. There were very few instances of animal or object sounds, but in the final analysis they were all assigned to the Conventional vocal group, because we noticed, like the reviewer, that they were at least somewhat language-specific and thus should be thought of as learned. Table 2 has been revised accordingly, eliminating the offending row. Since the data were actually sorted correctly, the relation between the amount of Universal vocalization and Universal gesture was reported correctly, in accord with the reviewer’s opinion on how the sorting should have been done.

• The fact that the two coders agree only at .8 and .65 regarding social/non-social interactions creates a problem. The percentage is very low. (line 523/524). This suggests highly ambiguous data and problematic categories. Can you please explain how you handled this problem?

The reviewer may know that kappa does not actually represent a percentage, but a special measure of agreement. The standard interpretation is that values between 0.01–0.20 represent slight agreement, 0.21–0.40 are seen as fair, 0.41– 0.60 as moderate, 0.61–0.80 as substantial, and 0.81–1.00 as almost perfect agreement. Both the kappa values are thus in the “substantial” range. Percent agreement is nowadays largely disregarded as a measure of agreement because it can be misleading if expected values in cells of a comparison are widely different, as in the present case—kappa corrects for such imbalances.

In our opinion, the most important agreement issue for our paper is the similarity of the distribution of identified categories across the two coders for the three domains (Non-social, Universal, Conventional) as portrayed in Figure 1. For gesture there were 170 items categorized with only 4 that were not agreed upon. For vocalization there were many more disagreements (73/365), but the outcome still corresponds to 80% agreement (which sounds better than .65 kappa, but that’s because kappa corrects for the imbalance of expected values). A 2x3 chi-square produced a wildly significant agreement (p < .0001) between the coders on vocalization categorization into the three domains. Whenever agreement is above chance, it is justified to conduct comparative analyses across groups of data, because if there are significant differences between the groups, there is good reason to trust them, since coder disagreement constitutes noise in the data and would produce Type II, not Type I error. In essence the more noise there is in the data, the harder it is to detect real group differences.

Finally, we hope the reviewer will notice our explanation in the main text about the primary source of the differences between coders regarding categorization of vocalizations into the domains. There is inherent difficulty in getting high agreement between coders on whether an infant vocalization is directed to an eliciting adult (whether it is non-social or social-interactive-universal) in a circumstance where the infant attention is varying moment to moment. This is not a fault of our method, but an inherent characteristic of the complexity in the actions of the infant and the parent during interaction, which involves eye contact, timing and other contextual factors, all of which are not only influencing each other but also varying moment to moment. One cannot and should not expect very high agreement among coders on all factors of judgments about infant intentions in such a circumstance. Still our agreement level justifies our analyses.

• I cannot see any figures in the Revision 1 attachment. Seeing them in the initial submission, please add descriptions of the axes in all figures

We do not understand why it has been so difficult for all the relevant files to be made available to the reviewers. You are not the first to mention the problem. We will do whatever we can to ensure you have all the files necessary.

DISCUSSION

• I also have an issue with the discussion about what constitutes a gesture and a general utilitarian movement. Also I would like to bring into this discussion the issue of development of fine motor skills which takes time for infants, even in the first year of life.

In other words, gestures involve fine motor movements, which take longer to develop than gross motor movements and sounds (a bit like having protophones and language sounds). So although there are movements with the hands, they are not considered gestures, as the latter take long to be produced in a controlled manner.

This suggestion has arisen in prior comments on the previous paper, as well as in presentations on our work. We have added a paragraph in the Discussion section explaining our opinion. The new paragraph reads as follows:

“It might be suggested that the higher rate of vocalization than gesture in early life can be accounted for by the fact that fine motor control of the arms, hands and fingers does not develop until later than phonatory control. But the suggestion in fact merely restates the finding rather than explaining it. Yes, the motor control of protophones occurs before the motor control of arms, but the appropriate question is why? We contend that an evolutionary explanation is needed and that the fitness signaling theory provides the necessary reasoning. In accord with the theory, it was and is because of the altriciality of infants in the human line (their slow and long-term development of head control, arm control, locomotion, and fine control of the extremities) that there has been selection pressure on them to signal their wellness effectively enough to elicit the long-term care they require because of altriciality. Phonation, according to the theory, long ago in hominin history became the target of that selection pressure, and in that way phonatory activity in humans became a precocial feature of hominins and thus laid a foundation for later developments leading to language. At a later point in evolution, complex gesture and ultimately sign language can be seen as having become, in accord with the reasoning, important secondary developments to the primary evolution of language in the vocal domain.”

The following additional reasoning might be useful as a response to the reviewer’s suggestion.

It is still not widely realized that infants are very active independently in production of phonation. As cited in the ms, even preterm infants not yet released from intensive care, produce highly differentiated phonatory patterns (Oller et al., 2019). In a sense, our earlier publication (Burkhardt-Reed et al., 2021) and those of others showing motor control of phonation proceeds faster than finger and hand motor control (that would be manifest if gesture were more prominent in early development) are the evidence that justifies the reviewer’s observation: phonatory development comes earlier than the fine motor control of gesture. The real issue is why fine motor control of phonation should develop faster than fine motor control of the hands and limbs. Notice that control of the supraglottal articulators comes later in development than phonatory control—canonical babbling, for example, begins in the second half year (Lang et al., 2019; Lohmander et al., 2020; Stoel-Gammon, 1992)—and it is not clear how to scale the rate of development of supraglottal articulation in comparison with that of fine motor control of the hands and fingers. What is eminently clear is that phonation control in humans comes earlier than either fine supraglottal articulatory control or fine motor control of the hands.

One might say that humans (in comparison with many other mammals and especially other primates) are very altricial with regard to such matters as fine finger and hand control, locomotion, head control, and posture. But they are very precocial with regard to phonation. In fact their control of phonation is not only precocial, it is largely unique among primates at any age—other primates have been found to have little if any voluntary control of phonation (Hauser, 1996). Training of any non-human primate to produce any phonatory vocalization on command has proven extremely difficult and success at any level may require years of operant conditioning (Hopkins et al., 2011). In contrast, training of supraglottal acts such as blowing raspberries or sticking the tongue out is easy. An evolutionary interpretation of these facts might be simply that the human line has been naturally selected to have not only the ability but the inclination to produce phonation and to explore it from birth. The fitness signaling theory offers an explanation as to how the human phonatory capability could have been naturally selected (Griebel & Oller, 2024; Locke, 2006). One might say that the overall altriciality of the human infant, requiring very long term provisioning and protection,was compensated for through natural selection of a way that infants could elicit long term care by vocal fitness signaling in the form of exploratory phonation.

• A final issue concerns a point acknowledged in the limitations section – there are no data from deaf children and this suggests that we cannot know whether the patterns observed are due to an evolutionary reason or whether a special emphasis on vocalizations is the result of communicative reinforcement from parent interactions throughout the first year. So if we are going to talk about an evolutionary perspective emerging from the current data, we need to account for the fact that parents respond to vocalizations from day 1 but not to gestures. So how do we know that the current data, and preference for vocalizations over gestures, are the result of a genetic/evolutionary basis of language and not just conditioning/reinforcement for sounds instead of gestures?

Indeed, we would love to make a report on vocal and gestural development in congenitally deaf infants. But the relevant data are not available to us.

On the conditioning suggestion, even if conditioning played a big role in early rates of gesture and vocalization (which seems doubtful, see next paragraphs), vocalization could be reinforced vastly more often than gesture simply because vocalization occurs shortly after birth at high rates while gesture is essentially absent shortly after birth. For gesture, there is virtually nothing to reinforce, while for protophones, there tend to be 4-5 per minute every waking hour (Oller et al., 2021).

But widely available data (especially since all-day recordings have been available) contradict the widespread belief that human infants vocalize largely in response to caregiver elicitation and that they learn vocalization (perhaps even phonation) through operant conditioning/parent reinforcement (Long et al., 2020; Long et al., 2019). We now know that the vast majority (more than 90%) of infant protophones are produced in the absence of any elicitation, and that they are very rarely directed toward caregivers in the natural environment of the home. There is no solid evidence that prior to the end of the first year, infants ever produce sounds on the basis of adaptation to caregiver vocalizations that are not themselves imitations by the caregivers of sounds the caregivers realize are already in the infant repertoires. The widespread belief in vocal imitation as the source of infant protophones is speculation and has no basis in fact that we know of. Experimental studies that claim to show infant vocal imitation prior to the end of the first year have been, we think, widely overinterpreted. In general what they show is that infants at various points across the first year can be experimentally induced to produce sounds already in their repertoires given vocal stimulation by sounds similar to those in their repertoires (e.g., Goldstein & Schwade, 2008; Kessen et al., 1979; Kugiumutzakis, 1999), but we know of no evidence of young infants being stimulated to produce sounds not in their endogenously developed repertoires.

So where do the infant protophones come from? They appear to be the products of endogenous exploration, overwhelmingly. When they do occur in face-to-face interaction, they show signs of adaptation to make them especially good signals of fitness (Long et al., 2022), but they do not show signs of imitation until the end of the first year, and even then only rarely. Vocabulary learning of course depends on vocal imitation, but that is a phenomenon that is extremely limited until the second year.

So, although the evidence is largely observational rather than experimental, it supports the idea that vocal development is overwhelmingly endogenous in the first year.

Of course there is no way to fully rule out the possibility that there is a role for differential reinforcement in the dramatic difference between rates of vocalization and gesture in early life.

If the editor thinks there should be a statement acknowledging a role for differential reinforcement cannot be completely ruled out, we will add it.

________________________________________

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #3: Yes: Loukia Taxitari

________________________________________

Burkhardt-Reed, M. M., Long, H. L., Bowman, D. D., Bene, E. R., & Oller, D. K. (2021). The origin of language and relative roles of voice and gesture in early communication development. Infant Behav Dev, 65, 101648. https://doi.org/10.1016/j.infbeh.2021.101648

Goldstein, M. H., & Schwade, J. A. (2008). Social feedback to infants’ babbling facilitates rapid phonological learning. Psychological Science, 19, 515-522.

Griebel, U., & Oller, D. K. (2024). From emotional signals to symbols. Frontiers in Pychology. https://doi.org/doi: 10.3389/fpsyg.2024.1135288

Hauser, M. (1996). The evolution of communication. MIT.

Hopkins, W. D., Taglialatela, J., & Leavens, D. A. (2011). Do apes have voluntary control of their vocalizations and facial expressions? In A. Vilain, J. L. Schwartz, C.

Attachments
Attachment
Submitted filename: Response to Reviewers.docx
Decision Letter - Elena Theodorou, Editor

Frequencies and functions of vocalizations and gestures in the second year of life

PONE-D-23-20708R2

Dear Dr. Burkhardt-Reed

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Elena Theodorou

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Thanks again for your positive response to my (and the others reviewers') comments. I am fully satisfied that my only outstanding comment about the figures from the previous review has been addressed.

Reviewer #3: Thank you for addressing all my comments. I really like your paper! I would still like the conditioning discussion to be part of the text, as I'm sure other people will have the same question. It's a pity to leave it out. Thank you for a quick and fruitful correspondence.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #3: Yes: Loukia Taxitari

**********

Formally Accepted
Acceptance Letter - Elena Theodorou, Editor

PONE-D-23-20708R2

PLOS ONE

Dear Dr. Burkhardt-Reed,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Elena Theodorou

Academic Editor

PLOS ONE

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .