Peer Review History
| Original SubmissionFebruary 14, 2020 |
|---|
|
PONE-D-20-04410 Absolute Pitch as a Latent Trait PLOS ONE Dear Dra. Germano, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. I found the manuscript interesting in that it tries to evaluate if absolute pitch is a trait or a categorical variable. The choice of methodology is appropriate, but the reporting is lacking in quality. The comments from the two reviewers are very constructive and should enable you to improve the manuscript. We would appreciate receiving your revised manuscript by Jun 15 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols Please include the following items when submitting your revised manuscript:
Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. We look forward to receiving your revised manuscript. Kind regards, Karl Bang Christensen, Ph.D. Academic Editor PLOS ONE Additional Editor Comments: Please provide a revised version addressing the comments from the two reviewers, who are very constructive and have provided comments that should enable you to improve the manuscript. The choice of methodology is appropriate, but the reporting is lacking in quality. You must address test of validity much more rigorously in a revised version. Evaluate fit of a one-dimensional CFA model (reporting chi-square, df and P-value). If you also waht to report the RMSEA with corresponding confidence interval or other indeces of close fit that is OK. The figures are attached on their own with no legends or titles. Some of them are not even mentioned in the main text of the paper. One example line 145 states 'fig. 1', but the next line appears to discuss fig. 2? Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at: https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 2. Please modify the title to ensure that it is meeting PLOS’ guidelines (https://journals.plos.org/plosone/s/submission-guidelines#loc-title). In particular, the title should be "specific, descriptive, concise, and comprehensible to readers outside the field" and in this case it is not informative and specific about your study's scope and methodology. When modifying the title please be sure to amend both the title on the online submission form (via Edit Submission) and the title in the manuscript so that they are identical. 3. Your ethics statement must appear in the Methods section of your manuscript. If your ethics statement is written in any section besides the Methods, please move it to the Methods section and delete it from any other section. Please also ensure that your ethics statement is included in your manuscript, as the ethics section of your online submission will not be published alongside your manuscript. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: No ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: No Reviewer #2: No ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: No Reviewer #2: No ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: No ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: PONE-D-20-04410 This is an interesting application of modern psychometric methods. The comparison of IRT models and latent class models appear well suited for the theoretical problem posed. I have some suggestions for the data analysis and presentation of results. 1. You should present some basic descriptive information, e.g. the frequency distribution for each item (e.g. Perfect, Imperfect, Wrong), and the frequency distribution of a simple sum of the items (or two sums according to your two approaches). This allows the reader to get some sense of your data. 2. You estimate a 2-parameter and a 3-parameter IRT model. However, it is unclear which of these models you deem is having the best fit. Also, it was unclear to me, which of these models you compare with the latent class model and for which IRT model you present the score distribution in figure 5. Please clarify. 3. Fit is evaluated through a global chi-square test. However, chi-square tests with app. 1000 DF are not optimal. You report χ2(991) = 6635.882, p-value =0.999 (line 239). This must be a typo, if the chisq value is correct, p=0. I suggest you do two things. 3a. For global tests of fit and comparisons of the 2-P and 3-P models, use AIC and BIC as in your comparison with latent class models. 3b. Evaluate item level fit, e.g. by the item fit tests suggested by Orlando and Thissen (Applied Psychological Measurement 2000). Such tests are available in the IRTPRO software and the free R package mirt (https://cran.r-project.org/web/packages/mirt/mirt.pdf). Such item based fit test may identify some stimuli that are not well modeled by IRT. 4. Your IRT parameter estimates have very large standard errors, in particular for the discrimination parameter in the 3-P model. Difficulties of estimating the discrimination parameter in 3-P models is a known problem, but I am still concerned. You may want to use a prior for the discrimination parameter in addition to a prior for the guessing parameter. 5. I am also a bit worried about the magnitude for the guessing parameter for some items. For pure guessing, you would expect a guessing parameter around 1/12 = 0.08. Is the any theoretical that some items would have a lower asymptote of 0.3? 6. In choosing between latent class model, you argue that the improved fit of the models with more than 2 classes should be ignored, due to the complexity of these models. However, in comparisons between IRT and latent class models for the perfect approach, you suddenly argue that a difference in fit of the same magnitude is important and should be interpreted. You cannot have it both ways. If you compare e.g. a 3 class model with the IRT model, you get the following results: AIC D AIC 3CL BIC D BIC 3CL SSABIC D SSABIC 3CL Perfect 6988.887 6952.444 7082.150 7101.664 7018.640 7000.048 This comparison show no particular superiority of the IRT model in terms of fit. The same reasoning could be applied for a 4-CL or a 5-CL model. You may want to keep a 2-CL model in the comparisons for conceptual reasons, but you should include a latent class model with better fit (e.g. 3 classes or 4 classes). Based on the results I have seen, I would conclude that for the “perfect” approach, a latent class model with 3, 4, or 5 classes has equally good fit as an IRT model. For the “imperfect” approach, the IRT model seems clearly better. 7. You present the score distribution for one IRT model, presumably for the “perfect” approach (the text in lines 272-273 seems to have the numbering of figures wrong). The score distribution is clearly skewed. A large group of people seem to have the same level of ability to identify the pitch of a tone (i.e. no ability). This may pose a problem for standard IRT model estimation, since a normal distribution is assumed for the latent trait. For this reason, a better model for your data may be a latent mixture distribution model with 2 latent classes. Class one consist of persons who are not able to identify the pitch regardless of the stimulus. Class two consists of persons with at least some ability to identify pitch. Within class two responses might follow a 2-P IRT model. Such a model can be estimated in Mplus. It is fairly complex, but you may want to at least discuss it. 8. With regards to the psychometric lingo, I would suggest that : 8a. “Psychometric” is better than “Psychometrical” 8b. “Latent variable models” is better than “structural equation model”. SEM refers to a particular type of latent variable models, a type you do not use in your analysis. 8c. “Continuous” is better than “Dimensional” Specific suggestions: Line 24. I suggest “Through Latent Variable Models (LVM) we can evaluate consistency validity…” Line 27. I suggest “… two LVM approaches: continuous latent variables (LV)…” Line 37, I suggest “The phenomenon of absolute pitch (AP)…” Line 65. It might be helpful to define relative pitch. Line 115. Is there a reference regarding the use of a combination of AP and RP to identify pitches? Line 159. I suggest “…continuous and categorical.” Line 160. I suggest writing “The former approach, Item Response Theory (IRT), …” Lines 166-170. I suggest writing: “a) An IRT model with two parameters for each stimuli: the discrimination parameter (also called parameter a), which describe the ability of this stimuli to distinguish between persons with low and high pitch identification ability and the item location parameter (also called parameter b), representing the level of pitch identification ability where you have 50% chance of correctly identifying the pitch of this stimulus.” Lines 172+173. I suggest writing: “… is the probability of a person with very low pitch identification ability still correctly get a correct answer for a given stimulus. Reviewer #2: The ultimate aim of this research project is not explicitly stated, and it remains a little unclear. The authors have created a new test of assessing absolute pitch, and this was investigated using an item-response theory (IRT) approach and a latent class analysis (LCA) approach. These models were then compared to see which offered the best fit. Although the foundation and rationale for the study seems reasonable, in that the authors wish to create a measure to determine whether individuals have the ability to identify absolute pitch, the applied methodologies are confusing and it should be better explained as to why they are appropriate. • Latent Class Analysis categorises people into groups, under the assumption that the same thing is being measured for all people, by a standardised count, process, or measurement device/scale. • IRT is used to determine whether a set of items are delivering a valid total score of an unobservable latent trait. Thus it should be explained in more detail why the fit of these models should be compared as they look at different things. It should be emphasized that the categorisation of people relies on the measurement process being valid and stable, so in some sense LCA should not be considered until the measure has been validated. For the IRT scale assessment approach, there are also many aspects that have been neglected. There is no indication of item fit. No investigation of response dependency. No assessment of reliability or targeting. Was a single parameter model considered? - A single-parameter (Rasch) model would be appropriate for scale development and validation purposes, and for determining whether a total score from a set of items is a sufficient statistic to assess the level of a latent trait. Additionally, for the ‘imperfect approach’, it may be worth the authors considering a partial-credit model, where an exact pitch classification is awarded a score of 2, a semitone deviation is awarded 1, and all other pitches are scored 0. There are also some further issues within the manuscript that would need attention: The model fit statistics are dubious, and there is no real interpretation of the fit statistics that are presented. Certainly a test with 1000 degrees of freedom will have no statistical power. In the manuscript, it is stated that items are centred around the 0 location, but there are no item locations reported below 0 – where are they centred? The pitch test is based across different musical instruments – have these instruments been calibrated? Has the pitch been externally verified in some way? Additionally, the manuscript is currently in need of a language edit and the Figures are incorrectly labelled. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Jakob Bue Bjorner Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
PONE-D-20-04410R1 A new Approach to Measuring Absolute Pitch on a Psychometric Theory of Isolated Pitch Perception: Is it Disentangling two Groups or Capturing a Continuous Ability? PLOS ONE Dear Dr. Germano, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Thank you for your careful revision. I agree with the reviwers that the manuscript has improved. The comments from the reviewers on the revised version illustretes that more work is needed. I agree with comment that the manuscript is currently too long and too difficult to read. I am also concerned about the results you obtain from the 3PL model. The very large standard errors, and the large guessing parameters makes me worry that these results cannot be trusted. One way to make the manuscript better would be to put less emphasis on these results. Please carefully consider all the points raised in the attached reviews and submit your revised manuscript by Oct 23 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols We look forward to receiving your revised manuscript. Kind regards, Karl Bang Christensen, Ph.D. Academic Editor PLOS ONE [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: (No Response) Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: No Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: I Don't Know ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: No Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: No Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Thanks for the responses to my previous comments. I find that your revisions have clarified the data analyses, but there is still some way to go. You have two scoring options for assessment of isolated pitch perception: a. Perfect: only the correct note is classified as a correct response, b. Imperfect: including the half-note below and above. You compare 1. IRT analyses using either a 2-PL or a 3-PL model, 2. Latent class analyses, 3. Mixture model analyses. You conclude that perfect scoring is best fitted using an IRT model, while imperfect scoring is best fitted with a two-class latent class model. 1. I understand that your choice of model is guided by the global indices such as BIC and AIC. However, from a conceptual point of view, this conclusion does not make sense to me. If you conclude that isolated pitch perception with perfect scoring represent a continuous latent trait, how could the use of a less stringent scoring criterion suddenly change this ability to two latent classes? It seems to me that this interpretation of results is theoretically incoherent. I think you need to discuss this and make an overall interpretation whether isolated pitch perception is best considered a continuous trait or two latent classes. 2. Related to the discussion above, it seem to me that the latent class solution for imperfect scoring has some unfortunate interpretation. Even the best group has only a 67% chance of getting item i right ad 72% change of getting item j right within +/- a half tone. This does not concur with the common understanding of absolute pitch. Maybe the absolute pitch group is only a subgroup within the current best latent class. In that case, you may need more than two latent classes. Please discuss whether your current model is theoretically plausible. 3. For perfect scoring, you find that an IRT model represent the best model for the data. However, for perfect scoring, evaluation of item fit finds significant misfit for item c and item d. This suggests problems for these two items as indicators of the latent trait. This is not discussed, but should be. This plots and fit tests do not suggest that a 3-PL model provides a better fit to the data than a 2-PL model. 4. I continue having concerns about your 3-PL model. The discrimination parameter is very high for item b and so it the guessing parameter. Also, the standard errors of the discrimination parameter are high. The BIC suggest that the 2-PL model might be the best. You should conclude which model you regard as the final model. 5. You aim to develop a test for isolated pitch recognition. I assume that part of this development is to decide whether your test is best scored using the perfect or the imperfect approach. I think you should provide your recommendation and the reasoning behind it. 6. The paper is a long and complex read because some many combinations of options are examined. You would increase readability by focusing the main paper on what you consider the best solution and present other options as supplemental analyses. For example, you could focus on perfect scoring, use the 2-PL model as your IRT model to be compared with a latent class and a mixture analysis. Other options could be alluded to briefly and results for these other models could be presented in a web appendix. I think such an approach would make your paper much more readable. 7. You write that the differences in item difficult and discrimination poses difficulties for a simple sum score approach. However, items may be summed without problems even if they have widely different item difficulty. For example, in the Rasch model (where all items have the same discrimination, but may differ in item difficulty) the sum score is a sufficient statistics for the latent trait. So the real issue is whether the items very so much in item difficulty that a simple sum is inappropriate. For the 2-PL model and perfect scoring, I do not think this is the case. Item discrimination varies between 1.2 and 1.9, not a dramatic variation. It is possible that a Rasch (i.e. 1-PL model) may fit these items. Please revise this discussion. 8. While the analyses seems to be well done, the interpretation and discussion of results could use input from English language researchers with psychometric / IRT expertise. Some description of the models is not well structured (e.g. the discussion of IRT models on lines 205-219). Also, while most of the paper is well written, some parts of the psychometric discussion still deviates for the normal language of the field, e.g. in the abstract [my suggestions in square brackets]: “We decided to adopt a psychometric perspective, approaching AP as a latent trait. Via Latent Variable Model (LVM) we can provide [evaluate] consistency and validity for a measurement [measure] to test for AP ability. A total of 783 undergraduate music students took part in the test. The battery test [test battery] consisted of 10 isolated pitches.” Reviewer #2: I would like to thank the authors for responding to the reviewers’ requests and making appropriate amendments to their paper. They have clearly invested time and effort into this, and I believe that the manuscript is now presented better and is much clearer in terms of the purpose of the paper and the process that has been carried out. There are a few additional amendment suggestions that I have, and these are provided below: I would suggest that the title is amended to state ‘specific groups’ or ‘ability groups’ rather than ‘two groups’. As you are using LCA, it is not known a priori how many latent classes will be identified. The Intro reads well, provides good background and makes sense. However, this is an exploratory study, to see how the items in the AP test work among the group tested. Do the items work together to form a measure of an underlying latent continuum (IRT)? Or do they work better as a set of indicator items that can classify people into groups (LCA)? I would suggest that this may be clarified for readers if the authors were to provide a statement in both the abstract and the introduction to state that this is an exploratory data modelling study, to determine which type of model best fits the data for the tested sample. For the IRT analysis, there is still no test of local dependency among the items. This is perhaps unnecessary for the purpose of the current study, but perhaps it could be identified as a potential limitation, or the authors could suggest that it could be assessed in future work if a latent trait IRT approach is pursued further. The authors state ‘Under Maximum Likelihood estimator and using logit parameterization (theta), the constant 1.7 in the logit gives only an approximate closeness to the normal. The translation to IRT parameter values uses factor mean and factor variance to bring them to the N~(0,1) metric used in IRT.’ To clarify this for the reader, I would suggest that the authors might also add a sentence to state that this means that the IRT analysis is centred on the person sample being at 0 logits, and that the item difficulty parameters are provided relative to this. A few additional very minor corrections are as follows: Line 189 states that MPlus is used. R also needs to be added here. Line 287. I believe this should say difficulty rather than discrimination. Line 409. Should be timbres rather than timbers. A list of abbreviations would also be useful, so that the reader can refer back to them without scrolling through all of the manuscript to find the relevant abbreviation. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Jakob Bue Bjorner Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 2 |
|
A new approach to measuring absolute pitch on a psychometric theory of Isolated Pitch Perception: Is it disentangling specific groups or capturing a continuous ability? PONE-D-20-04410R2 Dear Dr. Germano, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Karl Bang Christensen, Ph.D. Academic Editor PLOS ONE Additional Comments: Please edit the manuscript according to these helpful comments from the two reviewers listed below. Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Thanks for the responses to my previous comments. I find that the manuscript is further improved. I only have some suggestions for improvement of language. 1. Line 193. I suggest writing “and the R program” and cite e.g.: R Core Team (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/. 2. Line 194-195. I suggest writing: “The former Item Response Theory (IRT) approach …” 3. Line 198: I suggest writing: “Two different IRT models were used” 4. Line 223: I suggest writing: “Pearson’s X2 (S-X2) implemented in the R package mirt, as per Orlando and Thissen [38]” 5. Line 263, I suggest writing: “given that there were three approaches to statistical modeling…” 6. Lines 315-321: I do not agree with your comments on figure 2 and 3. There are indication of both overestimation and underestimation for low score levels. I suggest just stating: “For items c and d – scored with the criterion of perfect rating – misfit is illustrated by comparisons of predicted and observed proportion of correct results (Fig 2 and 3). In particular, higher than expected proportion of correct answers are seen for theta scores a little higher than 1 and for theta scores a little lower than -1.” 7. Line 417, I suggest writing: “Based on model fit information, we conclude that the continuous…” 8. Line 446, I suggest writing: ”Moreover, in a two-parameter IRT model for the perfect scoring approach, all the items showed…” 9. Line 454-455, I suggest dropping the first part of the sentence and just write: “When comparing LCA to IRT …” 10. Line 472-476, You write “If there are participants who are near infallible in isolated pitch recognition tasks, their prevalence will be reduced as the scores increases (i.e., the higher the score, the lower the number of subjects endorsing all the stimuli correctly). However, under the imperfect approach, all the participants that committed semitone errors were separated from the group that committed more broad errors. More research is necessary to examine the causes for the differences in the underlying models” . This can be misunderstood. I suggest writing: “Using the perfect scoring approach, 1.1% of participants had all items correct. According to the IRT model, these participants would be expected to have greater skills in isolated pitch recognition tasks than participants with lower numbers of correct responses. In contrast, for the imperfect scoring approach, the LCA model assumes that 20.9% of participants have high skills in isolated pitch recognition tasks. Within this group further differentiation in skills cannot be made. The 4.9% who had all 10 responses correct using the imperfect scoring approach were just luckier than the remaining 16% in the high-skill group. More research is necessary to examine the causes for the differences in the underlying models.” Reviewer #2: The authors have addressed all of my comments, and I would like to thank them for considering the suggestions of the reviewers. I have no further amendments to request, except some minor editing changes as listed below: p.8 R is now mentioned – does this need a reference or software version number? Line 308 states: ‘Table 4 shows the items level fit.’ Suggest this is changed to: ‘Table 4 shows item-level fit’ Line 324 states: ‘This table provides the item level for each item...’ Suggest this is changed to: ‘This table provides the item-level fit values for each item...’ Lines 340-342 state: ‘Considering the perfect approach, the lowest BIC was in favor of an IRT with two parameters. However, for the imperfect approach, the lowest BIC was in favor of an IRT with three parameters.’ Suggest this is changed to: Considering the perfect approach, the lowest BIC was in favor of an IRT model with two parameters. However, for the imperfect approach, the lowest BIC was in favor of an IRT model with three parameters. Line 345 states: ‘Therefore, for both perfect and imperfect models, we concluded that the two-parameters models fit better than three-parameters models’ Suggest this is changed to: Therefore, for both perfect and imperfect models, we concluded that the two-parameter model fits better than three-parameter model. Line 404 states: ‘Notably, the red group did not achieve 1, indicating a 100% probability of answering correctly for a giving stimulus’ Suggest this is changed to: ‘Notably, even the red group did not achieve a value of 1 for any of the items, which would indicate a 100% probability of answering correctly for a given stimulus’ Line 409 states: ‘This indicates that the ability to recognize isolated pitches in different timbres and registers without reference is better modeled as a continuous ability when the perfect rating approach is considered in comparison with a categorical and hybrid model’ Suggest this is changed to: ‘This indicates that the ability to recognize isolated pitches in different timbres and registers without reference is better modeled as a continuous ability, rather than when the perfect rating approach is considered with either a categorical or a hybrid model’ Line 446 states: ‘Moreover, in the perfect approach for two parameters, all the items showed high values of discrimination.' Suggest this is changed to: ‘Moreover, for the two-parameter model of the perfect approach, all the items showed high values of discrimination.' Line 490 states: ‘Interestingly, we observed that even the group classified as showing a high probability of choosing the correct answer (less than 20% of the 783 participants) across all the stimuli did not display 100% probability of answering correctly.’ Suggest this is changed to: ‘Interestingly, we observed that none of the individual stimuli were answered correctly 100% of the time, even among the group classified as showing a high probability of choosing the correct answer (less than 20% of the 783 participants).’ ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Jakob Bue Bjørner Reviewer #2: No |
| Formally Accepted |
|
PONE-D-20-04410R2 A new approach to measuring absolute pitch on a psychometric theory of Isolated Pitch Perception: Is it disentangling specific groups or capturing a continuous ability? Dear Dr. Germano: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Karl Bang Christensen Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .