Peer Review History
| Original SubmissionOctober 2, 2020 |
|---|
|
PONE-D-20-31042 A computational lens into how music characterizes genre in film PLOS ONE Dear Dr. Ma, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jan 18 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols We look forward to receiving your revised manuscript. Kind regards, Stavros Ntalampiras Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 2. Thank you for stating the following in the Financial Disclosure section: "The study was done at the Center for Computational Media Intelligence at USC, which is supported by a research award from Google. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript." We note that you received funding from a commercial source: Google. Please provide an amended Competing Interests Statement that explicitly states this commercial funder, along with any other relevant declarations relating to employment, consultancy, patents, products in development, marketed products, etc. Within this Competing Interests Statement, please confirm that this does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests). If there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared. Please include your amended Competing Interests Statement within your cover letter. We will change the online submission form on your behalf. Please know it is PLOS ONE policy for corresponding authors to declare, on behalf of all authors, all potential competing interests for the purposes of transparency. PLOS defines a competing interest as anything that interferes with, or could reasonably be perceived as interfering with, the full and objective presentation, peer review, editorial decision-making, or publication of research or non-research articles submitted to one of the journals. Competing interests can be financial or non-financial, professional, or personal. Competing interests can arise in relationship to an organization or another person. Please follow this link to our website for more details on competing interests: http://journals.plos.org/plosone/s/competing-interests [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: I Don't Know Reviewer #3: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: This work presents interesting and original findings about the use of music genre in films. Authors provide a proper literature review and the language is clear. ======= Methods ======= You could provide more details about p-values and statistically significance, but, honestly, I do not mind about this too much. The main issue, however, is that the difference among the models used should be inspected further. One method that I suggest is to show the results (i.e. tables 4 and 5) using violin plots, which allow a general qualitative overview of the distribution without falling in type I and II errors. Moreover, violin plots are easily to build. Another option is to just use a scatter plot in a Precision-Recall space and different colors for different models. You say that you have used p-value for checking results in table 5, but what test have you used? Why have you chosen that test? Have you corrected it with some method (e.g. Bonferroni/Holm methods)? Have you used a multi-distribution test such as Kruskal-Wallis or ANOVA? About the features, why have you chosen those features? Which previous studies have you followed? Have you simply used the Matlab standard features? If yes, why? In figure 3, you show the feature importance plot for MIR features; I did not get why you have not computed the feature importance for the VGG features: reducing the number of features used for a classification model may lead to and increase of the overall performance. Again, why have you not used confidence intervals/violin plots/box plots or similar in this figure? It is hard to understand what is the importance of each feature otherwise. ============ Ease of reading ============ You should also declare more precisely the contribute of their work in the abstract and possibly in the introduction: the sentence that is at now is almost unuseful. To my understanding, the paragraph "Visual-musical cross-modal analysis" has almost no reason to be. You repeat everything later, while previous works should be put in "Related works". Note that an extnsive survey about multi-modal and cross-modal music studies extists [1]. Paragraph "Multiple Instance Learning" is very unclear. You should say as soon as possible what is a bag and what is an instance in your study. Everything should then be referenced to your case (e.g. hypothesis etc.). This allows the reader to understand MPI with a concrete example. Results reported about ScoreStamper in paragraph "Automatically extracting musical cues in film" are scientifically unuseful. You have tested it in only 3 bags. How many instances, stamps, multiple occasions there were? How much were differente the music pieces in the song track? Reasons about the low recall are unclear. You should describe with more details the structure of the models used, even if these models were already used in previous papers. ========= References ========= [1] F. Simonetta, S. Ntalampiras, and F. Avanzini, “Multimodal Music Information Processing and Retrieval: Survey and Future Challenges,” in Proceedings of 2019 International Workshop on Multilayer Music Representation and Processing, Milan, Italy, 2019, pp. 10--18. https://arxiv.org/abs/1902.05347 Reviewer #2: This paper presents a study on the use of films' soundtracks to help automatic classification of their broad genre (e.g. Action, Comedy). To do so, the authors 1) curated a new dataset of films and their corresponding sound tracks with automatic fingerprinting annotations; 2) explored different features for describing the music content of soundtracks, in particular they explored common MIR features related to dynamics, rhythm, pitch, timbre and tone (such as MFCCs, tempo and chroma) and deep learning derived features (VGGish); 3) investigated different variations classifiers and problem definitions (SVMs, kNNs, DNNs, Multiple Instance Learning, different pooling strategies); 4) performed an ablation study on the importance of the MIR features for the classification of the different genres and its relation with simple image clues such as brightness and contrast. The paper is well written, has good references to previous work and a thoughtful discussion of the results, so I'd recommend it to be accepted. Even though the paper is in good shape already, there is room for improvement in the structure and more details about implementation and experiments should be added. See my comments below. Improvement in structure === In my opinion the contributions should be better highlighted. It is a bit difficult to understand what exactly the contributions are with respect to previous work since the discussion in related work doesn't clearly highlight the limitations in the literature besides some isolated comments spread out in the text (i.e. the first clear statement I saw was in line 46). The innovations in methodology, and the analysis should be also listed as contributions if they're more extensive than previous works. More detail on the experiments === Authors should explain how they assessed statistical significance. It is mentioned in the text but not clarified. Also it would be good to have more details on the network architecture besides a reference (capacity or number of parameters, layer's size, input size a bit more clear). Note that the dataset is not very big and this raises some questions on suitability of the architecture that could be partially answer with more information on the implementation. I think some reference to the performance of previous works is needed to understand if the models presented here are performing reasonably (which they seem when I went compared to [1]) but an explicit comment would help understand the work better. I'm not sure if I followed the conclusions in L298-L302 that the music style of a clip. The conclusion is that because the brightness and contrast in the clips using predicted labels are not correlating with the "expected behaviour" of each genre then the music in the clip doesn't necessarily correspond to the visual style? Then why do you see that effect in the clips when you use "actual labels"? Could it be an artifact of the model's performance? Nit comments === - The figures are not in the main text, not sure if this is an artifact of the reviewing template or something to correct. - L154: Briefly explain "texture windowed" - L166: Would prefer a short explanation on how brightness and contrast were obtained and refer to paper for further details. - L221: I don't understand this phrase, couldn't parse it. - L283 - L286: Are you trying to say that investigating brightness and contrast mean scores on the model's predictions help understand associations between those visual features and what the model learned? And that could potentially be applied to unknown genres? Maybe rephrase to make it clearer. Reviewer #3: The authors build on research in the field, utilizing well-constructed computational models to retrieve information and data to help us understand how film music operates across several genres and interacts with other film modalities. They have applied this to over 100 films, and I find that their approach in identifying the music from the soundtracks that are actually timestamped in the film itself is a sound and even essential one. Though this study does involve necessary technical information appropriate for a study like this, they are careful to take the reader step-by-step through their process, carefully explaining terms, and leading us to their conclusions in a logical, cogent manner. They have also provided strong data in support of the study. I believe that this study can open the way to further research, as they even suggest in the paper. As a practicing musician and musical scholar myself, I will be interested in seeing this lead to further published work that will help us better understand music’s role as significant and interactive cinematic device, and how viewers respond to the cinematic experience, emotively, perceptively, and cognitively. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: Yes: Joseph L. Rivers Jr. [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
PONE-D-20-31042R1 A computational lens into how music characterizes genre in film PLOS ONE Dear Dr. Ma, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Apr 03 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols We look forward to receiving your revised manuscript. Kind regards, Stavros Ntalampiras Academic Editor PLOS ONE [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: (No Response) Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: (No Response) ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: No Reviewer #2: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: 1) Statistical significance tests were only performed for the visual feature experiments and not for the VGG-ish and MIR features. 2) the caption of table 5 is unclear about what the bold style means: authors say to have computed p-values using bonferroni correction; this means that they have compared multiple tests, but I cannot understand what are these tests: did they compare all the tests for which they show the average at once? or did they compared "actual", "predicted" and "false positives" in each row? or all rows of "predicted" and then all rows of "actual" and then all rows of "false positives"? 3) Moreover, statistical tests need the same cardinality between the set tested. This would mean that the number of "films labeled with a given genre" is the same as the number of "films excluding the given genre", which seems unlikely. How they managed this problem for the statistical test? 4) I appreciate the addition of supplementary figure S2, but it shows recision-recall curves, not scatter plots. PR-curves comes out when the classification is made based on some threshold, and they are a method for evaluating the model, not the distributions of the predictions. AUC (Area under curve) is similar to F1-score and in this case PR-curves don't add any knowledge. Even if other reviewers find this plot useful, it's not clear what "no skill" line is. It would be more clear if MIR and VGG-ish points were on the same plot. When I wrote "scatter plot in a precision-recall space", I was meaning a scatter plot, not a curve. Scatter plots are plots which shows points in their coordinates; in my example, coordinates were precision and recall. Points could be, for instance, each film. Multiple distributions can be plotted using different colors, eg. multiple models: see for instance the following image https://www.researchgate.net/profile/Mohit_Bansal7/publication/267783907/figure/fig3/AS:669377024258060@1536603327607/BCubed-Precision-Recall-scatter-plot-for-the-Japanese-English-dataset-Each-point.png A plot such that could be useful to qualitatively evaluate the difference models without falling in type I and type II errors. Reviewer #2: In my previous review I mentioned that the paper was in good shape already but I recommended the authors to 1) clarify their contributions and structure; 2) provide more details in experiments and discussion, i.e. explain details in architecture, statistical significance, how the visual features were calculated, among others. The authors addressed all my comments and also improved the results and discussion section which is much more clear now, so I recommend the paper to be accepted in this new version. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 2 |
|
A computational lens into how music characterizes genre in film PONE-D-20-31042R2 Dear Dr. Ma, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Stavros Ntalampiras Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: (No Response) Reviewer #2: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: All the comments have finally been addressed. Statistical significance tests have been carried out over all the evaluations shown in the text and an additional plot has been added, giving the rough idea about how different the two models are. Since PLOS One guidelines instruct the reviewers to stress the rigorousity of the scientific procedure in respect to the results, even if the plot added shows how VGG-ish features only marginally outcomes classic MIR features, I find that the paper still adds knowledge worth of publication. Reviewer #2: The authors addressed my comments previously, and they have carefully answered and made the changes requested for the other reviewer in this iteration. The paper is in good shape to me. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No |
| Formally Accepted |
|
PONE-D-20-31042R2 A computational lens into how music characterizes genre in film Dear Dr. Ma: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Prof. Stavros Ntalampiras Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .