Peer Review History
| Original SubmissionDecember 5, 2023 |
|---|
|
PONE-D-23-38302A novel code representation for detecting Java source code clonesPLOS ONE Dear Dr. Quradaa, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Apr 08 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse. 3. Please provide a complete Data Availability Statement in the submission form, ensuring you include all necessary access information or a reason for why you are unable to make your data freely accessible. If your research concerns only data provided within your submission, please write "All data are in the manuscript and/or supporting information files" as your Data Availability Statement. Additional Editor Comments: Although this manuscript has a few of merits, the reviewers gave serious issues and it needs significant revision. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: No Reviewer #3: Partly ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: No Reviewer #3: N/A ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: No Reviewer #3: No ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: No ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Some potential drawbacks of the proposed machine learning approach for code clone detection using high and low level source code representations: 1. Increased complexity: Combining multiple code representations adds complexity over single-representation techniques. 2. Scalability issues: Deriving and analyzing multiple representations could hamper analysis of very large codebases. 3. Sensitivity to changes: Small code modifications may invalidate code representations, requiring re-analysis. 4. Variability across languages: Representations and extracted features may not generalize well across different programming languages. 5. Infrastructure requirements: Extraction and processing of representations requires language-specific toolchains/infrastructure. 6. Evaluation limitations: Approach tested on limited datasets, more rigorous validation on real-world projects needed. 7. False positives risks: Semantic abstraction could wrongly link syntactically different code snippets. 8. Configuration challenges: Proper configuration of machine learning models requires hyperparameter tuning expertise. 9. Explainability challenges: Identifying why certain code is flagged as clones from multiple blended representations is difficult. 10. Intellectual property risks: Extracting and sharing representations and models could inadvertently leak proprietary code. 11. Describe dataset features in more details and its total size and size of (train/test) as a table. 12. Flowchart and algorithm steps need to be inserted. 13. Time spent need to be measured in the experimental results. 14. Limitation Section need to be inserted. 15. All metrics need to be calculated such as Accuracy, Precision, Recall, F1 score, and ROC AUC score in the experimental results as tables. 16. Address the accuracy/improvement percentages in the abstract and in the conclusion sections, as well as the significance of these results. 17. The architecture of the proposed model must be provided 18. The authors need to make a clear proofread to avoid grammatical mistakes and typo errors. 19. The authors need to add recent articles in related work and update them. 20. Add future work in last section (conclusion) (if any) 21. To improve the Related Work and Introduction sections authors are recommended to review this highly related research work paper: a) A high-quality feature selection method based on frequent and correlated items for text classification b) A new feature selection method based on frequent and associated itemsets for text classification c) Building an Effective and Accurate Associative Classifier Based on Support Vector Machine d) An ASP .NET Web Applications Data Flow Testing Approach e) An Approach to Slicing Object-Oriented Programs Reviewer #2: The paper attempts to report work on using AST to detect Java clones. As the motivation of the work is not clearly defined, the methodology presented is not sound and proven. There is no data, experiment or lemma/theorem to support the proposal. Therefore, the paper is recommended for rejection. Reviewer #3: Dear Authors. Thanks for submitting this work. The paper reviews code representation for detecting Java source code clones. The topic looks promising; however, a few issues are available in this manuscript. Detailed feedback: Title: it is recommended to rephrase the title to show the approach of clone detection, for example using bytecode or assembly. Abstract: lines from 16 onwards might be too detailed for the abstract. it is recommended to remove them. The issue could be maintainability, reusability and even ethical consideration. Introduction it is recommended to provide either a table or a tree structure showing subtypes. The role or type of machine learning used in this study is not clear. This also should reflect the abstract to include the idea of machine learning. 94-100 seems to be repeated. Many parts in the introduction are repeated. define the acronyms on first use. the background should come before related works. Methodology instead of saying, "will be normalized and compiled using a compilation tool like the Stub", authors should specify what they use and why. Step 3 is not detailed at all, and it is suggested to be removed. there are no results of this study, so no generalization can be made. no limitations or dataset description. It is highly recommended to have a section for used datasets and tools. Discussion is shallow, focus on research questions and the objectives, please. conclusion it looks like a summary, please revise implications, limitations, and threats are crucial sections. Minor issues line 28- However, this approach ==>However, in this approach , l section numbering and figures are not inserted in their place. all figures are blur. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Tarek Abd El-Hafeez Reviewer #2: No Reviewer #3: No ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
A novel code representation for detecting Java code clones using high-level and abstract compiled code representations PONE-D-23-38302R1 Dear Dr. Quradaa, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Unil Yun, Ph.D. Academic Editor PLOS ONE Additional Editor Comments (optional): Authors revised carefully this manuscript and it is acceptable now. Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #3: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #3: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #3: N/A ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #3: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #3: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: All of the comments and feedback provided have been thoroughly addressed and incorporated into the final version of the document. Given the comprehensive consideration of all suggestions, I highly recommend proceeding with accepting the document for publishing. Reviewer #3: Thank you for addressing all comments. Please pay attention to figures as some of them are still blurry. and kindly ensure all arguments are completely included within the manuscript were applicable. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Tarek Abd El-Hafeez Reviewer #3: No ********** |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .