Peer Review History

Original SubmissionSeptember 7, 2024
Decision Letter - Shahid Akbar, Editor

PONE-D-24-38797Addressing Imbalanced Data Classification with Cluster-Based Reduced Noise SMOTEPLOS ONE

Dear Dr. Hajizadeh,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Dec 01 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Shahid Akbar, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, we expect all author-generated code to be made available without restrictions upon publication of the work. 

Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. We note that your Data Availability Statement is currently as follows: 

“All relevant data are within the manuscript and its Supporting Information files.”

Please confirm at this time whether or not your submission contains all raw data required to replicate the results of your study. Authors must share the “minimal data set” for their submission. PLOS defines the minimal data set to consist of the data required to replicate all study findings reported in the article, as well as related metadata and methods (https://journals.plos.org/plosone/s/data-availability#loc-minimal-data-set-definition).

For example, authors should submit the following data:

- The values behind the means, standard deviations and other measures reported;

- The values used to build graphs;

- The points extracted from images for analysis.

Authors do not need to submit their entire data set if only a portion of the data was used in the reported study.

If your submission does not contain these data, please either upload them as Supporting Information files or deposit them to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of recommended repositories, please see https://journals.plos.org/plosone/s/recommended-repositories.

If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent. If data are owned by a third party, please indicate how others may request data access.

Additional Editor Comments:

Major Revision

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors have done well but still, some points need to be addressed which increases the readability and look of the manuscript better.

Here are some points

1- The author mentions which dataset results have better accuracy in the abstract.

2- The literature review has two subheadings current research and research gap.

3- The author needs to read the following manuscript and add some applications of Machine learning in different fields in the literature review ”Enhancing software defect prediction: a framework with improved feature selection and ensemble machine learning” “Impact of 3G and 4G technology performance on customer satisfaction in the telecommunication industry” Software defect prediction using an intelligent ensemble-based model.

4- Future work must be a section

5- The author needs to add a table before the conclusion in which results have been compared with some other studies

6- There must be a discussion section at the end

7- The author needs to add tables and figures in the Manuscript rather than down the manuscript so that there will be a better look and easier for the reader

8- What are the issues of the imbalanced dataset and how was it resolved

9- Write the advantages and disadvantages of the proposed method.

10- What are the disadvantages of the Synthetic Minority Oversampling Technique

11- The conclusion must be concise as abstract

Reviewer #2: Overall the paper seems interesting. The topic is valid and well organized. However, the following recommendations are needs to be addressed in order to improve the quality of the papers.

1. The abstract needs more improvement, the authors should mention the results and improvement than existing methods.

2.The problem statement and motivation of the paper needs to be clearly mention to provide a clear background to the readers.

2. The authors should provide the main contributions in points at the end of introduction section.

3. I appreciate the valuable use of machine learning in the model. For the readers concerns, i suggest adding/citing the recent computational models related machine learning and oversampling such as AIPs-DeepEnC-GA, StackedEnC-AOP, DeepAVPTPPred,

iAFPs-Mv-BiTCN, AIPs-SnTCN and Deepstacked-AVPs in order to provide useful information to the readers .

4. How the authors handle the overfitting and generalization of the proposed model.

5. The authors should clearly mention the future directions and real life applications of the proposed study.

6. What are the limitations of the proposed model.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Tehseen Mazhar

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Revision 1

Dear Dr. Akbar and Reviewers,

Thank you for your valuable feedback and suggestions regarding our manuscript titled "Addressing Imbalanced Data Classification with Cluster-Based Reduced Noise SMOTE" (PONE-D-24-38797). We have carefully considered each comment and made the necessary revisions to improve the clarity and quality of our work. The changes can be tracked in the revised manuscript, and a separate Response to Reviewers sheet has also been prepared.

Summary of Revisions:

Abstract Improvement: We have revised the abstract to include specific results and improvements compared to existing methods, as suggested by Reviewer #2.

Problem Statement and Motivation: We have clarified the problem statement and motivation in the introduction to provide a stronger background for readers.

Main Contributions: The main contributions of our study are now clearly outlined in bullet points at the end of the introduction.

Literature Review Enhancements: We have included additional references and applications of machine learning, as recommended by the reviewers.

Future Work Section: A dedicated section for future work has been added to outline potential directions for further research.

Comparison Table: We have added a table comparing our results with other studies, enhancing the manuscript's comprehensiveness.

Discussion Section: A discussion section has been included to address the implications of our findings.

Addressing Overfitting: We have elaborated on how our method addresses overfitting and generalization issues.

Limitations and Advantages: The advantages and disadvantages of our proposed method has been added to the revised manuscript.

Concise Conclusion: The conclusion has been revised to be more concise, mirroring the abstract's style.

We believe these revisions have strengthened our manuscript and addressed the concerns raised during the review process. We appreciate the reviewers' insights and hope that our revised submission meets the publication criteria for PLOS ONE.

Thank you for considering our revised manuscript.

Kind regards,

Rassoul Hajizaded (Corresponding Author)

Attachments
Attachment
Submitted filename: Response to Reviewers.docx
Decision Letter - Agbotiname Lucky Imoize, Editor

Addressing Imbalanced Data Classification with Cluster-Based Reduced Noise SMOTE

PONE-D-24-38797R1

Dear Dr. Hajizadeh,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Agbotiname Lucky Imoize

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Accept.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: The required concerns are successfully incorporated by the authors and now the paper quality have improved.i recommend to accept the paper.

Reviewer #3: The manuscript titled "Addressing Imbalanced Data Classification with Cluster-Based Reduced Noise SMOTE" addresses a critical issue in machine learning: handling imbalanced datasets. The authors propose a novel method, Cluster-Based Reduced Noise SMOTE (CRN-SMOTE), which enhances oversampling via noise reduction using clustering techniques. The study is evaluated using multiple performance metrics across four datasets, showing promising results compared to established methods such as RN-SMOTE, SMOTE-Tomek Link, and SMOTE-ENN.

Strengths:

The topic is timely and addresses a significant gap in machine learning classification, particularly with imbalanced datasets.The problem of classifier bias toward majority classes is well-articulated, providing a clear motivation for the study. The integration of clustering for noise reduction within SMOTE is a novel approach, offering theoretical and practical contributions. The study utilizes multiple datasets (ILPD, QSAR, Blood, and Maternal Health Risk) and evaluates performance with a comprehensive set of metrics (Cohen's kappa, MCC, F1-score, precision, and recall). Consistent performance improvements across datasets demonstrate robustness and applicability. The results indicate a statistically significant improvement over state-of-the-art methods, which is well-supported by comparative data. The abstract and introduction effectively contextualize the study, making the research accessible to both academic and applied audiences. By advancing oversampling techniques, the manuscript contributes to improved classification performance for minority classes, particularly in critical domains such as health risk assessment. The implications extend beyond the datasets studied, offering potential benefits across various imbalanced datasets. Addressing class imbalance remains a critical research area in machine learning, with applications spanning healthcare, finance, and beyond. This manuscript aligns well with current trends and offers solutions that practitioners and researchers can readily adopt.

Conclusion

The manuscript provides a significant advancement in addressing imbalanced data classification. The proposed CRN-SMOTE method is well-motivated, thoroughly evaluated, and impactful, with the potential to influence both research and practice. The authors have demonstrated rigorous methodology, solid experimental design, and clear presentation of results. Therefore, I strongly recommend accepting the manuscript for publication

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Reviewer #3: Yes: Richard Govada Joshua

**********

Formally Accepted
Acceptance Letter - Agbotiname Lucky Imoize, Editor

PONE-D-24-38797R1

PLOS ONE

Dear Dr. Hajizadeh,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Mr. Agbotiname Lucky Imoize

Academic Editor

PLOS ONE

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .