Peer Review History
| Original SubmissionDecember 11, 2023 |
|---|
|
PONE-D-23-41442Application of Machine Learning-Based Algorithms for Predicting Stunting among Adolescent Girls in EthiopiaPLOS ONE Dear Dr. Zemariam, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that your manuscript will likely be suitable for publication if it is revised to address the points below. Therefore, my decision is "Major Revision". Please submit your revised manuscript by May 06 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Oluwafemi Samson Balogun, Ph.D. Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 2. We noticed you have some minor occurrence of overlapping text with the following previous publication(s), which needs to be addressed: https://bmcmedinformdecismak.biomedcentral.com/articles/10.1186/s12911-023-02102-w https://www.mdpi.com/2227-9067/10/10/1638 https://archpublichealth.biomedcentral.com/articles/10.1186/s13690-015-0093-9 In your revision ensure you cite all your sources (including your own works), and quote or rephrase any duplicated text outside the methods section. Further consideration is dependent on these concerns being addressed. 3. Your ethics statement should only appear in the Methods section of your manuscript. If your ethics statement is written in any section besides the Methods, please move it to the Methods section and delete it from any other section. Please ensure that your ethics statement is included in your manuscript, as the ethics statement entered into the online submission form will not be published alongside your manuscript. 4. Please include a separate caption for each figure in your manuscript. 5. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: No Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: No Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: No Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Introduction could potentially be enhanced: • Expanding on how machine learning has been utilized in public health, especially in developing countries, could provide a more comprehensive backdrop. This includes examples of successful ML applications in similar contexts. • A detailed discussion about the challenges in predicting stunting, especially in low-resource settings, and how machine learning can address these challenges would be informative. This includes limitations of traditional statistical methods and the advantages offered by ML techniques. • For readers who may not be familiar with machine learning, a clearer explanation of key ML concepts and algorithms could be beneficial. This includes a basic overview of the algorithms used in the study and why they are suitable for this type of analysis. • How stunting is influenced by socio-economic and environmental factors, and how ML can help untangle these complex relationships, would provide a more holistic understanding. • Providing a review of previous studies that have used ML for similar purposes, highlighting what has been achieved and where gaps remain • Rationale for the Study , particularly the need for ML approaches in the context of Ethiopia. Other drawbacks • The absence of some potentially relevant clinical, dietary, and socio-economic variables in the DHS data may affect the accuracy and comprehensiveness of the results. • The study focuses specifically on adolescent girls in Ethiopia, which may limit the generalizability of the findings to other populations or demographic groups. • The Boruta algorithm, employed for feature selection in the study, is a robust method that ensures comprehensive inclusion of relevant features. However, it comes with drawbacks such as high computational intensity due to its iterative nature and reliance on random forest classifications, which can be resource-heavy. There's a risk of overfitting, as Boruta tends to retain more features, potentially including noisy or less relevant ones. Its dependence on the performance of the random forest algorithm means its effectiveness is closely tied to how well this model fits the data. Additionally, the process of feature selection can sometimes be arbitrary and lacks clear interpretability, as the reasons for the inclusion or exclusion of certain features aren't always transparent. Furthermore, the efficiency of Boruta can be challenged in handling very high-dimensional data due to the exponential increase in features when creating shadow attributes. • Machine Learning Model Interpretability can be seen as "black boxes". Include interpretable AI • ML models parameter optimization is missing • Include a flowchart to explain the study clearly • Despite using the Synthetic Minority Oversampling Technique (SMOTE) to balance the data, there's a risk that this technique might introduce artificial bias. Oversampling can sometimes lead to overfitting, as the model might be too tailored to the synthetic examples created by SMOTE. Explain this. • Missing Data percentage • The absence of p-values in the summary statistics table of the study could be considered a drawback, particularly in the context of medical and epidemiological research. • Results interpretation please indicate the clearly the results is based on testing dataset. • The study uses association rule mining to identify patterns between features and stunting. However, these rules should be interpreted with caution as they indicate correlation, not causation, and the findings might be influenced by confounding factors. The discussion section of the manuscript, could benefit from a more explicit with a broader and more detailed comparison with existing literature,. Addressing the study's methodological limitations more thoroughly would provide a more balanced view. There's also a need for a more comprehensive discussion on potential biases inherent in the dataset and how these might influence the findings, a more detailed exploration of specific policy recommendations and practical applications would add value. Suggestions for future research, especially addressing the current study's limitations, could provide clearer direction for subsequent studies. Finally, a more in-depth discussion of the socioeconomic, cultural, and environmental factors affecting stunting in the Ethiopian context would offer more understanding of the issue. Reviewer #2: PONE-D-23-41442 Application of Machine Learning-Based Algorithms for Predicting Stunting among Adolescent Girls in Ethiopia The manuscript contains an important insight for handling and exploring the non-linear relationship of variables, it is an excellent supplement to results generated using the classical linear statistical models. In general, the manuscript is interesting and relevant to the field. The following issues needs your reflection and where necessary revisions. • Do you have a rationale to split the dataset to 80% and 20% for training and testing the model respectively, why not other proportions • What strategy has employed to mitigate overfitting in your study • Have you used the same software for data preprocessing and model selection analysis, if not kindly mention which other tools were employed? • You have mentioned a number of performance evaluation metrics, why don’t you use one of the metrics that best fit to your case. • Feature selection methods is not well explained, specifically it is important to depict how the employed feature selection method (Boruta algorithms) is suitable over other methods for your study. • The first paragraph of the discussion part is well explained on the introduction section, no need to repeat it, hence remove it and focus on interpreting your findings and portray its policy and further research implications. • Random forest algorithm has the best predictive model in your case; your discussion is limited to studies that come across with similar result to yours. But there are studies which identify best predictive model other than random forest algorithm, you have to include them and narrate in relation to your findings. • Paragraphs narrating the predictors of stunting and the use of association rule mining on the discussion part is direct replication of the result and devoid of interpretation and comparison with other studies’ findings ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Sorayya Malek Reviewer #2: No ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
PONE-D-23-41442R1Application of Machine Learning-Based Algorithms for Predicting Stunting among Adolescent Girls in EthiopiaPLOS ONE Dear Dr. Zemariam, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that your manuscript will likely be suitable for publication if it is revised to address the points below. Therefore, my decision is "Minor Revision". Please submit your revised manuscript within Nov 02 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Oluwafemi Samson Balogun, Ph.D. Academic Editor PLOS ONE Journal Requirements: Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #3: (No Response) ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #3: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #3: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #3: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #3: No ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #3: Review report Review report of research manuscript entitled “Application of Machine Learning-Based Algorithms for Predicting Stunting among Adolescent Girls in Ethiopia” I would like to say thank you for your invitation to review the above-mentioned manuscript! General Comments The paper is technically much sounded and the questions are informative. The statistical analysis of the data is also highly sounded, and the findings are appropriately discussed in the context of previous literature. It meets the requirements of the research community well in the data processing and management steps. 1. Reporting prevalence in the summary section for this particular study is senseless. You are running out of the scope of your title. Try to remove it please. 2. Starting from the title you should identify and incline towards either the model or the application mainly. Based on that the title, background, objectives, and all others will be shaped in that way! As to me, the ultimate objective looks emphasizing the application. My suggested topic/title: “Prediction of stunting and its socioeconomic determinants among adolescent girls in Ethiopia using machine learning algorithms” 3. Why are you interested to study only adolescent girls? Give a strong justification in your introduction. Do you think that stunting is the only concern of adolescent girls? 4. What was the mechanism that can be used to manage the lack of transparency and interpretability of the data? 5. What were the possible bias and limitations for your study? In addition, how could you handle it? Try to incorporate these issues in your study clearly. 6. Why did you exclude other parts of undernutrition? Why do you insisted to stunting only? Please give a strong justification. 7. Some of very relevant factors of stunting cannot be available in the secondary data including DHS data. How did you handle such issues? Try to incorporate in your limitation. 8. Eight machine-learning algorithms were included for model building and comparison for this particular study. What additional models did you plan in your future studies? What challenges did you face to be restricted to these eight models only for this study? What are the major effects for your study regarding variations of machine learning algorithms? 9. EDHS 2016 is old aged data. Why did you prefer it? There are recent EDHS data in other countries. Have you consider this issue? 10. To keep the logical coherence and flow of ideas, I recommend you to mention the consequences and effects of stunting following to the prevalence of stunting. 11. Why did you re categorize wealth index? 12. I appreciate your mean SHAP value report and Waterfall plot analysis. However, I have not seen any interpretation for such wonderful parts of your effort. I strongly recommend you to interpret it in detail using the log odd value for each specific findings to make it clear for readers. 13. Make sure that all of your pertinent findings have been discussed very well including its implications. 14. Artificial intelligence is the current active area of research particularly for health data. You have particularly employed machine learning for this specific study. Why did you prefer machine-learning parts of artificial intelligence on top of other artificial intelligence options like generative artificial intelligence and deep learning? 15. Try to revise editorial and grammatical issues throughout your entire document. Generally, this research addressed untouched and active research area using multiple machine learning algorithms and large data set. After correcting the given comments, this study is qualified for publication, potentially contribute significant role for the entire scientific community. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #3: No ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.
|
| Revision 2 |
|
Prediction of Stunting and Its Socioeconomic Determinants among Adolescent Girls in Ethiopia using Machine Learning Algorithms PONE-D-23-41442R2 Dear Dr. Alemu Birara Zemariam, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Oluwafemi Samson Balogun, Ph.D. Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #3: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #3: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #3: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #3: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #3: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #3: The authors have addressed all of my comments. It is now acceptable for publication with the current form. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #3: Yes: Ali Yimer ********** |
| Formally Accepted |
|
PONE-D-23-41442R2 PLOS ONE Dear Dr. Zemariam, I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team. At this stage, our production department will prepare your paper for publication. This includes ensuring the following: * All references, tables, and figures are properly cited * All relevant supporting information is included in the manuscript submission, * There are no issues that prevent the paper from being properly typeset If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps. Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. If we can help with anything else, please email us at customercare@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Oluwafemi Samson Balogun Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .