Classifying complex multimorbidity using latent class analysis and machine learning to generate insights into clustering of mental and cardiometabolic conditions

Moumita Mukherjee; Samhita Mukherjee; Hruthik Reddy Thokala; Raja Hashim Ali

doi:10.1371/journal.pone.0335676

Peer Review History

Original SubmissionDecember 12, 2024
9 Jun 2025 Decision Letter - Chiranjivi Adhikari, Editor Dear Dr. Mukherjee, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jul 24 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols . We look forward to receiving your revised manuscript. Kind regards, Chiranjivi Adhikari, MPH, MHEd., PhD Candidate Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf . 2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, we expect all author-generated code to be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse. 3. Thank you for stating the following in your Competing Interests section: “Not applicable” Please complete your Competing Interests on the online submission form to state any Competing Interests. If you have no competing interests, please state "The authors have declared that no competing interests exist.", as detailed online in our guide for authors at http://journals.plos.org/plosone/s/submit-now This information should be included in your cover letter; we will change the online submission form on your behalf. 4. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information. Additional Editor Comments (if provided): Dear Authors, It's an interesting piece of scientific task, with well documented and reviewed reseach questions, hypothesis, and methodologies. The write up is good except with some errors and some analyses to be carried out as follows as minor revisions; 1. As also the reviewers have noted, there are errors, such as in line 158, citation 4 with ?; in 163, 95% CI missed; in 165, citation missed; in 296/299/360 and so many texts, inconsistencies between chi2 and chi-square?! 2. Similarly, some text are missing, such as line297, only 1, and 'table' is missing.. 3. The table 2 shows 10 hypotheses, but below interpretation is only about three, also describe and mention the other hypotheses. 4. in line 197, write in sentence case. 5. Line 305, write in sentence case. Followings are as major revisions: 6. For AUCs, Precision, Recall, and F1-Scores, also report 95% CIs. 7. We may only obtain the information as to how well the model ranks positive vs. negative cases, true positive rate, and other performances but limited to individual model; from AUCs, Precision, Recall, and F1-Scores. However, to compare, as for policy and clinical decision making, there should be direct comparision, which is lacking, so, head-to-head tests like DeLong’s test (for comparing AUCs statistically) and/or Bootstrap tests (to get confidence intervals on the given parameters) are recommended with consultations of statistician (senior). 8. Similarly, AUCs and other parameters as mentioned may only consider ranking, not class label predictions, and so on. Therefore, McNemar test to observe whether the number of disagreements between two classifiers is statistically significant, as with a head-to-head comparison, or similar other to compare all, are strongly advised (with statistician). 9. Similarly, the tests you have carried out may not show how performance may vary across samples or resamples, which may miss the variance and stability of performance across datasets or folds. So, for practicality, as for public health decision making, also consult for permutation tests to evaluate whether a model performs significantly better than chance or another model under label shuffling; and/or cross-validated paired t-tests or Wilcoxon signed-rank tests across folds for robust model comparison. Additionally, for better readibility, for readers who are not very comfortable with such statistics, and tests, consider the tests like decision curve analysis and Calibration plots (how well predicted probabilities reflect true likelihood). Finally, also kindly addresss the comments from both reviewers, for which I greatly acknowledge their times. Chiranjivi, AE, Plos [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? Reviewer #1: Partly Reviewer #2: Yes ******** 2. Has the statistical analysis been performed appropriately and rigorously? -->?> Reviewer #1: Yes Reviewer #2: Yes ****** 3. Have the authors made all data underlying the findings in their manuscript fully available??> The PLOS Data policy Reviewer #1: Yes Reviewer #2: Yes ****** 4. Is the manuscript presented in an intelligible fashion and written in standard English??> Reviewer #1: Yes Reviewer #2: No ****** Reviewer #1: The aim of this research is to explore the complex multimorbid phenomenon and to inform public health policy makers in designing smart prediction based decision making to avoid delay in specific intervention areas and increasing the targeting accuracy. Classical and machine learning models are applied to identify the best model in classifying the individuals falling in this typical subgroup to help the health system design customized solutions. However, the manuscript seems to have many limitations including the followings: (i) There are many such works in the domain. What are the limitations of those studies? What is the significance of your work over others' works? (ii) You should separately present the related work in a new section. Moreover, you should present a summary table for the related works. (iii) You should have an organization paragraph at the end of your introduction section. (iv) In the fifth page of your manuscript, you missed many references. Please check that. (v) You should use proper reference for the dataset, not only just the Kaggle. (vi) The quality of the images used in this manuscript are very bad. (vii) Your material and method section is very poorly presented. (viii) The performances of different machine learning are not satisfactory. Moreover, you have not compared your results with that of others. (ix) Your research contributions are not satisfactory, except some analysis. Reviewer #2: The subject of the study is quite interesting with the findings impactful. The manuscript is well written, and the methodology and research design are scientific and sound. The results are reported well, however, there are still areas of improvement in following areas: 1. Referencing should be uniform in whole papaer, "line no. 40, 46, 58, 82 96" also in others if any..You have mentioned as Rosenkilde et al. (30) in 82 and (WHO, 2020) in 96. which one is correct ? please make uniformity and follow the journal guideline 2. line no. 120 need to rewrite as " A prospective experimental study by.... tested the link between loneliness and the onset of T2D symptoms using data from the Danish National Health Survey (ref) which includes 465290 participants older than 16 years." 3. line no. 165 and 169 " . Another study by Uphoff et al. (? ) ................. explored the impact of behavioural an impact on efficacy (? ). Cannot understand reference missing or what do you want to write ? please correct. 4. line no. 185, cannot figure out the line concept. please clarify. i think the sentence structure should be in correct order. 5. line no. 297- 299 . make uniformity in the test type. Chi2 test or chi-square test. 6. regarding methodoligical choices: The study discusses various classical and machine learning methods, but it does not provide in-depth justification for the chosen methods over others and may overlook potential biases introduced by these choices. please justify. 7. table 2 seems out of page please correct it. cannot see the full table in paper layout. 8. Add one paragraph for policy implications in the future direction section which will make this papaer more compresensive 9. Conclusion: duplication must be avoided. write breifly by addressing your research objectives. 10. Add information about the strength of your study before the limitation. Overall, the discussion section reads well.Thank you and best wishes ****** what does this mean? ). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy Reviewer #1: No Reviewer #2: Yes: Sujan Poudel ******** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/ . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org . Please note that Supporting Information files do not need this step. https://doi.org/10.1371/journal.pone.0335676.r001
Revision 1
10 Oct 2025 Author Response Point by Point Response Editor’s comments As also the reviewers have noted, there are errors, such as in line 158, citation 4 with ?; in 163, 95% CI missed; in 165, citation missed; in 296/299/360 and so many texts, inconsistencies between chi2 and chi-square Corrected Similarly, some text are missing, such as line297, only 1, and 'table' is missing Texts are corrected as per changes in the analysis The table 2 shows 10 hypotheses, but below interpretation is only about three, also describe and mention the other hypotheses. These analyses are omitted, and new methods are applied. Research questions are now strengthened. The new analysis – LCA followed by machine learning – do not include previous hypotheses. in line 197, write in sentence case. Line 305, write in sentence case. Sentences are now changed. For AUCs, Precision, Recall, and F1-Scores, also report 95% CIs. Computed and added We may only obtain the information as to how well the model ranks positive vs. negative cases, true positive rate, and other performances but limited to individual model; from AUCs, Precision, Recall, and F1-Scores. However, to compare, as for policy and clinical decision making, there should be direct comparision, which is lacking, so, head-to-head tests like DeLong’s test (for comparing AUCs statistically) and/or Bootstrap tests (to get confidence intervals on the given parameters) are recommended with consultations of statistician (senior). Similarly, AUCs and other parameters as mentioned may only consider ranking, not class label predictions, and so on. Therefore, McNemar test to observe whether the number of disagreements between two classifiers is statistically significant, as with a head-to-head comparison, or similar other to compare all, are strongly advised (with statistician). We added McNemar test and paired T test to observe whether the number of disagreements between two classifiers is statistically significant following the recommendations. So, for practicality, as for public health decision making, also consult for permutation tests to evaluate whether a model performs significantly better than chance or another model under label shuffling; and/or cross-validated paired t-tests or Wilcoxon signed-rank tests across folds for robust model comparison. Additionally, for better readibility, for readers who are not very comfortable with such statistics, and tests, consider the tests like decision curve analysis and Calibration plots (how well predicted probabilities reflect true likelihood). Yes, we have computed paired T test and Decision curve analysis and added in the new version. Reviewer 1 There are many such works in the domain. What are the limitations of those studies? We have modified our focus to complex multimorbidity. We applied latent class analysis to define the clusters. Then we applied different ML algorithms to explore the best algorithm for classification. We considered previous works like 1. Polessa Paula, D., Barbosa Aguiar, O., Pruner Marques, L., Bensenor, I., Suemoto, C. K., Mendes da Fonseca, M. J., & Griep, R. H. (2022). Comparing machine learning algorithms for multimorbidity prediction:An example from the Elsa-Brasil study. PloS one, 17(10), e0275619.https://doi.org/10.1371/journal.pone.0275619 2. Wang, X., Zheng, N., & Yin, M. (2025a). Multimorbidity Patterns and Depression: Bridging Epidemiological Associations with Predictive Analytics for Risk Stratification. Healthcare (Basel, Switzerland), 13(12), 1458. https://doi.org/10.3390/healthcare13121458 3. Wang, X., Zhang, D., Lu, L., Meng, S., Li, Y., Zhang, R., Zhou, J., Yu, Q., Zeng, L., Zhao, J., Zeng, Y., & Gao, R. (2025b). Development and validation of an explainable machine learning model for predicting the risk of sleep disorders in older adults with multimorbidity: a cross-sectional study. Frontiers in public health, 13, 1619406.https://doi.org/10.3389/fpubh.2025.1619406 4. Bertrand, A., Zhou, X., Lewis, A., Monfeuga, T., Gupta, R., Grau, V., & Rodriguez, B. (2025). Sex-specific cardiometabolic multimorbidity, metabolic syndrome and left ventricular function in heart failure with preserved ejection fraction in the UK Biobank. Cardiovascular diabetology, 24(1), 238. https://doi.org/10.1186/s12933-025-02788-4 What is the significance of your work over others' works? You should separately present the related work in a new section. Moreover, you should present a summary table for the related works. You should have an organization paragraph at the end of your introduction section. Recent studies applied machine learning (ML) techniques to classify multimorbid patients. Zaidan et al. (2023) reported that random forests achieved the highest accuracy in predicting complex multimorbidity across 3-, 4-, and 5-class models, with 91–92% accuracy for diabetes + depression + CVD + hypertension combinations. A cross-sectional study with longitudinal mortality follow-up by Zhang et al. (2021) applied latent class analysis (LCA) to create multimorbidity clusters in a clinically meaningful manner A retrospective modified cross-sectional study examining sex-specific differences in cardiometabolic comorbidity performed LCA to identify distinct patient clusters with different other analyses to understand differences in adverse cardiac remodeling (Bertrand et al., 2025). Study by Wang et al. (2025a) employed LCA to cluster multimorbidity patter using the China Health and Retirement Longitudinal Study and found four distinct morbidity patterns where all clusters show significant association with depression and predictive performance was evaluated using XGBoost model. The study by Polessa Paula et al. (2022) applied different machine learning models for multimorbidity prediction. Another study by Wang et al. (2025b) developed and validated explainable ML model to predict the risk of sleep disorder among older multimorbid population subgroup followed by applying Shapley Additive Explanations to identify important features contributing to the outcome. Despite these advances, proper clustering and classification of patients by multimorbidity severity remain under-researched in an widespread manner along with identification of most important contributing features, limiting the design of targeted interventions and resource optimization. In the fifth page of your manuscript, you missed many references. Please check that. Corrected. You should use proper reference for the dataset, not only just the Kaggle. Added. The main data is available in https://www.cdc.gov/brfss/annual_data/annual_2014.html The excerpt of the data available in Kaggle.com is used for analysis available in Diabetes Health Indicators Dataset The quality of the images used in this manuscript are very bad. Our sincere apologies. Now we are sharing the MS Word version with modified analyses and new figures. Hope you like them. Your material and method section is very poorly presented. Our sincere apologies again. We wrote it differently improving the quality of writing. Materials and Methods Dataset: The dataset is an excerpt of the 2015 Behavioral Risk Factor Surveillance System (BRFSS) from the US CDC, containing 21 features and 253,680 responses on diabetes-related health indicators, behavioral variables, health status, and demographics (Xie et al., 2019), Building Risk Prediction Models for Type 2 Diabetes Using Machine Learning Techniques. Feature engineering was performed using Stata 14.0 BE and Python 3.11 to create relevant variables and optimize classification. The dataset was loaded into Python using pandas, verified to contain no missing values, and split into input features (X) and the target variable (y). Data Preprocessing and Feature Engineering: Participants who were neither prediabetic nor diabetic or had never experienced stroke were excluded, leaving 46,736 observations. Class imbalance was addressed using SMOTE to generate synthetic samples near decision boundaries. Data were split 80:20 into training and testing sets. Two new variables were created: stroke occurrence and presence/absence of mental health disorders by sex. Analysis: Latent class analysis (LCA) clustered participants into five complex multimorbidity classes. LCA is a probabilistic, model-based clustering method that identifies unobserved participant subgroups (classes) within a heterogeneous population based on observed responses. It estimates the probability that an individual respondent belongs to each latent class, generating class membership probabilities for each respondent. The LCA model can be expressed as [ P(Y_i) = {k=1}^{K} k {j=1}^{J} P(Y{ij} \| C_i = k) ] Where (Y_i) represents observed responses for individual (i), (C_i) is the latent class, (k) is the prior probability of class (k), and (P(Y{ij} \| C_i = k)) is the conditional probability of observing response (j) given class (k). Model selection was based on the Akaike Information Criterion (AIC) Bayesian Information Criterion (BIC). LCA allows identification of distinct clusters of different complex multimorbid respondents. Control factors included dietary habits, lifestyle, health-seeking behavior, and socioeconomic status (Table 1). Table 1 here Machine Learning Models and Evaluation: Six supervised algorithms were applied after classifying participants into latent classes to predict class membership and determinants: multinomial logistic regression (MLR), multinomial Naive Bayes (MNB), decision tree (DT), random forest (RF), XGBoost (XGB), and artificial neural networks (ANN). Models were evaluated using accuracy, precision, recall, F1-score, confusion matrices, and AUROC. McNemar tests assessed differences between models. Feature importance analysis, permutation analysis, and SHAP analysis ranked features by contribution, identifying key predictors. This approach enabled robust classification of complex multimorbid subgroups and identification of determinants, supporting targeted health interventions. 1. Multinomial Logistic Regression (MLR): A generalized regression model for predicting categorical outcomes with more than two classes. 2. Multinomial Naive Bayes (MNB): A probabilistic classifier assuming conditional independence of features given the class. 3. Decision Tree (DT): A non-parametric tree-based model that splits data based on feature thresholds. 4. Random Forest (RF): An ensemble of decision trees using bootstrap aggregation to reduce variance and improve predictive accuracy. 5. Extreme Gradient Boosting (XGB): A boosting algorithm that sequentially combines weak learners to minimize prediction error. 6. Artificial Neural Networks (ANN): Shallow machine learning model with input, hidden, and output layers that capture complex nonlinear relationships. Evaluation Metrics: Models were evaluated using standard classification metrics. 1. Accuracy denotes to what extent the classifier classifies positives. Accuracy = (TP + TN) / (TP + FP + FN + TN) ………………………. Eq (1) 2. Precision depicts the extent of true positive classification with respect to the total of true and false positives. Precision = TP / (TP + FP) ……………………………………………... Eq (2) 3. Recall is also known as sensitivity, and it measures proportion of true positives correctly classified as true positives. Recall = TP / (TP + FN) ……………………………………………….. Eq (3) 4. F1 score estimates the ‘harmonic mean’ of two metrics - precision and recall—balancing any imbalance by giving higher weight to the lower value. F1-Score = 2 * (Precision * Recall) / (Precision + Recall) …………….. Eq (4) Where, how many cases are - TP (true positives), i.e., correctly predicted as positive; TN (true negatives), i.e., correctly predicted as negative; FP (false positives), i.e., incorrectly predicted as positive; FN (false negatives), i.e., incorrectly predicted as negative Area Under the Receiver Operating Characteristic (AUROC): Measures the ability of the model to discriminate between classes, calculated as the area under the plot of True Positive Rate vs. False Positive Rate. Feature Importance: To interpret model predictions, we applied random forest feature importance based on the mean decrease in impurity for each feature. - Permutation importance measures change in model performance after randomly shuffling feature values. SHAP (SHapley Additive exPlanations) quantifies the contribution of each feature to individual predictions. Statistical Tests and Decision Analysis: To validate model predictions and assess relevance, we used the following: - Paired t-test: Compares mean differences in continuous variables across paired samples (e.g., predicted vs. observed probabilities) to assess significant changes. - McNemar test: Compares classification outcomes of two correlated classifiers on the same dataset, testing if performance differs significantly. - t-test: For evaluating differences in continuous risk factors between classes. - Decision curve analysis (DCA): Evaluates net benefit across a range of threshold probabilities, helping to identify the most useful predictive model. These statistical tests and decision curve analyses allow us to determine whether ML models reliably distinguish between latent classes, identify significant determinants, and support decision-making for targeted interventions. This methodology aids accurate identification clustering of complex multimorbid populations while informing health systems through interpretable and reliable ML models. The performances of different machine learning are not satisfactory. Your research contributions are not satisfactory, except some analysis. Please find below the new results – Performance of machine learning models in classifying the risk of single, multiple, and complex cardiometabolic multiple morbidities Table 2 here Figure 4 here Among 6 ML models, RF (AUROC=0.805, 95% CI [0.800, 0.809]) outperforms all the models by model explainability (Table 2, Figure 4). XGBoost (AUROC=0.773, 95% CI [0.769, 0.777]) is the next best model according to AUROC (OvR). The worst-performing models are the base model MLR and MNB. RF is considered the best classification algorithm while designing the DSS architecture to disaggregate each complex morbid cluster to design an equitable service delivery framework. Table 3 here Pairwise model comparisons under the McNemar test reflect that each of the 6 models’ performance is significantly different (p=0.0000) from each other—indicating disagreement in model performance (Table 3). Additionally, the results of T statistics support the findings of McNemar test results (p=0.0000) (Table 3). The McNemar statistics, being a test of paired proportions, depict each of the 2 classifiers disagreeing with each other significantly. From the T statistics, it is also evident that when RF is compared to any other model, the T value is consistently negative while RF is the second model and positive when RF is the first model for comparison—indicating RF is significantly the best model compared to any other model and significantly different in performance. Table 4 here In addition, as per decision curve analysis, RF and XGB provide higher net benefit across a wide range, implying higher usability as decision support (Table 4). Moreover, you have not compared your results with that of others. It is now added in detail in the discussion section with different subsections Reviewer 2 Referencing should be uniform in whole papaer, "line no. 40, 46, 58, 82 96" also in others if any..You have mentioned as Rosenkilde et al. (30) in 82 and (WHO, 2020) in 96. which one is correct ? please make uniformity and follow the journal guideline Apologies again. Corrected. line no. 120 need to rewrite as " A prospective experimental study by.... tested the link between loneliness and the onset of T2D symptoms using data from the Danish National Health Survey (ref) which includes 465290 participants older than 16 years." Added as “A prospective experimental study by Rosenkilde et al. (2024) tested the link between loneliness and the onset of T2D symptoms using data from either Danish Health and Morbidity Survey (Jensen et al., 2019) or the Danish National Health Survey (Christensen et al., 2022) between 2000 and 2017 which includes 465290 participants Attachments Attachment Submitted filename: Point by Point response.docx https://doi.org/10.1371/journal.pone.0335676.r002
15 Oct 2025 Decision Letter - Chiranjivi Adhikari, Editor Classifying Complex Multimorbidity Using Latent Class Analysis and Machine Learning to Generate Insights into Clustering of Mental and Cardiometabolic Conditions PONE-D-24-56938R1 Dear Dr. Moumita Mukherjee, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact billing support . If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Chiranjivi Adhikari, MPH, MHEd., PhD Candidate Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: https://doi.org/10.1371/journal.pone.0335676.r003
Formally Accepted
Acceptance Letter - Chiranjivi Adhikari, Editor PONE-D-24-56938R1 PLOS ONE Dear Dr. Mukherjee, I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team. At this stage, our production department will prepare your paper for publication. This includes ensuring the following: * All references, tables, and figures are properly cited * All relevant supporting information is included in the manuscript submission, * There are no issues that prevent the paper from being properly typeset You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps. Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. You will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing. If we can help with anything else, please email us at customercare@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Mr. Chiranjivi Adhikari Academic Editor PLOS ONE https://doi.org/10.1371/journal.pone.0335676.r004

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .