Peer Review History
| Original SubmissionDecember 3, 2025 |
|---|
|
-->PONE-D-25-64660-->-->GBCapsNet: A calibrated capsule network for automated gallbladder disease diagnosis via ultrasound imaging-->-->PLOS One Dear Dr. KS, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Feb 20 2026 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:-->
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Maria Y Pakharukova, Ph.D., D.Sc. Academic Editor PLOS One Journal Requirements: -->1. When submitting your revision, we need you to address these additional requirements.-->--> -->-->Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at -->-->https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and -->-->https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf-->--> -->-->2. Please note that PLOS One has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, we expect all author-generated code to be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.-->--> -->-->3. Please provide a complete Data Availability Statement in the submission form, ensuring you include all necessary access information or a reason for why you are unable to make your data freely accessible. If your research concerns only data provided within your submission, please write "All data are in the manuscript and/or supporting information files" as your Data Availability Statement.-->--> -->-->4. One of the noted authors is a group or consortium. In addition to naming the author group, please list the individual authors and affiliations within this group in the acknowledgments section of your manuscript. Please also indicate clearly a lead author for this group along with a contact email address.-->--> -->-->5. Please amend either the abstract on the online submission form (via Edit Submission) or the abstract in the manuscript so that they are identical.-->--> -->-->6. Please include a new copy of Table 9 in your manuscript; the current table is difficult to read. Please follow the link for more information: https://journals.plos.org/plosone/s/tables-->--> -->-->7. We note that Figures 1, 2, 6, 7, 8 and 9 in your submission contain copyrighted images. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.-->--> -->-->We require you to either present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or remove the figures from your submission:-->--> -->-->a. You may seek permission from the original copyright holder of Figures 1, 2, 6, 7, 8 and 9 to publish the content specifically under the CC BY 4.0 license. -->--> -->-->We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:-->-->“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”-->--> -->-->Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission. -->--> -->-->In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”-->--> -->-->b. If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.-->--> -->-->8. Please upload a new copy of Figures 1 – 9 as the detail is not clear. Please follow the link for more information: https://journals.plos.org/plosone/s/figures-->--> -->-->9. If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions--> -->Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. --> Reviewer #1: Partly Reviewer #2: Partly ********** -->2. Has the statistical analysis been performed appropriately and rigorously? --> Reviewer #1: No Reviewer #2: I Don't Know ********** -->3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.--> Reviewer #1: Yes Reviewer #2: No ********** -->4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.--> Reviewer #1: Yes Reviewer #2: No ********** -->5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)--> Reviewer #1: 1. The research problem is relevant and well aligned with current trends in AI-based image classification. 2. The novelty of the proposed method should be more clearly articulated in the introduction. 3. The methodology section requires additional technical details for reproducibility. 4. Dataset description, including class distribution and preprocessing steps, is insufficient. 5. Experimental evaluation relies heavily on accuracy; additional metrics are recommended. 6. Comparison with recent state-of-the-art methods should be expanded. 7. Figures need clearer captions and better integration with the text. 8. Statistical validation of results is missing and should be included. 9. The literature review can be strengthened with more recent references. Reviewer #2: Major revision recommendation, with potentially another round of revisions: Thank you for submitting this work. I genuinely see potential here: combining a capsule network approach with explicit model calibration for gallbladder disease classification is directionally interesting, and the aim of producing more trustworthy confidence estimates is clinically relevant. Including short training times and near real time inference times makes it more applicable for clinical integration. The inclusion of a practical deployment concept (web interface) is also a strength. That said, in its current form I cannot recommend acceptance. The manuscript needs a substantial rewrite for clarity and scientific rigor, and I think it will likely require at least two review rounds, plus potentially additional experiments, before the claims can be considered reliable. If the authors can address the core methodological and reporting issues below, I will gladly reconsider my assessment. Strengths I see: - The topic is clinically relevant: gallbladder ultrasound interpretation is challenging and automation could be useful. - The manuscript attempts not only classification but also calibration, which is important for clinical decision support and often ignored. - The related work section includes many relevant studies and could become a strong context section with better framing. - The idea of a deployable interface is promising, but it must be presented as a demonstrator and supported with reproducible implementation details. - Including training and inference details strengthens the usecase in a clincial setting. Major issues that must be addressed: - Possible data leakage and inflated performance (overfitting - critical) - You report 10,692 ultrasound images from 1,782 patients. If train test splits were done at the image level, images from the same patient may appear in both training and test sets, which can massively inflate accuracy and AUC. - The near perfect results (confusion matrix essentially perfect, AUC of 1.0 per class) strongly raise concerns about overfitting or leakage. Concrete required actions - State explicitly whether the split was performed by patient ID (group split), not by image. - If not patient level, rerun all experiments with patient level splitting (ideally stratified by class). - Provide the split lists (patient IDs and image filenames) in supplementary material for reproducibility. Calibration procedure is unclear and likely not valid as reported - It is currently unclear how calibration was performed and on which split. The manuscript mentions multiple split ratios in different places. - Calibration must not be tuned on the final test set. Temperature scaling requires fitting the temperature parameter on a validation set and then reporting calibration on a held out test set. Concrete required actions: - Clearly describe the calibration method and protocol, including where the temperature parameter is fitted. - Make the split protocol consistent everywhere (manuscript abstract, submission abstract, methods, results, conclusion, tables). - Report calibration with proper separation of training, validation, test, or use nested cross validation. Baselines and comparisons are not sufficient for the strength of the claims - The paper focuses heavily on CNN limitations but does not provide strong, fair baselines trained and tested under the same protocol, table 9 tries, but it's unclear how this is done, e.g. retrained, tested at all on the same dataset, or compared with literature only. - The literature comparison table is not a substitute for a properly controlled baseline evaluation. Concrete required actions - Add at least one strong modern baseline (for example a recent CNN architecture, and optionally another family such as detection based models like YOLO if appropriate), trained and evaluated on the same dataset and the same split protocol. - If claiming advantages of capsule routing, demonstrate it with controlled comparisons and possibly ablations. Reproducibility is currently too weak, several essentials are missing, making it hard to evaluate or reproduce: - Dataset description is too minimal. Simply citing the dataset reference is not enough. - Lack of details on image sizes, resizing rationale, preprocessing, augmentations, random seed, and stability across runs. - No clarity on whether you used cross validation or repeated random splits. - Calibration code and implementation details are not described sufficiently, ideally the code is given, including that of the web interphase for a local test. - Figures (graphs and flow charts) are not readable at current resolution and therefore hard to judge. Concrete required actions - Expand dataset description in text: what exact ultrasound type is this (your own text suggests images from inside the gastrointestinal tract, which needs clear distinction versus standard transabdominal ultrasound and EUS). - State whether all images from the dataset were used. If not, list the exact subset used, maybe via supplementary materials in a json file, or a short download script that entials these, and why 'only' these and others might be disregarded. - Provide full training details (random seeds, preprocessing, resizing, normalization, augmentation, optimizer, batch size, epochs, stopping criteria). - Provide a GitHub repository or at minimum full supplementary materials enabling reproduction. - Replace low resolution figures with high resolution and preferably vector graphics, especially for Figures 3 and 5. Structure and scientific writing need a rewrite (IMRAD) - The manuscript currently mixes methods, results, discussion, and conclusion. - There is also no clearly separated Results section, and the limitations section is far too short. Concrete required actions - Rewrite to follow IMRAD: Introduction, Methods, Results, Discussion, Conclusion. - Move interpretive statements out of Results into Discussion (for example “very efficient and robust”). - Expand Limitations with the actual limitations implied by the study design, dataset composition, scanner diversity, and generalizability constraints. Specific concrete issues and edits I'd recomment Abstract issues - Sentence one is unclear. Clarify whether increased incidence, differential diagnosis difficulty, or both motivate automated diagnosis, and what “increased incidence” refers to (over years, in which populations, due to what). - Replace “humans have latency issues” with more accurate phrasing such as “manual interpretation can be time consuming and may delay treatment.” - Do not claim “first study” unless you can support it. If you keep it, soften to “to the best of our knowledge” and ensure it is accurate. - The results read like overfitting. Either investigate and report it, or temper the claims. - Explain how calibration was performed in the abstract or remove the claim that it improves trustworthiness if you cannot support it rigorously. - Avoid abbreviations in the abstract (at minimum check journal policy and ensure all are defined). - The trustworthiness and reliability statements are overstated. Your study may indicate this, but it does not prove it without broader validation. - Training and inference time are not interpretable without hardware details and image size information. Also, I would end the abstract with the main scientific takeaway, not runtime. Submission system statements and metadata inconsistencies - Funding statement is missing. Even if there is no external funding, state departmental funding or no funding explicitly. - Ethics statement: you should explain why it is not needed. You are using human image data, so at minimum state that all data are publicly available and de identified according to the dataset source. - Data statement: “not applicable” is inconsistent with the manuscript text stating where the dataset is available. The abstract in the submission system does not match the abstract in the manuscript PDF. Make them identical and consistent. Introduction and clinical framing problems - The “pear shaped” description is not strongly supported by the cited reference and can vary anatomically. - If you say “initial function”, either remove “initial” or discuss other functions. - Replace simplistic language such as “die” with medically accurate explanation (necrosis, infection risk, complications). - You describe ultrasound as “minimally invasive”. Standard transabdominal ultrasound is non invasive. EUS is minimally invasive. You must distinguish these clearly and ensure the dataset matches the modality you describe. - “Lack of radiation exposure” should be stated clearly as “no radiation exposure”. - Reference 13 is a preprint and does not sufficiently support a backbone claim. Use a peer reviewed source, preferably multiple. - When claiming “several articles”, cite several, ideally including more recent work and possibly a systematic review. - “Manual intervention” is unclear. Specify what is manual (image acquisition, interpretation, invasive procedure). - Several parts of the introduction read imprecise and generic. Check every statement for precision and whether the cited literature supports it. - Define AI and CNN before using abbreviations. - Mention years with “et al.” consistently (Author et al., Year), also in related works. - Clarify what “inherent constraints” are and how your objective addresses them. - “To enhance the true likelihood…” needs specificity. Which calibration techniques, and what is the planned pipeline. - “Please reconsider the sentence ‘Although deep learning models have achieved tremendous accuracy in automating of …’ because its placement and contribution are unclear. Either complete it, connect it directly to your study motivation, or remove it.” Authorship contribution statement - “All authors contributed equally” is difficult to believe. Please revise to a realistic contribution statement, e.g. 3 shared first and 2 shared last is kinda pushing it to the ultimate max. Previous works section - Several sentences are hard to read, missing punctuation, and sometimes lack references. - You call it “this study presents…” in a way that reads like a review paper. This is not a review paper, revise phrasing. - Many abbreviations appear without being defined (AGTO, BiGRU, etc.). - This section is strong in breadth, but needs better contextualization: modality type, dataset type, year of publication, and what is clinically standard versus what is experimental. - With such a strong related work base, it is natural to ask why the best models were not retrained or fine tuned as baselines, table 9 shows this but from the experiment section it's unclear how this was done. Methodology issues and requested clarifications - Provide more dataset detail in the manuscript, not only via citation. - Clarify image sizes, resizing choice, and discuss advantages and disadvantages of downsizing. Ideally test sensitivity to resolution. - Provide your splits in supplementary materials. - State whether cross validation or multiple random splits were used. If not, consider adding it to assess stability. - “Fully connected CNN layers” is confusing. Convolutional layers are not fully connected. Revise. - Provide parameter counts per layer and total, if possible (not mandatory, but helpful). - Explain the capsule block, routing weights, and “squash”. If it is standard and well known and already established within the AI/ML community then state that and cite the canonical source - however this weakens the claim of novelty. - The statement about “complex visual features” is vague. Either demonstrate it (example feature visualization or qualitative explanation) or remove. - Calibration methods remain under described without code or detailed protocol. - “Regarding reproducibility of routing and customization: you include Table 4, which may contain key details. However, as written there are still too many missing specifics for me to confidently repeat the method without substantial effort. Please revise Table 4 and the surrounding text so the routing customization is fully and unambiguously reproducible.” Figures must be higher resolution or vector. Currently some are not readable, especially where routing is described. Equation formatting: - clarify vectors and matrices (bold notation, consistent symbols) for Eq. 2 to 5. - Be clear what the formulas add, and refer back to them in discussion. Right now they do not meaningfully improve reproducibility due to missing implementation details, nor add to a discussion. - Routing by agreement explanation currently sounds like generic weighting. Explain what is actually different versus standard CNN decision making, and why it matters. Hardware and efficiency claims - If you report training and inference times, include hardware configuration and relevant data characteristics (image size, number of images, batch size, GPU or CPU type). Web interface section - The interface concept is valuable, but without code or an accessible demo it currently adds limited real world value. - Reframe this section as “clinical integration concept” and explain how it would fit a clinical workflow, clearly labeling it conceptual unless you provide an implementation. - Define OSI and fix the missing reference currently shown as “[?]”. Results, limitations, conclusion problems - There is effectively no dedicated Results section. Separate Methods and Results. - The limitations section is too short and does not reflect the real limitations implied by the current study. - The conclusion introduces Table 9 late and brings in new evidence. Introduce results earlier, then interpret in Discussion. - The conclusion mixes future work, discussion, and conclusions in a confusing way. - Suggestions about generalizing to other modalities are premature. First address scanner variability, population diversity, and real world generalization within the same modality, and state concretely how generalization will be tested. Concrete typos and consistency fixes (examples) - GBCaspNet appears once, should be GBCapsNet. - “biliary Artesia” should be “biliary atresia”. Fix inconsistent citations and numbering (Platt scaling reference mismatch). - Remove placeholder citation markers like “[?]”. - Fix punctuation issues like “Fig. 2..”. - Replace non scholarly references such as Medium posts for key technical claims. Closing recommendation: I recommend Major Revision. I see real potential in the direction and the intent, and I would be happy to reconsider after a substantial rewrite and a leakage free, reproducible evaluation. Right now, the reported near perfect performance combined with unclear split and calibration protocols, missing baseline comparisons, and insufficient reproducibility prevents a reliable assessment and does not justify acceptance in its current form. I'd also really appreciate a more IMRaD style article, especially since this is not a scoping review but an article with problem statement, methods, conclusion -> therefore it misses the results and discussion section. I truly hope the authors aren't discouraged as I see real potential, especially when providing the additional clarifications, potential experiments and easily reproducible code for other sites to profit of and eventually cite this article, and hopefully one day benefit healthcare. ********** -->6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.--> Reviewer #1: No Reviewer #2: No ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] To ensure your figures meet our technical requirements, please review our figure guidelines: https://journals.plos.org/plosone/s/figures You may also use PLOS’s free figure tool, NAAS, to help you prepare publication quality figures: https://journals.plos.org/plosone/s/figures#loc-tools-for-figure-preparation. NAAS will assess whether your figures meet our technical requirements by comparing each figure against our figure specifications. |
| Revision 1 |
|
-->PONE-D-25-64660R1-->-->GBCapsNet: A calibrated capsule network for automated gallbladder disease diagnosis via ultrasound imaging-->-->PLOS One Dear Dr. KS, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by May 15 2026 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:-->
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. As the corresponding author, your ORCID iD is verified in the submission system and will appear in the published article. PLOS supports the use of ORCID, and we encourage all coauthors to register for an ORCID iD and use it as well. Please encourage your coauthors to verify their ORCID iD within the submission system before final acceptance, as unverified ORCID iDs will not appear in the published article. Only the individual author can complete the verification step; PLOS staff cannot verify ORCID iDs on behalf of authors. We look forward to receiving your revised manuscript. Kind regards, Maria Y Pakharukova, Ph.D., D.Sc. Academic Editor PLOS One Journal Requirements: If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise. Additional Editor Comments: Reviewers who are experts in this field are concerned that not all questions have been carefully addressed in the current version of the manuscript. Please pay attention to the questions and make the necessary changes. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions -->Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.--> Reviewer #1: All comments have been addressed Reviewer #2: (No Response) ********** -->2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. --> Reviewer #1: Yes Reviewer #2: Partly ********** -->3. Has the statistical analysis been performed appropriately and rigorously? --> Reviewer #1: Yes Reviewer #2: No ********** -->4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.--> Reviewer #1: Yes Reviewer #2: No ********** -->5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.--> Reviewer #1: No Reviewer #2: Yes ********** -->6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)--> Reviewer #1: 1. The paper focuses on a meaningful problem and the motivation behind the work is clearly presented. 2. The proposed approach seems promising, but the explanation of the methodology could be made more simple and clear for better understanding. 3. Some parts of the paper are difficult to follow due to technical complexity, so improving clarity would help readers. 4. The dataset description is quite limited, more details about data size, sources, and preprocessing would improve reproducibility. 5. The experimental results are encouraging, but including more performance metrics would make the evaluation stronger. 6. The comparison with existing methods is good, but adding more recent models would strengthen the validation. 7. The contribution of each component in the proposed model is not fully clear and could be explained better. 8. The paper would benefit from a clear architecture diagram to visually explain the workflow. 9. There are some minor language and grammatical issues that should be corrected to improve readability. 10. The work is promising overall, and adding discussion on real-time implementation and future scope would further enhance its impact. Reviewer #2: I recommend major revision, or at minimum a more accurate alignment of the abstract and conclusion with the actual experimental evidence. Please note that my comments are only to here to improve the already great work, yet some are a must for me to support publication of the work. The authors have greatly improved readability by restructuring the manuscript, strengthening the limitations section, and acknowledging image-level splitting. However, my main concern remains unresolved experimentally: the data leakage risk is discussed in text but not addressed via controlled experiments, leaving the near-perfect metrics unchanged. These metrics are still emphasized in the abstract and conclusion including the superiority over other models. There are no controlled baselines, and the manuscript still makes comparative/outperformance claims by citing results from other papers without rerunning comparable experiments. More concerning than the missing baseline and leakage risk is that these limitations are not clearly reflected in the abstract and conclusion, which is misleading to readers and unfair to the cited works. Leakage and potential overfitting would almost certainly inflate the reported performance. The calibration protocol is clarified but remains hard to follow (and may contain contradictions); the editor can decide whether this has been addressed sufficiently. In addition, several important textual and referencing issues remain (see below), including ambiguous wording about “minimally invasive” ultrasound, a Medium blog post used as a core reference, and the continued equal contribution statement, which I find difficult to accept as warranted. These issues must be addressed honestly and clearly, including in the abstract and/or conclusion. Otherwise, I will have to recommend rejection in the next round. Level 1 (Critical) 1) Data leakage risk + unrealistic metrics must be addressed experimentally and transparently Image-level splitting is not patient-level splitting and can produce misleading performance. With ~6 images per patient (10K images, 1.7K patients), this will likely have a meaningful effect. This limitation must be clearly stated in the abstract and conclusion, not only in the body text. The reported performance (99.91% accuracy, AUC 1.0 across 9 classes) is exceptionally high for ultrasound classification, making the leakage/overfitting concern a methodological red flag. I recommend requesting patient identifiers from dataset creators (if possible) and at least providing full image IDs and split assignments (e.g., in GitHub and/or supplement) to support reproducibility. Consider an image-level similarity analysis to estimate this leakage, or benchmark on another dataset. At minimum, provide a quantitative discussion estimating the likely degree of leakage and its impact. 2) Unsupported “outperforms existing methods” claims (discussion & conclusion) The manuscript cannot claim the model outperformed existing gallbladder diagnosis methods based on Table 9, which compares accuracy values across different datasets, splits, tasks, and modalities. This is not a controlled comparison and does not support the stated conclusion (and this is central to PLOS ONE expectations). I do appreciate Table 5, which is a valuable contribution. 3) Add controlled baselines and uncertainty estimates Add at least one controlled baseline (e.g., ResNet-50 or MobileNet, readily available), fine-tuned on the same dataset, using the same split and same random seed, and report performance alongside GBCapsNet. Add confidence intervals by running multiple splits/seeds and reporting variability of outcomes. 4) Reproducibility and PLOS ONE code/data availability expectations While the dataset is named, the GitHub is not referenced within the manuscript, and the current setup is not sufficient for full reproducibility. PLOS ONE requires code availability stated in the manuscript. Add a clear statement and include explicit train/test split assignments (requested in round 1). Ideally provide this, and code, as supplementary material so it is linked to the DOI. Clarify whether the full dataset was used or only a subset. 5) Calibration protocol clarity and structure Clarify: what fraction of training data is held out for calibration, and how was it selected? Why is calibration not shown for the primary 90–10 split? Check abstract statements against Table 8 (they appear contradictory). Calibration methods belong in Methods, calibration outcomes in Results (currently there is overlap). Level 2 (Important) 6) Abstract/conclusion must reflect limitations and experimental scope If leakage/baseline experiments cannot be run, this must be reflected explicitly in the discussion, abstract, and conclusion. In the conclusion: separate (i) concise summary of results, (ii) honest interpretation/limitations, and (iii) future work. 7) Clinical/implementation section is too long and generic “Design considerations for clinical use” spans 2 pages but is largely generic (POCUS, DICOM, PACS, OSI networking). Condense to one paragraph or at most two in the discussion to improve focus and flow. 8) Ethics statement Even if formal ethics approval is not required, briefly explain the reasoning in the manuscript for stronger transparency, it could for example be below the data statement. 9) Optional but helpful: additional reporting Consider supporting Table 6 with a confusion matrix. Explicitly state whether data augmentation was used. Level 3 (Minor editorial corrections) 10) Terminology and factual consistency “Ultrasound is minimally invasive” is not generally correct; it depends on modality (e.g., EUS can be minimally invasive). Decide what applies here (EUBUS? transabdominal?) and use consistent terminology. OSI model: clarify whether you mean 5 layers vs 7 layers (currently confusing). 11) Abstract and language corrections “the findings suggest” → “the findings suggest” (plural agreement: findingS suggest). “capsule based” → “capsule-based” “Cholestrol” → “cholesterol” CNN layers: they are typically convolutional, not “fully connected” as stated (unless you refer to a specific head; please clarify). Author contributions: remove extra comma in “Chandrika, , A. Sai…” “Related work” is a better header than “previous works”. 12) References and citation quality Reference 19 (flagged in round 1) is still a Medium blog post; not acceptable as a core reference. Please replace with peer-reviewed works. Reference 2 is a preprint: acceptable if necessary, but label it as a preprint in references. Reference metadata issues: Ref 20 is NeurIPS 2017 (not preprint); similarly verify Ref 16 (MICCAI 2015), 17 (NeurIPS 2015), 46 (ICML 2017). Please double-check all references. 13) Formatting/consistency “Lecun normal” → “LeCun …” Figs 6 and 7 have identical captions; please differentiate them. I have seen great and honest improvements relative to the first submission. Many comments can be addressed quickly; others require additional baseline/controlled experiments and more time. If the additional experiments are not feasible, then the manuscript must honestly reflect these limitations, especially in the abstract and conclusion as this is the first part that is read by other researchers and often used for citations. Especially in the age of AI this is crucial. Under those conditions, I would be willing to support publication. ********** -->7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.--> Reviewer #1: No Reviewer #2: No ********** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] To ensure your figures meet our technical requirements, please review our figure guidelines: https://journals.plos.org/plosone/s/figures You may also use PLOS’s free figure tool, NAAS, to help you prepare publication quality figures: https://journals.plos.org/plosone/s/figures#loc-tools-for-figure-preparation. NAAS will assess whether your figures meet our technical requirements by comparing each figure against our figure specifications. |
| Revision 2 |
|
GBCapsNet: A calibrated capsule network for automated gallbladder disease diagnosis via ultrasound imaging PONE-D-25-64660R2 Dear Dr. KS, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact billing support. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Maria Y Pakharukova, Ph.D., D.Sc. Academic Editor PLOS One Additional Editor Comments (optional): Reviewers' comments: |
| Formally Accepted |
|
PONE-D-25-64660R2 PLOS One Dear Dr. KS, I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS One. Congratulations! Your manuscript is now being handed over to our production team. At this stage, our production department will prepare your paper for publication. This includes ensuring the following: * All references, tables, and figures are properly cited * All relevant supporting information is included in the manuscript submission, * There are no issues that prevent the paper from being properly typeset You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps. Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. You will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing. If we can help with anything else, please email us at customercare@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Maria Y Pakharukova Academic Editor PLOS One |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .