Peer Review History
| Original SubmissionJuly 15, 2025 |
|---|
|
PCOMPBIOL-D-25-01410 A multi-omics framework for survival mediation analysis of high-dimensional proteogenomic data PLOS Computational Biology Dear Dr. Ahn, Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jan 12 2026 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: * A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below. * A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. * An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter We look forward to receiving your revised manuscript. Kind regards, Chris Amos Guest Editor PLOS Computational Biology Ferhat Ay Section Editor PLOS Computational Biology Additional Editor Comments: This manuscript proposes a novel approach to mediation analysis for survival data and provides a very informative approach that uses an accelerated failure time (AFT) model for the analysis. For survival analysis of observational data from cancer studies, the AFT model is preferable to the usual Cox Proportional Hazards model because it does not make a proportional hazards assumption, which will be violated if there are substantive effects of a genetic factor on survival times. The manuscript is novel and important because it provides key theoretical developments for the application of AFT models in the study of expression data. That said, the reviewers all suggested major revisions. The key issue for these reviewers has been the assumption that the expression levels that are studied are normally distributed and uncorrelated with each other, with the lack of correlation being the more significant concern. In fact, most expression data shows clusters of highly correlated genes and the simulations need to at least evaluate what happens in the presence of some correlations and better would be to simulate clusters of correlated genes and evaluate the impact that will have on the analysis. Also, in the actual application to CPTAC only a single gene-mediator example is given. Since this is a multiomic method a much stronger and more illustrative example would be provided if the authors would evaluate more than one expression level jointly. The reviewers had other concerns about the method that should be addressed. All of them thought a major revision is needed but the significance and importance of this new method makes this an important contribution, if their points can be addressed. I apologize for delays in getting the review back but it was difficult finding reviewers for this important but complex contribution. Journal Requirements: If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise. 1) We ask that a manuscript source file is provided at Revision. Please upload your manuscript file as a .doc, .docx, .rtf or .tex. If you are providing a .tex file, please upload it under the item type u2018LaTeX Source Fileu2019 and leave your .pdf version as the item type u2018Manuscriptu2019. 2) Please upload all main figures as separate Figure files in .tif or .eps format. For more information about how to convert and format your figure files please see our guidelines: https://journals.plos.org/ploscompbiol/s/figures 3) Please amend your detailed Financial Disclosure statement. This is published with the article. It must therefore be completed in full sentences and contain the exact wording you wish to be published. 1) State what role the funders took in the study. If the funders had no role in your study, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript." 2) If any authors received a salary from any of your funders, please state which authors and which funders.. If you did not receive any funding for this study, please simply state: u201cThe authors received no specific funding for this work.u201d Reviewers' comments: Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: Review uploaded as attachment. Reviewer #2: This study presents a multi-omics survival mediation framework of significant methodological value, featuring a mathematically rigorous and innovative three-step workflow that effectively tackles high-dimensional data challenges and overcomes the limitations of traditional Cox models. This framework enables integrated genomic and proteomic analysis, addressing a critical gap in single-omics approaches and providing a generalizable paradigm that better reflects biological complexity. Furthermore, the model demonstrates high persuasiveness through comprehensive multi-scenario simulations, consistently showcasing superior statistical power and robust false discovery rate control compared to other methods. However, the study has several critical issues that require further refinement. 1. Validation of core model assumptions and clarification of applicability boundaries are insufficient: The outcome model of SMAHP is based on a parametric AFT model, assuming the error follows a normal distribution. However, the authors did not validate the validity of this distribution assumption in either simulated data or real-world data (CPTAC). If the actual data (e.g., survival time distribution) deviates from the normal assumption, it may lead to biased estimates of mediation effects. Additionally, the study did not investigate the applicability of the "proportional effect assumption" (linear associations between exposure and mediator, and between mediator and outcome). For example, it did not test for non-linear relationships between gene expression levels and protein abundances, nor did it explain the model’s robustness when interactions exist between variables. It is recommended to supplement residual distribution tests (e.g., Q-Q plots), non-linear relationship tests (e.g., adding quadratic terms), and clarify the limitations of the model when assumptions are violated. 2. Simulated data design is overly idealized and inconsistent with the characteristics of real-world data: In the simulation experiments, exposure variables (X) were set to follow a multivariate normal distribution with no correlations, and the association patterns of mediator variables (M) (e.g., 40% of M associated with 10% of X) were artificially predefined. This differs significantly from the characteristics of real multi-omics data (e.g., high co-expression among genes, heterogeneous associations between proteins and genes, and batch effects). Such an idealized design may overestimate SMAHP’s performance in real-world data. For example, if strong collinearity exists among exposure variables in real data, the variable screening performance of MCP-penalized regression may decline, but this scenario was not simulated. It is recommended to supplement simulation settings that better mimic real multi-omics data (e.g., introducing variable correlations, batch effects, and outliers) and re-validate SMAHP’s performance to ensure the method’s reliability in complex data scenarios. 3. Real-world data validation lacks depth and breadth: The current application is limited to single-cohort CPTAC head and neck squamous cell carcinoma (HNSCC) data (final sample size: 102 patients, after excluding 7 patients with missing overall survival (OS) data). External validation using multi-center, multi-disease data is lacking, making it difficult to demonstrate SMAHP’s generalizability. For example, the finding that "LCE3E mediates the effect of HMGB1P23 on OS" was not validated in other cancer types (e.g., lung cancer, breast cancer) or independent HNSCC cohorts, leaving open the possibility that this pathway is a cohort-specific occasional result. Furthermore, the application study did not extend the analysis to meet clinical practice needs—for instance, it did not explore the association between the mediation pathway and treatment response (e.g., chemotherapy sensitivity), nor did it validate the model’s performance in real-world data with small sample sizes or high censoring rates. It is recommended to supplement external validation with multi-cohort data (e.g., TCGA data, independent clinical study data), conduct analyses on multi-omics data from different disease types, and deepen result interpretation by integrating clinical indicators to enhance the method’s practical application value. 4. Robustness of causal inference assumptions is not fully demonstrated: The core assumption of causal mediation analysis—"no unmeasured confounding" (i.e., no omitted confounders affecting the associations between exposure and outcome, mediator and outcome, or exposure and mediator)—was not validated. The authors did not discuss potential unmeasured confounders in the CPTAC data (e.g., patient smoking history, alcohol consumption, comorbidities), nor did they perform sensitivity analyses (e.g., E-value calculation, simulating the effects of unmeasured confounders) to assess the impact of these factors on mediation effect estimates. Additionally, the study did not consider "interactions between mediators". In real biological systems, proteins often exhibit regulatory relationships, but SMAHP did not include such interaction terms, potentially missing key causal pathways. It is recommended to supplement sensitivity analyses to validate the rationality of the "no unmeasured confounding" assumption, or extend the analysis by incorporating mediator interaction terms into the model to improve the rigor of causal inference. Minor Comments 1. It is recommended to supplement the simulation experiments with an analysis of "the impact of variable dimensionality (p/k) on computational time" to provide readers with references for computational cost when selecting the method. 2. Figure 1 (causal pathway diagram) could be supplemented with 95% confidence intervals to intuitively illustrate the uncertainty of effect estimates. 3. The Discussion section should explicitly propose optimization directions for SMAHP in "scenarios with explosive growth in multi-omics data volume" (e.g., integration of parallel computing, dimensionality reduction algorithms) to provide insights for future research. Reviewer #3: The authors propose the SMAHP framework, which is a novel designed for high-dimensional causal mediation analysis integrating genomic, proteomic, and survival data. This addresses an important gap in existing methods which focus on single exposures, limited applicability to complex multi-omics data where multiple genes may work through multiple proteins to affect survival. The proposed three-step approach combining penalization, sure independence screening, and hypothesis testing is computationally feasible and demonstrates good performance in simulations under the conditions tested. The approach is technically sound, and the paper is generally well-written. However, the current manuscript would be strengthened by addressing several issues: 1. The authors indicate that the exposure variables (genes) are not correlated. By simulating independent genes, the authors test their method under a best-case scenario that likely overestimates performance. When genes are correlated, penalization methods like MCP would have to choose among correlated signals. This could lead to unstable variable selection and difficulty determining which gene is the true mediator versus simply correlated with it. Have the authors examined correlations among the top 100 genes selected from CPTAC data? If HMGB1P23 is highly correlated with other top-ranked genes, how confident can we be in the specificity of this finding? 2. The authors specifically select 100 genes and 200 proteins from pre-screening for SMAHP. Is the selection based on computational constraints or other considerations? Given that the starting dataset contains 60,669 genes and 9,469 proteins, reducing to 100 genes and 200 proteins seems to be a stringent threshold and could lose a true signal ranked just outside these cutoffs. Sensitivity analyses by repeating the analyses with alternative cutoffs could be helpful. 3. Based on my understanding, the method tests each protein mediator independently. However, proteins could have multiple weak sequential effects that combine to create a strong overall pathway. A gene can influence one protein and subsequently activates another protein, then affects survival. SMAHP tests each link separately and may find each individual effect too weak to reach significance even though the complete cascade has a substantial cumulative effect on survival outcome. While this limitation is shared by some other high-dimensional methods, the authors could further clarify this as a current limitation. 4. The authors mentioned that the proposed method identified a gene-protein pair associated with survival, specifically HMGB1P23 and LCE3E. To further elucidate the biological relevance of the identified HMGB1P23–LCE3E–survival pathway, the author could conduct additional analyses? For example, examining the correlation between HMGB1P23 gene expression and LCE3E protein abundance? Stratified survival analyses across joint expression strata of HMGB1P23 and LCE3E might also help to understand whether the mediation pattern translates into distinct survival outcomes. It would also be helpful to evaluate the robustness of the findings with other tests, which would strengthen confidence in the observed HMGB1P23–LCE3E mediation effect. ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: None Reviewer #3: None ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our Privacy Policy.. Reviewer #1: No Reviewer #2: Yes: Erping LongErping Long Reviewer #3: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] Figure resubmission: While revising your submission, we strongly recommend that you use PLOS’s NAAS tool (https://ngplosjournals.pagemajik.ai/artanalysis) to test your figure files. NAAS can convert your figure files to the TIFF file type and meet basic requirements (such as print size, resolution), or provide you with a report on issues that do not meet our requirements and that NAAS cannot fix. After uploading your figures to PLOS’s NAAS tool - https://ngplosjournals.pagemajik.ai/artanalysis, NAAS will process the files provided and display the results in the "Uploaded Files" section of the page as the processing is complete. If the uploaded figures meet our requirements (or NAAS is able to fix the files to meet our requirements), the figure will be marked as "fixed" above. If NAAS is unable to fix the files, a red "failed" label will appear above. When NAAS has confirmed that the figure files meet our requirements, please download the file via the download option, and include these NAAS processed figure files when submitting your revised manuscript. Reproducibility: To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols
|
| Revision 1 |
|
PCOMPBIOL-D-25-01410R1 A multi-omics framework for survival mediation analysis of high-dimensional proteogenomic data PLOS Computational Biology Dear Dr. Ahn, Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by May 25 2026 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: * A letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below. * A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. * An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. We look forward to receiving your revised manuscript. Kind regards, Chris Amos Guest Editor PLOS Computational Biology Ferhat Ay Section Editor PLOS Computational Biology Editor Comments : The authors have responded well to the reviewers' comments and this is an important and very detailed manuscript. Therefore, I am suggesting accepting pending the minor revision that the manuscript must include within the body of the text a statement about the availability of code. The response to the review states ' The SMAHP (all upper cases) R package is freely available in the Comprehensive R Archive Network (CRAN) repository (https://cran.r-project.org/web/packages/SMAHP/index.html). Please reach out to the corresponding author (Seungjun Ahn, seungjun.ahn@mountsinai.org) if you have any further inquiries about the R package.' but I do not see this comment included in the manuscript. Therefore, I do not know if this package would be accessible to the readers. Figure resubmission: While revising your submission, we strongly recommend that you use PLOS’s NAAS tool (https://ngplosjournals.pagemajik.ai/artanalysis) to test your figure files. NAAS can convert your figure files to the TIFF file type and meet basic requirements (such as print size, resolution), or provide you with a report on issues that do not meet our requirements and that NAAS cannot fix. After uploading your figures to PLOS’s NAAS tool - https://ngplosjournals.pagemajik.ai/artanalysis, NAAS will process the files provided and display the results in the "Uploaded Files" section of the page as the processing is complete. If the uploaded figures meet our requirements (or NAAS is able to fix the files to meet our requirements), the figure will be marked as "fixed" above. If NAAS is unable to fix the files, a red "failed" label will appear above. When NAAS has confirmed that the figure files meet our requirements, please download the file via the download option, and include these NAAS processed figure files when submitting your revised manuscript. Reproducibility: To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols |
| Revision 2 |
|
Dear Dr. Ahn, We are pleased to inform you that your manuscript 'A multi-omics framework for survival mediation analysis of high-dimensional proteogenomic data' has been provisionally accepted for publication in PLOS Computational Biology. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests. Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated. IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS. Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. Best regards, Chris Amos Guest Editor PLOS Computational Biology Ferhat Ay Section Editor PLOS Computational Biology *********************************************************** We just asked for some minor revisions to your paper. It seems like a very interesting and novel contribution |
| Formally Accepted |
|
PCOMPBIOL-D-25-01410R2 A multi-omics framework for survival mediation analysis of high-dimensional proteogenomic data Dear Dr Ahn, I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. For Research, Software, and Methods articles, you will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing. Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work! With kind regards, Judit Kozma PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .