Peer Review History

Original SubmissionSeptember 10, 2024
Decision Letter - Shihua Zhang, Editor, Xiuwei Zhang, Editor

PCOMPBIOL-D-24-01544

Optimal transport reveals dynamic gene regulatory networks via gene velocity estimation

PLOS Computational Biology

Dear Dr. Zhao,

Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

​Please submit your revised manuscript within 60 days Feb 23 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

* A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below.

* A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

* An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter

We look forward to receiving your revised manuscript.

Kind regards,

Xiuwei Zhang

Guest Editor

PLOS Computational Biology

Shihua Zhang

Section Editor

PLOS Computational Biology

Journal Requirements:

1) We ask that a manuscript source file is provided at Revision. Please upload your manuscript file as a .doc, .docx, .rtf or .tex. If you are providing a .tex file, please upload it under the item type u2018LaTeX Source Fileu2019 and leave your .pdf version as the item type u2018Manuscriptu2019.

2) Please upload all main figures as separate Figure files in .tif or .eps format. For more information about how to convert and format your figure files please see our guidelines:

https://journals.plos.org/ploscompbiol/s/figures

3) We notice that your supplementary Figures are included in the manuscript file. Please remove them and upload them with the file type 'Supporting Information'. Please ensure that each Supporting Information file has a legend listed in the manuscript after the references list.

4) Please ensure that the funders and grant numbers match between the Financial Disclosure field and the Funding Information tab in your submission form. Note that the funders must be provided in the same order in both places as well. State the initials, alongside each funding source, of each author to receive each grant. For example: "This work was supported by the National Institutes of Health (####### to AM; ###### to CJ) and the National Science Foundation (###### to AM)." State what role the funders took in the study. If the funders had no role in your study, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.".

If you did not receive any funding for this study, please simply state: u201cThe authors received no specific funding for this work.u201d

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The manuscript titled "Optimal transport reveals dynanic gene regulatory networks via gene velocity estirratiorl" presents an innovative approach to infer dynamic gene regulatory networks (GRNs) using optimal transport (OT) and gene velocity estimation from time-stamped single-cell RNA sequencing data. The method, OTVelo, integrates optimal transport theory with gene velocity estimates to model gene-to-gene interactions across time points. The authors successfully demonstrate the efficacy and accuracy of OTVelo on simulated, curated, and experimental datasets, offering valuable insights into gene regulatory dynamics during processes such as cell differentiation and reprogramming. The manuscript is well-written and the results are clearly presented, but there are several areas that could benefit from further clarification or elaboration. Below are some revision suggestions that may help improve the manuscript.

1. To improve the manuscript, it is suggested that the quality of the figures be optimized, as some of them appear unclear or low-resolution.

2. The authors should carefully verify the accuracy of the results presented in Figure 7, particularly the AUPR scores, which are below 0.64. These low scores may raise concerns regarding the model's performance, and ensuring the accuracy of the data could enhance the reliability of the conclusions drawn from the results.

3. It is suggested that the comparison of methods could be expanded. Including at least SCENIC, a widely used approach in gene regulatory network inference, would make the comparison more persuasive. This addition would provide valuable insights into the strengths and weaknesses of both methods and help highlight the advantages of the proposed approach.

4. It is suggested that the authors elaborate on the impact of different parameters (e.g., time lag, number of time points) on the performance of the OTVelo framework, potentially through sensitivity analyses or additional experiments.

5. It is suggested to provide a detailed user manual for the software available on GitHub, including instructions on how to run the program and explanations of the parameters. This would help users better understand the software and facilitate its application in different research settings.

Reviewer #2: This paper introduces OTVelo, a tool that uses fused Gromov Wasserstein optimal to estimate gene dynamics from time stamped scRNA-seq data and infer GRN edges using a correlation-based approach and a Granger causality approach. The manuscript is well written with clear description of method details. Open-source codes are provided with easy-to-follow instructions. OTVelo is compared with several other methods on both synthetic datasets and real datasets. My main concern is a relative lack of comparison and clarification on performance on real datasets, in addition to some minor questions about the setup of the method. Overall, I believe this is a novel approach with promising results and is worth publishing if the performance evaluation on real datasets could be improved.

Specific points:

1. It would be helpful to explore the stability of the inferred GRN with respect to the marginal distributions, which are set to uniform distributions in the current work. Some growth/death is expected in multi-time point scRNA-seq data as the time points are often relatively distant. For example, WaddingtonOT uses additional information to estimate a “growth rate”.

2. Could you also demonstrate how using FGW would improve the trajectory constructed using just Wasserstein as is done in WaddingtonOT? It seems counter-intuitive to me as the GW part would preserve the pairwise distances which might lead to missing branching structures.

3. Related to the point above, the quality of the inferred GRN highly depends on the accuracy of the reconstructed trajectory. More evaluations of the trajectory part would be helpful. For example, is it sensitive to choice of parameters including the weights in FGW and the strength of entropy regularization.

4. The better performance of OTVelo on simulated datasets is well demonstrated. However, the comparison to other methods on real data is rather insufficient. I suggest: (1) utilizing the real dataset benchmarks in BEELINE and in a more recent comparison paper: Stock, Marco, et al. "Topological benchmarking of algorithms to infer gene regulatory networks from single-cell RNA-seq data." Bioinformatics (2024): btae267.; (2) include more related methods, such as GRNBoost2; (3) if possible, also evaluate the performance on the task of find undirected edges and compare to methods like PIDC. For (3), I am fine if the authors find undirected edges irrelevant to the focus of the current work. In addition, it would be helpful to elaborate more on the evaluation results. For example, OTVelo outperforms SINCERITIES on signed AUPRC and underperforms SINCERITIES on other metrics including signed AUROC. In what application scenario is signed AUPRC more important than signed AUROC?

5. There is another work that applies OT to GRN inference, though with different approaches. It would be helpful to also survey this method in the introduction:

Lamoline, François, et al. "Gene regulatory network inference from single-cell data using optimal transport." bioRxiv (2024): 2024-05.

Reviewer #3: Inferring gene regulatory networks from gene expression data is an important and challenging problem for the systems biology community. The authors proposed OTVelo, a method for inferring gene regulatory relationships using time-stamped single-cell transcriptome data. OTVelo combines optimal transport and Granger causality, and also uses temporal trajectory to infer gene velocity, which is a meaningful and novel attempt. OTVelo shows comparable or slightly better performance than existing algorithms on several simulated and experimental datasets. Yet the authors also showed that the overall performance of all algorithms is not good enough. GRN inference remains a challenging task and deserves further efforts. To improve this paper, further discussion and tests may be needed.

1. According to Fig. 2 and Fig. 3, it seems that in cases where the network is more complex, the advantages of OTVelo would be more obvious. Can the authors further provide rationales behind such a phenomenon?

2. In Section 3.3.1, Fig. 6, and Fig. S8, the accuracy of gene regulatory networks derived by different algorithms should be quantitatively compared using ground-truth. The authors may quantify how well the regulatory relationships identified by the algorithm match the previously reported relationships. For example, identify how well key regulatory genes inferred by algorithms match with transcription factor activity in corresponding ATAC-Seq or ChIP-Seq data.

3. In Fig. 7, the overall performance of all algorithms is poor. It may not be necessary to overemphasize OTVelo's slightly higher AUPRC value in line 428. The authors may want to objectively discuss why GRN inference is challenging on these experimental data.

4. OTVelo appears to be limited to only some specific datasets. Experimentally derived single-cell transcriptome data not only contain cells in different cell cycles, but also different cell types. This may limit the scope of OTVelo. Can appropriate pre-processing, such as removing cell cycle effects and performing cell classification, help improve OTVelo?

5. PCA is a linear dimension reduction method, which works well when the gene dimension is not high. However, when the genetic dimension is increased and the main component is less significant, PCA will have limited performance. For example, in Fig. 9, PCA dimension reduction cannot reflect cell trajectories over time. Non-linear dimensionality reduction methods such as t-SNE or UMAP are recommended.

6. The authors emphasizes that OTVelo can infer gene velocity and reflect cell velocity on a dimension reduction map. Can the authors compare OTVelo with RNA velocity or methods alike on experimental data?

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: None

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

Figure resubmission:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. If there are other versions of figure files still present in your submission file inventory at resubmission, please replace them with the PACE-processed versions.

Reproducibility:

To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Revision 1

Attachments
Attachment
Submitted filename: ptp-response.pdf
Decision Letter - Shihua Zhang, Editor, Xiuwei Zhang, Editor

Dear Zhao,

We are pleased to inform you that your manuscript 'Optimal transport reveals dynamic gene regulatory networks via gene velocity estimation' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Xiuwei Zhang

Guest Editor

PLOS Computational Biology

Shihua Zhang

Section Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: There is no further question.

Reviewer #2: The authors have properly addressed all my comments.

Reviewer #3: The authors have addressed my previous comments. One minor issue: Line 482-483 should be described more accurately.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: None

Formally Accepted
Acceptance Letter - Shihua Zhang, Editor, Xiuwei Zhang, Editor

PCOMPBIOL-D-24-01544R1

Optimal transport reveals dynamic gene regulatory networks via gene velocity estimation

Dear Dr Zhao,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Zsofia Freund

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .