Peer Review History

Original SubmissionOctober 9, 2019
Decision Letter - Roberto Mantovani, Editor

PONE-D-19-28215

Constructing temporal regulatory cascades in the context of development and cell differentiation

PLOS ONE

Dear Dr. Daou,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

We would appreciate receiving your revised manuscript by Feb 16 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.
  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.
  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Roberto Mantovani

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please upload a new copy of Figure 1 as the detail is not clear.

Please follow the link for more information: http://blogs.PLOS.org/everyone/2011/05/10/how-to-check-your-manuscript-image-quality-in-editorial-manager/

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors propose a novel approach to explore time-series gene expression data, integrating them with the underlying regulatory network. The idea of identifying time-point-specific regulators candidates from the expression profiles and then mapping them back to a background regulatory network in order to figure out temporal relationships and identify a small set of key regulators for each time-point is surely worth to be investigated. The two examples reported by the authors are also interesting, but perhaps a more comprehensive selection of use cases could add strength to the manuscript. A comparison with other tools to investigate temporal regulatory cascades would also be interesting. Unluckily, the implementation of the user front-end to the method and of the documentation seems still in a very preliminary phase, and this needs to be addressed before publication. Also, the methods used to draw the regulatory network could be somewhat refined.

Major:

- The design of the gene regulatory network is of fundamental importance for this method, but this aspect seems to be a bit overlooked in the manuscript. In particular additional discussion and implementation efforts should be devoted to:

--how is a promoter defined?

--Why only are Transfac PWMs used when there exist other libraries like Jaspar, with the additional advantage that they are not commercial?

--The PWM score threshold method to associate a TF to a given promoter is very blunt and could be refined. In any case, how is the threshold chosen? A simple reference to another paper is not sufficient for a matter of this importance for this method.

-- Why selecting mammals specific conserved regions (human, mouse, dog, and cow) genomes works better than regions conserved among vertebrates instead? Would it be possible, for example, to identify mammal-specific regulators comparing the results obtained with networks built using vertebrate conserved regulatory regions instead of mammals ones?

-- On the other hand, would it be possible to rely on chromatin states instead of conservation in order to identify functional promoter regions and how the results would change?

- A comparison of results with other methods for temporal gene networks adopting other approaches could help readers and potential users in understanding the advantages of this method. For example, the cascade R package [Jung et al.] could be used as a benchmark.

- The Neural and Cardyomyocytes examples reported by the authors are suggestive, but a somewhat more extensive selection of examples could be helpful. In particular, it would be interesting to see if the method can be applied with success also to non-human time series. For example, an interesting dataset could be the time-resolved transcriptome of C. elegans [Boeck et al.]. PWMs for C. elegans are also available in Jaspar.

- This is the weakest point of the manuscript. The implementation of the method is not usable in its current state. At the link that has been provided, there is a very blunt interface without any documentation. No information, for example, is provided on the format of the input file, no license, no terms of use, no tutorial or explanation on how to use it, and understand the output. If this is open software, it should be made available using standard repositories. There is no way to select the background regulatory network to be used, so it seems to works only for human data using the default background regulatory network built by the authors, but it would be much better to provide more topologies to users and maybe also let users provide their topologies.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

Revision 1

Responses to editor’s comments:

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at:

http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

We adjusted the names of the supplementary data files from “S.. Dataset” to “S.. File” and “S1 Figure” to “S1 Fig.”. We hope that it now fits the style requirements, as described in the referred templates.

2. Please upload a new copy of Figure 1 as the detail is not clear.

As pointed, Figure 1 was not clear, mainly because of the amount of information that rendered the details hard to read. So we decided to substitute the figure with two new separate figures Figure 1 and Figure 2. The manuscript was adjusted accordingly.

Responses to reviewer #1:

The design of the gene regulatory network is of fundamental importance for this method, but this aspect seems to be a bit overlooked in the manuscript.

We agree with the reviewer on the importance of the design of the gene regulatory network (GRN) used. However, since the focus of the present paper was on the construction of regulatory time cascades, which can be done on the basis of any GRN, we intentionally cut this aspect short. Including all technical details of its construction would also require more detailed considerations of data sources used, algorithms used for the detection of transcription factor binding sites, thresholds applied, etc. The network we constructed, described previously and used in a number of studies consistently proved functional. Its construction was a considerable effort, and to re-do this with another data source would preclude a timely publication of the results described in our paper without adding to its main points. As said, the method described can be applied using any other GRN; however, as suggested by the reviewer, we added the possibility for users to exchange the precomputed GRN by a customized one in case the user has one at hand . Of course, the results may differ. We expanded the background regulatory network section in the materials and methods to include the details of the construction of the network.

To address the questions raised by Reviewer #1 in more detail:

--how is a promoter defined?

Based on 49,344 RefSeq-annotated human transcription units (UCSC track refGene, Jan. 22, 2014), the -1kb upstream region was selected as a proximal promoter. The transcription start site (TSS) indicated in RefSeq was used as the reference point.

On the basis of pre-calculated whole-genome alignments provided by the UCSC (46_ WAY_MULTIZ_hg19) these promoter definitions were utilized to retrieve the sequence conserved regulatory regions between human (hg19), mouse (mm9), dog (canFam2) and cow (bosTau4). Afterward, gaps resulting from the multiple genome alignment were removed.

--Why only are Transfac PWMs used when there exist other libraries like Jaspar, with the additional advantage that they are not commercial?

TRANSFAC has been the first database about gene regulatory components and binding site models (PWMs) and still is the most comprehensive data source for these entities, also comprising the Jaspar collection.

--The PWM score threshold method to associate a TF to a given promoter is very blunt and could be refined. In any case, how is the threshold chosen? A simple reference to another paper is not sufficient for a matter of this importance for this method.

MATCH was used to predict potential TFBSs in the previously identified conserved promoter regions, based on all vertebrate defined matrices using the PWM library from TRANSFAC (release 2013.1, 1446 vertebrate matrices) . All matrices with default minFN threshold (minimize false negatives) were used in order to predict potential TFBSs that have at least the quality of an annotated TFBS in TRANSFAC. 1360 out of 1446 TRANSFAC-PWMs had a sequence-conserved TFBS prediction. We ranked all predicted TFBSs associated with each PWM, according to their MATCH score. We chose the best 5% predicted binding sites for each PWM and constructed the background transcriptional regulatory network accordingly. The PWMs are translated to human TF-gene names (HGNC-defined) using the TRANSFAC database. Each TF-gene, identified by its official HGNC-defined gene name, was represented as a node, with a directed edge connecting it with its target gene node.

-- Why selecting mammals specific conserved regions (human, mouse, dog, and cow) genomes works better than regions conserved among vertebrates instead? Would it be possible, for example, to identify mammal-specific regulators comparing the results obtained with networks built using vertebrate conserved regulatory regions instead of mammals ones?

This is certainly a valid suggestion by Reviewer #1, but identifying mammal-specific regulators was not in the scope of our study. When extending conservation to non-mammalian genomes, the number of conserved binding sites would drop considerably and would not help in interpreting, as in this study, human data. On the other side, including monkey or rat would not add since they are too close to human and mouse, resp.

-- On the other hand, would it be possible to rely on chromatin states instead of conservation in order to identify functional promoter regions and how the results would change?

Since chromatin states are highly dependent on the cellular context, corresponding data would always refer to individual cells / cell lines only and would not help in constructing a comprehensive GRN.

- A comparison of results with other methods for temporal gene networks adopting other approaches could help readers and potential users in understanding the advantages of this method. For example, the cascade R package [Jung et al.] could be used as a benchmark.

The unique type of the output of the TRC makes it difficult to accurately compare it to other existing methods, as no other method has the same definition of a regulatory cascade. However, we utilized the context-relevance of the GO enrichment of the gene sets predicted by other methods as a benchmark for the comparison. We considered the suggested Cascade R package for the comparison however we felt it was not properly maintained and subsequently, we couldn't manage to run it on the required dataset, and more importantly we needed a method that also combines some sort of a precomputed regulatory network to make the comparison fair. To compare our suggested template profiles to other profile possibilities, we first applied the STEM in order to predict the top 10 significant gene expression patterns in the cardiac differentiation dataset and evaluated the GO enrichment of the genes set associated with each of these profiles.The GO enrichment of these sets showed very general terms not specific to the cardiac differentiation context. Next, we applied iDREM, which we consider the closest method to the TRC in terms of inputs and aims, using the cardiac differentiation dataset and the regulatory network provided by iDREM (human¬ predicted¬1000), to generate a dynamic regulatory network. The resulting model was in the form of a dynamic regulatory map that highlights major bifurcation events, each of which has a list of associated regulatory genes. The GO enrichment of these gene lists showed a mild enrichment of developmental GO terms in some bifurcation points and no enrichment in most of the others. However, proving the validity of a generated network or cascade requires an actual experimental validation of the predicted regulatory interactions in the particular cellular context, which is currently unpractical. We added a paragraph in the discussion section of the manuscript, where we address the comparison part just discussed.

- The Neural and Cardyomyocytes examples reported by the authors are suggestive, but a somewhat more extensive selection of examples could be helpful. In particular, it would be interesting to see if the method can be applied with success also to non-human time series. For example, an interesting dataset could be the time-resolved transcriptome of C. elegans [Boeck et al.]. PWMs for C. elegans are also available in Jaspar.

The GRN constructed was specifically designed to support the reliable interpretation of mammalian, in particular human gene expression time series for biomedical research. Analyzing non-mammalian data would require a different GRN, as well as specific expertise and field of focus related to that species. We chose those examples where the validation of the results on the basis of existing knowledge was possible. However, with the added option of uploading their own regulatory network, the users can now explore different expression datasets and networks related to other species.

- This is the weakest point of the manuscript. The implementation of the method is not usable in its current state. At the link that has been provided, there is a very blunt interface without any documentation. No information, for example, is provided on the format of the input file, no license, no terms of use, no tutorial or explanation on how to use it, and understand the output. If this is open software, it should be made available using standard repositories. There is no way to select the background regulatory network to be used, so it seems to works only for human data using the default background regulatory network built by the authors, but it would be much better to provide more topologies to users and maybe also let users provide their topologies.

We agree with the reviewer’s points, and thus we adjusted the implementation of the webtool accordingly. We added a manual that explains, for example, the file formats and the underlying methods, workflows, and parameters. We also added a tutorial section as well as help buttons and icons that are intended to guide the user through webtool. And most importantly, we added the option that allows the user to upload his own regulatory network as well and use it as a background network for the analysis.

We also added supportive workflows, which we had in mind from the beginning; these workflows such as the co-expression analysis are not novel in their methodology and are based on classic methods. Thus we did not go into the description of these workflows in the paper, keeping the focus in the novel TRC method; however, in the manual , a detailed description of these workflows can be found. This creates an exploratory platform where a user can further explore different aspects of regulation and gene expression analysis. For this manuscript, the relevant workflow would be the “TRC analysis” workflow which can be found as the first option in the workflows page that appears after the user uploads his data or uses the sample data or built-in network. After choosing the TRC analysis workflow, the user is forwarded to a page where the parameters are adjusted accordingly.

Thank you again for your time and effort,

Best regards

Rayan Daou

Attachments
Attachment
Submitted filename: Response to reviewers.pdf
Decision Letter - Roberto Mantovani, Editor

Constructing temporal regulatory cascades in the context of development and cell differentiation

PONE-D-19-28215R1

Dear Dr. Daou,

We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.

Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.

Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

With kind regards,

Roberto Mantovani

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Formally Accepted
Acceptance Letter - Roberto Mantovani, Editor

PONE-D-19-28215R1

Constructing temporal regulatory cascades in the context of development and cell differentiation

Dear Dr. Daou:

I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

For any other questions or concerns, please email plosone@plos.org.

Thank you for submitting your work to PLOS ONE.

With kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Prof. Roberto Mantovani

Academic Editor

PLOS ONE

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .