Peer Review History

Original SubmissionMay 8, 2023
Decision Letter - Marc Robinson-Rechavi, Editor, Arne Elofsson, Editor

Dear Dr Rivas,

Thank you very much for submitting your manuscript "RNA covariation at helix-level resolution for the identification of evolutionarily conserved RNA structure" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.

All three reviewers were extremely positive about your work. I must say that I rarely see such positive unanimity among reviewers, and I congratulate you on excellent research and writing. I am deciding for Minor revision to provide you with the opportunity to take into accoun the comments of the reviewers. None of these are compulsory, but I feel that several would improve your manuscript, and I hope that you will feel the same.

Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Marc Robinson-Rechavi

Academic Editor

PLOS Computational Biology

Arne Elofsson

Section Editor

PLOS Computational Biology

***********************

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: In this article, the author introduces an update to the R-Scape software package. Previous versions of R-Scape utilized a statistical test to determine the covariance of base paired RNA structures in the presence of variation (e.g., random mutation). This most recent update expands upon this idea, aggregating base pairs into helix-level structures and providing a measurement of their covariance. Overall, this was an excellent manuscript that presents a very useful new approach for evaluating the evolutionary impact of RNA structure on sequence and for aiding our understanding of RNA structure/function. The author performed extensive testing of the new version of R-scape including and in-depth analysis of lncRNA structure, which provides further evidence in support of her previous findings regarding the significance of proposed covariation in lncRNAs. I think this is important work as it pushes for more rigor and care in the use of evolutionary arguments for RNA structure/function and will be of wide interest to the readership of PLoS Computational Biology. I have only a few comments below.

The author argues provides evidence that the new helix aggregated metrics are more robust than the base-pair metrics. It would be helpful if they could provide suggestions or guidance to readers on what the roles of base pair vs helix level metrics should be: i.e., should each metric be calculated and assessed or are helix level metrics to replace base pair level ones? Are there cases where base pair level metrics could be better?

When the author writes, “I observe that all methods except for Šidák report higher sensitivity for the same number of false positives than taking as supported all helices with at least one covarying base pair.” do they mean that the number of false positives doesn't change with different values of k but sensitivity does, or that sensitivity divided by false positives grows?

Minor:

The manuscript is heavily written in first-person, maybe “depersonalize” it a little bit.

Figure 2 (a, b) has a space but (c,d) does not

The light blue font is a bit hard to read on Fig. 2.

A few noted minor grammar issues: e.g.

“The Lancaster weights allow (one) to incorporate additional”

“four aggregation[s] methods”

“Two structure[s] ha(ve) been proposed”

"explore a[n] structural function"

Reviewer #2: In Elena Rivas PhDs' research article, "RNA covariation at helix-level resolution for the identification of evolutionarily conserved RNA structure," she presents a novel methodology for recognizing RNA structures that have been evolutionarily conserved. The paper highlights a new technique for assessing statistically significant covariation at the RNA helix level, which is achieved by combining the significance and power of covariation calculated at the base pair level. Rivas begins by discussing R-scape, a tool developed to identify base pairs in RNA sequence alignments that covary beyond phylogenetic expectation. R-scape traditionally treats base pairs as independent entities, but Rivas highlights the fact that RNA base pairs do not exist in isolation—they stack together to form helices, which together create the complete structure of the RNA. It's these helix-forming Watson-Crick base pairs that are believed to carry most of the covariation signal in an RNA structure.

Introducing a new covariation measure that aggregates these signals at the helix level, Rivas compares its performance to the conventional base-pair level approach. She demonstrates that this new technique improves sensitivity in detecting evolutionarily conserved RNA structure without compromising specificity. The paper then discusses the concept of p-value aggregation, where a group of base pairs with borderline covariation evidence may collectively yield a more robust helix signal. This is achieved by aggregating the p-values of individual marginally significant pairs. Rivas asserts that this aggregation process doesn't decrease sensitivity; even a single base pair with a strong covariation signal can make the entire helix significant, irrespective of the significance of other base pairs in the helix.

Rivas also establishes a benchmark to evaluate the performance of the helix-level significant covariation test. This benchmark includes real structural RNAs, synthetic structural RNAs, and decoy RNA alignments that lack structure. The results of the R-scape covariation analysis are combined and ranked by helix E-value, and the ranked list of helices is used to plot the fraction of true positive helices detected against an increasing threshold of mean false positives per query alignments.

Finally, Rivas reappraises the evolutionary evidence for a subset of long non-coding RNAs (lncRNAs) using the new helix-level approach. This evaluation uncovers an artifact that arises when covariation is employed to build an alignment for a hypothetical structure, which is then tested for whether its covariation significantly supports the structure. This helix-level reassessment strengthens the argument against these lncRNAs having a conserved secondary structure.

All of these results are important and impactful steps forward for RNA researchers.

Minor Points:

Helix level conservation is a powerful way to build consensus structures (https://academic.oup.com/bioinformatics/article/22/24/2988/208807), does this metric aid R-scape in predicting new structures, either with cacofold or otherwise?

Now that R-scape is moving towards helix-level statistics, is there any chance covariation/consistency metrics (like the previously misused RAFS metric) can be reevaluated for their utility?

Didn’t proofread extensively but found minor typos, like “mayority” instead of “majority” on page 9 for example.

Reviewer #3: The author is looking to extend the work presented in Rscape on statistical identification of evolutionarily conserved RNA structure from the base pair level to the helix level, as RNA base pairs do not occur in isolation.

The author explores four statistical methods to aggregate p-values: Fisher, Lancaster, Sidak, and a weighted Fisher approach.

The p-value aggregation approach is hypothesized increase sensitivity, and the author proposes a benchmark to assess whether that is associated with a loss of specificity, using real and synthetic RNA datasets, respectively with positive examples and decoys.

The author observes that sensitivity/specificity is increased in structured RNAs, offering higher sensitivity with lower false positive rates. As an application, the author looks at potentially conserved structured in well-known long non-coding RNAs, uncovering more precise information than was previously available with state of the art tools.

This paper is well structured, with a well defined objective and hypothesis, transparent and logical methods, clearly presented results and relevant exploratory applications. In particular, the construction of the decoys for the sensitivity/specificity benchmarks is very elaborate and strongly contributes to making the results more convincing.

My only suggestion would maybe suggest discussing why three of the four methods appear to perform very similarly (and extremely well) whereas sidak seems to underperform a bit, as this result raises some curiosity in a non-expert eye. While this is not necessary for the results presented to be convincing, it may help the discussion.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Walter Moss

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

References:

Review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.

If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Revision 1

Attachments
Attachment
Submitted filename: response.pdf
Decision Letter - Marc Robinson-Rechavi, Editor, Arne Elofsson, Editor

Dear Dr Rivas,

We are pleased to inform you that your manuscript 'RNA covariation at helix-level resolution for the identification of evolutionarily conserved RNA structure' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Marc Robinson-Rechavi

Academic Editor

PLOS Computational Biology

Arne Elofsson

Section Editor

PLOS Computational Biology

***********************************************************

Formally Accepted
Acceptance Letter - Marc Robinson-Rechavi, Editor, Arne Elofsson, Editor

PCOMPBIOL-D-23-00725R1

RNA covariation at helix-level resolution for the identification of evolutionarily conserved RNA structure

Dear Dr Rivas,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Lilla Horvath

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .