Peer Review History
| Original SubmissionJuly 30, 2020 |
|---|
|
PONE-D-20-23843 Similarity between mutation spectra in hypermutated genomes of rubella virus and in SARS-CoV-2 genomes accumulated during the COVID-19 pandemic PLOS ONE Dear Dr. Gordenin, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. 1) Make the suggested minor changes requested by the reviewers. While the additional data analysis suggested by Reviewer #1 might strengthen the manuscript, it is not essential for acceptance for publication (in case it is challenging or technically difficult). 2) I want to emphasize that novelty is not a central criterion for publication in PLoS ONE - hence I would like to encourage you to not put too much focus on what is novel (compared to previously published competing studies) but also present and discuss comprehensively aspects where there is overlap (as the tools employed differ between the studies). 3) Please contact me about anything that is unclear in the reviewers comments (or if there are difficulties in making the suggested changes), so that a I can help to speed up the revision process. Please submit your revised manuscript by Oct 24 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols We look forward to receiving your revised manuscript. Kind regards, Sebastian D. Fugmann, Ph.D. Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: N/A Reviewer #2: Yes Reviewer #3: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: In the manuscript “Similarity between mutation spectra in hypermutated genomes of rubella virus and SARS-CoV-2 genomes accumulated during the COVID-19 pandemic”, by Klimczak et al., the authors use bioinformatics and statistical tools to compare the mutational spectra of rubella and SARS-CoV-2 single-stranded RNA viruses. They extracted four main conclusions: 1) The mutational spectra are similar, “with C to U as well as A to G and U to C being the most prominent in plus-strand genomic RNA of each virus”; 2) “U to C changes universally showed a preference for loops versus stems in predicted RNA secondary structure”; 3) “C to U changes showed enrichment in the uCn motif, which suggested a subclass of APOBEC cytidine deaminase being a source of these substitutions”; 4) There was an “Enrichment of several other trinucleotide- centered mutation motifs only in SARS-CoV-2 - likely indicative of a mutation process characteristic to this virus.” The analysis is competent, and the presentation suitable for publication. However, as there is an extensive overlap between this manuscript and the recently published work of the group of Dr. Silvo Conticello (Di Giorgio et al., Sci. Adv. 2020; 6: eabb5813), it would be beneficial if the authors identify the similar and distinct conclusions concerning this work. Furthermore, besides the two additional references already cited in the manuscript, a recent paper by Dr. Simmonds (https://doi.org/10.1128/mSphere.00408-20) should also be referenced and discussed, particularly given its thorough analysis of the C to U changes in the Sars-Cov-2 genomes. The authors should engage in a thorough comparison between their data and the recently published results. To make the manuscript shine, it is also perhaps a good idea to highlight what seems to be the only aspect that the four published papers on the mutational spectra of Sars-Cov-2 have failed to notice: conclusion 2 (U to C changes universally show a preference for loops). Below we comment on each of the first three conclusions. The mutational spectra are similar Although this reviewer suspects that the contribution of mutations introduced by NGS is negligible, the authors should address this issue in the methods or results, instead of leaving it to the discussion of the G to U changes (Dr. Simmonds deals with this problem compellingly). The rational beyond the generation of the NoDups, NoDupsFunc, and NoDupsNonFunc is clear. However, by removing the information on the number of repeated mutations the authors could be missing hotspots. It should be possible to use the phylogenetic data to determine whether repeated mutations are due to independent mutational events or a shared founder effect. “…density values for specific base substitutions cannot be compared directly between two viruses because they were obtained from vastly different genome numbers”. Given that the major focus of the work is on the comparison between the rubeola and Sars-Cov-2 viruses, it is not apparent why the authors did not subsample the Sars-Cov-2 data to equilibrate the genome numbers and perform the comparison. The comparison may not work for lack of power, but that would be another issue. Figure 2. 1) The definition of the null hypothesis (“positive correlation between rubella and SARS-CoV-2 spectra”) is misleading. Since the P values are all below 0.05, and the authors conclude that there is a correlation, the null hypothesis must be “lack of positive correlation between rubella and SARS-CoV-2 spectra”. 2) There is a slight drop in the correlation when only silent mutations are considered. Could this mean that part of the correlation is explained by selection? More information is needed on the “ADAR score tool.” U to C changes universally show a preference for loops The authors seem to favor the scenario in which A3A mostly drives the U to C changes. Since the group of Conticello favors the action of APOBEC1, it would be useful to read a discussion on the strengths and weaknesses of each scenario. To boost the impact of the manuscript, the authors should also consider changing the title to highlight the possible role of APOBEC3A. Regardless of their decision on the title of the manuscript, it is essential to 1) clarify whether the preference for RNA loops is an exclusive feature of A3A or common to other APOBECs editing RNA and 2) deal with the fact that “none of the trinucleotide motifs containing C to U base substitutions showed loop or stem preference in selection-free SARS-CoV-2 NoDupsNonFunc filtered dataset.” Concerning this latter point, the authors could perhaps estimate how often the significance is lost when considering random subsets of NoDups with the same number of elements of NoDupsNonFunct. In addition, it might be useful to incorporate the number of independent mutations in the same position to increase the sample size and perhaps reach significance in the NoDupsNonFunct. Finally, an analysis of the frequency of codons prone to give rise to non-synonymous changes upon C to U mutation in the stem versus loop regions could help evaluate whether selection explains the bias toward loops. For instance, if the frequency of these codons is considerably higher in the loop regions, than the lack of significance in the NoDupsNonFunct could not be as problematic as it seems. C to U changes showed enrichment in the uCn motif It would be helpful to know what is the degree of overlap between the datasets of the studies already published and the data analyzed in this manuscript. In the introduction, this topic is presented in a very confusing manner. First, the authors focus on the motif preferred by APOBEC3G, which is only known to target (c)DNA. Then, they mention the target preference of APOBEC3A (A3A) and APOBEC3B (A3B) and another trinucleotide that is also a match. It would be better to remove APOBEC3G from the picture and refer the top three (DNA) trinucleotide preferences of A3A and A3B (as percentages). In addition, the authors could use the data available for APOBEC1and A3A to estimate if the trinucleotide motifs in RNA can be predicted from the trinucleotide motifs in DNA (https://doi.org/10.1038/s41467-020-16802-8 provides some hints). Minor issues A figure with a model would be useful. Furthermore, the authors could consider a complete map of the loop and stem regions along the Sars-Cov-2 genome. This may require some creativity due to scale issues, but for a quick reading any other form of data presentation is better than a table. The writing could be improved. There are several passages with weird punctuation or grammar that need editing (e.g., “Similarly, to what,” “with the null hypothesis that, there is,” “genome shown below,” “generating RNA mutation load in population of SARS-CoV-2 genomes”, “in individual isolates We de-duplicated”…) Reviewer #2: In this manuscript by Dmitry A. Gordenin et al. make effective use of published data sets to address a relevant and straightforward research question: Is there a similarity between mutation spectra in hypermutated genomes of rubella virus and in SARS-CoV-2 genomes accumulated during the COVID-19 pandemic. The genomes of six independent isolates of hypermutated 60 vaccine-derived rubella viruses provide a total of 993 mutations, as previously published. This relatively small data set was compared to a large collection of 32,341 whole genome sequence of multiple SARS-CoV-2 isolates (FASTA entries) were included in the mutation calling, of which 251,273 mutations were retained. Mutational patterning and motif searches suggest that the mutation mechanisms underlying hypermutation of the rubella vaccine virus are very similar or even identical to those of the SARS-CoV-2 viruses propagating in the human population with regard to the uCn APOBEC signature. An enrichment of several other trinucleotide-centered mutation motifs was also found in the SARS-CoV-2 . Despite the fact, that the latter have not been characterized in further detail and compared (limited rubella data set) this is a relevant and interesting study that deserves publication in its present form. Reviewer #3: Thank you for the opportunity to review the manuscript of Gordenin et al., on the comparative mutation-spectrum evaluation between two ssRNA viruses (Rubella and SARS-CoV-2), which I had previously read on bioRxiv. The strong points of this study are that the computational methods including the mathematical and statistical calculations that, as applied, make independent and diverse datasets comparable. On the other hand, this study has a few weaknesses with regards to the theoretical background and results explanations that I will summarise point-by-point below. Overall this is the first study that highlights the evolutionary impact of APOBEC and (very likely) ADAR editing of ssRNA viruses from the population-genetics perspective, and also the first one to link this to the mutability and infectivity of the pandemic-responsible SARS-CoV-2. I therefore recommend that the authors add in the discussion a paragraph that walks through a couple of such examples to connect the present study with previous ones. Other than that, this is an excellent study that should be published. Point-by-point comments Introduction • Lines 24-26: RdRPs may be a source of mutation, not a question, however as shown downstream in this work it seems that it’s rather of a “background-frequency” when SNVs are catalogued in aggregate. I suggest adding a “may” be a source of mutation in this sentence. • Lines 66-71: A U-to-C change on the (+) annotated strand of the ss-RNA viruses rubella and SARS-CoV-2 can only be an ADAR effect only if the complementing (-) strand is not from the same virus-genome, as the authors very correctly highlight in lines 72-74. It can perhaps be another RNA or DNA molecule (intermediate? Another viral genome copy?). When SNVs are called by alignment to single-stranded viral genomes the A-to-G on the “other side” which is formed through a loop will be still read as A-to-G. Which leaves an open question about potentially other modifications that may lead to U-to-C on RNAs. I recommend that the authors should have a look into this too (https://www.nature.com/articles/s41467-019-13525-3) and cite it as it highlights potential other sources for U-to-C changes on the mRNA, though not primarily focusing on that. In addition, the fact that A-to-G and U-to-C are probably not always ADAR-related is also supported by the data presented: in Figure 2 it shows that the A-to-G and T-to-C density values are never the close in the robust datasets (A and B panels). The same misconception is also published in reference [23]. (Lines 356 - 364) additional statistical data support this discrepancy. Results • Lines 246-251:This should move at the end of the introduction, between the two sentences ending and starting in line 90. Like this it will set the main goal of this study straight out and help the reader get on board faster. • Lines 257- 326: This part is very technical I suggest that it gets shortened in the results and be included in the Materials and Methods. • Lines 253-255: Please, provide a more self explanatory figure legend. Also in 339-346 legend Figure 2 explain the Dups, NoDups etc • Line 349: For accuracy please consider changing the term “genomic” RNA, which could be very confusing for the reader, and instead I recommend using something like “viral RNA genome”. • Lines 366-371: there is a good chance here to introduce the contribution of the RdRP errors up to the background level frequency. • Lines 402-404: This sentence is a bit self-contradicting. Since secondary structure effects can not be predicted, how can the U-to-C can be explained? Maybe these are cases where these changes are not ADAR-driven? It is confusing, please rephrase clearly. • Lines 424-426: please rephrase here since there’s a separate calculation and visualisation (Fig 4) for C-to-U, G-to-A, U-to-C, A-to-G. Another point that it’s worth keeping in mind is, that in the APOBEC motif sought by the authors it’s not impossible that different enzymes are targeting the (+) or (-) strand in the same double-stranded structure. For instance, the ds structure could be an RNA-DNA hybrid - (some APOBEC3s show preference to such hybrids). Discussion • Lines 498-500: this is a very intriguing point which can also be supported by the findings in reference [23] by the no-specificity in the ADAR motif (25% base composition per nucleotide in the flanking bases). Please, make it a separate paragraph and explain how this can work. Moreover, in Lines 500-501: please rephrase and explain in detail what you mean by “ADARs only in vivo”. Figures • All figures should be reformatted to a better quality. • Figure 1: the text should be minimized. • Figure 2: The panel titles (NoDupsNonFunc etc) should be better explained (also in Figure 3). Perhaps keep the one “dominant” panel (A,B,C?) that really summarizes the mutations in SARS-CoV-2 which can be immediately compared with the corresponding one of Rubella (D). Provide the rest both for SARS-CoV-2 and Rubella in the supplementary material (both for Figure 2 and 3). • Figure 3: in panels A,B,C there is a higher rate of background density from the least dominant mutations that seems higher than the one in Rubella (D). I presume that this is due to the different depth. However if this is an error rate by the polymerase, please calculate it and perform a statistical comparison (wilcoxon test - to consider the rank association of the background?) to Rubella. • Figure 4: please remove the 0 to achieve a less busy plot. Again, keep one of the NoDupsNonFunc, NoDups or Dups. Whichever you think is the most representative to be compared to Rubella (the rest put in the supplementary). And please then colour the bars the same way to highlight potential complementarity between C-to-U / G-to-A or A-to-G and U-to-C (which I can already see that there’s no complementarity to A-to-G and U-to-C. So maybe they aren’t really related?) Please discuss this in Lines 498 and 500. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: Yes: F Nina Papavasiliou [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
Similarity between mutation spectra in hypermutated genomes of rubella virus and in SARS-CoV-2 genomes accumulated during the COVID-19 pandemic PONE-D-20-23843R1 Dear Dr. Gordenin, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Sebastian D. Fugmann, Ph.D. Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: |
| Formally Accepted |
|
PONE-D-20-23843R1 Similarity between mutation spectra in hypermutated genomes of rubella virus and in SARS-CoV-2 genomes accumulated during the COVID-19 pandemic Dear Dr. Gordenin: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Sebastian D. Fugmann Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .