Peer Review History
| Original SubmissionJune 11, 2020 |
|---|
|
Dear Dr. Cole, Thank you very much for submitting your manuscript "REDCRAFT: A Computational Platform Using Residual Dipolar Coupling NMR Data for Determining Structures of Perdeuterated Proteins Without NOEs" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments. We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation. When you are ready to resubmit, please upload the following: [1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. [2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file). Important additional instructions are given below your reviewer comments. Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts. Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Alexander MacKerell Associate Editor PLOS Computational Biology Arne Elofsson Deputy Editor PLOS Computational Biology *********************** Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The authors present an update to their REDCRAFT software, which is designed to calculate the structure of proteins from residual dipolar coupling data. Previous iterations of REDCRAFT have been described in several publications: J Magn Reson 2008, J Magn Reson 2010, J Biomol NMR 2014, and JCTC 2016. From what I could gather, this manuscript presents the largest benchmark of REDCRAFT to date, which is commendable. I think, however, that the manuscript could benefit from an extensive reorganization to highlight the novel features of REDCRAFT. This current version reads often as if the algorithm has never been described. In addition, I could not find a mention to exactly what new features the authors included in the latest version of REDCRAFT. The authors also fail to provide a URL for either the source code or the binaries of the software. This manuscript seems to be a benchmark and demo of REDCRAFT's abilities on a variety of proteins. While I find this important, new developments in NMR software should be encouraged, I cannot recommend the article for publication without major revisions. In particular, the authors should trim the contents of the manuscript to focus on a simpler message: what is new in REDCRAFT. Otherwise, the authors should just say, here's our code tested in a large benchmark and compared to traditional methods. Finally, I believe that there is room for some extra experiments (all computational) that could bring some novelty and added value to the work. Below are my comments on each section of the manuscript that I hope can help the authors in their revisions: (1) Abstract / Summary - From the abstract and author summary, I cannot find what novel features have been added to this version of the software. The authors should make it very clear what is new in this version and why it matters to the users. - As much as I like NMR, of the structures deposited in the RCSB PDB in the last 5 years, 50127 were determined by X-ray crystallography, 4468 by EM, and 2074 by NMR (both ssNMR and solution). I would change the very first sentence in the abstract to state that there are 'three primary experimental means of characterizing macromolecular structures'. - I would also note that structure determination with/from RDC data is not recent. There are many papers from the early 2000s (so, nearly 20 years ago) describing applications and methods. - On line 40, the authors write that structure determination with RDCs is limited to small proteins 'using RDC data from many alignment media (>3) that cannot be collected from large proteins'. Then, on line 45, they state that REDCRAFT can be applied to large proteins using simulated RDC data from perdeuterated proteins. So, can we use RDCs for large proteins or not? (2) Introduction - On line 72, if you are going to be as specific as '11.649', then do not use 'nearly'. I'd say 'nearly 12.000'. - Regarding line 75, I would argue that NMR is not determining structures in their 'native' environment. The temperatures at which structures are determined and the concentrations used in the sample preparation are hardly comparable to 'native' conditions. Please remove the word 'native', as it induces the reader in error. - On lines 78-80, I would add molecular dynamics simulations to the list of methods used by NMR software to produce structures. CNS and XPLOR-NIH both use MD for example. I would also argue that not having new methods isn't necessarily bad. Using old forcefields and not refining structures in explicit solvent, that I'd agree with that it's bad! - On line 91, I disagree that the main limitation for studying large proteins with NMR is on the computational side, or with costs. Rather, in my experience, it's more to do with the spectral crowding and the inability to assign peaks productively. - I need a citation for the sentence at lines 92-93: RDCs as a promising alternative to NOEs. By the way, there's a typo in Residual on that sentence. - On line 107, the authors state that 'all the existing RDC-based structure determination approaches [...]', but wouldn't that include REDCRAFT, which was first published in 2008? - On line 113, as in other parts of the manuscript, the authors refer to their search/optimization strategy as 'novel', but I have read the same exact strategy in their 2008 paper. Please remove 'novel'. - On lines 116-117, the authors state the novelty of their work. Please make it easier to find! Do not bury the lede! (3) Results - On line 134, the authors make the assumption that because you can collect some RDCs for a (large) protein, you could collect more and thus pave the way to a RDC-only structure. I am not an expert in RDCs, but I would be very surprised if this is the case. Often, you can only get a few NOEs for a system. - On line 145 the authors state that they calculate RMSDs as a range, when there are missing fragments of the protein. Why not simply present the RMSD over the existing atoms? - When describing the data used for GB1 (lines 155 and below), the authors should state how many RDCs of each class they used in the final set of 209. - On Table 1, the Q factors for some structures are quite high. For example, the REDCRAFT GB3 structure has a range of 0.02 to 0.34, which according to the authors' own definition, corresponds to saying 'extremely good to bad' fit. SR10 has Q factors of ~0.7 and above, so, very poorly fit. Could the authors explain, or maybe illustrate with a figure, which regions are well fitted to the data and which are not? - On the same note, I would find it interesting to see the RMSD of the ensemble of final structures, as a measure of convergence of the structure determination process. - On a more general note, the number of RDCs picked for each case seems quite arbitrary. If I was developing a method like REDCRAFT, I'd like to know what is the lower limit (ballpark) of data I need to use to get a good structure. As the authors state throughout the manuscript, researchers often use 'an excessive' amount of RDCs. It would be interesting to pick one of these cases (or all of them, they are not so many), and take a random sample of 50 RDCs, 100 RDCs, 150 RDCs, ..., and calculate when the structures start to converge to a fold. Maybe there is a relationship to be found between protein size/type and the optimal number of RDCs? - The authors state on lines 205-206 that not using RDCs resulted in a Q factor of ~0.33, but using RDCs lowered it to ~0.28. Why such a small drop? I expected that optimizing the structure using RDCs and then comparing it with respect to how well it fits the data would yield much better Q factors. Can the authors comment on this? - On line 220: 20.000 rounds of Powell? To resolve minor van der Waals (typo in the manuscript, please correct with appropriate capitalization) clashes? From experience, 50 steps of Powell are more than enough to clear clashes. Does XPLOR-NIH even get to the 20.000th step or it just converges before? - Related to the structure determination of PF2048.1, a shift from 2.3 to less than 1.0A with minimization is quite substantial. I would wager that there is a lot that that EM step is doing in terms of optimizing the structure. (4) Discussion - On line 269: 'as little as 11% of the previously used data' is cherry-picking the lowest value. The average reduction is closer to 25%. (5) Methods - The section on RDCs is extremely length and unnecessarily detailed. Please summarize it to the bare minimum to understand the methods used in the manuscript and refer readers to reviews on the topic for more in-depth descriptions. - On lines 346-347 the authors state that the energy landscape of RDC data is too complex to be navigated by GD/MC methods. However, if we refer to their own example of PF2048.1, it seems that simple EM was able to refine that structure quite substantially, using RDC data. I agree with the authors that specialized algorithms to handle RDC data are necessary and could probably be better than those written in the 80s (CNS/XPLOR), but the data presented in the manuscript does not back this statement. - On line 352, the authors again refer to developments in 'recent years' and refer to REDCRAFT. Again, 2008 is 12 years ago, not necessarily recent. Please rephrase. - Line 354: remove novel from the description of the search algorithm. The current manuscript does not describe any new methodology or feature, so I assume the two-stage method is the same as the original REDCRAFT paper, again, of 2008. - On lines 369 and onwards, the authors describe how REDCRAFT is the only OOP software available to date. I hope they mean in the field of structural biology. OOP is only useful for developers and is quite tricky to get right. The authors rightfully state that one of the strengths of the paradigm is the ease of adding extensions to the original code. However, the examples they give (Polypeptide built from Atom and AminoAcid objects) are not novel nor unique to REDCRAFT. Biopython's PDB parser, written in 2003, has a similar hierarchical structure for its Structure representation (incidentally also using OOP). Without a very well-documented description of the API, the code base, the classes, their methods, it's nearly impossible for anyone to 'easily' extend REDCRAFT. Indeed, I see the authors have on their webpage a nice readthedocs page, but there is no description of the code/API. See the OpenMM documentation for a very good example of what you should aim for if you want to attract developers to extend your project. Reviewer #2: uploaded as an attachment ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: João Rodrigues Reviewer #2: No Figure Files: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Data Requirements: Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5. Reproducibility: To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions, please see http://journals.plos.org/compbiol/s/submission-guidelines#loc-materials-and-methods
|
| Revision 1 |
|
Dear Dr. Cole, Thank you very much for submitting your manuscript "REDCRAFT: A Computational Platform Using Residual Dipolar Coupling NMR Data for Determining Structures of Perdeuterated Proteins in Solution" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations. Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. When you are ready to resubmit, please upload the following: [1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out [2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file). Important additional instructions are given below your reviewer comments. Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Alexander MacKerell Associate Editor PLOS Computational Biology Arne Elofsson Deputy Editor PLOS Computational Biology *********************** A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately: [LINK] Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: I would like to start by thanking the authors for their extensive and careful responses to my many previous comments. This revised version of the manuscript is substantially better and clearer for the reader. The methods are also more clearly presented and the results much better organized. I would only subdivide the REDCRAFT methods section to highlight the new algorithms. Nevertheless, I am happy to recommend this work for publication. Reviewer #2: please see attached ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: João Rodrigues Reviewer #2: No Figure Files: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Data Requirements: Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5. Reproducibility: To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see http://journals.plos.org/ploscompbiol/s/submission-guidelines#loc-materials-and-methods
|
| Revision 2 |
|
Dear Dr. Cole, We are pleased to inform you that your manuscript 'REDCRAFT: A Computational Platform Using Residual Dipolar Coupling NMR Data for Determining Structures of Perdeuterated Proteins in Solution' has been provisionally accepted for publication in PLOS Computational Biology. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests. Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated. IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS. Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. Best regards, Alexander MacKerell Associate Editor PLOS Computational Biology Arne Elofsson Deputy Editor PLOS Computational Biology *********************************************************** |
| Formally Accepted |
|
PCOMPBIOL-D-20-01003R2 REDCRAFT: A Computational Platform Using Residual Dipolar Coupling NMR Data for Determining Structures of Perdeuterated Proteins in Solution Dear Dr Cole, I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work! With kind regards, Alice Ellingham PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .