Peer Review History
| Original SubmissionAugust 18, 2020 |
|---|
|
Dear Dr Sharir-Ivry, Thank you very much for submitting your Research Article entitled 'Quantifying Evolutionary Importance of Protein Sites: A Tale of Two Measures' to PLOS Genetics. Your manuscript was fully evaluated at the editorial level and by two independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review again a much-revised version. We cannot, of course, promise publication at that time. Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see our guidelines. Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. [LINK] We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions. Yours sincerely, Jianzhi Zhang Associate Editor PLOS Genetics Kirsten Bomblies Section Editor: Evolution PLOS Genetics Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: In the manuscript “Quantifying Evolutionary Importance of Protein Sites: A Tale of Two Measures”, the authors reported the relationship between the evolutionary constraint of a site in protein and its conservation gradient. For a site, its gradient is the pearson correlation between the constraints on the other sites and their distances to the focal site. A stronger gradient of a site (larger correlation) indicates that its neighboring sites tend to have stronger constraints than the distal sites. The authors observed a linear relationship, suggesting that the sites with strong constraints also had strong conservation gradients. The authors concluded that such a relationship was likely due to residue-residue contacts among the sites etc. Particularly, the authors found that catalytic sites in enzymes had much stronger conservation gradients than the other sites, and these strong gradients could not be explained by the particularly strong constraints on the focal sites. Therefore, the authors concluded that the observation was likely caused by the catalytic function. Although the results and conclusions are interesting, I have several major concerns. Major points One main result of the manuscript is that the sites with strong constraints tend to have strong conservation gradients. This linear relationship is convincing and expected. It is known that neighboring residues are expected to be involved in the same biological function e.g. catalysis, binding, and protein stability etc., and thus tend to have similar constraints. For example, in the reference 13, the residues in close proximity to the strongly constrained residues also have strong constraints, whereas the distal residues are less constrained. Therefore, the strongly constrained residues are expected to have relatively larger gradients than the residues with weak constraints, resulting in the observed relationship. However, different from what the author claimed, I think the linear relationship is likely quite weak, given the large variance of each bin. And it worth reporting more details on the regression, e.g. R-square and whether only the median/mean of each bin was used for the regression, which artificially reduces the data variation. Another major result is that the catalytic residues have particularly strong conservation gradients. The authors concluded that the high conservation gradients are probably because the unique requirement for the active site to selectively stabilize the transition state of the catalyzed chemical reaction imposes additional selective constraints on the rest of the enzyme. However, the large gradients of catalytic sites may be because all the identified catalytic sites in a catalytic region have strong constraints, whereas taking the PPI sites as an example, it is known that the PPI interface is relatively large but only a few key PPI residues have strong constraints, and the neighboring sites have weak constraints. This renders even the key residues having low conservation gradients. In addition, catalytic sites tend to be buried rather than exposed on protein surfaces as other binding sites. Located at the core region of a protein further increases conservation gradients due to the paths from core (high constraints) to surface (very low constraints). In sum, without controlling for these factors, the high gradients of the catalytic sites may not be due to the intrinsic catalytic properties. It may be interesting to quantify the influences of all these factors using a simple lattice model. The functional sites, core sites and surface sites have their constraints sampled from respective constraint distributions to calculate gradients. The factors may include the size of the functional region in the protein which has core and surface regions, the average and variance of site constraints in the functional region, the location of the region (surface or core/grove) etc. Overall, the gradients are moderate or small, calculated using Pearson correlation. Spearman correlation robust to outliers may be necessary to confirm the discovery. Minor points The authors mentioned very briefly that their discovery is important to phylogenetic inference, accurate quantification of selective pressure at single-site resolution etc. Please discuss a bit more the details in the discussion. The authors used “long-range” conservation in the manuscript. It would be useful to define the long range e.g. up to 30A. However, there is a possibility that many of the general conservation gradients observed by the authors are mainly due to “short” range residues. In this manuscript, many sites from different proteins were pooled together to estimate an average dn/ds for these sites. Many of those sites may have quite different dn/ds. PAML may be used to test whether the sites in a protein have different evolutionary rates, and then estimate the rates respectively for the sites. The multiple groups of sites with different rates may be informative for the analyses. Reviewer #2: Overall, this is a nice contribution. However, I have one major concern: Most proteins have a natural conservation gradient from the outside to the inside. So any study trying to identify some alternative cause for a conservation gradient must very carefully control for this strong confounder. I don't think the present study does so. I would argue the present study doesn't even properly discuss this issue. To me, the key question is to what extent sites create a conservation gradient given where they are in the protein structure. The authors look at buried and exposed sites, but that's a very crude classification. A site can be buried but relatively close to the surface or right in the center of the protein, and these two sites will experience both different selection pressures and different conservation gradients. A good measure to assess how close a site is to the center of the protein core is the weighted contact number (WCN), using an inverse square distance weighting. In fact, WCN is literally a measure of centrality, rather than a measure of number of contacts. (As an aside, many authors in the field mis-understand this issue.) If the authors correlate WCN with conservation gradient, they should find a fairly strong correlation. Then, the authors can build a regression model that regresses the conservation gradient against both WCN and conservation rank. The degree to which conservation rank contributes to such a model is a measure of the intrinsic conservation rank a site generates, independent of where in the structure it is located. It may well be that if the authors perform this analysis, catalytic sites stand out even more. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: No: I think the authors should provide their raw data and analysis scripts. As is, the study is not reproducible. ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No |
| Revision 1 |
|
* Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. * Dear Dr Xia, Thank you very much for submitting your Research Article entitled 'Quantifying Evolutionary Importance of Protein Sites: A Tale of Two Measures' to PLOS Genetics. The manuscript was fully evaluated at the editorial level and by the two original peer reviewers. While one reviewer is fully satisfied, the other has a few comments that need to be addressed. We therefore ask you to modify the manuscript according to the review recommendations. Your revisions should address the specific points made by each reviewer. In addition we ask that you: 1) Provide a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. 2) Upload a Striking Image with a corresponding caption to accompany your manuscript if one is available (either a new image or an existing one from within your manuscript). If this image is judged to be suitable, it may be featured on our website. Images should ideally be high resolution, eye-catching, single panel square images. For examples, please browse our archive. If your image is from someone other than yourself, please ensure that the artist has read and agreed to the terms and conditions of the Creative Commons Attribution License. Note: we cannot publish copyrighted images. We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. [LINK] Please let us know if you have any questions while making these revisions. Yours sincerely, Jianzhi Zhang Associate Editor PLOS Genetics Kirsten Bomblies Section Editor: Evolution PLOS Genetics Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: In this revision, the authors added more results and analyses, and the conclusions are more solid. However, there are still several concerns. In the manuscript, the authors emphasize “long-range” gradients. I commented on this before suggesting the authors to define long vs short. In this revision, the authors analyzed the residues within 30A. I feel that to support “long-range”, the author should have analyzed the residues beyond a certain cutoff (within a shell) for catalytic residues and other residues. I suggested that the strong correlation between constraint and conservation gradient may be due to the location of catalytic residues, i.e. close to the core of proteins. (The core-surface and catalytic function together lead to the high gradient). To address this comment, the authors used the numbers of contact residues to indicate the packing. However, catalytic residues (other ligand binding residues) may be in a groove (thus close to core) but have no contact residues. Number of contact residues may not be a direct measure for this purpose. Fig2 shows the relationship between the normalized conservation ranks of residues and their conservation gradients, for different types of residues, such as ligand binding sites, catalytic sites, allosteric sites etc. The conclusion is that the relationship for catalytic sites is quite unique. However, from the fig, it seems that this is likely because the catalytic sites have x range 0.95 to 0.65. For some other residue types, the relationships in this range seem similar to that of catalytic sites. The authors may need to add regression lines using only that x range. The r-square (cor squared) is quite small, indicating the fitting is at most moderate. For fig4, are the black dots “all residues (w/o functional sites)”? Their results are missing in panel B. The result of such residues can tell how much the constraints on residues alone influence the gradients. Conservation gradients depend on the relative residue constraints within each protein. The normalization used by fig1&2 is more reasonable than comparing dn/ds from different proteins. It seems that the normalized conservations can be used for those key analyses in fig3&4 with x changed accordingly. About the conclusion in DIscussion, I think the measures in this manuscript probably can not be informative for “de-novo functional site prediction and protein design”, because many functional sites, except catalytic sites, are similar to non-functional sites in terms of the measures. The following sentences may contain typos. When considering only the three most conserved residues from each functional site (Fig 4B), subset of residues exhibits lower evolutionary rates and higher conservation gradients compared with subsets from all functional sites residues (Fig 4A). Beyond the classical, “intrinsic” measure of conservation and the “extrinsic” measure which is conservation gradient the site exerts on the rest of the protein. Reviewer #2: Thank you for your careful revisions. I have no further comments. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No |
| Revision 2 |
|
Dear Dr Xia, We are pleased to inform you that your manuscript entitled "Quantifying Evolutionary Importance of Protein Sites: A Tale of Two Measures" has been editorially accepted for publication in PLOS Genetics. Congratulations! Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made. Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org. In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date. Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics! Yours sincerely, Jianzhi Zhang Associate Editor PLOS Genetics Kirsten Bomblies Section Editor: Evolution PLOS Genetics Twitter: @PLOSGenetics ---------------------------------------------------- Comments from the reviewers (if applicable): Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The authors addressed my comments ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No ---------------------------------------------------- Data Deposition If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website. The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-20-01275R2 More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support. Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present. ---------------------------------------------------- Press Queries If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org. |
| Formally Accepted |
|
PGENETICS-D-20-01275R2 Quantifying Evolutionary Importance of Protein Sites: A Tale of Two Measures Dear Dr Xia, We are pleased to inform you that your manuscript entitled "Quantifying Evolutionary Importance of Protein Sites: A Tale of Two Measures" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work! With kind regards, Alice Ellingham PLOS Genetics On behalf of: The PLOS Genetics Team Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom plosgenetics@plos.org | +44 (0) 1223-442823 plosgenetics.org | Twitter: @PLOSGenetics |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .