Peer Review History

Original SubmissionSeptember 24, 2019
Decision Letter - Gregory S. Barsh, Editor, Xuanyao Liu, Editor

Dear Dr Ratnakumar,

Thank you very much for submitting your Research Article entitled 'Protein-Protein interactions uncover ‘core genes’ within Omnigenic disease networks' to PLOS Genetics. Your manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review again a much-revised version. We cannot, of course, promise publication at that time.

Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see our guidelines.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool.  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions.

Yours sincerely,

Xuanyao Liu, Ph.D.

Guest Editor

PLOS Genetics

Gregory Barsh

Editor-in-Chief

PLOS Genetics

Although the reviewers commented favorably on the general goal and approach of the manuscript, there were significant criticisms that preclude publications. These comments indicate that significant amount of re-analysis is need to make the paper convincing (See comments of reviewer1 and reviewer2). "Core genes" or the gene targets that your approach aims to identify need to be carefully defined, and additional analyses need to support your claims on the findings. We would be willing to reconsider the manuscript after it has undergone a major revision that takes into account of the criticisms of the reviewers, with no assurance of acceptance.

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: In this paper, the authors present a method to identify genes which are enriched for protein-protein interactions (ppi) with genes near GWAS loci (or some other list of genes). The authors argue that these represent “core genes” under the omnigenic model. They describe some examples where the identified genes are known to be highly disease-relevant, and they report that their list of genes is enriched for signal according to clinvar and according to pLI.

I tend to think the authors are onto something with this method, but more work is needed before the paper is convincing. In particular, the authors must distinguish between the following claims: (1) genes with high ppi to GWAS-proximal genes, but without GWAS hits themselves (ppi-only genes), are bonafide disease-relevant genes. (2) among GWAS-proximal genes, those with ppi to other GWAS-proximal genes (ppi-gwas genes) are more disease relevant than those without (gwas-only genes). (3) ppi-only genes, in addition to being disease relevant, are even more disease relevant than gwas-only genes.

This is the sequence of claims that is necessary to justify calling the ppi genes “core”, implying that they are more important than most GWAS genes, which are presumably peripheral. The authors have provided evidence for (1), but not for (2-3). I have some suggestions for how they could make these comparisons. However, I do not think that acceptance of the paper should be contingent on identifying strong evidence for both these claims (see below).

The ClinVar and pLI comparisons are made between ppi genes (including both ppi-gwas and ppi-only) and all genes. There is also a comparison with “parent” genes, i.e. genes with high ppi to the larger set of parent-term GWAS-proximal hits. Missing is a comparison with gwas-only genes. I would like to see a comparison between ppi-gwas and gwas-only, and between ppi -only and gwas-only. These comparisons could provide some evidence of (2) and (3) respectively. Moreover, are ppi-only genes enriched for heritability, even if thehy do not harbor significant GWAS hits? Are ppi-gwas genes more likely to be among the very most significant genes, compared with gwas-only genes? For each parent category, is the ClinVar enrichment signal concentrated among genes associated with parent-category-related pathologies?

Even if all these analyses are positive, a critical caveat is that many GWAS-proximal genes will not actually be the right gene. (Could it be possible to observe this empirically, by showing that the 2nd closest gene to each GWAS hit is also enriched with ppi to other GWAS-proximal genes and this cannot be explained by proximal genes having shared ppi?) The reason this caveat can affect these analyses is that it is difficult to distinguish between two possibilities: (1) ppi-gwas genes are more likely to have ClinVar phenotypes because they are more important for disease than gwas-only genes, even though the gwas-only genes are also bona fide disease genes; and (2) ppi-gwas genes are more likely to have ClinVar phenotypes because they are more likely to be the true disease gene, whereas gwas-only genes are more likely to be the wrong gene at the locus altogether (but the correct gene, if we knew it, would be just as important as a ppi-gwas gene). I think the authors need to discuss this distinction. I don’t see a way that they can realistically rule out possibility 2, but that’s ok: their list of genes is interesting under either possibility 1 or possibility 2.

If the suggested analyses are negative, I don’t think it should necessarily preclude publication of this paper, but I would think that some re-writing would be needed. At a minimum, “core genes” should be replaced with “GWAS-interacting gene” or some other, less aspirational term. It’s still fine to motivate the network analyses using the omnigenic model, and say that your goal is to identify core genes, but if there is evidence that the identified genes are any more “core” than other GWAS genes, then it doesn’t seem appropriate to call them “core genes”. You can call p53 a core gene in cancer, but it doesn’t follow that all the other ppi genes are just as important.

I want to emphasize again that it is still valuable to identify new disease-relevant genes via ppi with other disease genes, even if you aren’t claiming that they are all “core” or even enriched for core genes relative to other GWAS genes.

I have some minor comments as well:

1. I noticed that many HLA genes are detected, and usually they are annotated as “new” even for phenotypes with clear HLA associations. I think there may be some issue at HLA where only one gene is being picked as the GWAS gene, and the others are being called “new” even though this obviously isn’t appropriate at this locus. Probably change the “new” criterion to “not within x Kb of the lead SNP”, and possibly exclude HLA from all analyses.

2. The omnigenic model is usually described in terms of gene-regulatory networks, not ppi networks. I don’t think this distinction is critical to the model, but maybe the authors could clarify that ppi is only one way that genes can interact, that it is not the type of interaction most people would associate with the omnigenic model, but that it is just as plausible for peripheral gene effects to be mediated by ppi as it is for them to be mediated by gene-regulatory networks. In addition, the authors could either apply their method to some gene-regulatory networks or explain why they have chosen not to do so (I don’t actually recommend doing this).

3. For any genes that you especially highlight in the text (BRCA1, APP, etc), make sure to be clear about whether they are annotated as GWAS genes in your dataset and also about whether they have been implicated by genetic studies that you did not include. I think a main table could be helpful here: for example, the BRCA1 row could contain a p-value using your method, the rank of this p-value out of all genes, a “no” indicating that this gene is not annotated as a cancer gene in your GWAS data set, and a reference for a study about rare variation in BRCA1.

4. The writing style is idiosyncratic, especially in the methods section. I found it perfectly readable, but it could be good to run this by someone who is a strong writer.

Reviewer #2: See attachment.

Reviewer #3: The manuscript by Ratnakumar and colleagues presented results analyzing GWAS associated genes in light of protein-protein interaction (PPI) networks. Their hypothesis is that genes physically interacting with disease-associated genes with high frequencies are candidates of "core genes", according to the omni-genetic model of complex trait genetics. Their analysis identified hundreds of putative "core genes" across a large set of phenotypes.

While I think some of the results, especially those about cancer, are interesting, I am not convinced that their analysis really sheds light on core genes under the omnigenetic model. Also the overall novelty of the work is questionable to me. Below are specific comments:

Major comments

1) Core genes have a specific meaning under the omnigenetic model: they are believed to mediate the effects of a potentially large number of genes associated with a complex trait (called "peripheral genes"). The omnigenetic paper discusses regulatory influence of perturbation of peripheral genes on core genes, in partiular, as one main mechanism. It seems to me that this paper mis-interprets the concept of core genes. All the analysis are based on the numbers of PPIs: there is no evidence that the genes interacting with other GWAS genes would "mediate" their effects on disease risks.

The analysis is more like finding "hub" genes in PPI network. There is a large literature on this topic, including some in the context of disease genetics, e.g. 21764832, 21490723, 25915600. It seems to me that none of these papers is properly cited. Given this background, I think the conceptual novelty of the work is quite limited.

2) In Figure 3d: the authors show that their core genes are more enriched with constrained genes (according to pLI scores). The authors compare pLI distribution of the core genes vs. all genes in the PPI network. I think the core genes are likley to have higher degress than background, so this analysis should control for degree distribution of two gene sets being compared. Otherwise, the results would show "highly connected" genes are more likely to have high pLI scores. This would be similar to earlier studies showing hub genes are under stronger evolutionary constraint, but this would have no implications on genetics of diseases.

3) The results of cancer core gene analysis are interesting. However, the analysis seems somewhat superficial. I would recommend the authors to perform some deeper analysis. For example, they reported >100 new genes, what are the overepresented biological pathways other than immune genes? Any specific new genes that may provide some new insights on cancer? If one compares across cancer types, are the genes usually found in one cancer type or multiple cancer types?

Minor comments

1) Figure legends are generally too long and sometimes redundant with the main text.

2) Lines 336-340: the authors mentioned that lipid traits have the highest proportion of core genes that are also top GWAS hits, and offered the possible explanation that lipid traits may be less polygenic. While this may well be true, I think it is important to consider the power of studies. For studies with large sample sizes (likely the case for lipid traits), it is likely that many "core" genes would be found as GWAS hits, so the proportion would be high.

3) Line 358: should cite Figure 3e instead of 3c.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Luke O'Connor

Reviewer #2: No

Reviewer #3: No

Attachments
Attachment
Submitted filename: ratnakumar19.docx
Revision 1

Attachments
Attachment
Submitted filename: Response_to_reviewers.pdf
Decision Letter - Gregory S. Barsh, Editor, Xuanyao Liu, Editor

* Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. *

Dear Dr Ratnakumar,

Thank you very much for submitting your Research Article entitled 'Protein-Protein interactions uncover candidate ‘core genes’ within Omnigenic disease networks' to PLOS Genetics. Your manuscript was fully evaluated at the editorial level and by independent peer reviewers. While we cannot make a definite commitment, we will probably accept your paper for publication, if you provide an answer to reviewer 1 on the node degree correction for Clinvar analysis and the drug target analysis, and make changes according to the remaining of the reviewers’ comments.

In addition we ask that you:

1) Provide a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

2) Upload a Striking Image with a corresponding caption to accompany your manuscript if one is available (either a new image or an existing one from within your manuscript). If this image is judged to be suitable, it may be featured on our website. Images should ideally be high resolution, eye-catching, single panel square images. For examples, please browse our archive. If your image is from someone other than yourself, please ensure that the artist has read and agreed to the terms and conditions of the Creative Commons Attribution License. Note: we cannot publish copyrighted images.

We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

Please let us know if you have any questions while making these revisions.

Yours sincerely,

Xuanyao Liu, Ph.D.

Guest Editor

PLOS Genetics

Gregory Barsh

Editor-in-Chief

PLOS Genetics

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have provided a thorough response to my comments and suggestions. Following the comments of Reviewer 2, I am now curious whether the Clinvar analysis and the drug target analysis were corrected for node degree. Otherwise, I only have presentation-related comments:

• In the current presentation, it feels like you are assuming your hypothesis is true (that ppi genes are candidate core genes) throughout pages 9-15. I would recommend that in this section, you use the term “ppi genes” instead of “candidate core genes”. Then, you could retitle the section starting on page 15 “Clinvar enrichment: ppi genes are candidate core genes.” The title for p. 21 could be “Drug target enrichment: ppi genes are candidate core genes”, and on p. 22, “Cancer driver enrichment: …”.

• The main figures should be readable without referencing the caption. Currently, many axes are unlabeled, and the titles are inconsistent (Figures 3-5). I would recommend adding axis labels everywhere it is not totally obvious and making all titles describe what is being plotted (rather than delivering the punchline).

• Consider including the finding of no excess GWAS signal at ppi-only genes. This analysis suggests that ppi-only genes would not be detected in GWAS of larger sample size.

• In many places, a long sentence should be split into two sentences. For example, see lines 85-89, 104-109, 202-205, 250-259, 637-640. In a number of places, a comma should be deleted (e.g. lines 637, 644, 139).

Reviewer #2: I congratulate the authors for successfully incorporating reviewer feedback into the manuscript. My concerns with the text have been satisfied, and I have only minor comments below.

Line 65: “insulin for” rather than “insulin in”

Line 66: BRCA1, as a protein, should not be italicized

Line 69: Technically the experiments still have to be performed. It is more accurate to say “by utilizing publicly available reference datasets.”

Line 264: delete the second comma

Line 386-389: Sentence is difficult to make sense of.

Line 473: delete the comma

Line 573-586: I think this could be condensed to provide a simple, short summary.

Line 597-603: I think it suffices to say “We present a new application of GWAS data: identifying core genes as envisioned by the omnigenic model that do not themselves exhibit GWAS signal by consulting physical interactions.”

Line 629-631: Sentence should be rewritten.

Line 637: delete first comma.

Line 644-653: I recommend deleting this paragraph. It is not terribly helpful.

Line 666-667: As described earlier, the method does not exactly work on every GWAS – the GWAS must have enough hits with PPI edges. It would be better to say that this method adds another layer to interpreting GWAS signal.

Lines 897-906: This should be rewritten so say very clearly that candidate core genes within 1 Mb of a GWAS hit are PPI-GWAS hits while the rest are PPI only. Any GWAS hits without a candidate core gene are GWAS-only.

Lines 949-952: Delete.

Lines 1046-1050: Delete.

In general: The omnigenic model is by convention not capitalized

Figure 1c: gene symbol names and axis text too small

Figure 1b step 2: axis text way too small

Figure 3d,e: axis text too small

Figure 4e: axis text too small

Figure 5e: number text too small

Figure 5f: axis text too small

Reviewer #3: The reviewers have done a good job at addressing my comments. While conceptually related to several papers published in the past, I think the large-scale analysis the authors performed across many GWAS datasets would be a useful resource for the community.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

Revision 2

Attachments
Attachment
Submitted filename: Response_to_reviewers_round2.pdf
Decision Letter - Gregory S. Barsh, Editor, Xuanyao Liu, Editor

Dear Dr Ratnakumar,

We are pleased to inform you that your manuscript entitled "Protein-Protein interactions uncover candidate ‘core genes’ within Omnigenic disease networks" has been editorially accepted for publication in PLOS Genetics. Congratulations!

Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional accept, but your manuscript will not be scheduled for publication until the required changes have been made.

Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

If you have a press-related query, or would like to know about one way to make your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics!

Yours sincerely,

Xuanyao Liu, Ph.D.

Guest Editor

PLOS Genetics

Gregory Barsh

Editor-in-Chief

PLOS Genetics

www.plosgenetics.org

Twitter: @PLOSGenetics

----------------------------------------------------

Comments from the reviewers (if applicable):

The authors have addressed all the comments of the reviewers to their satisfaction. Thank you very much for the effort.

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have fully addressed my comments.

Reviewer #2: I commend the authors for their diligence in responding to reviewer comments. My concerns have been fully addressed.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Luke O'Connor

Reviewer #2: Yes: Evan A Boyle

----------------------------------------------------

Data Deposition

If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website.

The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: 

http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-19-01600R2

More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support.

Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present.

----------------------------------------------------

Press Queries

If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org.

Formally Accepted
Acceptance Letter - Gregory S. Barsh, Editor, Xuanyao Liu, Editor

PGENETICS-D-19-01600R2

Protein-Protein interactions uncover candidate ‘core genes’ within omnigenic disease networks

Dear Dr Ratnakumar,

We are pleased to inform you that your manuscript entitled "Protein-Protein interactions uncover candidate ‘core genes’ within omnigenic disease networks" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Matt Lyles

PLOS Genetics

On behalf of:

The PLOS Genetics Team

Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom

plosgenetics@plos.org | +44 (0) 1223-442823

plosgenetics.org | Twitter: @PLOSGenetics

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .