Peer Review History

Original SubmissionJuly 26, 2019
Decision Letter - Gregory P. Copenhaver, Editor, Aimee M. Dudley, Editor

* Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. *

Dear Dr Ryan,

Thank you very much for submitting your Research Article entitled 'Paralog buffering contributes to the variable essentiality of genes in cancer cell lines' to PLOS Genetics. Your manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important topic but identified some aspects of the manuscript that should be improved.

We therefore ask you to modify the manuscript according to the review recommendations before we can consider your manuscript for acceptance. Your revisions should address the specific points made by each reviewer.

In addition we ask that you:

1) Provide a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

2) Upload a Striking Image with a corresponding caption to accompany your manuscript if one is available (either a new image or an existing one from within your manuscript). If this image is judged to be suitable, it may be featured on our website. Images should ideally be high resolution, eye-catching, single panel square images. For examples, please browse our archive. If your image is from someone other than yourself, please ensure that the artist has read and agreed to the terms and conditions of the Creative Commons Attribution License. Note: we cannot publish copyrighted images.

We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

Please let us know if you have any questions while making these revisions.

Yours sincerely,

Aimee M. Dudley, Ph.D.

Associate Editor

PLOS Genetics

Gregory P. Copenhaver

Editor-in-Chief

PLOS Genetics

The reviewers appreciated the work and the manuscript itself, but had several questions and suggestions that should be addressed. One major concern, which I also had after reading the manuscript, is that (as the authors know) cancer cell lines are well known to harbor extensive aneuploidy and copy number variation. How much of the background-specific non-essentiality could be explained by the presence of multiple copies of the gene or its paralog(s)?

Other reviewer comments of particular importance are:

- the need to provide more detail about how fitness is calculated

- questions/ suggestions about the statistical methods used (continuous versus discretized data and t-tests versus a regression model)

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: In this paper, De Kegel & Ryan investigate the contribution of paralogs to variation in gene essentiality across human cell lines. The authors re-analyze the DepMap dataset of CRISPR-Cas9 KO screens in >500 cancer cell lines to explore this question. Similar to previous studies in model organisms, they find that duplicated genes are less likely to be essential than singleton genes. They extend this finding by showing that duplicated genes also show more variation in essentiality across many different genetic backgrounds, something that has not yet been feasible to investigate in model organisms. They further investigate the underlying mechanisms driving the essentiality of genes with paralogs, such as the number of paralogs a gene has and the mode of duplication by which the paralogs occurred.

The paper is well-written, well-structured, and uses clear and precise language. All relevant data is included in the supplementary tables. The authors frequently refer to previous work done in other organisms, and - to my knowledge - cite all relevant literature. They clearly describe the used methods and have performed a thorough cleaning of the DepMap dataset before analysis.

While many of the findings in the article are intuitive given previous knowledge from model organisms, these observations have not previously been described in human cells. In addition, the authors take their work a step further than previous studies by looking at variability in gene essentiality among many different genetic backgrounds. This paper is thus certainly useful to the scientific community. However, I have a few comments, mainly about the detection and definition of differential essential genes (“sometimes essential genes”), which may affect all further downstream analyses.

Main comments:

1. Page 6/7 - “~86% of the genes that share 90% or more sequence identity with their closest paralog are targeted by at least one multi-target sgRNA in the Avana library”. Are there still 3 or more gRNAs remaining for these genes in the dataset, or have most of the highly similar paralogs been filtered out? If many or most of the highly similar genes have been removed, can the authors comment on how this affects the results?

2. Page 7/8 – The authors describe how they use a single CRISPR score threshold (-0.47) to determine gene essentiality. This means that genes with an average score close to this cutoff would be classified as differential essential even though the actual scores may be very comparable across cell lines (for example -0.45 in some cell lines and -0.49 in others). In addition, genes with a score just below -0.47 in one cell line and above -0.47 in all others would be classified as differential essential. This could be prevented by for example excluding all scores in the -0.45 to -0.50 range. Could the authors comment and/or show data on the effect of their choice of this single threshold on the set of differential essential genes?

3. Page 6-8 - If a given cell line carries a mutation (compared to the reference genome) in the sequence targeted by a guide, this guide may be less efficient in this cell line. Could the authors comment on whether some of the observed differential essentiality could be caused by differences in guide efficiency due to mutations or other sequence differences between cell lines?

Minor comments:

1. Page 13/14 - The authors use expression data to identify gene pairs where the essentiality of a gene A correlates with lower expression of its paralog B. I agree with the authors that these could indicate potential synthetic lethal relationships, but other explanations are also possible. Although at several places they describe these as “putative” synthetic lethal relationships, at other moments they talk in more definitive terms about synthetic lethal pairs. In my opinion, consistent use of more careful phrasing like “putative synthetic lethal pairs” would be preferable.

Reviewer #2: See uploaded attachment

Reviewer #3: In this study, De Kegel and Ryan examine the set of genes required for viability across 558 tumor cell lines, as determined by the DepMap project at the Broad institute. The authors investigate why certain genes are essential in all, some or none of the tested cell lines and to what extent this variability can be explained by gene duplication/paralogy. The authors report that, while paralogs are overall less likely to be essential, the ones that do have an essential phenotype in at least one context provide insight into the mechanisms of variable essentiality. Specifically, the probability of a gene being required for viability depends on the number of its cognate paralogs, their mutual sequence similarity and relative expression levels, as well as the nature of the duplication event they originated from.

I found this study to be very well designed, executed and described. The question posed is very interesting and takes great advantage of an extensive body of work in model organisms (i.e., yeast) as well as the latest technologies that have recently generated relevant datasets for human cancer cell lines. The analysis is thorough and rigorous, and the description of the results and interpretations is very clear and precise. I think this would be an important contribution to the field of functional genomics and cancer biology.

I don't have any major comments or suggestions on the manuscript.

A few minor comments are:

- The authors estimate that 13-17% of paralogs that are essential in some, but not all, cell lines are explained by their genetic dependence on a paralog partner that's variably expressed across the cell lines. Could the authors provide their intuition/speculation/preliminary insight into what could explain the other (more substantial) fraction of variably essential paralogs? Could it be synthetic lethality with other non-paralog genes that are also variably expressed across the cell lines? And if the genome-wide expression data for these cell lines is available, could one attempt to identify these synthetic lethal partners based on the correlation of their expression levels with fitness levels?

- Maybe I missed it throughout the text and the methods, but could the authors clarify where is their synthetic lethality data coming from? E.g., lines 290-294: "we found that among WGD paralog pairs, ~16.5% are synthetic lethal" -- what dataset was used to perform this assessment?

- Also, lines 303-305: "Approximately 19.1% of the paralog pairs consisting of at least one protein complex member gene are synthetic lethal, compared to only ~9.3% of the non-protein-complex pairs" -- what is the overall frequency of synthetic lethality among complex- and non-complex pairs (regardless of paralogy)?

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Attachments
Attachment
Submitted filename: Review PLoS Genet De Kegel 2019.docx
Revision 1

Attachments
Attachment
Submitted filename: Response_letter.docx
Decision Letter - Gregory P. Copenhaver, Editor, Aimee M. Dudley, Editor

Dear Dr Ryan,

We are pleased to inform you that your manuscript entitled "Paralog buffering contributes to the variable essentiality of genes in cancer cell lines" has been editorially accepted for publication in PLOS Genetics. Congratulations!

Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional accept, but your manuscript will not be scheduled for publication until the required changes have been made.

Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

If you have a press-related query, or would like to know about one way to make your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics!

Yours sincerely,

Aimee M. Dudley, Ph.D.

Associate Editor

PLOS Genetics

Gregory P. Copenhaver

Editor-in-Chief

PLOS Genetics

www.plosgenetics.org

Twitter: @PLOSGenetics

----------------------------------------------------

Comments from the reviewers (if applicable):

I have read the detailed response and believe that the authors have thoroughly addressed the concerns of the 3 reviewers. The revision is acceptable for publication in PLoS Genetics.

----------------------------------------------------

Data Deposition

If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website.

The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: 

http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-19-01244R1

More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support.

Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present.

----------------------------------------------------

Press Queries

If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org.

Formally Accepted
Acceptance Letter - Gregory P. Copenhaver, Editor, Aimee M. Dudley, Editor

PGENETICS-D-19-01244R1

Paralog buffering contributes to the variable essentiality of genes in cancer cell lines

Dear Dr Ryan,

We are pleased to inform you that your manuscript entitled "Paralog buffering contributes to the variable essentiality of genes in cancer cell lines" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Kaitlin Butler

PLOS Genetics

On behalf of:

The PLOS Genetics Team

Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom

plosgenetics@plos.org | +44 (0) 1223-442823

plosgenetics.org | Twitter: @PLOSGenetics

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .