Peer Review History
| Original SubmissionAugust 20, 2021 |
|---|
|
Dear Dr. Sun, Thank you very much for submitting your Research Article entitled 'eQTL mapping using allele-specific gene expression' to PLOS Genetics. The manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review a much-revised version. We cannot, of course, promise publication at that time. Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. [LINK] We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions. Yours sincerely, Mingyao Li Associate Editor PLOS Genetics David Balding Section Editor: Methods PLOS Genetics Reviewer's Responses to Questions Comments to the Authors: Reviewer #1: Zhabotynsky et al. present geoP, a new method to approximate permutation p values, to reduce the computational burden of permutation tests in detecting eQTLs with allele-specific gene expression (ASE). The geoP method approximates the permutation p values by estimating the relationship between permutation p values and minimum nominal p values per gene. Overall, the manuscript is well written, and this work has implications for enabling eQTL detection with ASE in large-scale eQTL studies. The authors have devoted efforts in present analyses on 28 GTEx tissues and Geuvadis data. The authors also developed a data processing pipeline to process the RNAseq data saved on cloud. Clarifications and changes that could improve the paper include: Major: 1. The authors compared the number of eGenes identified by different methods in the real datasets. As the same eGene could be affected by more than one functional regulatory variant, I was wondering for the same gene-SNP pairs tested, how many significant pairs are consistent between different methods, and how many pairs are additionally identified by TReCASE equipped with geoP? How would the results change with different permutation p-value cutoffs? 2. The authors showed the functional categories of the top eQTLs identified by TReCASE and MatrixEQTL are consistent with each other. It would be more convincing to provide additional biological insights by TReCASE beyond the consistent findings. 3. In dynamic eQTL detection, what is the biological interpretation for the identified dynamic eQTLs? For example, how would the effect size of eQTLs change over different expression levels of CTCF or TP53? What functional categories are these dynamic eQTLs enriched in? Are these dynamic eQTLs also detected as eQTLs in the original GTEx study or TReCASE? 4. A recently published paper (Liang et al, 2021, NC) developed a method named mixQTL that can also identify eQTLs with allele-specific expression and is scalable to large sample sizes. What are the benefits of TReCASE equipped with geoP over mixQTL? 5. It would be easier for other people to use your method if it is implemented in software packages. Also, the current description of this method on page 16 is not clear enough. For example, the authors mentioned “do additional adjustment and restart the process”, what kind of “additional adjustment” is suggested? Minor: 1. In Figure 3e-f, the authors compared functional category enrichment of eQTLs in enhancers in 5 tissues but compared all 28 tissues in figure 3a-d. Why only concentrate on these 5 tissues in figure 3e-f? 2. In figure 5f, why is the detection power increased with decreased genetic effect? Figure 5f is not mentioned in the main text. Reviewer #2: The authors proposed a geoP method which they claimed to be computationally more efficient and robust at various cutoffs than the eigenMT does in eQTL mapping using ASE information. They also demonstrated the usage of ASE to study dynamic eQTLs and compared two popular methods TReCASE versus RASQUAL. Though the authors did a lot of analyses with various GTEx data sets, I feel the organization and presentation need to be further improved to demonstrate the major contributions of the work. What’s the major contribution and the novelty of the work? They need to be clearly presented and illustrated. I understand the authors try to demonstrate the use of ASE in eQTL mapping. However, the two topics covered in the work, developing a geoP method and studying dynamic eQTLs, do not seem providing a strong support of the title of the work (in fact, they seem to be unrelated). The title is also too broad and not specific. The abstract needs to be polished to summary the major contribution of the work. It is not convincible by the way the authors defined false negatives and false positives shown in Figure 1(c). This is also related to what the authors claimed: “Using a permutation p-value cutoff of 0.01 (corresponding to FDR around 1%), we detected 20-100% more eGenes (genes with at least one significant eQTL) than the most recent GTEx study [1], where ASE was not used in eQTL mapping.” Is this claim based on real data analysis or simulation? If based on real data analysis, then it is not convincible. A method that detects more eQTLs than others does not imply that it is more powerful based on the real data analysis. How do you know what you detected are not false positives? geoP does look like have higher false positives than eigenMT does from Fig 1(c) (again, not sure if the figure is based on simulation or real data analysis). To claim a method is more powerful than others, one should do a simulation study in which the underlying truth is known. The authors stated in the Discussion “Although similar message has been shown in earlier studies, our results are more comprehensive due to the larger number of datasets that we have studied.” Analyzing more datasets does not add to the novelty of the work. The authors spent large efforts in building an online cloud-based data processing pipeline. This is much appreciated. Reviewer #3: Review of Zhabotynsky et al. In this study, Zhabotynsky et al describe and test a method for eQTL mapping that leverages allele-specific read counts to increase mapping power. This is not a novel approach, but the authors do a good job showing that their method is faster and more accurate than other approaches. Overall, I found this paper to be very clearly written with well-justified analyses. This is a very impressive demonstration of the power of utilizing ASE and well-designed statistical modeling to identify eQTL. Detailed Comments: 1. The authors compare their methods to those reported in the GTEx paper. They note some differences in processing and testing pipeline and write “Despite these minor differences, the number of eGenes reported by the two pipelines (when we use MatrixEQTL for eQTL mapping) are highly consistent (Figure 2a).”. I don’t think Figure 2a shows this. I think Figure 2a needs to have GTEx on one axis and MatrixEQTL on the other in order to support the point they are making. Or is this an axis labelling error and the figure does show what they say? In addition to this figure it would be good to report (numerically) the percentage overlap in eQTL reported by the GTEx paper compared to those discovered using these authors’ pipeline with MatrixEQTL. I assume the overlap is >90-95%. 2. Figure 2a-c what are dotted red lines? It looks like the line in (a) is a best fit regression line, and that this same line is plotted in (b)? The line in (c) looks different but I think that is just due to different axis limits. I suggest harmonizing axis limits on both axes for subfigures (a) through (c) 3. In the section “Explore dynamic eQTLs using individual-specific genetic effects estimated by ASE” (note: typo in section title, should be “Exploring”), I’m not sure I understand the point of including the models that condition on CTCF and TP53. The authors report some interesting results on age-associated dynamic eQTL in blood, but don’t report anything of note regarding the CTCF and TP53 models in the main text. 4. I think it would be very useful if the authors could include some more detailed information on eQTL that they are able to detect that were missing from the MatrixEQTL analysis. For example, they could present a detailed portrait of a single eGene where they show the actual data and illustrate why the ASE and beta-binomial modeling are pulling this eGene to significance. 5. To improve accessibility/usability, I suggest that the authors upload their singularity image to a hub/database such as https://cloud.sylabs.io/. Although there pipeline appears to be available at https://github.com/Sun-lab/gtex_AnVIL, there are instructions for other aspects of analysis included (e.g. cloud analysis) that some users may not use. It would be useful to develop a short example walkthrough for the software. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: None Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No |
| Revision 1 |
|
Dear Dr Sun, Thank you very much for submitting your Research Article entitled 'eQTL mapping using allele-specific count data is computationally feasible, powerful, and provides individual-specific estimates of genetic effects' to PLOS Genetics. The manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers identified some concerns that we ask you address in a revised manuscript. Please modify the manuscript according to the review recommendations, address the specific points made by each reviewer. In addition we ask that you: 1) Provide a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. 2) Upload a Striking Image with a corresponding caption to accompany your manuscript if one is available (either a new image or an existing one from within your manuscript). If this image is judged to be suitable, it may be featured on our website. Images should ideally be high resolution, eye-catching, single panel square images. For examples, please browse our archive. If your image is from someone other than yourself, please ensure that the artist has read and agreed to the terms and conditions of the Creative Commons Attribution License. Note: we cannot publish copyrighted images. We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. [LINK] Please let us know if you have any questions while making these revisions. Yours sincerely, Mingyao Li Associate Editor PLOS Genetics David Balding Section Editor: Methods PLOS Genetics Reviewer's Responses to Questions Reviewer #1: I am satisfied with most of the review comments that the authors have addressed, except the results of CTCF and TP53 related dynamic eQTLs. In the updated manuscript, the authors "assessed whether the dynamic eQTLs related with CTCF are enriched among CTCF binding sites (CTCFBSDB 2.0 http://insulatordb.uthsc.edu/ [23]) and whether p53’s target genes (Supplementary Table 3 of [24]) are enriched among the eGenes with dynamic eQTLs with respect to TP53", however, "No significant enrichment was found in either case". The authors chose to study CTCF and TP53 based on the assumption that "TF expression can modulate the strength of eQTLs located in TF binding sites", but the current results do not support this assumption. For the CTCF dynamic eQTLs located in the CTCF binding sites, are their effect sizes more correlated with CTCF expression than other eQTLs? If not, can you find more evidence to support that your identified dynamic eQTLs are truly meaningful? Reviewer #2: The authors have addressed my comments and I have not further comment. Reviewer #3: The authors have largely addressed my concerns. I have two minor suggestions: (1) I like that they show in Supplemental section C.3 the different examples of eQTL that are missed by MatrixEQTL but captured by TReCASE. But please clarify what part (b) is showing in Supp Figures 11, 12, 13, and 14. (2) Pipeline usability. The availability of a docker image and the authors’ newer in-depth walkthrough/README are very useful. I would request one small addition. The README at https://github.com/Sun-lab/gtex_AnVIL/blob/master/README.md provides information on the four arguments to their script get_TReC_ASReC.R. It would be really useful to have “toy” versions of those arguments that are files added directly to the GitHub repo. Sometimes when one is running a new pipeline it is very useful to test it on small example files just to ensure that the pipeline is working as expected. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No |
| Revision 2 |
|
Dear Dr Sun, We are pleased to inform you that your manuscript entitled "eQTL mapping using allele-specific count data is computationally feasible, powerful, and provides individual-specific estimates of genetic effects" has been editorially accepted for publication in PLOS Genetics. Congratulations! Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made. Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org. In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date. Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics! Yours sincerely, Mingyao Li Associate Editor PLOS Genetics David Balding Section Editor: Methods PLOS Genetics Twitter: @PLOSGenetics ---------------------------------------------------- Comments from the reviewers (if applicable): Reviewer #1: The authors have addressed my comments and I have not further comment. Reviewer #3: The authors have satisfied my concerns - very nice paper! ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #3: No ---------------------------------------------------- Data Deposition If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website. The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-21-01129R2 More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support. Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present. ---------------------------------------------------- Press Queries If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org. |
| Formally Accepted |
|
PGENETICS-D-21-01129R2 eQTL mapping using allele-specific count data is computationally feasible, powerful, and provides individual-specific estimates of genetic effects Dear Dr Sun, We are pleased to inform you that your manuscript entitled "eQTL mapping using allele-specific count data is computationally feasible, powerful, and provides individual-specific estimates of genetic effects" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work! With kind regards, Agnes Pap PLOS Genetics On behalf of: The PLOS Genetics Team Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom plosgenetics@plos.org | +44 (0) 1223-442823 plosgenetics.org | Twitter: @PLOSGenetics |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .