Peer Review History
| Original SubmissionFebruary 20, 2024 |
|---|
|
Dear Dr Zhang, Thank you very much for submitting your Research Article entitled 'FABIO: TWAS Fine-mapping to Prioritize Causal Genes for Binary Traits' to PLOS Genetics. The manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review a much-revised version. We cannot, of course, promise publication at that time. Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions. Yours sincerely, Xiaofeng Zhu Section Editor PLOS Genetics Xiaofeng Zhu Section Editor PLOS Genetics Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: In this manuscript, you present FABIO, a novel multivariate TWAS method aimed at identifying disease-associated genes. FABIO offers two key advantages over existing methods. First, it is specifically designed for binary traits using individual-level data. Second, it can simultaneously analyze all genes on a chromosome, rather than focusing on a single gene or multiple genes within a single cis-region, as is the case with current methods. I highly commend the methodological innovation of this work and anticipate that it will play a significant role in future TWAS analyses. However, there are some issues with the presentation of the manuscript, and I have a few questions that I hope the authors can address or clarify. Treating binary traits as continuous in existing methods? To my knowledge, existing methods such as FOCUS and FOGS, which are based on GWAS summary data, do not approximate binary variables as continuous ones. In fact, for binary traits, GWAS tests the marginal correlations between a variant and the liability score of this binary trait (i.e., the latent variable z_i in FABIO, equations 2 and 3). While there may be bias due to model misspecification (i.e., the generating model is a liability model (probit model) while the estimation model is a logistic model), this is not the same issue as the one claimed by the authors in the second paragraph of the introduction. The authors also state in their simulations that these existing methods are based on GWAS summary data, rather than performing a linear model on binary outcomes. As mentioned in the General Comment, I have no doubt about the methodological innovation of FABIO, but it does not address these specific issues with existing methods. Fairness of simulation for FOGS I observed unusual results in Figures 2 and 3: FOGS has a very high type-I error (based on FDR) and extremely low power. After reading the authors' processing methods, I found that they used simulations to discover the potential type-I error of each method (2.6% for FABIO, 3.3% for FOCUS, 46.5% for FOGS) and then used these thresholds to calculate power. This is unfair to FOGS. After reading about FOGS, I discovered that it is not based on a sparsity assumption, and it is normal for its type-I error to be higher than that of FABIO and FOCUS, which utilize a sparsity assumption. The authors then use this very high type-I error as the threshold for determining power, which exaggerates the shortcomings of FOGS. In addition, the terminology "true FDR of 0.05" is somewhat misleading. I believe that the analysis of type-I error and power should be independent, and the threshold should be chosen using the same method (e.g., q-value < 0.05). The phrase "simulated threshold of 0.05 FDR" may be more appropriate and fair. Sufficiency of simulation replications As other reviewers may also point out, the number of simulation replications is only 20, which may not be sufficient for evaluating these three methods. I believe 100 replications should be performed for the main cases (e.g., those shown in Figures 2 and 3). Addressing GReX correlation across the chromosome? FABIO utilizes traditional GReX modeling, and the process of predicting SNP weights only uses LD information from the cis-region. In other words, although gene expression may have many variants with strong trans-regulatory effects (trans-eQTL), this trans-eQTL information is not utilized in predicting gene expression, resulting in LD-wise uncorrelated predicted gene expression. Therefore, my initial view is that although FABIO processes all genes on a chromosome simultaneously, it does not leverage GReX correlations among genes between LD blocks. Although these predicted genes are independent, analyzing them in the same model can utilize prior distribution information to control FDR, which I believe is an advantage of FABIO. Optimal tuning parameter selection FABIO is a variable selection method, which belongs to the same category as lasso and SuSiE. For lasso and SuSiE, the penalty parameter lambda and the number of single effects L are the main tuning parameters that control their performance, respectively. In FABIO, the proportion of non-zero effects pi is a crucial tuning parameter. In the supplementary materials, the authors claim that they assume log pi follows a uniform distribution on U(log(1/p),0), while traditional methods tend to assume pi follows a beta distribution, adjusting a and b (the two parameters of the beta distribution) to control the mean and variance of pi. Can the performance of FABIO be improved by modifying the prior distribution of pi if the users have prior knowledge about the approximate proportion of important genes on a chromosome? Software availability and documentation I noticed that the software for FABIO has been made publicly available with a relatively detailed tutorial. I suggest that the authors prominently highlight this point in the manuscript (https://superggbond.github.io/FABIO/). I also suggest adding a short paragraph somewhere to briefly introduce how to use it, what inputs are, and which (important) tuning parameters are recommended to specify manually. Reviewer #2: In the manuscript under consideration, the authors presented a novel method, FABIO, using a Bayesian additive model to fine map genes in a TWAS framework. The mathematical formulation of the method is clearly presented. Simulations and real data analyses are professionally done. Although I think it is a great piece of work, there are some aspects may need to be clarified and explained more intuitively. Here are my comments for the authors’ considerations. Major: First, I feel that the idea of using a linear combination of GReX to predict phenotype for fine mapping a little bit counter intuitive. The default reason why fine mapping is needed is that the genes are correlated to each other, therefore better modeling of such correlations are required. However, the model just adds all genes (GReX) up, despite the potential problem of collinearity. Intuitively, the models that explicitly taking correlation structure (e.g., the pair-wise correlation matrix, analogue to the LD matrix of all variants in GWAS fine-mapping) into account may be the right solution. I didn’t mean the presenting method is total wrong. At least some intuitive interpretation is needed here. Second, there is a lack of details on how the cutoff of the methods are set to ensure a fair comparison between the three methods. It has been repetitively stated that FDR = 0.05 is used without specifics on how this FDR is calculated. However, I can’t figure out a default way to calculate FDR in the present framework with PIP. This is an important procedure that has not been presented. Minor: The explanation of Figure 1 and the first few paragraphs in Results are redundant to existing texts in Methods. It appears to me that the paper was originally prepared in a format that Results show up earlier leaving Methods to the end. Better flow aligning to PLoS Genetics may be required. The comparison to other methods could be more thorough. For instance, there are two newer papers published in Nat Genet on this topic (including the one published from the same group of this paper). It may be nice to carry out some comparisons. Reviewer #3: This study presents a novel method to prioritize causal genes by integrating GWAS and eQTL studies. It is novel in two aspects. First, it is designed for binary traits with a probit model. Second, it accounts for all genes on one chromosome. The authors evaluated the performance of the new method (FABIO) with simulated date and compared its performance to two existing methods (FUSION and FOGS). The performances of the three methods were also compared in the applications to six binary traits in the UK Biobank dataset. This manuscript is very well-written. It is very easy to follow, including the methodological details. With various simulation scenarios (e.g., case/control ratio, phenotypic variance explained, number of causal SNPs, and number of causal genes), the authors presented convincing evidence that FABIO has well-controlled false positive rates and much better power than the other two methods. In the application to real data, FABIO narrowed down to the smallest number of genes while identifying genes in the likely risk regions, defined based on GWAS or TWAS significant signals. The findings from FABIO are more robust to down-sampling. Gene set and pathway enrichment analysis further supported the biological relevance of the candidate genes fine-mapped by FABIO. Lastly, FABIO has acceptable computational efficiency for biobank-scale dataset. Overall, this study developed an outstanding method and package for the field of GWAS. It is likely to be widely used in GWAS of binary traits. It is quite a pleasure to read this manuscript, which is informative and easy to follow. I support the publication of the manuscript in its current form. I do have one suggestion for the author’s consideration. • The very poor performance of one of the comparison method, FOGS, in both simulation and application is kind of surprising. It will be useful to readers if the authors could discuss or propose some explanations. Does it have to do with the specific simulation approaches? Or with the binary traits, instead of quantitative traits? ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No |
| Revision 1 |
|
PGENETICS-D-24-00208R1FABIO: TWAS Fine-mapping to Prioritize Causal Genes for Binary TraitsPLOS Genetics Dear Dr. Zhang, Thank you for submitting your manuscript to PLOS Genetics. As you can see, both reviewer 2 and 3 have no further questions and agree the manuscript is acceptable. However, reviewer 1 still has some minor questions. Therefore, we invite you to submit a revised version of the manuscript that addresses reviewer 1's questions accordingly. Please submit your revised manuscript within 30 days Dec 09 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosgenetics@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pgenetics/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:* A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below.* A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.* An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. We look forward to receiving your revised manuscript. Kind regards, Xiaofeng ZhuSection EditorPLOS Genetics Xiaofeng ZhuSection EditorPLOS Genetics Aimée DudleyEditor-in-ChiefPLOS Genetics Anne GorielyEditor-in-ChiefPLOS Genetics Journal Requirements: Additional Editor Comments (if provided): [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: I am pleased and grateful that you have provided a comprehensive response, and have modified your method using one of my suggestions. I believe you have appropriately incorporated the changes. However, I still have a minor question regarding the calculation of the true FDR and local FDR. I am grateful the you can address, as they may be due to my limited knowledge in this area. 1. When calculating the true FDR in the simulations, why did the you choose to divide the simulations into 5 groups? 2. I am very interested in your approach to adjusting the PIP based on Efron's method. This could potentially benefit all current work based on SuSiE. After reading Efron's paper, I understand that this method requires estimating the density function of a mixture model and pre-specifying some tuning parameters like p0 (equation 2.13). While Efron provided some guidance on how to estimate these, I was hoping you could share a more detailed pipeline on how you applied this method in your work (e.g., whether it is able to leverage any existing R packages to facilitate the estimation process). 3. I would like you to help me understand that the term "local" in the local FDR context refers to a statistical concept, and not a genetic locus or a block in the block-wise GReX estimation. Reviewer #2: The authors have addressed my previous comments satisfactorily. Reviewer #3: The authors did an impressive amount of work to revise the manuscript and address my and other reviewers' concerns. The current manuscript is of publication quality. This research will have significant impact in the field of GWAS. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: Yes: Kaixiong Ye [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] Figure resubmission: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. If there are other versions of figure files still present in your submission file inventory at resubmission, please replace them with the PACE-processed versions. Reproducibility: To enhance the reproducibility of your results, we recommend that authors deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols |
| Revision 2 |
|
Dear Dr Zhang, We are pleased to inform you that your manuscript entitled "FABIO: TWAS Fine-mapping to Prioritize Causal Genes for Binary Traits" has been editorially accepted for publication in PLOS Genetics. Congratulations! Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made. Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org. In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date. Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics! Yours sincerely, Xiaofeng Zhu Section Editor PLOS Genetics Xiaofeng Zhu Section Editor PLOS Genetics Aimée Dudley Editor-in-Chief PLOS Genetics Anne Goriely Editor-in-Chief PLOS Genetics Twitter: @PLOSGenetics ---------------------------------------------------- Comments from the reviewers (if applicable): ---------------------------------------------------- Data Deposition If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website. The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-24-00208R2 More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support. Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present. ---------------------------------------------------- Press Queries If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org. |
| Formally Accepted |
|
PGENETICS-D-24-00208R2 FABIO: TWAS Fine-mapping to Prioritize Causal Genes for Binary Traits Dear Dr Zhang, We are pleased to inform you that your manuscript entitled "FABIO: TWAS Fine-mapping to Prioritize Causal Genes for Binary Traits" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work! With kind regards, Zsofia Freund PLOS Genetics On behalf of: The PLOS Genetics Team Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom plosgenetics@plos.org | +44 (0) 1223-442823 plosgenetics.org | Twitter: @PLOSGenetics |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .