Peer Review History
| Original SubmissionMay 19, 2020 |
|---|
|
Dear Dr Allegrini, Thank you very much for submitting your Research Article entitled 'Multivariable G-E interplay in the prediction of educational achievement' to PLOS Genetics. Your manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review again a much-revised version. We cannot, of course, promise publication at that time. In addition, PLOS Genetics policy requires that data be made available to other researchers, and that this availability not be at the sole discretion of authors. Therefore, we would like to have you clarify data sharing prior to a final decision on your manuscript.. Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see our guidelines. Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. [LINK] We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions. Yours sincerely, Chris Cotsapas, PhD Associate Editor PLOS Genetics Scott Williams Section Editor: Natural Variation PLOS Genetics Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: Uploaded as an attachment Reviewer #2: See attachment. Reviewer #3: In this paper, the authors consider the joint role of genome-wide polygenic scores (GPSs) and environmental variables at predicting educational achievement. While it's known that such measures are known to be individually good predictors of educational variables, this study quantifies the degree of overlapping signal between a set of genetic and environmental variables. At a high level, they do this using regularized prediction models and mediation-style analyses. They find that 40% of the predictive power of GPSs and 18% of the predictive power of environmental variables is due to the correlation between these variables. They also test whether interactions of environmental and genetic variables add to the predictive power of joint genetic-environmental predictive models, and find no evidence that interactions of the variables they considered contribute significantly to multivariable prediction. This paper represents an impressive effort in a rich data set. There is a lot of interesting information presented. Also the paper was easy to read, the analyses were clearly described in a way that I believe I would be able to replicate the analyses, and the tables and figures had clear and comprehensive captions. That said, upon reading the paper, I found myself with a lot of questions about what the major take-aways are for this project. Major concerns: 1) How does this paper fit relative to other papers that do similar work? The authors make several strong statements about the novelty of their work, but there are a number of papers that ask very closely related questions. For example, Selzam et al (2019) compare the within- and between-family predictive power of GPSs for educational achievement, which would represent the passive rGE that the authors describe. Also, Lee et al. (2018) contains an analysis quite similar to this one looking at the incremental R2 of an education GPS at predicting educational attainment above several environmental variables. While educational attainment and educational achievement are different variables, they are very highly genetically correlated. It would be helpful if the authors could explicitly highlight what we are learning from this paper relative to related existing work. 2) I had a difficult time interpreting their rGE results for a number of reasons. a) How does the sparse variable selection procedure affect the authors' estimates? As far as I could tell, the authors selected their genetic and environmental variables using an elastic net procedure in the same step. If there is strong rGE (as the authors find) some of the genetic variables that are highly correlated with the environmental variables may be selected in the place of the environmental variables in the penalization process, even if they represent independent signals. Likewise, some environmental variables will be selected in the place of the genetic variables. Since you are removing variables that are correlated with each other, this would have an effect on the subsequent rGE analysis. It may be possible to address this concern by selecting the genetic and environmental variables separately. b) How appropriate is it to estimate the mediation analysis in both directions? In order to interpret the estimates of how much the environmental variables mediate the genetic variables, you need to assume the model in Figure 4B. To estimate the amount of genetic confounding, you need to assume the model described in line 324 of the manuscript. Since both of these models can't be simultaneously true, I believe it can't be the case that the two estimates in lines 35 to 39 are both true. c) How do errors in variables affect the interpretation of these analyses? For example, the 'chaos at home' variable is likely a noisy proxy for the true amount of chaos at home. This means that even if 100% of the genetic signal were mediated through this variable, the GPS would still remain predictive after including that variable in the regression. Similarly, the polygenic scores may be thought of as noisy measures of the true additive genetic component. If that's the case, then a mediation analysis will overestimate the amount that the environmental variables mediate the predictive power of the genetic factor. Minor concern: 3) The notation used the methods section is confusing. The authors often use the same variables to represent different things. For example, in the equation on line 463, the authors use 'r' to denote both a vector covariances and also as an indicator that the genotypes come from a reference sample. Also the variables and coefficients in line 463 correspond to different variables and parameters in line 505 despite having the same names. References: Lee, J. J., Wedow, R., Okbay, A., Kong, E., Maghzian, O., Zacher, M., ... & Fontana, M. A. (2018). Gene discovery and polygenic prediction from a 1.1-million-person GWAS of educational attainment. Nature Genetics, 50(8), 1112. Selzam, S., Ritchie, S. J., Pingault, J. B., Reynolds, C. A., O’Reilly, P. F., & Plomin, R. (2019). Comparing within-and between-family polygenic score prediction. The American Journal of Human Genetics, 105(2), 351-363. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: No: All data underlying the figures are available in the supplement. Some restrictions will apply regarding data access. Data used for the submission may be made available on request to the Twins Early Development Study (TEDS), through their data access mechanism (see www.teds.ac.uk/researchers/teds-data-access-policy). Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No
|
| Revision 1 |
|
Dear Dr Allegrini, We are pleased to inform you that your manuscript entitled "Multivariable G-E interplay in the prediction of educational achievement" has been editorially accepted for publication in PLOS Genetics, subject to the proposed waiver of data access restrictions we discussed. Congratulations! Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional accept, but your manuscript will not be scheduled for publication until the required changes have been made. Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org. In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. If you have a press-related query, or would like to know about one way to make your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date. Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics! Yours sincerely, Chris Cotsapas, PhD Associate Editor PLOS Genetics Scott Williams Section Editor: Natural Variation PLOS Genetics Twitter: @PLOSGenetics ---------------------------------------------------- Comments from the reviewers (if applicable): Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The authors addressed all points that I am concerned. Reviewer #2: I agree with the authors there are good arguments for such a composite measures, as the SES and chaos mesures applied in the manuscript. They also do appear to predict well. I do not agree that life events often represent stochastic events unrelated to each other. They tend to correlate and have similar effect on e.g. psychopathology, I guess that could be the case for effect on EA as well, which is in line with the authors findings. Further, some of these life event variables are probably rather rare, and therefore not powerful in separate analyses. In the G, E model comparisons it makes no difference, but in figure 1C and figure 2, finding significant effect of the composite measures and not of the individual life events is not surprising? For choices of PRS’s applying the most powerful does make sense, implying this may well change in the coming years. How was power assessed? I do acknowledge there is no chance adding all the variables that capture EA relevant influences, therefore of course it should be as clear as possible how you choice your variables, and what the limitation is according to that choice. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: No: there are restriction on access, due to confidentiality for participants ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No ---------------------------------------------------- Data Deposition If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website. The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-20-00796R1 More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support. Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present. ---------------------------------------------------- Press Queries If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org. |
| Formally Accepted |
|
PGENETICS-D-20-00796R1 Multivariable G-E interplay in the prediction of educational achievement Dear Dr Allegrini, We are pleased to inform you that your manuscript entitled "Multivariable G-E interplay in the prediction of educational achievement" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work! With kind regards, Matt Lyles PLOS Genetics On behalf of: The PLOS Genetics Team Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom plosgenetics@plos.org | +44 (0) 1223-442823 plosgenetics.org | Twitter: @PLOSGenetics |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .