Peer Review History
| Original SubmissionFebruary 7, 2022 |
|---|
|
PONE-D-22-03783GWAS in the Southern African contextPLOS ONE Dear Dr. Swart, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jul 14 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Badri Padhukasahasram Academic Editor PLOS ONE Journal requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 2. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match. When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section. 3. Thank you for stating in your Funding Statement: “This research was partially funded by the South African government through the South African Medical Research Council and the National Research Foundation. YS was supported by a Stellenbosch University Postgraduate Bursary.” Please provide an amended statement that declares *all* the funding or sources of support (whether external or internal to your organization) received during this study, as detailed online in our guide for authors at http://journals.plos.org/plosone/s/submit-now. Please also include the statement “There was no additional external funding received for this study.” in your updated Funding Statement. Please include your amended Funding Statement within your cover letter. We will change the online submission form on your behalf. 4. Thank you for stating the following financial disclosure: “This research was partially funded by the South African government through the South African Medical Research Council and the National Research Foundation. YS was supported by a Stellenbosch University Postgraduate Bursary.” Please state what role the funders took in the study. If the funders had no role, please state: ""The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."" If this statement is not correct you must amend it as needed. Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf. 5. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability. Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized. Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access. We will update your Data Availability statement to reflect the information you provide in your cover letter. 6. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. In your revised cover letter, please address the following prompts: a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent. b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. We will update your Data Availability statement on your behalf to reflect the information you provide. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. Additional Editor Comments: Dear Dr. Swart, The reviews for your manuscript GWAS in the Southern African context are now available. Based on comments from 3 expert reviewers, I recommend that the manuscript be accepted provided authors make minor revisions outlined by all reviewers. In particular, changes to the text to make it clearer, explanation of definitions of true and false positives, better figure and figure legends can improve the manuscript. In terms of major comments, authors can consider a simplified 3-way and 5-way admixture scenarios to be simulated and tested to avoid additional confounding effects on the GWAS methods tested. Lastly, reviewers also request that the model and scripts used be made readily available to the readers. I look forward to a revised manuscript that addresses all these comments and concerns. Sincerely, Dr. Badri Padhukasahasram [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: Yes Reviewer #3: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The authors set out to validate a recently published approach to use admixture LD to improve GWAS in more complex admixture scenarios (3-way and 5-way). The authors use realistic effect sizes for genetic effects maximizing the applicability of their results to real GWAS situations. However, because of the complex demographic history of the source populations for the admixed communities used as models in this study, it is hard to disentangle effects from varying levels of a particular ancestry and those from a difference in diversity from a single ancestry. The paper has the potential to be a broadly useful resource to the genetic epidemiology community, and could be improved if a simplified 3-way and 5-way admixture scenario were simulated and tested to avoid additional confounding effects on the GWAS methods tested. However, that may be beyond the scope of what the authors hope to present here and I would not require it for accepting this manuscript. I do find several issues with the paper that I think need to be addressed. While GWAS models are well specified (lines 225-253), the conditions of the simulated datasets (both phenotypes and genotypes) are not. It is difficult to evaluate the paper's claims without being able to evaluate how the data were produced. 1. I would suggest the authors include links to the model files used and the custom script used for ancestry assignment (line 161) from the simulated data should be linked to or provided as supplementary data. 2. It is unclear if the authors simulated all the admixture events (arrows) represented in figure 1. 3. It is unclear to me how the genotypes/phenotypes are being linked for analysis? How were the 10 causal SNPs chosen? 4. The three types of phenotypes simulated need to be explained in more detail. The discussion (lines 388-390) seems to suggest that a phenotype caused by a local ancestry-specific SNP was not simulated and tested with the various methods. Would distributing causal SNPs randomly across ancestries not result in biased results of all methods tested that are looking for an ancestry-specific effect? Could this explain the high number of false positives seen in the APA and LAAA models compared to the Standard and GA? 5. The relationship alluded to in Lines 316-318 between the amount of ancestry and false positives should be more formalized, for example simulating datasets with 5-95% of a certain ancestry and plotting the change in false positives. This would be broadly informative beyond the study populations used here. 6. On Line 271, how was LAI accuracy inferred? Were inaccurate segments identified and removed or were they misassigned? 7. The results of this study showing that in many instances the Standard model performs as well or nearly as well as the APA or LAAA, while providing orders of magnitude fewer false positives. Identifying 2-3 fewer true hits but avoiding ~1000 false positives that would make interpreting GWAS results extremely difficult, so to support the use of only a Standard Model. This is quite counter-intuitive and requires more discussion. Reviewer #2: The author is not consistent with abbreviation and full name (under abstract GWAS is not describes) then later LAAA is used as abbreviation and as a full term. The aim and the rational of this study its not clearly stated. The Author is using a public data but it was not mentioned clearly under method section that, this is a public data and therefore there is no need for ethical clearance. I think there is a need for a paragraph of sample/data description. Figures have poor quality (they are blur). All Figure legends have incomplete description of what is happening in a figure. I think sub title will make it easier to follow. Reviewer #3: 1 This is an interesting and self-contained paper that will be very helpful to people analysing complex populations. The authors simulated 3-way and 5-way admixture populations that are broadly representative of two groups in South Africa and then with different phenotypes try to understand which models correctly detect SNPs associated with the phenotypes. There are several different modelling approaches and as seen in the paper how you do this has a significant timpact 2 The paper is written clearly and overall the methods are sound. 3 I have one major criticism 4 There is no definition of true positive and false positive -- see detailed comments below. Without knowing this, I am not sure I agree with the conclusions (or at least I can't independently say I do). If their definition of a false positive would include a SNP that is _not_ a SNP in the "input" of the simulation as being causal but is for example 20 base pairs away and has been detected because of LD effects then I think that needs considerable motivation and explanation to justify calling that a false positive. Anyway -- please be clear and define this. 5. The number of SNPs used is only 400k -- which is small -- does this impact the result? If we were using a dense array and/or imputed data would this help or make things more complicated 6. The limitations expressed on page 15 are significant There are different results for 3-way and 5-way admixture (and as the authors point out this differs from results by others for 2-way admixture). But is that a function of the n in n-way or is it more complex than that? I suppose I want a more definitive answer, which is not possible with the evidence givern. 7. As a point of principle I think the code that was developed should be made available via GitHub or the like. 8. Bibliography -- a very annyoing feature of the paper is that references are numbered in the text but there is no number in the bibliography!!!! 9 Other issues: l97-98: There is no reference for the claim that two-way admixed populations "generally" have simple demographic histories and originate from a single pulse admixture event. I suppose this depends what is meant by "simple" and "pulse", but I think either give some reference that supports this or weaken the claim. l117 : "From" -> from Comma after "Khoe-San" on l128 should deleted -- the subject of the sentence is complex and hence the temptation but the comma is wrong and it would be better to simplify the txt. l122 : See point above about availability -- shouldn't these specifications be made available publicly. Just from a selfish point of view you are more likely to get citations if you do and people use them. l 126-130 : I think the characterisation of Khoe-San as a group culturally related needs care as this is contested esp by members of the communities. This is not my area, but the term Khoe-san language is a term that is more convenient for people outside those communities rather than accurate (perhaps like speaking about the Danish-Japanese language gruop?) and I don't think you can say that pastoralism and hunter-gatherer lifestyles are close culturally. As a relatively recent phenomenon as a result of colonialism since the the late 18th C this may have become true in recent historical times but to extrapolate from one place may be risk over-generalising. l142: wouldn't South Asian be better than East Asian if you are using GIH as the proxy. l143: This modelling of SAC is as the authors recognise a simplification -- white settlement in the Cape only started ~370 years ago with ongoing admixture -- and probably at peak from 300-150 years ago. This is not intended as a criticism and I don't think this result weakens the results but I think the authors should address the consequences of this simplification. l167 Simulating phenotypes: A little more detail would be useful here -- I'd like to understand how this takes LD and both global and local ancestry into account. I didn't understand how the SNPs were chosen -- do you just choose one lead SNP in a region or do you simulate SNPs that though not causal are in LD with the "causal" SNP. This para ia bit opaque. Is different LD in different popualtions considered. l178 -- use dash not hyphen (l 201, 227, 238 and many other palces too) l 288-307 -- I don't think true positives etc should be hyphenated l299 -- semi-colon not comma at end l378 -- principle -> principal l383 -- amount -> number l395 -- I am intrigued by this. When we do a GWAS we often get hits caused by SNPs being in LD wth the causal (or at least a lead SNP). We don't see these as false hits though because we understand that they tag the causal SNP. Of course this depends on how bug the admixture block is -- I wonder whether your framework of true positives/false positives is correct, and I think this needs clarificaition at least. If you define a false positive as a SNP that is found in the simulated data output then I think it is too strong -- I think this related to my comment on line 167 above -- see my general comment. l436 -- no comma -- the subject is complex but it just makes it worse to put a comma in. Similar issue on line 456 l448 I don't think only more complex admixture scenarios. Also just understanding the variability you get that may depend on changes in admixture ratios, LD-structure, population difference and so on. l 462. This sentence needs a main verb. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
GWAS in the Southern African context PONE-D-22-03783R1 Dear Dr. Swart, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Badri Padhukasahasram Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: |
| Formally Accepted |
|
PONE-D-22-03783R1 GWAS in the Southern African context Dear Dr. Swart: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Badri Padhukasahasram %CORR_ED_EDITOR_ROLE% PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .