Peer Review History
| Original SubmissionJuly 30, 2019 |
|---|
|
Transfer Alert
This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.
PONE-D-19-21466 Abundance of Super-Population Specific Microsatellites in Human Gene Regions PLOS ONE Dear Dr. Kinney, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. The reviewers highlight multiple points that need consideration. This involves framing of the study (what are the central questions), definitions of terms (super or just specific), methodogical concerns (exome sequencing origin?) and questions of interpretation. We would appreciate receiving your revised manuscript by Nov 03 2019 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols Please include the following items when submitting your revised manuscript:
Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. We look forward to receiving your revised manuscript. Kind regards, Arnar Palsson, Ph.D. Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. Please amend either the author names on the online submission form (via Edit Submission) or in the manuscript so that they are identical. Additional Editor Comments (if provided): The reviewers highlight multiple points that need consideration. This involves framing of the study (what are the central questions), definitions of terms (super or just specific), methodogical concerns (exome sequencing origin?) and questions of interpretation. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: Partly ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: No ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Kinney et al. investigate the variation of more than 300,000 microsatellites across five super-populations of human ethnic groups using genotypes called from 100bp Illumina reads as part of the 1000 Genomes Project. The authors identify 3,984 microsatellites they term super-population specific (SPS). Subsequent analyses of SPS microsatellites reveal: (1) PCA on the original set of 300,000+ microsatellites and the reduced set of SPS microsatellites produce highly similar patterns on PC1-PC2 biplots; (2) a significant fraction of SPS microsatellites overlap with a set of previously identified eSTRs – microsatellites associated with differences in gene expression; (3) genes harboring SPS microsatellites are enriched for extra-cellular matrix pathways, and; (4) a significant number of SPS microsatellites are coincidental with sequences previously implicated in selective sweeps. Major Concerns 1. The motivation for identifying SPS microsatellites is never explicitly stated. The authors simply tell us they have done so. The closest we get to a motivating statement is in the abstract: “We discover 3,984 super-population specific microsatellites and investigate their potential biological and clinical significance.” But the connection between SPS microsatellites and their functional significance seems hinted at rather than explicitly reasoned through. We’re told, for example, that the SPS set is enriched for eSTRs under a section heading of “Numerous SPS microsatellites affect gene expression.” Yet, we never get explicit statements relating to the importance of this finding. We are told there is a significant overlap, that it may or may not relate to ethnic difference in disease prevalence, and that is it. While I applaud the Introduction’s excellent summary of the state of research into functional microsatellites, the reader is left to connect dots that are too far apart. The four main findings emphasized by the authors – particularly (2) and (4) above – suggest the main motivation for identifying SPS microsatellites is that differences in genotype frequencies among super populations are potentially explained by the action of natural selection … and are therefore of functional importance. However, the manner in which the paper is written makes it feel like we should just know why SPS microsatellites are of importance and that we should just know why the statistically significant findings of the authors prove the worth of SPS microsatellites. 2. The definition of a super-population specific microsatellite is never given. While the method for finding them is explained well and in detail in the Methods section, a succinct, intuitive definition should be given in the abstract, as well as the Introduction and/or Results. For example, the authors might write, “An SPS microsatellite is a microsatellite in which the genotype frequencies of at least one super-population differs from those of all four of the remaining super-populations.” This is still a little clunky, but something to that effect would have made my reading of the paper much easier from the beginning. 3. Although differences in genotype frequencies may in fact stem from the action of natural selection due to different selective pressures at the continental scale, the authors never mention alternative explanations. An obvious alternative cause of SPS patterns of microsatellite variation is the combination of high mutability, genetic drift, and isolation-by-distance. These population genetic factors could easily lead to substantial differences in populations separated by vast distances. Indeed, the main interest in microsatellites has long been their potential to diagnose population structure. The authors do mention the potential use of microsatellites for population structure analyses near the beginning of the Discussion. However, the Discussion should include an honest appraisal of alternative evolutionary explanations for the development of SPS loci. 4. Another example of hinting at the importance of the findings rather than appraising them in a comprehensive and objective manner is the overlap of SPS microsatellites with genomic regions previously identified as potential regions where natural selection acted. The authors write that a selective sweep is “due to a novel allele [that] reduces nearby genetic variation.” However, natural selection on a microsatellite is a very different animal than selection on a SNP.. Selective sweeps of linked genetic variation result from the emergence of that novel allele on a specific genetic background. If selection actually acts on a microsatellite locus, a novel allele is difficult to define. Due to the high mutability of microsatellite loci, the same favored repeat length can arise many times, linked to many different combinations of linked SNPs. Therefore, it is not clear that selection on a microsatellite locus would lead to the classical patterns associated with selective sweeps. The authors do mention that the overlaps are with soft sweep regions, which themselves leave much messier signatures of natural selection than hard sweeps on a new variant. In my opinion, however, this still overlooks the inherent problem of attempting to link selection on microsatellites with signatures of selection based on theory that assumes a SNP is the target of selection. Indeed, we might think of this in the reverse. Isn’t it possible that selection on a single nucleotide variant (that may only offer a selective advantage in only one or a subset of super populations) would cause patterns of variation at linked microsatellites to become different from each other in different populations? In other words, the differences in microsatellite variation point to selection, but selection on a SNP. This is exactly how selection on SNPs in cis-regulatory regions that lead to lactase persistence in some African populations was identified. Minor Concerns 1. p. 2, ln 44: The authors compare array length variation, not “array length mutations” 2. (throughout) Principal component analysis, not Principle component analysis. 3. (throughout) Strange use of colons. For the most part, the authors seem to use them like an em dash would be used. 4. p. 6, ln 116: just curious if higher PCs (e.g., PC3 or PC4) lead to separation of AMR and SAS super populations. More importantly, how much variation do the first two PCs explain? 5. p. 7, lns 143-146. The authors state that eSTRs are commonly found in regions subject to purifying selection, then define a sweep in terms of positive selection. 6. p. 8, ln 157+ Confused by the genomic compartments used. UTRs are technically part of exons, but not the CDS. So, (1) you might make a distinction between UTRs and full+partial exons that make up the CDS … but then why a separate “coding” compartment? … or (2) you might make a distinction between UTRs and CDS … but then why a separate “exon” compartment, since UTRs and CDS comprise exons. Is there overlap between the compartments here? – i.e., are some STRs double-counted? 7. p 11., ln 197 Reference to support the claim that hydrophobic amino acids are more likely to have deleterious effects on protein function. 8. p 14., 1st paragraph Why devote a whole paragraph of a short discussion section to the idea that we have an incomplete picture of human genetic variation. Isn’t the 1000 Genomes Project (with its 26 individual populations) a big step in the right direction? These days, the reference genome is simply a jumping off point for analyses that do make use of data sets that include much more comprehensive coverage of human genetic variation. Reviewer #2: The manuscript is interesting but several issues need to be addressed before publication in PLOSONE. 1. What were the default parameters of the perl script the authors used for finding the repeats? Did they use a constant length cutoff, or did it change with the repeat unit? Studies have reported way more than 1.6 million STRs in humans, and knowing the exact parameters the authors used will help readers understand why they had such a small set of loci to start with. 2. Line 112 and other relevant places should be modified to clearly convey that they have used existing data and not done the sequencing themselves. 3. Link to eSTRs - Why did the authors choose to use all 80,980 loci rather than the 2060 loci that showed a significant effect on expression as reported by Gymrek et al. ? How many common loci are there between the 2060 eSTRs and 3984 SPS? 4. Authors are advised to elaborate on the p value testing they did to show that 64 common out of 3984 total is significant. In particular, I would like to see if they took into account the bias towards coding regions that is introduced by whole exome sequencing. As can be seen from their study, the loci considered by Gymrek et al were predominantly proximal to genic regions. And as the current study preliminarily uses exome data, the chances of overlap are higher than what is true if whole genome is considered as background. The authors must incorporate this into calculating the significance. 5. Regarding the analysis of over/under representation by repeat unit length, if a larger length cutoff was used for larger repeat unit lengths, that might explain why tetra to hexamers are underrepresented in their analyses. 6. The authors call these microsatellites super-population specific, yet there are a lot of overlaps of these loci indicating they are merely highly polymorphic. Is it really correct to call them “specific”? On a related note, what happens to the analyses if they are done only with the unique loci (~2500 loci added together). ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step. |
| Revision 1 |
|
Abundance of Ethnically Biased Microsatellites in Human Gene Regions PONE-D-19-21466R1 Dear Dr. Kinney, We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements. Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication. Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. With kind regards, Arnar Palsson, Ph.D. Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Thank you for addressing my concerns effectively and presenting the changes you made in such a comprehensive and lucid manner. Reviewer #2: (No Response) ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Ryan J. Haasl Reviewer #2: Yes: Rakesh Mishra |
| Formally Accepted |
|
PONE-D-19-21466R1 Abundance of Ethnically Biased Microsatellites in Human Gene Regions Dear Dr. Kinney: I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. For any other questions or concerns, please email plosone@plos.org. Thank you for submitting your work to PLOS ONE. With kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Arnar Palsson Academic Editor PLOS ONE |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .