Peer Review History

Original SubmissionFebruary 27, 2020
Decision Letter - Peng Xu, Editor

PONE-D-20-05745

A 3K Axiom® SNP array from a transcriptome-wide SNP resource sheds new light on the genetic diversity and structure of the iconic subtropical conifer tree Araucaria angustifolia (Bert.) Kuntze

PLOS ONE

Dear Dr. Grattapaglia,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

We would appreciate receiving your revised manuscript by May 17 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.
  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.
  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Peng Xu, Ph.D.

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements:

1.    Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.plosone.org/attachments/PLOSOne_formatting_sample_main_body.pdf and http://www.plosone.org/attachments/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We understand that samples were taken from a species listed as Critically Endangered by the IUCN. In your Methods section, please provide additional information regarding the permits you obtained for the work. Please ensure you have included the full name of the authority that approved the field site access and, if no permits were required, a brief statement explaining why.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: For the writing, a lot of very long and complex sentences were included in this manuscript, making it difficult to read and understand. I suggest the authors to revise them and use more short sentences instead.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachments
Attachment
Submitted filename: reviewer opinions.docx
Revision 1

RESPONSE TO EDITORIAL AND REVIEWERS REQUIREMENTS - PONE-D-20-05745

“A 3K Axiom® SNP array from a transcriptome-wide SNP resource sheds new light on the genetic diversity and structure of the iconic subtropical conifer tree Araucaria angustifolia (Bert.) Kuntze”

EDITORIAL REQUIREMENT

EDITOR: We understand that samples were taken from a species listed as Critically Endangered by the IUCN. In your Methods section, please provide additional information regarding the permits you obtained for the work. Please ensure you have included the full name of the authority that approved the field site access and, if no permits were required, a brief statement explaining why.

RESPONSE: Following the editorial requirements, we have included in the Methods section the full information regarding the governmental authorization we received to develop this research with Araucaria angustifolia being it a critically endangered species by the IUCN. The authorization number 02001.007609/2012-77 was issued by the Brazilian Institute of the Environment (IBAMA), the regulating body of the Brazilian Ministry of Environment in December 2012 giving full authorization to collect samples from natural populations for DNA analysis toward genetic studies

REVIEWER #1

REVIEWER #1: In the SNP filtering step, Line 221, the parameters of minimum allele frequency (MAF), missing rate, and Hardy-Weinberg equilibrium (HWE) were not used. The MAF filtering can exclude SNPs showing little variation in the population. The SNPs with a high missing rate indicated the number of detected genotype in the population is limited. The HWE filtering help exclude the SNPs arising from sequencing errors and natural selection. Therefore, these criteria are important for the selection of the SNPs.

RESPONSE: The section that the reviewer is indicating on line 221 does not refer to the filtering of SNPs following genotyping. Rather it describes the steps taken during the SNP discovery process using the VCF files of the RAD and RNA sequencing data or a few individuals used for that objective. The parameters taken during SNP discovery are evidently different ones based sequence context and parameters implemented by the GATK best practices. Also given that only a few individuals were sequenced for SNP discovery, neither MAF nor HWE would make any sense.

The parameters the reviewer is mentioning (MAF, call rate and HWE) were used after SNP genotyping is carried out with the array on population samples. This was done and fully explained in section “ SNP array design and validation” on line 267. In that section these parameters were applied during the SNP validation process that followed the construction of the SNP array.

Still regarding the HWE filter, although this filter makes sense in human genetics or species that abide to the HWE expectations given a dioecious sexual reproduction, large populations and little or no structure, this filter does not apply to plant populations with mixed mating or selfing system in subdivided populations. For example, in selfing species, if a HWE filter is applied, all SNPs would be excluded as no or very few heterozygotes are seen. Additionally, in our case, since we genotyped individuals from several different populations, the consolidated set of data would never abide to HWE due to strong population differentiation that causes reduction n overall heterozygosity. So the HWE filter cannot be applied in our case as in essentially all plant population SNP arrays.

REVIEWER #1: In Line 285, 294, 408, MAF ≥ 0.005 was used to declare a polymorphic SNP. However, the commonly recognized and used MAF threshold is 0.05 or 0.01 for valid SNPs. MAF ≥ 0.005 indicated that the array may contain a number of rare SNPs. A MAF distribution plot should be provided to help evaluate this issue.

RESPONSE: We do acknowledge that commonly the MAF threshold used is MAF> 0.05. However this is really a threshold that depends on the sample size used and the envisaged applications of the SNP array. As we describe in detail in that same section “ SNP array design and validation” we used a call rate ≥ 97% or ≥ 90% depending on the criteria adopted and MAF ≥ 0.005 for two reasons. We did deliberately intend to include rare SNPs in our array as these are in fact informative especially if one intends to use the SNP array for association analysis.

Low-frequency and rare variants have been shown to represent an important and understudied component of complex trait genetics. The literature on the importance of rare variants is abundant especially in human genetics (see for example doi: 10.1126/science.1219240 and doi: 10.1007/978-1-4939-7868-7_5) and methods have been increasingly been developed and used to deal with rare variants in GWAS (see for example doi: 10.1002/cphg.83). The second reason is that we genotyped different Araucaria populations and SNP allele frequencies vary across populations such that a SNP may be rare in one population and frequent in another. This is commonly seen in plant populations across large geographical areas mainly due to drift and occasionally selection. So the information content of SNPs will vary across populations and this fully justifies adopting a much lower MAF threshold to include all SNPs with such behavior. The genotyping quality of the SNPs was evaluated by the dish quality parameter that has to do with the quality of cluster separation among other things.

Regarding a MAF distribution plot, which we called site frequency spectrum (SFS) plot, it was adequately provided this in our supplementary material, Figure S3 and cited in the manuscript on line 427. As can be seen in the figure, the SFS does in fact show an enrichment on rare variants. As explained above, this was fully intentional and justified. However the plot also shows enrichment toward higher frequency SNPs.

REVIEWER #1: In Line 271, the author mentioned that the 3K SNP probes for the conifer were tilled on a multispecies Axiom® myDesign ™ array that contained a total of 51,867 SNPs, shared among five different plant species. What is the phylogenetic relationship of these species and more importantly, the genetic divergence of these species? If they are closely related species, some SNPs may inter-influence and cause false genotyping result.

RESPONSE: We would like to thank the reviewer for this question, which provides an opportunity to explain what we did. We added a sentence in Methods explaining this issue, to better clarify the readership.

The development of this multispecies array will be the topic of a specific methods publication under preparation. For right now we presented it as an invited talk at Plant Genome which can be viewed in Youtube (https://www.youtube.com/watch?v=-Wfb6WftQiU), and the reference was cited in the manuscript (reference 62). The five species were Araucaria angustifolia (Brazilian pine – order Pinales), Anacardium occidentale (cashew – order Sapindales), Manihot esculenta (cassava – order Malpighiales), Coffea robusta (coffee – order Gentianales) and Eucalyptus sp. (order Myrtales). These species are phylogenetically quite distant. Not only they belong to different families but also to different phylogenetic orders. Additionally, during array design all 51 thousand plus probes designed were subject to a detailed bioinformatics analysis to avoid probe homology such that SNP probe cross talking was precluded. Finally, even in the unlikely case that some cross talk signal happens, an analytical step can be used in Axiom Suite to quench the fluorescent signal of unwanted probes such that only the signal of the probes of the species under analysis is captured. Reminding, evidently, that only a single species sample is hybridized to the array at the time, in other words no DNA mixing is used. So we are fully confident that no false genotyping result happened or will happen for any of the five species.

Attachments
Attachment
Submitted filename: Response_to_reviewers_ PONE-D-20-05745_RevisedMS.pdf
Decision Letter - Xiaoming Pang, Editor

PONE-D-20-05745R1

A 3K Axiom® SNP array from a transcriptome-wide SNP resource sheds new light on the genetic diversity and structure of the iconic subtropical conifer tree Araucaria angustifolia (Bert.) Kuntze

PLOS ONE

Dear Dr. Grattapaglia,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Sep 07 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Xiaoming Pang, PhD

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: For the minor allele frequency, the authors have raised two reasons for the selction of 0.005. However, since the total number of the SNPs tilling on the array is only 3k, the rare SNPs are difficult to represent significance in both GWAS and in different populations. I still think a higher filtering parameter for MAD is necessary to ensure a higher genotyping rate.

Reviewer #2: Because the huge genome of conifers, cheap array based genotyping rather than large scale GBS will serve as an excellent foundation for implementing genomic selection. Therefore, this article provides useful information to researchers targeting at conifer species. I think that this article should be published.

Minor comments

1. Line 60, “ low to ultra-low sequencing of the whole genome.” what is low sequencing? Do you mean “low-coverage” or “low-cost”?

2. line 78, I disagree with this comment. I think that in addition to humans and animals, many forest trees already have high-density SNP chips for years, including conifers (Mol Ecol Resour. 2013;13(2):324–36. BMC Genomics, 2020;21:9)

3. line 168, the StringTie was used for transcriptome assembly, I think this tool need BAM format data that aligned reads to genome as input, so any references genome was available in this study?

4. It is true that SNP will definitely outperform SSR because of its large number, but comparing thousands of SNPs with just 8 SSR to make this point is neither fair nor necessary.

5. line 486, “Northern populations were on average 25% less genetically diverse (Ho = 0.262) than southern populations (Ho= 0.353) based on SNPs, but this pattern was not as clearly detected with the microsatellite dataset” , I am not sure if this conclusion is correct. Based on statistical tests, I guess they all have significant differences.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Revision 2

RESPONSE TO ADDITIONAL REVIEWERS REQUIREMENTS - PONE-D-20-05745R1

“A 3K Axiom® SNP array from a transcriptome-wide SNP resource sheds new light on the genetic diversity and structure of the iconic subtropical conifer tree Araucaria angustifolia (Bert.) Kuntze”

REVIEWER #1

REVIEWER #1 COMMENT: For the minor allele frequency, the authors have raised two reasons for the selection of 0.005. However, since the total number of the SNPs tilling on the array is only 3k, the rare SNPs are difficult to represent significance in both GWAS and in different populations. I still think a higher filtering parameter for MAF is necessary to ensure a higher genotyping rate.

RESPONSE: As we had responded previously, the threshold used was in fact justified by the two reasons we outlined before. On one hand we deliberately intended to include rare SNPs in our array as these could be informative in specific situations. However the strongest justification was that the estimates were obtained in limited samples sizes for each one of several populations, and in light of the strong differentiation detected, it is likely that SNP allele frequencies vary across populations, such that a SNP allele may be rare overall but it may have higher frequency in some specific populations and thus informative. So, these rare alleles could in fact be well represented in some populations when larger samples sizes are genotyped. This is commonly seen in outbred plant populations across large geographical areas mainly due to drift and occasionally selection.

Regarding the second point we do agree with the reviewer that to include rarer SNPs it is important to ensure high genotyping rate. As described in the MS, we did use stringent dish quality and call rates metrics supported by the Axiom technology (Call rate > 97% for Axiom criteria) such that the quality of the genotype call is robust, despite the low estimated MAF. So we really don’t see any reason to exclude these seemingly rare SNPs from our evaluation in this initial array performance. Certainly as larger samples are studied across variable populations we are confident that several of these rarer SNPs will display higher frequencies.

Another point that should be mentioned is that by including rarer SNPs on the array, the ascertainment bias toward having largely frequent SNPs typical seen in array design, gets mitigated somehow adding to the value of the array. We had discussed this point in the MS.

In any case to address the reviewer’s concern, we added a paragraph to the MS, based on our arguments above, further detailing the reasons why we used a lower MAF threshold. We are confident that this would clarify the readership about our decision. We wrote:

“Despite the fact that typically only SNPs with MAF ≥ 0.05 are considered as polymorphic, we deliberately intended to take into account rare SNPs in our array for two reasons: (1) rare SNPs could be informative for some specific applications such as gene flow, parentage and association studies; (2) we used stringent dish quality and call rates metrics criteria such that the quality of the genotype calls are robust, despite the low MAF; and (3) by having genotyped limited samples of individuals in this first study for each one of several populations, and in light of the strong differentiation detected (see below), it is expected that SNP allele frequencies will vary across populations, such that a SNP allele may be rare overall but it may have higher frequency and thus be informative in specific populations. So, these rare SNP alleles could turn out to be well represented in some populations when larger samples sizes are genotyped, especially for outbred forest tree populations across large geographical areas.”

REVIEWER #2

Minor comments

REVIEWER #2 COMMENT 1. Line 60, “ low to ultra-low sequencing of the whole genome.” what is low sequencing? Do you mean “low-coverage” or “low-cost”?

RESPONSE: Thanks. We really mentioned low-coverage sequencing which also tend to be low-cost from a general point of view. We fixed that.

REVIEWER #2 COMMENT 2. line 78, I disagree with this comment. I think that in addition to humans and animals, many forest trees already have high-density SNP chips for years, including conifers (Mol Ecol Resour. 2013;13(2):324–36. BMC Genomics, 2020;21:9)

RESPONSE: Yes we agree and we added forest trees to the sentence. Additionally, as can be read a few paragraphs down the introduction, we had fully cited all the conifer chips published up to the time we had submitted the MS, including the Mol Ecol Res 2013 reference mentioned. We have now added the Howe et al. 2020 reference to the citations to make it fully complete.

REVIEWER #2 COMMENT 3. line 168, the StringTie was used for transcriptome assembly, I think this tool need BAM format data that aligned reads to genome as input, so any references genome was available in this study?

RESPONSE: Thank you for this opportunity to better clarify our methods in the manuscript. In fact, we have not used StringTie for the transcriptome assembly. As described in the prior MS version, we have adopted the StringTie approach only as an attempt to reconstruct cDNA fragments for both ends of the reads from the RNA-Seq data. The reviewer is correct to notice that StringTie uses BAM formatted data as input and also requires reads to be aligned to a reference genome. However, the StringTie tool includes a separate routine called SR (from Super-Reads), in which the input is constituted of unaligned paired-end reads. This is an optional step for preparing the alignments for the input of StringTie+SR runs (described in the StringTie manuscript, (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4643835/).

StringTie+SR borrows some algorithmic techniques from de novo genome assembly (i.e, do not need an available reference genome) to help with transcript assembly. Its input can include longer contigs that it assembles de novo from unambiguous, non-branching parts of a transcript. If the RNA-Seq data is paired, it is possible to use the SR routine to reconstruct the RNA-Seq fragments from their end sequences, which the authors of the tool have called Super-Reads (please see the StringTie+SR manual,

http://ccb.jhu.edu/software/stringtie/index.shtml?t=manual#assembly).

It can be used as an optional step for preparing these longer contigs for the input of StringTie+SR or any other transcriptome assembly tool that can handle single-end longer contigs that correspond approximately to the original cDNA fragment (such as an EST assembler tool). Because the StringTie+SR still relies on the alignment of these Super-Reads on the top of an reference genome, which is not available in the case of the Araucaria angustifolia, we just use this data as an input of the miraSearchESTSNPs assembler as described later in the text. To further clarify these points, we rewrote this section in the manuscript as follows:

“As a first step, we used the StringTie+SR [51] method to pre-assemble longer contigs from the RNA-Seq paired-end data aiming to achieve the better performance of the subsequent transcriptome assembly. From StringTie+SR, we ran the perl script SR (Link to external file: http://ccb.jhu.edu/software/stringtie/dl/superreads.pl) separately on the reads of each three megastrobiles and each three somatic embryogenic cultures of A. angustifolia. The SR method pre-assembled de novo the paired-end reads from the unambiguous, non-branching parts of the transcript in these six sources of cDNA data, resulting in super-reads of approximately 300 bp corresponding to the entire sequence of the original cDNA fragment. These super-reads were used as input of the miraSearchESTSNPs from MIRA 4 assembler [52] to generate the consensus sequences of the transcript assembly from the different sources of cDNA and to perform the SNP detection within this assembly.”

REVIEWER #2 COMMENT 4. It is true that SNP will definitely outperform SSR because of its large number, but comparing thousands of SNPs with just 8 SSR to make this point is neither fair nor necessary.

RESPONSE: We definitely agree that 8 microsatellites are not comparable to the power of a large set of SNPs. However the point we intend to make in this study is valid in light of what has been published in the literature so far regarding the population genetics structure and diversity of A. angustifolia. Essentially all studies to date have used small sets of 5 to 10 microsatellites and this is the reason we made this comparison. This is in fact a crucial point of our MS, besides the SNP array development. It is therefore necessary, in our understanding, to clearly compare results obtained with the new array we present with was has been done so far to make the point that several of the conclusions provided in the literature for the population genetics of A. angustifolia are, in fact, misrepresentative.

To this end we made sure we genotyped the same exact individuals with the two methods so that no doubt would persist to the reader regarding population and individual sampling and the conclusions derived thereof regarding diversity, inbreeding and structure of the species. So we thank the reviewer for the comment but we do think it is necessary to keep this analysis exactly to make a strong point about this issue. We go over this in detail in our discussion especially in the final section “SNP array data to advance Araucaria genetics, conservation and breeding”. We discuss that the consequences of deriving conclusions about diversity and structure from microsatellite data could be deceptive to any recommendation for conservation.

REVIEWER #2 COMMENT 5. line 486, “Northern populations were on average 25% less genetically diverse (Ho = 0.262) than southern populations (Ho= 0.353) based on SNPs, but this pattern was not as clearly detected with the microsatellite dataset” , I am not sure if this conclusion is correct. Based on statistical tests, I guess they all have significant differences.

RESPONSE: Thanks for the comment and observation. We agree that the difference also existed but was quite smaller. We improved this sentence in the revised version by stating: “Northern populations were on average 25% less genetically diverse (Ho = 0.262) than southern populations (Ho= 0.353) based on SNPs, but this pattern was less pronounced with the microsatellite dataset for which the difference was only 15% (Northern Ho = 0.587; Southern Ho = 0.694) (Table 3).”

Attachments
Attachment
Submitted filename: Response_to_reviewers_ PONE-D-20-05745_R1_RevisedMS.docx
Decision Letter - Xiaoming Pang, Editor

A 3K Axiom® SNP array from a transcriptome-wide SNP resource sheds new light on the genetic diversity and structure of the iconic subtropical conifer tree Araucaria angustifolia (Bert.) Kuntze

PONE-D-20-05745R2

Dear Dr. Grattapaglia,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Xiaoming Pang, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Formally Accepted
Acceptance Letter - Xiaoming Pang, Editor

PONE-D-20-05745R2

A 3K Axiom® SNP array from a transcriptome-wide SNP resource sheds new light on the genetic diversity and structure of the iconic subtropical conifer tree Araucaria angustifolia (Bert.) Kuntze

Dear Dr. Grattapaglia:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Xiaoming Pang

Academic Editor

PLOS ONE

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .