Peer Review History

Original SubmissionJuly 22, 2020
Decision Letter - Timothy G. Barraclough, Editor, Stefano Allesina, Editor
Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.

Dear Dr. Sukumaran,

Thank you very much for submitting your manuscript "Incorporating the Speciation Process into Species Delimitation" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

I share the opinions provided by both reviewers that this manuscript describes an interesting approach that is in need of some major revision to make it suitable for publication. I would expect a revision to address their comments critically with substantive changes. A few comments of my own to add:

1) The reviewers make an important point that it is not fully apparent to superficial readers of this manuscript that the model can only delimit species if some species are already defined. I think this should be clearly explained up front, returned to later, and apparent from the abstract.

2) Line 17-20. The fact that populations will be artificially promoted by MSC methods is independent of species concept, but what represents the correct level to define a species is very much dependent on species concept. A major absence in the introduction is defining what you mean by species – without that definition, it cannot be possible to objectively delimit them with a statistical analysis. I think you define them as “what systematists define them to be”, i.e. that information external from the method is used to define enough species that you can estimate your parameters. I don’t have a problem with that as a definition but it needs to be clearly stated. Without it, species have some mythical status in this approach and appear to be conjured from thin air, as nothing in the model actually specifies the correct level unless you know the parameter – which you can't do without knowing some species already! There is some useful discussion later on, but it needs to be much clearer upfront.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Timothy G. Barraclough

Guest Editor

PLOS Computational Biology

Stefano Allesina

Deputy Editor

PLOS Computational Biology

***********************

I share the opinions provided by both reviewers that this manuscript describes an interesting approach that is in need of some major revision to make it suitable for publication. I would expect a revision to address their comments critically with substantive changes. A few comments of my own to add:

1) The reviewers make an important point that it is not fully apparent to superficial readers of this manuscript that the model can only delimit species if some species are already defined. I think this should be clearly explained up front, returned to later, and apparent from the abstract.

2) Line 17-20. The fact that populations will be artificially promoted by MSC methods is independent of species concept, but what represents the correct level to define a species is very much dependent on species concept. A major absence in the introduction is defining what you mean by species – without that definition, it cannot be possible to objectively delimit them with a statistical analysis. I think you define them as “what systematists define them to be”, i.e. that information external from the method is used to define enough species that you can estimate your parameters. I don’t have a problem with that as a definition but it needs to be clearly stated. Without it, species have some mythical status in this approach and appear to be conjured from thin air, as nothing in the model actually specifies the correct level unless you know the parameter – which you can't do without knowing some species already! There is some useful discussion later on, but it needs to be much clearer upfront.

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: This manuscript describes a novel method for species delimitation using the protracted speciation model, which explicitly models splits of populations within species and their subsequent transitions to full species. Authors claim that they can correctly distinguish species from populations and can avoid over-splits of species by including the population-species transition processes.

I think incorporating the model of speciation process is a logical idea to correct the over-splits caused by the MSC-based delimitation. Also, this method can be useful not just for delimitation but for studies of speciation process itself. Nevertheless, I do not think that some of the authors claims in the manuscript is fully convincing and some corrections are required before publication. Especially, that the genomic sequences alone cannot correctly delimit species is not supported by their results. I think it is rather due to their specific model design.

As already pointed out in the Leaché et al. 2018 paper, the protracted speciation (PS) model decouples speciation process and lineage splitting process. The PS model maps transition events on a population tree with a constant rate (sigma). This parameter is independent from the lineage branching process and is not directly affected by population processes like genetic differentiation or gene flow. Therefore, the inference of sigma is impossible with tree branch lengths alone. The species assignment of some tips is always required to infer the transition rate under this model setting. The manuscript should more clearly mention this property.

In real situations, speciation process is very likely to be affected by multiple population level processes, and the transition rate likely correlated with these parameters (Such as Ne*m). Sigma probably could be estimated from them. Without testing the adequacy of the constant-independent-rate PS model, it is not safe to conclude that the delimitation is impossible from genomic data alone.

Apart from this point, I think the explicit modeling of speciation process is a promising approach and guiding delimitation with taxonomic information is still useful in many situations.

Other points:

Line 51.

The gdi thresholds in Jackson et al. and Leaché et al. are informed by biological observations. They set it to match with the observed species-population boundaries. This procedure is similar to estimating the transition rate from taxonomic knowledge used in this manuscript.

Line 79.

A brief model description is required here before reporting results.

Line 92.

"Figure 1" is missing in the text.

Reviewer #2: This manuscript describes use of the protracted birth-death (PBD) model for the purpose of species delimitation, as implemented in the DELINEATE software. A motivation for the work is to avoid the oversplitting that is commonly observed with species delimitation methods based on the multispecies coalescent, which is facilitated by the speciation-completion rate parameter in the PBD model. The application of this model to the problem of species delimitation is interesting and timely, and the DELINEATE software is nice addition to the collection to the set of tools available for this problem.

However, there are a few major issues with the manuscript that must be corrected before publication:

(1) The entire model is not formally described anywhere in the manuscript, making it impossible for the reader to fully understand how the software works. The PBD model is described to some extent in Figure 1, but Figure 1 is not referenced anywhere in the manuscript. However, the parameter $\\sigma$ doesn’t appear in the caption to Figure 1, although speciation completion is discussed there, and this parameter appears to be the key component of the model for this purpose, as it is discussed throughout the text. It seemed that perhaps the model would be carefully described in the section labeled “Statistical Model Description and Inference Algorithm”, but it is not. It is stated that the data are sequences, and partitions of sequences into species are defined. But how, specifically, are the sequence data integrated with the PBD model? How is $Pr(\\Lambda | \\sigma, S)$ actually computed? It is imperative that the full model be carefully defined somewhere in the manuscript.

The authors also need to more carefully define the algorithm used in DELINEATE and/or specify what options were used in running the simulations. My guess is that all partitions are attempted, the likelihood is calculated on a fixed lineage tree for each partition, and the one with the highest probability is selected. If this is indeed correct, it should be stated. Also, how is a 95% confidence set obtained? Are partitions added in decreasing order of probability until 95% of the probability is accounted for? Or something else?

(2) Is the species completion rate underestimated in these types of models because some species die out? It might be good to comment on this.

(3) For the unconstrained simulations, is there any pattern in the kinds of partitions that had the highest probability? Does the method tend to underestimate or overestimate the number of species? Are certain scenarios less likely to be detected than others?

(4) The first sentence of the Discussion doesn’t seem to be to be supported directly by any of the analyses. Is there evidence presented that this method prevents oversplitting, or is this comment just based on the fact that the model is expected to do that?

Similarly, while I don’t necessarily disagree with the claim (lines 152-153) that “.. inferences that rely on genetic data alone, without reference to any other information for delimiting species, are not reliable”, I don’t see that this claim is justified by the work presented here. Just because this model doesn’t allow for accurate delimitation doesn’t mean that there is no model for genetic data that can’t produce an accurate delimitation.

Minor comments/typos:

- line 27, “for their particular” — complete the phrase

- lines 35-39, problem with the latex citations

- line 44, maybe remove “the” before “MSC-based”

- line 58, remove the comma after “divergence”

- line 141, “under at reasonable” — re-word

- line 157, combine the first two paragraphs of this section

- line 214, missing punctuation

- line 255, “any” is repeated

- line 264, “as it is necessary assumption” — re-word

- first paragraph of Materials & Methods reads awkwardly — might be better to say something like “DELINEATE” has 3 modes of inference: …”

- Section on “Constrained Species Delimitation”, this is very difficult to read since the model has not yet been introduced. What does the parameter $\\sigma$ mean?

- line 394, extra semi-colon

- paragraph starting on line 383, I found this redundant/unnecessary

-line 413, refers to the PBD model, not yet introduced

- line 444, “speciation” is misspelled

- line 451, “the” is repeated

- line 505, “partition” -> “partitions”

- Caption to Fig. 3, “minimum” is misspelled

- When referencing the journal Systematic Biology, capitalize both words.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions, please see http://journals.plos.org/compbiol/s/submission-guidelines#loc-materials-and-methods

Revision 1

Attachments
Attachment
Submitted filename: response-letter.pdf
Decision Letter - Timothy G. Barraclough, Editor, Stefano Allesina, Editor

Dear Dr. Sukumaran,

Thank you very much for submitting your manuscript "Incorporating the Speciation Process into Species Delimitation" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.

The manuscript now presents an improved account of the method and its fit into the context of other methods. I agree with all the comments by reviewer 1, so please can you complete minor revisions to address the remaining questions. I agree that the manuscript would be much clearer with Methods first, so please do this now.

Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Timothy G. Barraclough

Guest Editor

PLOS Computational Biology

Stefano Allesina

Deputy Editor

PLOS Computational Biology

***********************

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

[LINK]

The manuscript now presents an improved account of the method and its fit into the context of other methods. I agree with all the comments by reviewer 1, so please can you complete minor revisions to address the remaining questions. I agree that the manuscript would be much clearer with Methods first, so please do this now.

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: This is a revised version of the manuscript describing DELINEATE, a species delimitation method which explicitly models species-population transition processes.

The authors did not agree with some of my comments in the last review, but at least they made some essential corrections. I think the manuscript now more correctly delivers their findings than the last version, and I agree that the MSC (but not genetic species delimitation in general) can be unreliable and external information helps improve delimitation accuracy. Although I am not fully convinced by their response, especially about the interpretation of gdi, I leave them to readers' decisions. I have some comments on the descriptions of the model, and I wrote them in the “Other comments”.

Also, I would recommend moving the Materials&Methods before the Results. In the current version of manuscript, it is very hard to understand the Results section because no information about the modes of software and simulation setups is provided beforehand. (I asked them to add some sentences at the beginning of the Results, but they did not.) I think moving the M&M to the front is easier and PLOS computation biology's style looks flexible according to their guideline.

Other comments:

Line 454:

Providing an equation of Pr(Lambda|sigma, S) will help readers to understand this method. If I understand it correctly, its actual form is Pr(No. transition events>0 | sigma, branch length) or Pr(No. transition events==0 | sigma, branch length) assuming Poisson-distributed events and branch lengths are determined by S and Lambda. The calculation of an overall likelihood and dynamic programming can be clearly understood with an explicit equation too.

Line 466:

It is a bit confusing to use “likelihood” and “probability” interchangeably. I think that Pr(Lambda|sigma, S) is a likelihood function of partition, Lambda, and the algorithm calculate a likelihood of delimitation/partition.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see http://journals.plos.org/ploscompbiol/s/submission-guidelines#loc-materials-and-methods

References:

Review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.

Revision 2

Attachments
Attachment
Submitted filename: response-letter2.pdf
Decision Letter - Timothy G. Barraclough, Editor, Stefano Allesina, Editor

Dear Dr. Sukumaran,

We are pleased to inform you that your manuscript 'Incorporating the Speciation Process into Species Delimitation' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Timothy G. Barraclough

Guest Editor

PLOS Computational Biology

Stefano Allesina

Deputy Editor

PLOS Computational Biology

***********************************************************

Formally Accepted
Acceptance Letter - Timothy G. Barraclough, Editor, Stefano Allesina, Editor

PCOMPBIOL-D-20-01275R2

Incorporating the Speciation Process into Species Delimitation

Dear Dr Sukumaran,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Katalin Szabo

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .