Peer Review History

Original SubmissionMay 13, 2025
Decision Letter - Roland Roberts, Editor

Dear Dr Khalaf,

Thank you for submitting your manuscript entitled "Forty New Genomes Shed Light on Sexual Reproduction and the Origin of Tetraploidy in Microsporidia" for consideration as a Research Article by PLOS Biology.

Your manuscript has now been evaluated by the PLOS Biology editorial staff, as well as by an academic editor with relevant expertise, and I'm writing to let you know that we would like to send your submission out for external peer review.

However, before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire.

Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. After your manuscript has passed the checks it will be sent out for review. To provide the metadata for your submission, please Login to Editorial Manager (https://www.editorialmanager.com/pbiology) within two working days, i.e. by May 22 2025 11:59PM.

If your manuscript has been previously peer-reviewed at another journal, PLOS Biology is willing to work with those reviews in order to avoid re-starting the process. Submission of the previous reviews is entirely optional and our ability to use them effectively will depend on the willingness of the previous journal to confirm the content of the reports and share the reviewer identities. Please note that we reserve the right to invite additional reviewers if we consider that additional/independent reviewers are needed, although we aim to avoid this as far as possible. In our experience, working with previous reviews does save time.

If you would like us to consider previous reviewer reports, please edit your cover letter to let us know and include the name of the journal where the work was previously considered and the manuscript ID it was given. In addition, please upload a response to the reviews as a 'Prior Peer Review' file type, which should include the reports in full and a point-by-point reply detailing how you have or plan to address the reviewers' concerns.

During the process of completing your manuscript submission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF.

Feel free to email us at plosbiology@plos.org if you have any queries relating to your submission.

Kind regards,

Roli Roberts

Roland Roberts, PhD

Senior Editor

PLOS Biology

rroberts@plos.org

Revision 1
Decision Letter - Roland Roberts, Editor

Dear Dr Khalaf,

Thank you for your patience while your manuscript "Forty New Genomes Shed Light on Sexual Reproduction and the Origin of Tetraploidy in Microsporidia" was peer-reviewed at PLOS Biology. It has now been evaluated by the PLOS Biology editors, an Academic Editor with relevant expertise, and by three independent reviewers.

You'll see that reviewer #1 is positive about the study, but thinks that the manuscript needs significant streamlining, especially for our broader readership. S/he also wants better treatment of the prior literature around sex in microsporidia, including population genetic support for this claim. There are also some presentational (to include a table of the genomes’ properties) and methodological (to justify the use of prokaryotic annotation pipeline) requests. Reviewer #2 is also positive, but wants improved analysis and presentation of the phylogeny, expressed scepticism about the number of genes in some species, and has multiple queries about the methods (including how you excluded the co-occurrence of multiple microsporidia in a single host). Reviewer #3 starts very positive, but then becomes more critical, raising concerns about inferring ploidy without cytological data, and the overall lack of methodological clarity. Like reviewer #1, s/he also thinks that the paper needs streamlining.

In light of the reviews, which you will find at the end of this email, we would like to invite you to revise the work to thoroughly address the reviewers' reports.

Given the extent of revision needed, we cannot make a decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is likely to be sent for further evaluation by all or a subset of the reviewers.

In addition to these revisions, you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests shortly.

We expect to receive your revised manuscript within 3 months. Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension.

At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may withdraw it.

**IMPORTANT - SUBMITTING YOUR REVISION**

Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript:

1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript.

*NOTE: In your point-by-point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point.

You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response.

2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Revised Article with Changes Highlighted" file type.

*Re-submission Checklist*

When you are ready to resubmit your revised manuscript, please refer to this re-submission checklist: https://plos.io/Biology_Checklist

To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record.

Please make sure to read the following important policies and guidelines while preparing your revision:

*Published Peer Review*

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*PLOS Data Policy*

Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5

*Blot and Gel Data Policy*

We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Roli Roberts

Roland Roberts, PhD

Senior Editor

PLOS Biology

rroberts@plos.org

------------------------------------

REVIEWERS' COMMENTS:

Reviewer #1:

The paper by Khalaf and colleagues provides a comprehensive analysis of 40 microsporidian genomes derived from long-read sequencing datasets of infected animal hosts. The study is robust, with results that are generally well-supported by the provided data and analyses. Below, I offer suggestions to improve the paper's structure, readability, and scientific rigor, along with specific technical comments.

Note that the absence of line numbers and the placement of figure legends (separated from figures and embedded mid-text) made the review process unnecessarily challenging, and I recommend addressing this for clarity.

General Comments:

* Streamline Content for Conciseness and Broader Appeal: While the dataset is extensive, the paper could be significantly shortened by consolidating analyses addressing similar questions. For instance, the phylogenetic and species definition analyses could be combined into a single, concise paragraph, as they are closely related. Similarly, analyses of inter-species versus intra-haplotype rearrangement/recombination both explore genomic diversity in microsporidia and should be merged into one section. Reducing overly detailed discussions of niche interest (e.g., specific phylogenetic nuances) would enhance accessibility for the broader readership of PLoS Biology.

* Clarify "Compartments" Terminology: The use of "compartments" in the context of eukaryotic genomes typically refers to A/B compartments—genomic regions with distinct gene/repeat enrichments and conformational differences tied to the cell cycle. The Hi-C data presented suggests compartmentalization, but it's unclear whether these reflect standard A/B compartments, as seen in other eukaryotes, including symbiotic or pathogenic fungi. The authors should investigate and explicitly compare their findings to known A/B compartmentalization to avoid confusion.

* Acknowledge Prior Hypotheses on Microsporidian Sex and Ploidy: The paper's major claims, such as the proposed sexual reproduction model and diploid/tetraploid cycles, build on earlier hypotheses that are not cited. For example, Pelin et al. (2016) and Corradi (2015) discuss diploid-tetraploid cycles and potential karyogamy in microsporidia, suggesting mechanisms for genetic diversity. These seminal works must be referenced to contextualize the current findings. The authors should clarify whether their data supports, refines, or challenges these hypotheses. Additionally, claims of sexual reproduction should be substantiated with population genetics analyses, which are absent here but have been explored elsewhere (e.g., in studies not cited in this manuscript).

Specific Comments:

* Page 7, Compartmentalization: The statement, "We show that one tetraploid genome is organised into two compartments, likely the nuclei of the diplokaryon," requires clarification. Do these Hi-C-defined compartments show differential enrichment in transposable elements (TEs), BUSCO genes, or other features, as expected in A/B compartments? The authors should compare their findings to known eukaryotic compartmentalization to ensure accurate terminology.

* Page 9, Genome Data Accessibility: The paper lacks a clear summary of the 40 genomes' characteristics. I recommend including a table detailing genome statistics (e.g., size, scaffold number, N50), sequencing methods (e.g., Hi-C, telomere resolution), and quality metrics. Additionally, explain why Hi-C was applied to some genomes but not others and outline the sequencing-to-scaffolding pipeline to provide context for methodological choices.

* Page 11, Figure and Legend Placement: Embedding figure legends mid-text, with figures placed at the end, hinders readability. Figures and their legends should be integrated within the main text at relevant points to facilitate review and comprehension.

* Page 12, Annotation Pipeline: The bacterial annotation pipeline is surprising given microsporidia's eukaryotic nature. Although introns are rare and small, a eukaryotic annotation pipeline is standard and should be employed. Justify the use of a bacterial pipeline or revise the methodology.

* Page 12, Transposable Elements and rRNA: The paper notes correlations between retroelement, DNA transposon, and helitron loads with genome size but lacks detail on their genomic distribution. Are TEs or rRNA operons enriched in specific chromosomal regions, such as subtelomeres, as observed in Encephalitozoon? This should be explored and discussed.

* Page 13, Species Definitions: Defining species based on branch length is problematic, as it risks oversimplification (e.g., analogous to grouping marsupials and mammals as conspecific). Species delimitation in microsporidia should incorporate population genetics approaches to assess genetic exchange, as asexual lineages may not align with traditional species concepts. The authors should acknowledge these limitations and consider integrating population genetics data to support taxonomic claims.

* Page 15, Nomenclature: Designations like "idChiSpeb1.μ" are opaque and confusing. If a species name exists (e.g., for the sample from Chironomus sp.), it should be used consistently to improve clarity.

Reviewer #2:

Microsporidia are a large group of intracellular parasites that infect many species of animals. Their genomes have been of interest as they have the smallest known eukaryotic genomes, but their genomic structure has not been thoroughly investigated due to only have a few species with chromosomal level assemblies. Here the authors take advantage of genome reads from serendipitously infected insects to assemble 40 genomes, representing 17 species. The authors use these genome assemblies to investigate ploidy, genome structure, and provide evidence for sexual reproduction. Overall, this is an interesting paper and the high quality microsporidia genome assemblies will be a useful resource.

Major points:

1. The authors' claim "They allowed us to revise microsporidian phylogeny, notably changing the position of Glugeida to the sister group of the ancestor of the 'orphan lineage and Amblyosporida" is unsupported by the provided data. There needs to be support values provided for the nodes of the tree in figure 2 and ideally more than one phylogenetic approach would be used. The authors use terms like "clustering robustly" and "We confidently placed" , but it is unclear what these descriptions are supported by. The authors choice to use metchnikoviids as the root is not explained, as most groups show that Mitosporidium daphnae is sister to the metchnikoviids and microsporidia. The nodes are also very difficult to see as they are compressed. I would recommend creating a separate version of this tree for the main figure that is just species and not strains. The full tree can then be a supplemental figure.

2. The number of genes predicted which are shown in figure 2 seem unreasonably high. Some species of microsporidia are reported to have less than 2000 protein coding genes, so the reporting at least 10 species having over 20,000 proteins is unexpected. What is the explanation for this? Is this different variants for the same gene being predicted multiple times? Or is this just very short proteins? Other methods besides Prokka that have previously been used on microsporidia genomes should be investigated (see the following paper for examples: DOI: 10.1111/jeu.13038.) The authors state that "no functional annotations were required in this piece of work", but gene prediction affects analysis such as figure 4a and figure 8 which some species have 100 gene pairs vs 1000 gene pairs. Also for these genomes to be useful resource, the annotations along with the genome assemblies should be deposited in a central repository such as NCBI.

3. There are very high levels of duplicated buscos (up to close to 100%) in some assemblies. Is this just because some of the assemblies are haploid resolved? This needs to be explained.

4. How the paper deals with purged genomes is confusing. A better explanation of how and why genomes were purged would be helpful to add.

5. The authors do not appear to take any steps to confirm that the microsporidia reads from their samples come from a single species. Co-occurrence of different microsporidia species in mosquitos has been shown to be ~10% (https://doi.org/10.1111/1755-0998.13205). For the non-chromosomal level genomes, some criteria needs to be used to determine whether the microsporidia reads in each sample are from the same species.

6. What criteria is used to classify ploidy in an assembly needs to be defined. For example, why is the ploidy of iyOphElle1.µ, not determined, as this is a chromosome assembly?

7. The number of genome assemblies that were screened using blobtools for microsporidia reads is not mentioned and should be.

Minor points:

1. I would recommend against using µ as to name the novel microsporidia species genomes. There will certainly be instances where others will use these genomes in applications where Greek letters are not able to be used, and this could lead to confusion. This is a known issue in biology nomenclature and see the following paper for this recommendation in the context of gene names: DOI: 10.1006/geno.2002.6748 . I would suggest using some other abbreviation such "m", "micro", or "ms" that makes the same point using the Latin alphabet.

2. Methods are lacking for the primers and genes that were used to identify individuals as microsporidia positive.

3. In figure 1A "number of microsporidia genomes" would be more appropriate as the X-axis label and moved underneath the X-axis.

4. Several acronyms and abbreviations are used that are not defined such as MFP, Asm., and OUT.

5. The paper often uses a lot of jargon that makes the paper less accessible, such as "Operational taxonomic unit classification of species suggests autotetraploidy in Microsporidia". It would be helpful to revise the paper with an eye towards clarity and simplicity that would help non-specialists be able to understand.

6. The scale in figure 3 is too compressed see the branch lengths of .15 that the authors point as delineating species. It would be helpful to use a different scale that allows this level of differences to be visualized.

7. What is being a considered an OTU in this paper not explained.

8. The text in figure 4A is too small to read and needs to be increased to at least 8 pt.

9. In figure 5 the clade names at the bottom are upside down and should be flipped so they can be more easily read.

10. A scale needs to be shown for figure 7.

Reviewer #3:

Khalaf et al. report on 40 new Microsporidia genomes that were sequenced incidentally as part of the Darwin Tree of Life project. The authors generated rather intact, complete genomes of these parasites and used Hi-C to assess contacts between homologous chromosomes in a genome. The authors use the data to address a large number of questions in Microsporidia biology, such as phylogeny, synteny, and most interestingly the evolution of tetraploidy. The authors show that tetraploids are autopolyploids rather than allopolyploids. A novel hypothesis that the two nuclei inside diplokaryotic cells are actually in some cases tetraploids divided into diploid nuclear compartments is proposed. Overall, this study provides new insights into the biology of these fascinating parasites and is easy to read.

However, I have a few major criticisms that dampen my enthusiasm. Firstly, this paper really rests on really accurate estimation of ploidy, which is tricky without cytology. For example, it is impossible to distinguish a haploid from a homozygous diploid or tetraploid. The presentation of how ploidy was assigned to these genomes was not sufficient to evaluate how robust the assignment was. Some of this comes up in Figure 9, where there is a mixed signal which the authors ascribe to segmental duplication. To be clear, I don't think the authors did a poor job in their assignments, I just think they need to be clearer about how they were done and whether there was any ambiguity. A second issue relates to the Hi-C data. Here, the authors could do a better job in walking us through what is happening. The figure itself is not self explanatory, and while their hypothesis is interesting, it's not clear how well it is supported by the data. How much uncertainty is there in this diplokaryotic model? Third, the authors run through many different analyses that are interesting, but again require really solid ploidy estimation and phasing/assembly of data into homeologues, such as the bizarre finding of an uneven and rearranged tetraploid.

I have few other general comments, but again the paper is pretty cleanly written.

1. The authors should probably streamline some of the Results/presentation. For example Figure 1B is not a big result and there is not much they can say about it. Is it worth even mentioning?

2. A new way of defining species is presented using amino acid substitutions per site. This seems more complicated that the recent analysis by Albuquerque and Haag, doi: 10.1111/jeu.12944 using ANI. I'm not sure the field needs a more complicated method that involves more variation in molecular rates.

3. Figure 4 only uses tetraploids. However, if allopolyploidy occurred between two species that were diploid but homozygous, it could look the same as a diploid.

4. Without evidence of deep coalescence between homeologues, is the section "No evidence of recent rediploidisation in Microsporidia" likely to find anything?

5. Figure 12 could use a little polishing as it shows meiosis yielding two products and gametes undergoing mitosis. These simplifications could end up confusing readers.

Revision 2

Attachments
Attachment
Submitted filename: Response_to_Reviewers.docx
Decision Letter - Roland Roberts, Editor

Dear Dr Khalaf,

Thank you for your patience while we considered your revised manuscript "Forty New Genomes Shed Light on Sexual Reproduction and the Origin of Tetraploidy in Microsporidia" for publication as a Research Article at PLOS Biology. This revised version of your manuscript has been evaluated by the PLOS Biology editors, the Academic Editor, and the original reviewers.

Based on the reviews, we are likely to accept this manuscript for publication, provided you satisfactorily address the remaining points raised by the reviewers and the following data and other policy-related requests.

IMPORTANT - please attend to the following:

a) Please address the remaining points from the reviewers...

b) ...of which the Academic Editor said "I think that reviewer #2 has a valid point that it can be hard to discern if these are clonal vs polyclonal infections. I don't think it takes away from their major conclusions to add this as a caveat given that extent of analysis. As for the protein predictions, I think you can ask that they also address this and ensure that the methods they have used are sufficiently documented that this can be repeated by others working in the field." I've included that in case it's helpful.

c) Please address my Data Policy requests below; specifically, we need you to supply the numerical values underlying Figs 1, 2ABCDE, 3AB, 4, 5, 6A, 7, 8ABC, 9AB, and all of the Supp Figs, either as a supplementary data file or as a permanent DOI’d deposition. I note that you already have an associated Zenodo deposition (https://doi.org/10.5281/zenodo.15364388). Please could you confirm whether the data and code in this deposition are sufficient to recreate the Figures (main and supplementary)?

d) Please cite the location of the data clearly in all relevant main and supplementary Figure legends, e.g. “The data underlying this Figure can be found in S1 Data” or “The data underlying this Figure can be found in https://zenodo.org/records/15364388”

e) I note that (unlike in earlier versions) the supplementary Figs and Tables are currently only accessible as part of very large zipped folders – please submit these as separate files, and include their legends in the manuscript file. It’s fine for the other files (non-Fig, non-Table) to simply be provided in the Zenodo deposition.

f) Please make any custom code available, either as a supplementary file or as part of your Zenodo deposition.

g) Please include the URLs of your funders in the Financial Disclosure statement.

As you address these items, please take this last chance to review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the cover letter that accompanies your revised manuscript.

In addition to these revisions, you may need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests shortly. If you do not receive a separate email within a few days, please assume that checks have been completed, and no additional changes are required.

We expect to receive your revised manuscript within two weeks.

To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include the following:

- a cover letter that should detail your responses to any editorial requests, if applicable, and whether changes have been made to the reference list

- a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable, if not applicable please do not delete your existing 'Response to Reviewers' file.)

- a track-changes file indicating any changes that you have made to the manuscript.

NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines:

https://journals.plos.org/plosbiology/s/supporting-information

*Published Peer Review History*

Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://plos.org/published-peer-review-history/

*Press*

Should you, your institution's press office or the journal office choose to press release your paper, please ensure you have opted out of Early Article Posting on the submission form. We ask that you notify us as soon as possible if you or your institution is planning to press release the article.

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please do not hesitate to contact me should you have any questions.

Sincerely,

Roli Roberts

Roland Roberts, PhD

Senior Editor

rroberts@plos.org

PLOS Biology

------------------------------------------------------------------------

DATA POLICY:

You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797

Note that we do not require all raw data. Rather, we ask that all individual quantitative observations that underlie the data summarized in the figures and results of your paper be made available in one of the following forms:

1) Supplementary files (e.g., excel). Please ensure that all data files are uploaded as 'Supporting Information' and are invariably referred to (in the manuscript, figure legends, and the Description field when uploading your files) using the following format verbatim: S1 Data, S2 Data, etc. Multiple panels of a single or even several figures can be included as multiple sheets in one excel file that is saved using exactly the following convention: S1_Data.xlsx (using an underscore).

2) Deposition in a publicly available repository. Please also provide the accession code or a reviewer link so that we may view your data before publication.

Regardless of the method selected, please ensure that you provide the individual numerical values that underlie the summary data displayed in the following figure panels as they are essential for readers to assess your analysis and to reproduce it: Figs 1, 2ABCDE, 3AB, 4, 5, 6A, 7, 8ABC, 9AB, and all of the Supp Figs. NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values).

IMPORTANT: Please also ensure that figure legends in your manuscript include information on where the underlying data can be found, and ensure your supplemental data file/s has a legend.

Please ensure that your Data Statement in the submission system accurately describes where your data can be found.

------------------------------------------------------------------------

CODE POLICY

Per journal policy, if you have generated any custom code during the course of this investigation, please make it available without restrictions. Please ensure that the code is sufficiently well documented and reusable, and that your Data Statement in the Editorial Manager submission system accurately describes where your code can be found.

Please note that we cannot accept sole deposition of code in GitHub, as this could be changed after publication. However, you can archive this version of your publicly available GitHub code to Zenodo. Once you do this, it will generate a DOI number, which you will need to provide in the Data Accessibility Statement (you are welcome to also provide the GitHub access information). See the process for doing this here: https://docs.github.com/en/repositories/archiving-a-github-repository/referencing-and-citing-content

------------------------------------------------------------------------

DATA NOT SHOWN?

- Please note that per journal policy, we do not allow the mention of "data not shown", "personal communication", "manuscript in preparation" or other references to data that is not publicly available or contained within this manuscript. Please either remove mention of these data or provide figures presenting the results and the data underlying the figure(s).

------------------------------------------------------------------------

REVIEWERS' COMMENTS:

Reviewer #1:

No comments.

Reviewer #2:

Although the authors do address some of my concerns there are some that they don't. My biggest remaining concern is the lack of protein predictions and annotations. Although the authors don't use genome wide protein predictions them in their analysis, a large value of a paper that assembles novel genomes, is that the genomes are used as a resource, and the prediction and annotation of proteins is a large part of that. I also don't understand the data on the protein prediction numbers the authors provide. It seems like AUGUSTUS for many of the genomes is predicting 0 proteins. Presumably for their BUSCO analysis they are relying on AUGUSTUS predictions, so its unclear to me why their predictions are so low. My other remaining concerns are listed below.

1.The authors claim that the genomes are clonal infections but they still don't provide convincing data to support that. Their marker scan approach is relying on 1000 bp long sequences (this is from the cited paper, the authors should include this information in their methods) and if only 10 percent of the genome might be sequenced, this might result in no ribosomal sequence or only a fragmented ribosomal sequence. Their genome scan results also likely wouldn't be able to detect if only 10 percent of the genome was from a different species. Their blob tools analysis could potentially deal with a partial cooccurring genome. The authors state: " A mixed infection of multiple microsporidians would display a distinct pattern in the BlobToolKit plots - especially given that the two (or more) microsporidians would likely differ in read depth (because of different levels of infection)," Here are a couple of examples of non-chromosomal genomes and the range of coverages that they see. Do the authors have evidence that a co-infection that is at 10% of the main infection would not have any contigs included within the low range of coverage?

iyAmbProt1.µ 2.219 - 41.448

ihCicViri2.µ 21.3 - 345.6

Although these genomes may indeed be from single clonal infections, I do not believe the authors have provided evidence to support their new claim "we determined that all of our forty microsporidian genomes are derived from single, clonal infections.""

I do think this is a hard problem and the authors do not necessarily have to solve it, but If the authors are not going to convincingly address this, adding a sentence to the discussion or methods about the limitation of analysing microsporidia genomes from metagenomic data would be helpful.

2. The authors are still rooting their tree based on Metchnikovella, without an explanation why. This doesn't effect their results, but displaying the tree this way will be confusing as Mitosporidium is considered the outgroup to metchnikovella and the microsporidia.

Reviewer #3:

I have evaluated the revision, and I think the authors have done a decent job of addressing the previous comments. I note that the responses are somewhat hard to follow because the line numbers do not jive with the rebuttal.

I think the authors should keep in mind that actually it is very easy to have a situation where you have a completely homozygous diploid. There are many eukaryotes where haploid selfing occurs and generates completely homozygous diploids. They do acknowledge that a completely homozygous diploid is impossible to differentiate from a completely homozygous tetraploid or haploid. I also stick behind my statement about ANI, but we can disagree on this opinion and this doesn't influence the results in this paper.

Congratulations to the authors.

Revision 3

Attachments
Attachment
Submitted filename: Response to Reviewers and Addressing Policy Concerns.pdf
Decision Letter - Roland Roberts, Editor

Dear Dr Khalaf,

Thank you for the submission of your revised Research Article "Forty New Genomes Shed Light on Sexual Reproduction and the Origin of Tetraploidy in Microsporidia" for publication in PLOS Biology. On behalf of my colleagues and the Academic Editor, Joseph Heitman, I'm pleased to say that we can in principle accept your manuscript for publication, provided you address any remaining formatting and reporting issues. These will be detailed in an email you should receive within 2-3 business days from our colleagues in the journal operations team; no action is required from you until then. Please note that we will not be able to formally accept your manuscript and schedule it for publication until you have completed any requested changes.

Please take a minute to log into Editorial Manager at http://www.editorialmanager.com/pbiology/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process.

PRESS: We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have previously opted in to the early version process, we ask that you notify us immediately of any press plans so that we may opt out on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

Thank you again for choosing PLOS Biology for publication and supporting Open Access publishing. We look forward to publishing your study. 

Sincerely,

Roli Roberts

Roland G Roberts, PhD, PhD

Senior Editor

PLOS Biology

rroberts@plos.org

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .