Peer Review History

Original SubmissionNovember 28, 2023
Decision Letter - Alberto Amato, Editor
Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.

PONE-D-23-39763Diverse patterns of correspondence between protist metabarcodes and protist metagenome-assembled genomesPLOS ONE

Dear Dr. Richter,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Mar 09 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Alberto Amato

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. Thank you for stating the following financial disclosure:

“This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 949745).”

Please state what role the funders took in the study.  If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

If this statement is not correct you must amend it as needed.

Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf.

4. Thank you for stating the following in the Acknowledgments Section of your manuscript:

“We thank Malu Calle for useful advice on statistical analyses of compositional data. This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 949745).”

We note that you have provided funding information that is currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

“This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 949745).”

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

5. Please update your submission to use the PLOS LaTeX template. The template and more information on our requirements for LaTeX submissions can be found at http://journals.plos.org/plosone/s/latex.

6. Please amend the manuscript submission data (via Edit Submission) to include authors Daryna Zavadska, Adrià Auladell, Nicolas Henry, and Cédric Berney.

7. We notice that your supplementary figures and tables are included in the manuscript file. Please remove them and upload them with the file type 'Supporting Information'. Please ensure that each Supporting Information file has a legend listed in the manuscript after the references list.

Additional Editor Comments:

Dear Dr. Richter,

I apologise for the very long time we take to have your manuscript reviewed.

Both reviewers find the manuscript very well done and interesting.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The work entitled "Diverse patterns of correspondence between protist metabarcodes and protist

metagenome-assembled genomes" by Daryna Zavadska and co-workers addresses the question of the consistency of large scale genomic studies, based on concurrent approaches, ie, either from marker gene metabarcoding (processed via the recontruction of OTUs or via ASVs) and whole genome analysis through metagenomics (including single MAG reconstructions). The authors question how these concurrent approaches compare, and if they actually lead to similar results. The comparison is made somehow difficult when some of the markers used are not in *MAG reconstructions, but using reasonable hypotheses, well selected methods, absolutely well mastered statistical analyses and real data, authors ended up with four scenarios.

Scenario I - one V9 OTU matches more than one MAG; Scenario II - more than one V9 OTU matches more than one MAG; Scenario III - more than one V9 OTU matches one MAG; Scenario IV - one V9 OTU matches one MAG. The scientific community postulating that scenario IV is the most likely, should now consider the possibility of scenarii I to III, that may leads to divergent conclusions when treating a same complex environmental sample.

This study is therefore novel, consistent, well designed, well analyzed, and extremely clearly written. I have no critics and no comment.

Reviewer #2: In the present work, Zavadska et al. try to match metagenomic assembled genomes and metabarcoding OTUs from the same TARA Oceans samples based on the correspondence of their relative abundances. They first assess which are the best proportionality/correlation metrics to use and build positive and negative controls for validation. Then, after establishing several selection criteria, they obtain 4 different types of MAG-OTU matches, which they discuss. Among other matches, the authors are able to establish reliable one-to-one correspondences in some cases, which represent valuable results to the community.

I think the study is well conducted and methods used are robust, but both methods and results could be better presented in the text and figures (more on this below).

My main concern is that in this study the V9 region (~ 100bp long) of the 18S was used, although samples for the V4 region (~ 380 bp) are also available for the TARA oceans dataset. I think that using the V4 region, which contains nearly 4 times more phylogenetic signal than V9 region, would give different results (and maybe improve the ones already obtained). Thus, I strongly suggest that the authors conduct the same analyses using the V4 dataset.

General comments:

The introduction is quite long and repetitive. A more concise writing and, in particular, avoiding repeating ideas in different paragraphs would help. Introduction and Results have several references to the methods used. Given that methods are central in this manuscript, I do not understand why is there no Methods section in the main text. It could be a good idea to create a Methods section with the main ideas instead of putting everything in the Supplementary Material. This could facilitate the reading and give a clearer structure to the manuscript. Also, it could reduce the length of other sections. For example, some lines from Introduction could go there (e.g., lines 53-55, lines 139-144), as well as some lines from the Results (a great portion of the first section, for example). Following with the structure, the Results section contains discussion parts (the discussion on each of the 4 scenarios, for example, which I think is very well presented), but then Discussion has only 3 small paragraphs. I suggest creating a single Results and Discussion section (but this is just a suggestion). Some parts from the introduction could also be moved and expanded in Discussion.

Specific comments:

· L160: where is Figure S4.4? It is constantly referenced in the main text and supplementary information but I did not find it. When clicked, it links to Figure S8. References in the text to this figure does not seem to match Figure S4 (and within Figure S4 there are no numbers but letters for each subfigure). For example, in line 368: ‘The final system used to subset SMAGs and V9 OTUs by taxonomy is summarized in Figure S4.4’.

· L177: it can be misleading to reference Figure S6 (and Figure S4.4) when Figures S2-S5 are not referenced yet (they first appear in line 181). Change Figure numbering or text order.

· L180: given that the agreement between rho and Spearman’s correlation was used to select candidate matches, these agreeing values need to be in Figure 2 instead of rho proportionality ones. If not, it can be misleading. Also, see specific section below for Figure 2.

· Figure 3: this is a really good summary of the results.

· Discussion: I miss some discussion on how the sequencing depth could affect the results presented. Are iqlr transformations enough to overcome the limitations of compositional data? Another point worth discussing is that the method used is very dependent on good taxonomic assignments. Finally, I would try to stress the results, as they are relevant to the community. Matching OTUs to MAGs seems to be difficult in most cases, but the authors were able to find a good one-to-one correspondence for some of them. This is really valuable, and I would try to emphasize it in the text.

· L311: reference?

· L350: I really appreciate that the authors are sharing all data and code. This should be the way to work in every manuscript, but unfortunately it is not.

· L380: why this filtering?

· L424: which values/thresholds were used to create these categories?

· Figure 2:

I suggest rethinking and redoing Figure 2. This figure tries to summarize so much information in a single page that it is really hard to understand. At least, it was really difficult for me to follow everything that is presented. Given that these are case examples (and not all results), it could be better to just show one representative case per scenario and save the rest for supplementary. With this, plots could be a little bit bigger and overlapping data/lines/points could be separated into different plots.

In a) all top 10 matches are shown equally and only the dots are highlighting the MAGs selected in each scenario. This is hard to interpret, as dots have too much overlap to see a clear picture. For example, the Chloroparvula plot is repeated 4 times and only the dots change (and there is a lot of overlap, so these are hard to see separately). I would make the separation between matches considered and not considered in each scenario much clearer. Having the 2 taxonomic levels mixed in the same plot is also not helpful, I would separate them.

In b), tiles are so little in some cases that are impossible to read. Also, is there some information repeated between a) and b)? This is not clear to me. Why does Florenciella have different taxonomic levels than the rest (Scenario I, section b)? This is hard to find in the text.

In c), fill colors overlap with colors of taxonomic ranks. Also, the outline of the points is hardly visible. And in some cases, the overlap is so high that the plot cannot be interpreted. Could the dots be connected to make it clearer? This needs improvement.

In d), what do the colors represent?

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Revision 1

We responded to reviewer and editor comments in the PDF file included in this submission. That document contains appropriate formatting for the response. We include the unformatted version of our response below:

PONE-D-23-39763

Diverse patterns of correspondence between protist metabarcodes and protist metagenome-assembled genomes

Response to Reviewers

We note that all line numbers below refer to the revised manuscript document with tracked changes. This document contains LaTeX code in Microsoft Word format, which we believe is the easiest way to display the differences between the originally submitted version and this revision (despite the fact that it contains some elements of LaTeX code).

Reviewer #1: The work entitled "Diverse patterns of correspondence between protist metabarcodes and protist metagenome-assembled genomes" by Daryna Zavadska and co-workers addresses the question of the consistency of large scale genomic studies, based on concurrent approaches, ie, either from marker gene metabarcoding (processed via the recontruction of OTUs or via ASVs) and whole genome analysis through metagenomics (including single MAG reconstructions). The authors question how these concurrent approaches compare, and if they actually lead to similar results. The comparison is made somehow difficult when some of the markers used are not in *MAG reconstructions, but using reasonable hypotheses, well selected methods, absolutely well mastered statistical analyses and real data, authors ended up with four scenarios.

Scenario I - one V9 OTU matches more than one MAG; Scenario II - more than one V9 OTU matches more than one MAG; Scenario III - more than one V9 OTU matches one MAG; Scenario IV - one V9 OTU matches one MAG. The scientific community postulating that scenario IV is the most likely, should now consider the possibility of scenarii I to III, that may leads to divergent conclusions when treating a same complex environmental sample.

This study is therefore novel, consistent, well designed, well analyzed, and extremely clearly written. I have no critics and no comment.

We appreciate the reviewer’s comments and we thank them for their time in reviewing our manuscript.

Reviewer #2: In the present work, Zavadska et al. try to match metagenomic assembled genomes and metabarcoding OTUs from the same TARA Oceans samples based on the correspondence of their relative abundances. They first assess which are the best proportionality/correlation metrics to use and build positive and negative controls for validation. Then, after establishing several selection criteria, they obtain 4 different types of MAG-OTU matches, which they discuss. Among other matches, the authors are able to establish reliable one-to-one correspondences in some cases, which represent valuable results to the community.

I think the study is well conducted and methods used are robust, but both methods and results could be better presented in the text and figures (more on this below).

We thank the reviewer for their comments, which helped us to improve our manuscript. We believe we have addressed their concerns related to the methods and results, as described in our responses to each specific point below.

My main concern is that in this study the V9 region (~ 100bp long) of the 18S was used, although samples for the V4 region (~ 380 bp) are also available for the TARA oceans dataset. I think that using the V4 region, which contains nearly 4 times more phylogenetic signal than V9 region, would give different results (and maybe improve the ones already obtained). Thus, I strongly suggest that the authors conduct the same analyses using the V4 dataset.

We thank the reviewer for this suggestion. We ourselves also considered the possibility of using the V4 dataset when we initially planned our analyses. Although the V4 region is longer than the V9, we selected the V9 region for our analyses for the following 3 reasons:

The V4 region is more subject to amplification bias than is the V9 (because the V4 has significantly larger variation in length), which could potentially distort the relative abundance values we use as input.

The V4 primers used by Tara Oceans are less universal than the V9 primers used. The V4 primers are known to exclude some groups of eukaryotes and may exclude other, as yet unknown, groups.

At the time we began our analyses, the V4 data had not yet been published. We did not feel it would be appropriate for us to publish a large-scale analysis of unpublished data.

We understand the reviewer’s suggestion that V4 could provide different results or improve the ones already obtained, but we believe that adding V4 is not necessary to support the conclusions we derived from the results in our manuscript, for the following 3 reasons:

The purpose of our study was not to perform an exhaustive comparison of MAGs versus all other data sources. It was to develop a framework for pairwise comparisons of compositional datasets and, as an example case, to compare MAGs vs. V9 OTUs. In this vein, we also did not compare to other Tara Oceans datasets that could provide relative abundance data: metatranscriptomic datasets (within which ribosomal sequences can be collected), or imaging data.

Due to the high computational burden of performing the simulations necessary to produce benchmarks for proportionality and correlation values, we estimate that adding a V4 analysis would take 3-4 months, only considering computational time.

Variable regions in the 18S do not necessarily evolve at strictly the same rate, even between closely related species within a single eukaryotic lineage. In a small minority of eukaryotic lineages, we expect a perfect one-to-one V4-V9 correspondence. However, in most cases we expect that there is either one V4 for multiple V9s or multiple V4s for one V9. Furthermore, in the case where one V4 corresponds to one V9, it may still hide many different “species” based on ITS or other more variable markers. As the reviewer alluded, this would change the nature of our analyses from a pairwise statistical comparison, to a multiple statistical comparison among 3 datasets, and would necessitate a new framework to classify and adjudicate differences between V4 and V9. While we agree that this would be a valuable contribution, it would likely require a significant change to our manuscript, and may not necessarily be useful in the near future, as many projects are already adopting long-read metabarcoding approaches that include both variable regions (e.g., Jamy et al., 2022 https://doi.org/10.1038/s41559-022-01838-4; Gaonkar and Campbell, 2024 https://doi.org/10.1002/ece3.11232).

Therefore, although we agree with the reviewer’s suggestion that adding V4 could contribute to further knowledge on the relationship between MAGs and barcoding datasets, we believe that adding V4 analyses would be out of the scope of our current work. We would instead suggest that future authors perform comparisons to V4 (or to other datasets), for which we believe our methodology could serve as a basis.

General comments:

The introduction is quite long and repetitive. A more concise writing and, in particular, avoiding repeating ideas in different paragraphs would help. Introduction and Results have several references to the methods used. Given that methods are central in this manuscript, I do not understand why is there no Methods section in the main text. It could be a good idea to create a Methods section with the main ideas instead of putting everything in the Supplementary Material. This could facilitate the reading and give a clearer structure to the manuscript. Also, it could reduce the length of other sections. For example, some lines from Introduction could go there (e.g., lines 53-55, lines 139-144), as well as some lines from the Results (a great portion of the first section, for example). Following with the structure, the Results section contains discussion parts (the discussion on each of the 4 scenarios, for example, which I think is very well presented), but then Discussion has only 3 small paragraphs. I suggest creating a single Results and Discussion section (but this is just a suggestion). Some parts from the introduction could also be moved and expanded in Discussion.

We thank the reviewer for these suggestions. To facilitate reading following the reviewer’s advice, we (1) created a new Results and Discussion section, (2) moved parts of the Introduction to the new combined Results and Discussion section, (3) created a Methods section and moved parts of the Introduction and of the Results into the new Methods section, and (4) reorganized the Introduction to remove repetitive sections.

Specific comments:

· L160: where is Figure S4.4? It is constantly referenced in the main text and supplementary information but I did not find it. When clicked, it links to Figure S8. References in the text to this figure does not seem to match Figure S4 (and within Figure S4 there are no numbers but letters for each subfigure). For example, in line 368: ‘The final system used to subset SMAGs and V9 OTUs by taxonomy is summarized in Figure S4.4’.

We thank the reviewer for pointing out this error. This was a bug in the LaTeX file that we used to produce the submitted version of the manuscript. We corrected the error in the resubmitted manuscript.

· L177: it can be misleading to reference Figure S6 (and Figure S4.4) when Figures S2-S5 are not referenced yet (they first appear in line 181). Change Figure numbering or text order.

We corrected these errors in figure numbering.

· L180: given that the agreement between rho and Spearman’s correlation was used to select candidate matches, these agreeing values need to be in Figure 2 instead of rho proportionality ones. If not, it can be misleading. Also, see specific section below for Figure 2.

We tried the reviewer’s suggestion to include Spearman’s correlation along with rho proportionality values in Figure 2, but we could not find a way to present all of the data in a clear and readable manner. Instead, we placed all of the Spearman’s correlation data in the supplementary figures. We also note that all data necessary to select candidate matches (including proportionality and correlation data) is available in a table on the GitHub accompanying our manuscript, here: https://github.com/beaplab/Protist_barcode_MAG_correspondence/blob/main/data/downstream_outputs/Candidate_pairs-for_R_script_scatterplot_corrected_Pairs_automated.csv

Please see below for responses to other specific suggestions on Figure 2.

· Figure 3: this is a really good summary of the results.

Thank you for this comment, we are glad that the figure provides a useful summary of the overall results.

· Discussion: I miss some discussion on how the sequencing depth could affect the results presented. Are iqlr transformations enough to overcome the limitations of compositional data? Another point worth discussing is that the method used is very dependent on good taxonomic assignments. Finally, I would try to stress the results, as they are relevant to the community. Matching OTUs to MAGs seems to be difficult in most cases, but the authors were able to find a good one-to-one correspondence for some of them. This is really valuable, and I would try to emphasize it in the text.

Thank you for the suggestions. We did not test the effects of sequencing depth in our study, since we used data from a previous paper that directly tested the effects of sequencing depth on the recovery of community diversity in the form of OTUs (Figure 1A and associated supplementary figures from DOI: 10.1126/science.1261605). We added a sentence to the Discussion (lines 852-855) to address the reviewer’s suggestion: “A lack of sequencing depth could also result in an underestimate of the sequence diversity present in a sample, although this is routinely tested in metabarcoding studies and is generally not a significant effect”

Regarding whether iqlr transformations are sufficient to overcome the limitations of compositional data: we agree that although iqlr transformations are theoretically intended to overcome these limitations, it is useful to know whether the approach is valid in practice. Following iqlr transformation, we produced results from both shuffling of actual data and using simulated datasets that showed that we were able to distinguish signal from noise (in which the controls represented a noise signal). Nonetheless, the signal that we were able to detect may be due to an actual biological pattern or may be the results of multiple biological biases. Our results matching V9 OTUs with SAGs (as positive controls) suggest that the biological patterns can indeed be detected for some taxonomic groups, but without a larger number of positive control SAGs, we were not able to test this exhaustively. For other taxonomic groups, we were not able to match V9 OTUs with positive control SAGs, as their proportionality or correlation values were below those of artificially generated noise; in these cases, either the underlying biological signal was not strong enough for our method (including the iqlr transformation) to detect it, or the number of stations in which the SAG was present was too low (as few as only 5 stations), or some combination of the two possibilities (see Supplementary Figure 8).

We completely agree with the reviewer’s point that good taxonomic assignments are critical for our method to work properly. To emphasize this, we added a sentence to the Discussion, lines 864-865: “This highlights the importance of accurate taxonomic assignments to produce optimal results within our approach.”

We also appreciate the reviewer’s point that our approach was able to identify potential one-to-one matches between V9 OTUs and MAGs, and that could be valuable to the community. The philosophy guiding how we prepared our manuscript was to present the results, with appropriate caveats, and to allow the reader to draw their own conclusions based on these results. Therefore, we tried to refrain from overemphasizing the potential V9 OTU and MAG matches that we found in the text, as we believe they represent initial hypotheses for potential relationships. We would not want to lead the reader to believe that we presented strong, direct evidence for any individual match, nor that the overall approach is generally applicable to all taxonomic groups (since we hope to encourage researchers working on specific taxonomic groups to investigate the results of our approach within that particular group).

· L311: reference?

Although the idea that either V9 OTUs or MAGs are more appropriate representations of underlying biological entities is implied in various articles in the literature, we were unable to find any references that explicitly assert the superiority of one or the other approach (instead, it is generally implicit, in our interpretation). To reduce the possibility of confusion, we removed this sentence from our article.

· L350: I really appreciate that the authors are sharing all data and code. This should be the way to work in every manuscript, but unfortunately it is not.

We completely agree.

· L380: why this filtering?

We removed V9 OTUs with total abundance less than 100 as they were not likely to contribute any signal to our measurements of proportionality or correlation. This is reflected in the figure below, where total abundance for reads assigned to a given OTU is on the horizontal axis and that OTU’s maximum relative abundance in a single sample is on the vertical axis. As can be observed, OTUs with total abundance less than 100 essentially never contribute substantially to relative abundance in any sample:

To address the reviewer’s comment, we added the following to the supplementary information (lines 1339-1342): “we removed V9 OTUs whose sum of number of reads across all samples was ≤ 100 from subsequent calculations, as these OTUs did not contribute substantially to relative abundance in any sample (data not shown).” We note that we corrected a typo in this sentence, in which the threshold was erroneously written to be ≥ 100, not ≤ 100. We thank the reviewer for pointing out this error.

· L424: which values/thresholds were use

Attachments
Attachment
Submitted filename: barcode_mag_correlation_response_reviewers_revision_1.pdf
Decision Letter - Alberto Amato, Editor

Diverse patterns of correspondence between protist metabarcodes and protist metagenome-assembled genomes

PONE-D-23-39763R1

Dear Dr. Richter,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Alberto Amato

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Dear Dr; Richter,

thank you very much for the great job you and your colleagues did on the manuscript.

I am glad to accept your revised version for publication.

Sincerely

Alberto Amato

Reviewers' comments:

Formally Accepted
Acceptance Letter - Alberto Amato, Editor

PONE-D-23-39763R1

PLOS ONE

Dear Dr. Richter,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Alberto Amato

Academic Editor

PLOS ONE

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .