Peer Review History

Original SubmissionAugust 10, 2025
Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.

Decision Letter - Thomas Leitner, Editor

PCOMPBIOL-D-25-01603

WEPP: Phylogenetic Placement Achieves Near-Haplotype Resolution in Wastewater-Based Epidemiology

PLOS Computational Biology

Dear Dr. Turakhia,

Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript within 60 days Dec 28 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

* A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below.

* A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

* An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter

We look forward to receiving your revised manuscript.

Kind regards,

Lin Hou

Academic Editor

PLOS Computational Biology

Thomas Leitner

Section Editor

PLOS Computational Biology

Journal Requirements:

1) Please ensure that the CRediT author contributions listed for every co-author are completed accurately and in full.

At this stage, the following Authors/Authors require contributions: Pranav Gangwar, Pratik Katte, Manu Bhatt, and Yatish Turakhia. Please ensure that the full contributions of each author are acknowledged in the "Add/Edit/Remove Authors" section of our submission form.

The list of CRediT author contributions may be found here: https://journals.plos.org/ploscompbiol/s/authorship#loc-author-contributions

2) We ask that a manuscript source file is provided at Revision. Please upload your manuscript file as a .doc, .docx, .rtf or .tex. If you are providing a .tex file, please upload it under the item type u2018LaTeX Source Fileu2019 and leave your .pdf version as the item type u2018Manuscriptu2019.

3) Please upload all main figures as separate Figure files in .tif or .eps format. For more information about how to convert and format your figure files please see our guidelines:

https://journals.plos.org/ploscompbiol/s/figures

4) We have noticed that you have uploaded Supporting Information files, but you have not included a list of legends. Please add a full list of legends for your Supporting Information files after the references list.

5) Some material included in your submission may be copyrighted. According to PLOSu2019s copyright policy, authors who use figures or other material (e.g., graphics, clipart, maps) from another author or copyright holder must demonstrate or obtain permission to publish this material under the Creative Commons Attribution 4.0 International (CC BY 4.0) License used by PLOS journals. Please closely review the details of PLOSu2019s copyright requirements here: PLOS Licenses and Copyright. If you need to request permissions from a copyright holder, you may use PLOS's Copyright Content Permission form.

Please respond directly to this email and provide any known details concerning your material's license terms and permissions required for reuse, even if you have not yet obtained copyright permissions or are unsure of your material's copyright compatibility. Once you have responded and addressed all other outstanding technical requirements, you may resubmit your manuscript within Editorial Manager.

Potential Copyright Issues:

i) Figure 1. Please confirm whether you drew the images / clip-art within the figure panels by hand. If you did not draw the images, please provide (a) a link to the source of the images or icons and their license / terms of use; or (b) written permission from the copyright holder to publish the images or icons under our CC BY 4.0 license. Alternatively, you may replace the images with open source alternatives. See these open source resources you may use to replace images / clip-art:

- https://commons.wikimedia.org

- https://openclipart.org/.

ii) The following Figure contains screenshots: 1. We are not permitted to publish these under our CC-BY 4.0 license, websites are usually intellectual property and are copyrighted.This includes peripheral graphics of the web browser such as icons and button. We ask that you please remove or replace it.

6) Please amend your detailed Financial Disclosure statement. This is published with the article. It must therefore be completed in full sentences and contain the exact wording you wish to be published.

1) State the initials, alongside each funding source, of each author to receive each grant. For example: "This work was supported by the National Institutes of Health (####### to AM; ###### to CJ) and the National Science Foundation (###### to AM)."

2) State what role the funders took in the study. If the funders had no role in your study, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.".

Note: If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise.

Reviewers' comments:

Reviewer's Responses to Questions

Reviewer #1: "WEPP: Phylogenetic Placement Achieves Near-Haplotype Resolution in Wastewater-Based Epidemiology" by Pranav Gangwar et al. is an ambitious and timely approach to augment current possibilities of genomic WBE. I think that this contribution does not only serve as a rich inspiration of further research, but is highly relevant to practitioner from the applied public health side. Hence, I would like to strongly support the publication of the manuscript.

Having said this, from my point of view, beside minor issues listed below, the manuscript in its current form suffers from two, probably related issues, I would like to see addressed, prior to publication. (i) I found the use of the terms "lineage", "haplotype", ("genotype",) "allele" confusing. A precise introduction of the terminology in the context of amplicon sequencing of mixed samples would very much improve the readability. (ii) In this context, the term "Potential Haplotypes for unaccounted alleles" is particular difficult. Related to the latter, I also have to admit that I could not fully follow how the "Read-to-Haplotype Mapping and Potential Haplotypes for unaccounted alleles" as described should produce the promised association with confidence, even across amplicon boundaries. I have observed that in the provided examples analysis the list of Potential Haplotypes is long and no direct measure of (un)certainty is provided. Even if I applaud the ambitious goal to deduce genomic background of unaccounted alleles, since this would be very important applications, I would either like to see a more elaborated evaluation, maybe even in the context of recombination events, or at least to tone down the language a bit.

## minor issues and suggestions

* Fig 2: assuming that the points at x = 0 and y = 0 represent FP and FN, respectively, please elaborate how the different tools perform with respect to correct classification (detection instead of quantification)

* I assume that "All MATs were pre-annotated with PANGO lineage annotations." does only apply to the SARS-CoV-2 tree and not to RSV-A and Mpox trees mentioned before this sentence. If so, please correct and specify the lineage annotation regime used for RSV-A and Mpox.

* "For the samples used to mimic realistic cluster emergence scenarios, we first removed the haplotypes found (via ClusterTracker) in the new region from the MAT and simulated 5% of wastewater reads from one of the removed haplotypes, while the remaining 95% were drawn from lineages circulating in the region at that time.". What does "new region" refer to? Please elaborate.

* "For these experiments, we created four base mixtures and simulated each with a novel haplotype at 5%, 10%, or 20% abundance, resulting in 12 samples with 1.6 million reads each." Please specify the nature of simulated haplotypes (number of alleles, distance between alleles).

* please consider to comment (even if it is just in the discussion) how WEPP would perform in the detection of novel recombinant lineages not present in the MAT used.

* what is the time complexity of the algorithm as a function of tree size and read numbers?

* I did not manage to find the views described in Fig 1B(ii-iv) in the dashboard.

Warm regards, Fabian Amman

Reviewer #2: Gangwar et al. introduce a computational pipeline called WEPP, designed to enhance the resolution and analytical capabilities of wastewater-based epidemiology (WBE). By leveraging phylogenetic placement of sequencing reads onto mutation-annotated trees (MATs), WEPP enables the identification of near-exact viral haplotypes directly from wastewater samples. This approach goes beyond lineage-level detection, allowing for the discovery of novel variants, the inference of multiple introductions, and the early identification of emerging pathogens sometimes weeks before clinical confirmation.

The manuscript states that WEPP "uncovered biological insights missed by other tools" and "detected variants up to five weeks before clinical confirmation," but does not present quantitative performance metrics or confidence intervals. I would to suggest include statistically robust benchmarking with synthetic datasets (such as SWAMPy or Freyja benchmark) and real data tested in parallel with Freyja.

In addition, although WEPP has been applied to various pathogens, there is no explicit analysis of the impact of coverage and read quality, which are highly variable factors in environmental samples. I recommend including simulated experiments with different levels of coverage and baseline quality (Q-score), evaluating the algorithm's behavior. This analysis would demonstrate the method's robustness in low-quality contexts, common in WBE.

The claim that WEPP detects variants up to five weeks before clinical confirmation is highly relevant, but lacks epidemiological context. The inclusion of a time-series table or graph (preferred) comparing WBE detection data with corresponding clinical records should demonstrate the magnitude and consistency of the temporal gain.

How does WEPP account for the impact of recombination events in RNA viruses on phylogenetic placement and downstream haplotype inference? Given that recombination can obscure true evolutionary relationships and lead to misleading branch placements in mutation-annotated trees, it would be important to clarify whether WEPP incorporates any correction, detection, or filtering strategy for recombinant genomes, and how such cases might affect its resolution and interpretation of wastewater-derived sequences.

Reviewer #3: This manuscript by Gangwar et al. describes a pathogen-agnostic pipeline that can improve wastewater based pathogen surveillance. It is based on phylogenetic placement of sequencing reads onto mutation-annotated trees. Most of the validation in this manuscript was conducted using SARS-CoV-2 sequecing data, with validation extended to RSV-A and Mpox data. The tool enables identification of haplotypes, reporting the abundance of haplotypes and lineages as well as can flag potential novel variants. Data are integrated in an interactive dashboard for visualization. All of these features combined make a powerful package for future application. However, as presented the manuscript is overly long, repetitive at times and a more critical assessment of what the tool can and can’t do would be greatly beneficial to the reader.

Specific comments (note: there is a lack of page / line numbers):

1. Summary / last sentence: WWE has been used previously to track lineages and report on variants; please revise that this was previously only restricted to clinical surveillance.

2. Abstract: Statement that WEPP detected findings that previous datasets missed does not seem to be true for all datasets.

3. Introduction: Starting at paragraph 4 (nearly half of the introduction) the introduction is a summary of the results (even including reference to figures). While a couple of sentences summarizing the approach and results are ok, this is excessive (and the manuscript ends up being repetitive).

4. Results (and also in intro): prediction of subsequent cases is indicated as 2 weeks. This needs context in terms of sampling scale of WW collection as well as likely organism, lineage, immune status of the population. Other reports have indicated a 1 week lag time

5. The manuscript refers to high-error ONT sequencing in multiple sections. However, this appears to be an outdated statement given the improvements over the past few years in chemistry and basecalling. Please reword.

6. Repetition: the section on intralineage clusters etc introduces the datasets at the beginning; and then later discusses them; this can be much condensed.

7. The discussion needs a limitations section. This should also include a discussion on the importance of epidemiologic data and sampling scales (features the tool itself cannot address).

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our Privacy Policy..

Reviewer #1: Yes: Fabian AmmanFabian Amman

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

Figure resubmission:

While revising your submission, we strongly recommend that you use PLOS’s NAAS tool (https://ngplosjournals.pagemajik.ai/artanalysis) to test your figure files. NAAS can convert your figure files to the TIFF file type and meet basic requirements (such as print size, resolution), or provide you with a report on issues that do not meet our requirements and that NAAS cannot fix.

After uploading your figures to PLOS’s NAAS tool - https://ngplosjournals.pagemajik.ai/artanalysis, NAAS will process the files provided and display the results in the "Uploaded Files" section of the page as the processing is complete. If the uploaded figures meet our requirements (or NAAS is able to fix the files to meet our requirements), the figure will be marked as "fixed" above. If NAAS is unable to fix the files, a red "failed" label will appear above. When NAAS has confirmed that the figure files meet our requirements, please download the file via the download option, and include these NAAS processed figure files when submitting your revised manuscript.

Reproducibility:

To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Revision 1

Attachments
Attachment
Submitted filename: Response to Reviewers - WEPP.pdf
Decision Letter - Lin Hou, Editor

Dear Professor Turakhia,

We are pleased to inform you that your manuscript 'WEPP: Phylogenetic Placement Achieves Near-Haplotype Resolution in Wastewater-Based Epidemiology' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology.

Best regards,

Lin Hou

Academic Editor

PLOS Computational Biology

Thomas Leitner

Section Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: During revision of the manuscript "WEPP: Phylogenetic Placement Achieves Near-Haplotype Resolution in Wastewater-Based Epidemiology" by Pranav Gangwar et al. all issues raised during the initial reviewing were adequately addressed.

The only minor issue I have spotted:

"removing its neighboring haplotypes (indicated by ‘Node Removal Radius’, Table 1). As evident" should read

"removing its neighboring haplotypes (indicated by ‘Node Removal Radius’, Table 2). As evident"

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our Privacy Policy..

Reviewer #1: No

Formally Accepted
Acceptance Letter - Lin Hou, Editor

PCOMPBIOL-D-25-01603R1

WEPP: Phylogenetic Placement Achieves Near-Haplotype Resolution in Wastewater-Based Epidemiology

Dear Dr Turakhia,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

For Research, Software, and Methods articles, you will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Anita Estes

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .