Peer Review History

Original SubmissionAugust 10, 2021
Decision Letter - Xavier Didelot, Editor, Chengqi YI, Editor

Dear Dr Holmes,

Thank you very much for submitting your Research Article entitled 'HAM-ART: An optimised culture-free Hi-C metagenomics pipeline for tracking antimicrobial resistance genes in complex microbial communities.' to PLOS Genetics.

The manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review a much-revised version. We cannot, of course, promise publication at that time.

Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool.  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions.

Yours sincerely,

Xavier Didelot

Associate Editor

PLOS Genetics

Chengqi YI

Section Editor: Methods

PLOS Genetics

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: Title: HAM-ART: An optimised culture-free Hi-C metagenomics pipeline for tracking antimicrobial resistance genes in complex microbial communities.

Summary:

This is a particularly well prepared and easy to read manuscript. As a consequence, I have zero minor errors to report. As the authors have discussed, their application of Hi-C to generate MAGs and link AMRs containing sequences is not novel per se. However, their approach to Hi-C MAG refinement that leverages external information sources such as GTDBtk determined single-copy genes, is innovative. At the risk of sounding pompous, I would like to commend them for their determination in pursuing the best possible outcome from their data.

That said, there is sufficient new ground being tread within the work that much more explicit methodological detail is required for each stage (from determining consensus-cluster, splitting and refining). Further, and although I realise this is a tall-order, this reviewer would recommend that a section comprising in-silico validation (possibly against a simulated ground-truth) be added. Lastly, I did find any mention of a code repository for this pipeline, which I would regard as mandatory even at the stage of initial submission. This would have been of great assistance and access to the code would remove part of the burden of describing the method. Altogether, this leaves me with concerns for reproducibility.

Comment:

page 17 line 218: The authors may have found that a read-based approach, which targetted resistance genes, would have had greater sensitivity. This would then modify the following discussion contrasting OG/CV presence/absence. Granted, this would not enable any subsequent gene-context analysis. Note: I see this is briefly addressed at line 370.

page 23 line 313: Although this statement is not in hot dispute for this reviewer, I would have liked there to be a wider consideration of available exogenous variables -- that is to say a more complex regression model. I see that there are some words to this effect further down (line 327) mentioning limitations.

page 24 line 341: Although it provides a taxonomic hint as a side-effect of its validation analysis, the authors of checkM would probably be the first to say that it has never been meant for taxonomic assignment. I would consider revising this sentence.

page 31 line 498: Did the authors consider using or did they trial the use an existing Hi-C metagenomic binning software? Modularity based clustering algorithms (such as Louvain) suffer from a resolution limit which may have impacted the authors

capacity to extract less abundant genomes.

page 31 line 507: This is potentially of interest to the tool developers within Hi-C metagenomics community. As such, more detail is required to describe this method. In particular how do the authors reconcile the variation in coverage that will be seen between core and accessory fragments when splitting?

page 31 line 512: As with my previous comment, this step requires greater detail.

page 32 line 526: The detailed description found in this section, along with the brief descriptions of previous steps, highlights the novelty of the author's approach to refine MAGs. Altogether from splitting to this step, there are a number of methodological points at which things could have gone awry. As such, algorithm validation against a ground-truth would have been greatly appreciated by this reviewer.

Reviewer #2: The manuscript by Kalmar et al. entitled “HAM-ART: An optimized culture-free Hi-C metagenomics pipeline for tracking antimicrobial resistance genes in complex microbial communities” describes a laboratory and a computational pipeline using Hi-C data applied to metagenomic samples. Proximity ligation-based methods (3C, Hi-C) applied to environmental samples is a relatively new field with high potential notably to study mobile genetic elements in microbial communities. In the last years several methods and pipelines have been published in this field demonstrating the increasing interest in this technology. The present manuscript takes advantage of the Hi-C data to study AMR genes and their associated MAGs in a longitudinal cohort of pigs. Not surprisingly, the authors found that a higher antibiotics usage is correlated with a higher AMR genes presence. They also develop and propose a new pipeline to analyze Hi-C data ranging from metagenomic samples.

Even if the field will benefit from new pipelines, the manuscript suffers from various problems that will need to be address before publication.

Major comments:

1 - First, the authors say in the abstract that their pipeline is also a laboratory one but I cannot determine the novelty in the preparation of the libraries compare to previous protocols. Moreover, there is no data to illustrate that their Hi-C data are of good quality in terms of 3D signal. Several tools or calculations exist to check the quality of Hi-C data (3D ratio, q3C) and it will be of great importance to know the quality of their data and to compare them to previous published datasets. Moreover, the authors say that they have implemented several quality controls in their computational pipeline (line 581) but they do not provide them for their libraries.

2- The authors describe succinctly a new computational pipeline but the description and the content will need modifications, descriptions as well as comparison with available ones. First, the authors apply an iterative partitioning procedure using the Louvain algorithm, a procedure and a software already used in previous publications and pipeline. Then they keep group of contigs that cluster together during 100 iterations of the algorithm and they call them Consensus Clusters (CCs); a term very close from the Core Communities (CCs) used in a previous publication and in an available pipeline – MetaTOR (Marbouty et al, 2017, Baudry et al. 2019).

Line 507: “The separation of mixed CCs is first addressed using a coverage distribution-based separation algorithm for each CC which splits the CC if the distribution of sequencing coverage was clearly multimodal.” The authors must provide the algorithm used at this step.

Line 512 :” An iterative CC extension step was built into the pipeline at this point to extend clusters based on the Hi-C inter-contig contacts and cautiously identify contigs that should be allocated to multiple CCs.” Same as previously, details are needed here

If the authors want to publish a new pipeline as they claim it [line 305: “ HAM-ART is the first method that is designed to cope with large sample numbers, using the most common Illumina based sequencing platform and delivering results from affordable amounts of sequencing depth.”], they must provide their code to the community and benchmark their approach with already published pipeline (bin3C, MetaTOR, HicZin …) and other datasets.

3- The paper will be easier to understand if the authors provide a summary table with libraries constructed, assemblies generated, binning statistics … for each sample. Moreover, it will be of great interest for the community to have access to the different assemblies as they can reproduce some of the outputs.

4- The analysis on the lnu(A) gene is interesting but lack several controls and need further analysis. Are the contigs detected as plasmids by state of the art tools like PlasFlow or PlasmidFinder ? Can the authors check the topology of the corresponding contigs by using their Hi-C data ? If the contigs are circular, they should be able to detect the 3D signal in the corresponding contact matrices.

Same remark concerning the E. coli analysis and the possible plasmid.

Minor Comments:

- Line 102 : I would replace “assembles” by “bin” as the pipeline do not assemble the genomes but regroup contigs together.

- In general, there is some problems with the terms “assembled”, “scaffolded” and binned and it is difficult to understand, for instance for the E. coli example, the limits of the method. Is it a problem of assembly (shotgun sequencing) ? a problem of binning (not enough Hi-C reads) ?

- MLST is not defined.

- I have problem to understand exactly how the different assemblies were performed. Do the different Hi-C and Shotgun libraries have been mixed to perform them ? or only the shotgun ones ?

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: No: data are not already accessible and i would recommand to also submit the different assemblies generated. Code is also mandatory.

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Matthew Z. DeMaere

Reviewer #2: Yes: Martial Marbouty

Revision 1

Attachments
Attachment
Submitted filename: ham_art_responses_final.docx
Decision Letter - Xavier Didelot, Editor, Chengqi YI, Editor

Dear Dr Holmes,

Thank you very much for submitting your Research Article entitled 'HAM-ART: An optimised culture-free Hi-C metagenomics pipeline for tracking antimicrobial resistance genes in complex microbial communities.' to PLOS Genetics.

The manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated your efforts to account for their previous comments in this revised manuscript. They agreed that your manuscript has improved significantly. One of the reviewers requested a few more minor changes, which we would like you to address in a revised manuscript.

We therefore ask you to modify the manuscript according to the review recommendations. Your revisions should address the specific points made by each reviewer.

In addition we ask that you:

1) Provide a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

2) Upload a Striking Image with a corresponding caption to accompany your manuscript if one is available (either a new image or an existing one from within your manuscript). If this image is judged to be suitable, it may be featured on our website. Images should ideally be high resolution, eye-catching, single panel square images. For examples, please browse our archive. If your image is from someone other than yourself, please ensure that the artist has read and agreed to the terms and conditions of the Creative Commons Attribution License. Note: we cannot publish copyrighted images.

We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

Please let us know if you have any questions while making these revisions.

Yours sincerely,

Xavier Didelot

Associate Editor

PLOS Genetics

Chengqi YI

Section Editor: Methods

PLOS Genetics

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: This reviewer agrees with authors in regards to the on-going difficulty in properly validating new methods in Hi-C metagenomics and I accept that they have tried to address this by sequencing targetted isolates.

Besides this frustration with validation, to which I assign ~no~ blame to the authors, I am satisfied with how the authors have addressed my remarks and appreciate their care in doing so.

Reviewer #2: The revised manuscript by Kalmar appears better than the previous version. Codes and data are now available and would help scientists to reproduce their results. Precisions about their computational pipeline have been also added and their approaches appear promising. The manuscript still need minor modifications / precisions:

1- The authors have provided important statistics about their HiC libraries but never discussed about them. It could of great importance as their HiC libraries do not seem to encode more 3D signal compared to a classical 3C library. Is there a real gain in performing HiC that is more expensive than 3C ?

2- I would really appreciate that the authors submit their assemblies to the NCBI or other databases as it will allow others scientist to reproduce their results without this time-consuming assembly step. It is really easy to do and will help the whole community.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Matthew Z DeMaere

Reviewer #2: Yes: Martial Marbouty

Revision 2

Attachments
Attachment
Submitted filename: ham_art_responses_Jan22.docx
Decision Letter - Xavier Didelot, Editor, Chengqi YI, Editor

Dear Dr Holmes,

We are pleased to inform you that your manuscript entitled "HAM-ART: An optimised culture-free Hi-C metagenomics pipeline for tracking antimicrobial resistance genes in complex microbial communities." has been editorially accepted for publication in PLOS Genetics. Congratulations!

Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made.

Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics!

Yours sincerely,

Xavier Didelot

Associate Editor

PLOS Genetics

Chengqi YI

Section Editor: Methods

PLOS Genetics

www.plosgenetics.org

Twitter: @PLOSGenetics

----------------------------------------------------

Comments from the reviewers (if applicable):

----------------------------------------------------

Data Deposition

If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website.

The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: 

http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-21-00617R2

More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support.

Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present.

----------------------------------------------------

Press Queries

If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org.

Formally Accepted
Acceptance Letter - Xavier Didelot, Editor, Chengqi YI, Editor

PGENETICS-D-21-00617R2

HAM-ART: An optimised culture-free Hi-C metagenomics pipeline for tracking antimicrobial resistance genes in complex microbial communities.

Dear Dr Holmes,

We are pleased to inform you that your manuscript entitled "HAM-ART: An optimised culture-free Hi-C metagenomics pipeline for tracking antimicrobial resistance genes in complex microbial communities." has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Orsolya Voros

PLOS Genetics

On behalf of:

The PLOS Genetics Team

Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom

plosgenetics@plos.org | +44 (0) 1223-442823

plosgenetics.org | Twitter: @PLOSGenetics

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .