Peer Review History

Original SubmissionNovember 24, 2020
Decision Letter - Lotte Søgaard-Andersen, Editor, Kai Papenfort, Editor

Dear Dr Engelmann,

Thank you very much for submitting your Research Article entitled 'Revealing the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach' to PLOS Genetics.

The manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review a revised version. We cannot, of course, promise publication at that time.

Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see our guidelines.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool.  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

Please do not hesitate to contact us if you have any concerns or questions.

Yours sincerely,

Kai Papenfort

Guest Editor

PLOS Genetics

Lotte Søgaard-Andersen

Section Editor: Prokaryotic Genetics

PLOS Genetics

Dear Dr Engelmann.

Thank you again for submitting your work to PLoS Genetics. Your manuscript has now been seen by three independent reviewers. All the reviewers found the work to be of interest, however, they all raised several concerns (see detailed comments below), which should be addressed in your revised manuscript as well as in your rebuttal letter. Given that your study might well serve as a resource for other researchers working on S. aureus, please make sure that the PRIDE data are made available (see major comment #1 of reviewer #3) and that Table S4 is updated according to the requests made by reviewer #1.

Looking forward to your revised manuscript.

Kai Papenfort

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: This is an interesting paper, which presented a strategy combining computational approaches, proteomics, and ribosome profiling to gain knowledge on the missing part of the genome, the small proteins (< 100 aa) and peptides. The authors have made a rigorous study that will be useful for the community working on Staphylococcus aureus and in more general on bacteria. Their study revealed the existence of 178 small proteins and among them, 27 were not predicted by the genome annotation of the Newman strain, with 83% having a size above 50 aa. Many of these small proteins are basic suggesting that they might target nucleic acids or phospholipids. This work opens up many perspectives for further functional studies.

I have only few comments to be addressed:

1- A significant number of the sORF encoding SP100 (almost 20%) does not present translational activity as demonstrated by the ribosome profiling data (Table S4). The sORF (below 100 nts) seems also to be poorly active as no translational activity was detected for almost 65% of them. As the authors mentioned, it is difficult to detect small proteins that are encoded inside other genes as they can be masked by the signal of the annotated gene. However, recent ribosome profiling improvements have been done to enrich ribosome density at the start codons (using for instance retapamulin to block translation initiation or Onc112 which blocks the entry at the elongation step). It would be of interest to mention these works that can improve the detection of small proteins below 50 aa.

2- The authors should better mention the limits of the proteomic approach they used. For instance, because they have used cytosolic protein fraction for proteomics, it might be possible that some of the peptides have been missed because they are either secreted or transported to the membranes.

3- The annotation of Table S4 can be significantly improved. For instance, I think that CspA protein is an RNA chaperone that is primarily involved in post-transcriptional and translational controls but not in transcriptional control. Most of the highly abundant SP100 are factors involved in ribosome function and biogenesis. Several of these genes are given as they belong to several bacterial ribosomal proteins. The authors should precise the meaningful. Does it mean that these sORFs are pseudogenes? It is also mentioned that the ORFSaNew0084 is an RNA binding protein, which binds to sRNAs and mRNAs, it is Hfq protein? The legend of Table S4 should be detailed (ORF type?).

4- Page 13: the authors have mentioned that the sORFs they have identified are classified into three categories: sORFs in intergenic regions, sORFs overlapping with the 5’ or 3’ UTR of different ORFs, and located within a larger ORF but in a different reading frames. It would be of interest to add this information in Table S4 as well as the sequences of the proteins. Did they find sORFs as possible upstream leaders of other genes? Certainly that further work help to decipher the sORFs that are translated within operons from those that specifically control the translation of the downstream gene.

5- The title has to be changed because even if the strategy is powerful, many of the small ORFs (below 50 aa) were not identified here... I would suggest something like "Towards the charactrization of the hidden world of small proteins..."

Reviewer #2: In the report, Revealing the hidden world of small proteins in Staphylococcus aureus, a

proteogenomics approach, Fuchs et al., developed a bioinformatics workflow to identify potential small protein candidates in the genome of Staphylococcus aureus. The workflow combines proteomics and genomic data analysis and is validated by ribosome profiling. The approach is already established; comparable workflows (with certain variations from lab to lab) are described to explore the existence of small proteins in different organisms (e.g. Yang 211, Miravet-verde 2019; Weaver 2019, Venturini 2020). In this regard, the workflow is solid but it does not have a strong component of innovation away from the field of Staphylococcus aureus. Indeed, for S. aureus laboratories will be an interesting tool.

The experimental part combines MS peptidic data coupled to ribosome profiling. The ribo seq experiment detected a number of ribosome-protected mRNA fragments (RPF) that were mapped to 76% of the total putative sORFs in the genome. However, only a few of them were validated with the corresponding MS peptidic signal. sORFs that are embedded within other larger ORFs did not produce a reliable RFPs signal while other sORFs showed no translational activity. Although this integrative work represents an interesting starting point for identifying small proteins in S. aureus, the strength of the experimental dataset does not allow to answer important questions as to whether these peptidic fragments detected by MS are bona fide small proteins and more importantly, whether they have any biological relevance or there is any indication that these peptides play a role in any cellular process (e.g. interaction with other proteins, phenotypes associated with their expression or virulence potential).

The authors obtained MS-based peptidic signal using gel-based and gel-free pre-fractionation methods and compared the two methods for their precision and accuracy to detect small proteins. It is unclear to this referee what is the purpose of this comparison, what are the benefits of one approach in comparison to the other and more importantly, what is the final conclusion of the comparative study. I understood that the authors used these two methods to detect common peptidic signals and used them as trustable signals. So, it seems that one must run the two approaches to obtained a reliable signal to detect small proteins.

Other comments:

1) Regarding ribosomal profiling, this technique does not corroborate the existence of the identified SP100 as this technique operates at the mRNA level. Please rewrite this sentence acordingly.

2) The number and classification of the MS data is hard to follow. I had problems to understand this part of the manuscript. For instance, there are several values of SP100 found using different approaches/sections and is at times difficult to digest.

3) Why do the authors have focused on the strain Newman? It seems that this clinical isolate is not an MRSA and does not cause important infections these days.

Reviewer #3: The authors report proteomic re-annotation of the S. aureus genome, with a combined LC-MS/MS and ribosome profiling approach leading to identification of 27 previously unannotated gene products <100 amino acids. Given the mushrooming number of recent reports of proteogenomic analysis of small proteins, including many bacteria, novelty of new studies is an important consideration. To that point, this paper most importantly offers (1) the first proteogenomic analysis of S. aureus Newman, an important pathogen, and (2) the development of a web interface for 6-frame genome translation database construction for MaxQuant searches. While many methods for 6-frame genomic databases have been previously reported, it seems worthwhile to create an accessible tool for non-specialists. In another more technical note, the application of multienzyme digest in proteogenomics has been underexplored and the demonstration that this approach (modestly) improves small protein detection in this study may be of some interest to proteomics practitioners. Other mass spectrometry methods reported, such as caparison of gel vs. non-gel-based prefractionation, are in line with previous reports using human samples (such as https://pubs.acs.org/doi/abs/10.1021/acs.analchem.6b00191).

Major concern: It is true that many proteogenomics studies rely on single peptide-spectral matches (PSMs) for small protein identification, due to the low occurrence of unique tryptic peptides within short sequences. This, in itself, is not a concern. However, it is best practice to provide the unique PSMs, along with scores, mass errors, etc., for examination in the case of single peptide identifications – for example, see http://pubsapp.acs.org/paragonplus/submission/jprobs/jprobs_proteomics_guidelines.pdf?. This is particularly important for proteins reported for the first time. I may simply have missed this – in which case, I apologize – but I was unable to access the PRIDE data or find a reviewer key, nor was I able to locate reproduced PSMs in the supplementary material. While the authors have made clear that they utilized stringent criteria, manual inspection is still required.

Minor concern:

Pages 13-14: “For the latter two we can additionally distinguish between those located

at the same strand and those located at the complementary strand. The majority (n=13) belongs to the group (iii) of which five are localized at the same strand (Fig 9). This group of proteins is highly interesting as their existence has been ignored for a long time and highlights the power of an unbiased proteogenomics approach as applied here.”

This sentence seems to imply that overlapping same-strand genes have not yet been reported, which is not true and references should be provided. There is certainly precedent for their existence – at least in E. coli and human. For example, https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5761644/, https://academic.oup.com/nar/article/48/3/1029/5556081, https://academic.oup.com/nar/article/48/3/1029/5556081, https://www.embopress.org/doi/full/10.15252/embr.202050640, just to provide a few examples out of many.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Revision 1

Attachments
Attachment
Submitted filename: Reviewer comments.docx
Decision Letter - Lotte Søgaard-Andersen, Editor, Kai Papenfort, Editor

Dear Dr Engelmann,

Thank you very much for submitting your Research Article entitled 'Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach' to PLOS Genetics.

The manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important topic but identified some concerns that we ask you address in a revised manuscript

We therefore ask you to modify the manuscript according to the review recommendations. Your revisions should address the specific points made by each reviewer.

In addition we ask that you:

1) Provide a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

2) Upload a Striking Image with a corresponding caption to accompany your manuscript if one is available (either a new image or an existing one from within your manuscript). If this image is judged to be suitable, it may be featured on our website. Images should ideally be high resolution, eye-catching, single panel square images. For examples, please browse our archive. If your image is from someone other than yourself, please ensure that the artist has read and agreed to the terms and conditions of the Creative Commons Attribution License. Note: we cannot publish copyrighted images.

We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

Please let us know if you have any questions while making these revisions.

Yours sincerely,

Kai Papenfort

Guest Editor

PLOS Genetics

Lotte Søgaard-Andersen

Section Editor: Prokaryotic Genetics

PLOS Genetics

Dear Dr. Engelmann.

Your revised manuscript has now been evaluated and I am happy to tell you that all reviewers agree that the manuscript has significantly improved and, in principle, is suitable for publication. However, reviewer #2 identified two remaining issues that need to be addressed before we can move forward. Please make sure to address these remaining issues in your revised manuscript and your rebuttal letter. Please note that I will first assess your responses and may not need to approach the referee for further comments.

Kind regards

Kai Papenfort

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have taken into account most of the suggestions of the referees, which have improved the manuscript. The work provides a catalog of novel small proteins that will require further analysis to proove their functional importance. Nevertheless this is certainly a useful study.

Reviewer #2: This is a revised version of the manuscript; Revealing the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach by Fuchs et al. In general, I find the authors’ replies to my satisfaction. I just have two remarks related to my previous comments:

1) One of my criticism was related to the limited innovative component of the workflow proposed in this work; proteomics + genomics (salt) validated by MS (pepper). The authors replied that Pepper is the component of innovation; MS data analysis for identification of small proteins, to crosscheck this data to ORF prediction. MS analysis provides a direct detection of small proteins whereas ribosome profiling provides indirect proof. Then, I do not understand why the authors validated MS data analysis with ribosome profiling. I suggest the authors made this innovative component more clear; maybe they can extend a bit the discussion section to explain the benefits of this technique in relation to other, already existing, approaches.

2) My second point was precisely related to pepper. sORFs embedded within other larger ORFs did not produce a reliable RFPs signal whereas other sORFs showed no translational activity. I meant to emphasize that, whereas this work represents an interesting starting point to identify small proteins in S. aureus, the pepper workflow seems to not provide a sufficiently strong experimental proof to validate that all sORF detected are bona fide small proteins. I understand this is beyond the scope of this work; this referee does not mean that the authors must check and validate all the small proteins detected and whether they play a biologically relevant role. Nonetheless, I think it would be a good idea to made a clear note, maybe in the discussion section, about the limitations of the pepper workflow. I think is important to discuss not only the sORF that were validated but also why other sORF were not possible to validate. This would make the workflow more robust to the eyes of the reader.

Reviewer #3: With the clarification and inclusion of PSMs and criteria, which are indeed very high quality, I believe this manuscript well passes the bar for publication.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Revision 2

Attachments
Attachment
Submitted filename: Reviewer comments_210505.docx
Decision Letter - Lotte Søgaard-Andersen, Editor, Kai Papenfort, Editor

Dear Dr Engelmann,

We are pleased to inform you that your manuscript entitled "Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach" has been editorially accepted for publication in PLOS Genetics. Congratulations!

Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made.

Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics!

Yours sincerely,

Kai Papenfort

Guest Editor

PLOS Genetics

Lotte Søgaard-Andersen

Section Editor: Prokaryotic Genetics

PLOS Genetics

www.plosgenetics.org

Twitter: @PLOSGenetics

----------------------------------------------------

Comments from the reviewers (if applicable):

Dear Dr Engelmann.

Thank you again for submitting your revised work, which I think is now suitable for publication. Congratulations on an excellent manuscript.

Kind regards.

Kai Papenfort

(Guest Editor)

----------------------------------------------------

Data Deposition

If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website.

The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: 

http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-20-01793R2

More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support.

Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present.

----------------------------------------------------

Press Queries

If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org.

Formally Accepted
Acceptance Letter - Lotte Søgaard-Andersen, Editor, Kai Papenfort, Editor

PGENETICS-D-20-01793R2

Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach

Dear Dr Engelmann,

We are pleased to inform you that your manuscript entitled "Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Kata Acsay

PLOS Genetics

On behalf of:

The PLOS Genetics Team

Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom

plosgenetics@plos.org | +44 (0) 1223-442823

plosgenetics.org | Twitter: @PLOSGenetics

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .