Peer Review History
| Original SubmissionOctober 27, 2020 |
|---|
|
Dear Dr. Tavazoie, Thank you for submitting your manuscript entitled "Dynamic landscape of protein occupancy across the Escherichia coli chromosome" for consideration as a Research Article by PLOS Biology. Your manuscript, the reviews and your revision plan have now been evaluated by the PLOS Biology editorial staff, as well as by an academic editor with relevant expertise, and I'm writing to let you know that we would like to consider your manuscript further. However, before we can send a "Major Revision" decision, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire. Please re-submit your manuscript within two working days, i.e. by Nov 11 2020 11:59PM. Login to Editorial Manager here: https://www.editorialmanager.com/pbiology During resubmission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF when you re-submit. Once your full submission is complete, your paper will undergo a series of routine checks. Once your manuscript has passed all checks I will be able to send you your formal decision. Given the disruptions resulting from the ongoing COVID-19 pandemic, please expect delays in the editorial process. We apologise in advance for any inconvenience caused and will do our best to minimize impact as far as possible. Feel free to email us at plosbiology@plos.org if you have any queries relating to your submission. Kind regards, Paula --- Paula Jauregui, PhD, Associate Editor PLOS Biology |
| Revision 1 |
|
Dear Dr. Tavazoie, Thank you very much for submitting your manuscript "Dynamic landscape of protein occupancy across the Escherichia coli chromosome" for consideration as a Methods and Resources at PLOS Biology. Your manuscript and the three Review Commons reviews has been evaluated by the PLOS Biology editors, and by an Academic Editor with relevant expertise. In light of the reviews (below), we will not be able to accept the current version of the manuscript, but we would welcome re-submission of a much-revised version that takes into account the reviewers' comments. We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent for further evaluation by the reviewers. We expect to receive your revised manuscript within 3 months. In particular, referee #1 thinks that a user-friendly accessible browser format is really needed, asks about promoters with more than one protein bound, asks whether it is possible to substract H-NS ChIP signals, and says that you should make a clear statement about what is missing. Referee #2's main concern is the source code is missing. Referee #2 also thinks that you should re-write the description of protein occupancy and peak calling in the method section for clarity and asks specific questions about it, says that the functional validation of YieP binding site is not convincing enough. Referee #3 has several suggestions regarding the integration and cross referencing of your findings with existing data, and has questions about TF and co-regulation. Taking these comments into account, we think that your plan for revision is reasonable. Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension. At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may end consideration of the manuscript at PLOS Biology. **IMPORTANT - SUBMITTING YOUR REVISION** Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript: 1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript. *NOTE: In your point by point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point. You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response. 2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Related" file type. *Re-submission Checklist* When you are ready to resubmit your revised manuscript, please refer to this re-submission checklist: https://plos.io/Biology_Checklist To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record. Please make sure to read the following important policies and guidelines while preparing your revision: *Published Peer Review* Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details: https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/ *PLOS Data Policy* Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5 *Blot and Gel Data Policy* We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements *Protocols deposition* To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosbiology/s/submission-guidelines#loc-materials-and-methods Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Paula --- Paula Jauregui, PhD, Associate Editor, PLOS Biology ***************************************************** REVIEWS (From Review Commons): Reviewer #1: The paper describes a technique that produces an IPOD (in vivo protein occupancy display) across the entire E coli chromosome. By subtracting out signals due to RNA polymerase, this display is HR (high resolution) in the sense that the authors can see signals due to individually bound proteins. The drawback of the method is that the signals seen in the IPOD are anonymous, but the authors describe in some detail how the display can be exploited. The paper is clearly written and presented, and will be accessible to students, with some effort, and easy to grasp for those who have followed the topic, especially those familiar with the previous 2009 Mol Cell Vora et al paper. The key conclusions are convincing. Of course the methodology has inbuilt issues, like all top-down methods that rely on a particular probe, and these will bias the results, but the authors assess the pros- and cons realistically. Concerning the data, I would have liked to see more of the IPOD in the format in Figure 2. I guess this is somewhere in the Supplementary data, but I was unable easily to access it. What is really needed is a userfriendly accessible browser format. The authors make a convincing case that certain 'spikes' in the IPOD correspond to individual proteins bound. So what about promoter regions where several proteins are bound e.g. the lac operon regulatory region where one might expect to see Lac Repressor and CAP? Or the galETK promoter where there could be Gal Repressor, HU and CAP? Readers might like to check their favorite promoters for themselves. At the very least, the authors need to make a clear statement about what is missing. For me the highlight was the observation that EPODs (extended protein occupancy domains) don't change much according to growth conditions. This leads to a very nice view of proteins on the E coli 'landscape' where regulatory features vary but EPODs don't (well, not much, anyway). The non-HNS EPODS are intriguing. Is it possible to do a subtraction of H-NS ChIP signals as was done for RNAP ChIP signals? Significance The opening paragraph of the discussion explains the relative merits of bottom-up versus top-down approaches to understanding bacterial regulatory networks, and carefully explains that IPOD-HR is an addition to the top-down told. This work is of interest to those concerned with bacterial chromosome organisation and the regulation of bacterial gene expression. Reviewer #2: The authors describe a novel approach that allows characterization of the protein-DNA interaction dynamics at the global scale. The method is a further advance on protein occupancy display (IPOD) developed earlier by the same authors. The crucial modification introduced in the present work is the use of rifampicin treatment to distinguish the contribution from sequence-specific transcription factors and RNA polymerase. The new method is referred to as IPOD-HR (for high resolution). They apply this method across different physiological conditions (exponential and stationary phase, rich or minimal medium) and validate the results using a set of well characterized transcriptional regulators (ArgR, LexA, PurR) and corresponding mutants. They then utilize an unsupervised clustering algorithm on the protein occupancy data for annotated transcription factors and show that transcription regulators belonging to the same GO term tend to cluster together across the conditions. As a large number of protein occupancy peaks obtained in this study does not correspond to the annotated transcription factor binding sites in promoter, the authors select one such site and use it to fish out and identify the interacting protein (YieP). They validate the functional importance of this interaction by in vitro DNA-binding assays and RNA-Seq data for the corresponding mutant which show a significant reduction of the regulated transcript (sdaC) level. Building on this result the authors examine the sequences corresponding to IPOD-HR peaks and find that about a half of them do not match any known (or predicted) transcription factor binding sites. They use computational motif discovery algorithm FIRE to IPOD-HR data to find about 200 novel motifs. These account for almost a third of the observed protein occupancy peaks. Importantly, the inferred motifs show significant enrichment in promoter but not ORF regions. The last part of this work explores so-called extended protein occupancy domains (EPODs) which were first reported by the authors in the previous work and apparently correspond to kilobase-scale regions of high level of protein occupancy. The correction for RNA polymerase occupancy levels introduced in the present study allowed the authors to claim that most EPODs are transcriptionally silent, akin to eukaryotic heterochromatin regions. One striking result is that EPODs appear to be almost invariant across the range of physiological conditions used in this study. The pathways enriched in EPODs regions (prophage genes and mobile elements) as well as the strong overlap with H-NS ChIP-Seq data prompt the authors to suggest that the main role of EPODs is the transcriptional silencing of xenogenetic material. Nonetheless, a significant fraction of EPODs does not overlap with H-NS binding and provide an intriguing opportunity for further study. The strength of the method is that no prior knowledge of the DNA interacting protein is required and it can be used to survey the protein occupancy of both transcription regulatory sites and large-scale protein-bound domains of bacterial chromosome. This will be a valuable addition to the existing toolkit of high-throughput methods and has a great potential when applied to other organisms and physiological conditions. However, several points need to be addressed before the work can be published. 1.My main concern is that being heavily dependent on computational analysis this study does not provide the source code for the analysis performed, specifically protein occupancy and feature (peak) calling and transcription factor co-clustering. While I agree that the inclusion of code for routine steps in NGS data processing such as QC and alignment adds little value to the reader, these are computational steps on which the entire method is built and they should be included in the manuscript as a link to the source code. 2.The description of protein occupancy and peak calling in the method section is difficult to parse and probably should be re-written for better clarity. Specific questions that arise while reading this part: - Why is spline used for smoothing abundances? The choice of smoothing function and its parameters needs better explanation. The same goes for the choice of LOESS fit for RNA polymerase normalization. - Where did 1.1 scaling factor come from? - What's the effect of data downsampling on the fit robustness? - Similarly, the selection of parameters used in peak calling in both CWT and EPODs cases needs to be explained. They seem arbitrary without additional clarification. 3.The functional validation of YieP binding site is not convincing enough. The transcript level depletion in mutant RNA-Seq data might be an indirect effect. A reporter gene assay will provide a more direct evidence that the site is indeed a functional transcriptional regulator. 4.Perhaps outside of the scope of this study but it would be great to provide some kind of functional validation to the novel transcription factor motifs reported here. Reviewer #3: **Summary:** In this manuscript, Freddolino and collaborators introduced their improved in vivo protein occupancy display technology (IPOD-HR) for genome-wide profiling of protein occupancy. The IPOD-HR workflow integrates protein occupancy data and RNA polymerase ChIP-seq data to identify all regions of a bacterial genome occupied by DNAbinding proteins. Using the IPOD-HR technology, the authors identified ~19,000 potential TF-DNA interactions in the genome of Escherichia coli across six different conditions/genotypes. Remarkably, protein occupancy profiles generated using IPOD-HR recapitulated a high proportion of previously reported TF binding sites in E. coli chromosome, identified ~90% of the known DNA binding motifs for E. coli TFs, and discovered thousands of novel TF binding sites that await further characterization. The authors showed that protein occupancy profiles (generated using IPOD-HR) in combination with prior knowledge about a transcriptional regulatory network (TRN) of interest may offer valuable information about the regulatory activity of TFs, uncharacterized gene promoters, and binding affinity of uncharacterized putative TFs. Therefore, this new technology may facilitate the reconstruction of condition-specific bacterial TRNs. **Major comments:** The authors have done an excellent job showing, with specific examples, potential applications of their new technology. However, the manuscript would deeply benefit from a more comprehensive analysis of the functional implications of the TF-DNA binding events discovered using IPOD-HR. Authors should use the protein occupancy profiles they have already generated to reconstruct the binding profiles of individual regulators across different conditions (at least for global regulators with dozens of targets). Authors may achieve this goal by genome-wide scanning of the de novo motifs (matched with known TF binding motifs) in the occupied chromosomal regions or using similar approaches. This type of analysis is necessary to assess the full potential of their technology and to uncover the conditional partition of TF regulons, which would offer key insights about the architecture and dynamic of E. coli TRN. Without this level of resolution, ChIP-seq experiments would still be required to compare the binding profiles of any TF of interest across multiple conditions, reducing the value of IPOD-HR. Finally, I encourage the authors to compare the IPOD-HR reconstructed TF regulons with the information available in RegulonDB. **Specific comments** -L141-148: the authors should clearly explain in 2-3 sentences the main changes in IPODHR with respect to IPOD (RNA polymerase occupancy correction, etc). What are the main changes in the analytical framework? It is unclear what the authors meant. -Fig 2E: authors should evaluate the statistical significance of IPOD-HR occupancy. -Fig 3B: do the TF modules indicate co-regulation? In other words, do TFs that cluster together share target genes or they only have similar average occupancy profiles? Moreover, do the authors use any TF regulon size threshold when computing average occupancy score? If not, how do the authors correct for overlap between regulons (i.e. between small regulons and large regulons)? -L331-333 & Supp. Data S1: the authors, if possible, should include information about TFs potentially binding to the identified peaks. This information will increase the value of the data generated in this study. -L360-365: an important piece of information missing in the current version of the manuscript is the recall rate for known TF regulons (i.e. how many of the known targets of each TF was identified by IPOD-HR). This is information is important to validate your results beyond TF motifs. Additionally, did the IPOD-HR data discover novel targets for previously characterized TFs? If yes, the authors should describe these findings. Finally, can the authors evaluate if the reconstructed regulons are supported by transcriptional data (collected in this study) or publicly available data? This would clarify the function of the putative binding events. -L397-406: the authors should evaluate the overlap between regulons associated with newly inferred motifs and previously characterized TF regulons. This comparison may offer information about the potential role of the uncharacterized TFs and combinatorial regulation. -L530-534: It is unclear to me how you would compensate for the lack of prior information when using the IPOD-HR technology for reconstructing TRNs of uncharacterized species? **Minor comments:** -L60-62: and not all binding events are functional. Especially in the case of TF overexpression commonly used in ChIP-seq experiments. -Fig 3A: authors should indicate how many target genes were used for estimating the occupancy of each TF. -Fig 3C: please add the actual correlation score. Is this anti-correlation a general trend (if you choose any random genes) or specific for the ArgR and PurR targets? -L764: please remind the reader why rifampicin is used. -L828: for how many conditions did the authors perform RNA polymerase ChIP-seq with samples that were not from the same culture used for IPOD? Can the authors show that using different samples for IPOD and RNApol ChIP-seq does not change the estimated TF occupancy profiles? -L1142: No access to the data (GEO requires a password) -Table S4: not available. The reviewer field of expertise is reconstruction of gene regulatory networks. |
| Revision 2 |
|
Dear Dr. Tavazoie, Thank you for submitting your revised Methods and Resources entitled "Dynamic landscape of protein occupancy across the Escherichia coli chromosome" for publication in PLOS Biology. I have now obtained advice from two of the original reviewers and have discussed their comments with the Academic Editor. Based on the reviews, we will probably accept this manuscript for publication, provided you satisfactorily address the remaining points raised by the reviewers. Please also make sure to address the following data and other policy-related requests. You will see that reviewer #3 would like you to add a column with the information regarding the closest gene to each motif binding site, and asks why the same genome-wide scan of motif binding sites used to generate Supplementary Data 4 cannot be applied to the non-redundant motifs that matched with known DNA binding motifs for E. coli. Please address the remaining issues from reviewer #3. DATA POLICY: You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797 Note that we do not require all raw data. Rather, we ask that all individual quantitative observations that underlie the data summarized in the figures and results of your paper be made available in one of the following forms: 1) Supplementary files (e.g., excel). Please ensure that all data files are uploaded as 'Supporting Information' and are invariably referred to (in the manuscript, figure legends, and the Description field when uploading your files) using the following format verbatim: S1 Data, S2 Data, etc. Multiple panels of a single or even several figures can be included as multiple sheets in one excel file that is saved using exactly the following convention: S1_Data.xlsx (using an underscore). 2) Deposition in a publicly available repository. Please also provide the accession code or a reviewer link so that we may view your data before publication. Regardless of the method selected, please ensure that you provide the individual numerical values that underlie the summary data displayed in the following figure panels as they are essential for readers to assess your analysis and to reproduce it: Figures 2B, 2C, 2D, 2E, 2F, 2G, 3A, 3D, 3E, 3F, 3G, 4A, 4E, 5A, 5B, 5C, 5D, 5G, 5H, 6B, 6E, Supplementary figures 3A, and 3B. NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values). Please also ensure that figure legends in your manuscript include information on where the underlying data can be found, and ensure your supplemental data file/s has a legend. Please ensure that your Data Statement in the submission system accurately describes where your data can be found. BLOT AND GEL REPORTING REQUIREMENTS: We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare and upload them now. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements. We will require this for Figure 4C. As you address these items, please take this last chance to review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the cover letter that accompanies your revised manuscript. We expect to receive your revised manuscript within two weeks. To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include the following: - a cover letter that should detail your responses to any editorial requests, if applicable, and whether changes have been made to the reference list - a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable) - a track-changes file indicating any changes that you have made to the manuscript. NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines: https://journals.plos.org/plosbiology/s/supporting-information *Published Peer Review History* Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details: https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/ *Early Version* Please note that an uncorrected proof of your manuscript will be published online ahead of the final version, unless you opted out when submitting your manuscript. If, for any reason, you do not want an earlier version of your manuscript published online, uncheck the box. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us as soon as possible if you or your institution is planning to press release the article. *Protocols deposition* To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols Please do not hesitate to contact me should you have any questions. Sincerely, Paula --- Paula Jauregui, PhD, Associate Editor, PLOS Biology ------------------------------------------------------------------------ Reviewer remarks: Reviewer #2: Evgeny Nudler. Transcription Elongation and Gene Control. Reviewer #3: Reconstruction of gene regulatory networks. Reviewer #2: The authors did a great job addressing every point I made. I support the publication of the manuscript in its present form. Reviewer #3: The revised manuscript describes the IPOD-HR technology for genome-wide profiling of protein occupancy. The authors comprehensively showed applications of the IPOD-HR method using E. coli as their model system. In this revised version, the authors satisfactorily addressed most of the issues I have previously raised. There are two minor analyses that could benefit the manuscript before acceptance to PLOS Biology: 1. As a method & resource article, would it be possible for authors to add a column with the information regarding the closest gene to each motif binding site listed on Supplementary Data 4? I imagine authors already compiled that information in order to perform the GO enrichment analysis shown in Supplementary Data 5. 2. I understand the authors' response about the current limitations to reconstruct individual TF regulons using a limited set of conditions. However, I still don't understand why the same genome-wide scan of motif binding sites used to generate Supplementary Data 4 cannot be applied to the non-redundant motifs that matched with known DNA binding motifs for E. coli? This could offer potential information about individual (partially characterized) TF regulons. Authors could highlight the caveats they have mentioned so readers are aware of any limitation when interpreting the data. Minor point: 3. Fig S2 panel labels are missing. Also, authors referred to TF clusters in their response but those are not shown in the figure S2. |
| Revision 3 |
|
Dear Dr. Tavazoie, On behalf of my colleagues and the Academic Editor, Matthew K. Waldor, I am pleased to say that we can in principle offer to publish your Methods and Resources "Dynamic landscape of protein occupancy across the Escherichia coli chromosome" in PLOS Biology, provided you address any remaining formatting and reporting issues. These will be detailed in an email that will follow this letter and that you will usually receive within 2-3 business days, during which time no action is required from you. Please note that we will not be able to formally accept your manuscript and schedule it for publication until you have made the required changes. Please take a minute to log into Editorial Manager at http://www.editorialmanager.com/pbiology/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process. PRESS We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have not yet opted out of the early version process, we ask that you notify us immediately of any press plans so that we may do so on your behalf. We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/. Thank you again for supporting Open Access publishing. We look forward to publishing your paper in PLOS Biology. Sincerely, Paula --- Paula Jauregui, PhD Associate Editor PLOS Biology |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .