Peer Review History

Original SubmissionMay 15, 2024
Decision Letter - Suzanne De Bruijn, Editor

Dear Dr Lee,

Thank you for submitting your manuscript entitled "HCNetlas: Human cell network atlas enabling cell type-resolved disease genetics" for consideration as a Methods and Resources by PLOS Biology. Please accept my apologies for the unusual delay incurred while we sought external expert advice.

Your manuscript has now been evaluated by the PLOS Biology editorial staff as well as by an academic editor with relevant expertise and I am writing to let you know that we would like to send your submission out for external peer review.

However, before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire.

Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. After your manuscript has passed the checks it will be sent out for review. To provide the metadata for your submission, please Login to Editorial Manager (https://www.editorialmanager.com/pbiology) within two working days, i.e. by Jun 08 2024 11:59PM.

If your manuscript has been previously peer-reviewed at another journal, PLOS Biology is willing to work with those reviews in order to avoid re-starting the process. Submission of the previous reviews is entirely optional and our ability to use them effectively will depend on the willingness of the previous journal to confirm the content of the reports and share the reviewer identities. Please note that we reserve the right to invite additional reviewers if we consider that additional/independent reviewers are needed, although we aim to avoid this as far as possible. In our experience, working with previous reviews does save time.

If you would like us to consider previous reviewer reports, please edit your cover letter to let us know and include the name of the journal where the work was previously considered and the manuscript ID it was given. In addition, please upload a response to the reviews as a 'Prior Peer Review' file type, which should include the reports in full and a point-by-point reply detailing how you have or plan to address the reviewers' concerns.

During the process of completing your manuscript submission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF.

Feel free to email us at plosbiology@plos.org if you have any queries relating to your submission.

Kind regards,

Suzanne

Suzanne De Bruijn, PhD,

Associate Editor

PLOS Biology

sbruijn@plos.org

Revision 1
Decision Letter - Richard Hodge, Editor

Dear Dr Lee,

Thank you for your patience while your manuscript "HCNetlas: Human cell network atlas enabling cell type-resolved disease genetics" was peer-reviewed at PLOS Biology as a Methods and Resources Article. Please accept my sincere apologies for the long delays that you have experienced during the peer review process. Your manuscript has now been evaluated by the PLOS Biology editors, an Academic Editor with relevant expertise, and by three independent reviewers.

In light of the reviews, which you will find at the end of this email, we would like to invite you to revise the work to thoroughly address the reviewers' reports.

As you will see, the reviewers are generally positive about your HCNetlas resource but raise some overlapping concerns with its general utility. Specifically, Reviewer’s #2 and #3 note that the resource has a limited scope since it is based on only two datasets and that this restricts its applicability as a control resource for disease-related gene analysis. They ask that a revised version expands the database to show that it can replace matched control samples as claimed. In addition, Reviewer #1 raises concerns with the number of cells/samples used for the analyses and asks whether multiple samples were used to construct the CGN’s. After discussions with the Academic Editor, we ask that you please comprehensively address the questions that Reviewer #1 raises.

Given the extent of revision needed, we cannot make a decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is likely to be sent for further evaluation by all or a subset of the reviewers.

We expect to receive your revised manuscript within 3 months. Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension.

At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may withdraw it.

**IMPORTANT - SUBMITTING YOUR REVISION**

Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript:

1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript.

*NOTE: In your point-by-point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point.

You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response.

2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Revised Article with Changes Highlighted" file type.

*Re-submission Checklist*

When you are ready to resubmit your revised manuscript, please refer to this re-submission checklist: https://plos.io/Biology_Checklist

To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record.

Please make sure to read the following important policies and guidelines while preparing your revision:

*Published Peer Review*

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*PLOS Data Policy*

Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5

*Blot and Gel Data Policy*

We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Richard

Richard Hodge, PhD

Senior Editor, PLOS Biology

rhodge@plos.org

------------------------------------

REVIEWS:

Reviewer #1: Summary of the paper:

The authors introduced HCNetlas (human cell network atlas), a collection of reference cell type-specific gene networks (CGNs) from a wide array of healthy tissue cells, and used them as the reference network to compare with disease CGNs. For the comparisons of disease vs healthy CGNs, they developed three network analysis methods, differential compactness, differential hubness, and differential pathways, to investigate cell type-specific functions of disease genes. They applied their methods to systemic lupus erythematosus (SLE), Alzheimer's, and lung cancer disease tissue scRNA-seq samples to identify disease-associated cell types, genes, and pathways.

The novelty of this paper lies in the development of reference CGNs and their application to disease data to reveal the cell types most affected by diseases. This could facilitate the development of cell-type-resolved diagnostics and therapeutic strategies for complex human diseases. Overall, the paper is original and of good quality, but needs to address the following questions.

Major Comments:

Q1: An individual's disease network consists of both disease-related gene and network alterations and the individual's biological/genetic differences. Using multiple individual samples may not necessarily eliminate all the biological bias. The paper mentioned 1000 single cells as the cutoff for constructing the CGNS. It is not clear whether the paper used multiple samples to construct the CGNs (both healthy and disease). How many samples are needed to produce a robust network? Some discussion on this will be helpful.

Q2: In evaluating the cell-type-specificity of CGNs (Method), the healthy CGNs are constructed using scHumanNet, a program developed in their previous paper. Please give some basic introduction to the scHumanNet method so readers do not have to go back to read the previous paper.

Q3: (Method) They tested the correctness of CGNs by assuming that genes functionally connected within a CGN reflect the properties of their respective cell type (defined by the GO annotation). They considered the interconnectivity within genes for each cell type as a measure of the cell-type specificity. This is true, but how can this prove the correctness of edges (or each edge) in CGNs? They identified hub genes within each CGN and ranked them using the FindAllHub() function of the scHumanNet. Again, how to prove the correctness of these hub genes?

Q4: (Results) HCNetlas. Figure 1C has many tissue abbreviations. Explanations of these tissues should be displayed somewhere (moved from the supplementary tables to the main text). Why does the CGN network size reach the plateau when the cell count reaches 1000? Why does the network size shrink when the cell count goes over 1000? The network gene expression profiles of the same cell types tend to cluster together in Figure 1B. Aren't the cell types defined by gene expression profiles? Figure 1B can only confirm that the program chose a similar set of genes for the same cell types, but can't prove whether these genes are related to the cell type or not. Can you explain these?

Q5: (Results) HCNetlas as a tool for unraveling cell type specificity of disease genes. They ranked genes in each CNG by network degree centrality and then conducted gene set enrichment analysis using disease-associated genes obtained from DisGeNET and GWAS. For Figure 3A, how are the disease gene sets (rows) ordered? Are they ordered randomly? How are the CGNs (columns) ordered hierarchically? What do the colors/numbers represent? The conclusions drawn from Figure 3A are confusing.

Q6: (Results) What are the definitions of gain and loss of pathways? A pathway consists of multiple genes. How to compute them?

Q7: (Results) How many disease samples were used in the SLE, AD, and Lung Cancer data analysis?

Q8: I found a related paper in the following: Li, Z., Liu, G., Yang, X. et al. An atlas of cell-type-specific interactome networks across 44 human tumor types. Genome Med 16, 30 (2024). https://doi.org/10.1186/s13073-024-01303-w

Reviewer #2: In their manuscript Yu et al describe the setting of a new resource database for the collection of cell type-specific gene networks (CGNs) within specific cell types of various tissues. They want to establish reference CGNs, and then for any given set of disease genes be able to compare to the reference set of CGNs. Thus, the idea behind their project is to make available for researchers a resource of control CGNs that could be used for disease-related gene analysis instead of using matched control samples. Overall, the paper is well-written and include validation with the use of specific disease for assessing differences in CGNs.

Although the idea behind the study is interesting and valuable, I am unsure if at this stage with its application to few datasets, if HCNetlas brings enough for investigators to use it as their controls and what are the plans for extension. Why did the authors limit their CGNs to a few datasets, instead of applying it to all control datasets available in the HCA? It seems that for now HCNetlas is limited to data from 2 published papers, comprising in total 25 organs and the associated 61 cell types identified in these 2 reports. One of these papers is mostly limited to immune cell identification, and thus limiting the whole human being aspect of the analyses and CGNs identified. At this stage, it seems a better fit for any immune-related disorder as most of the cell types included for CGNs are immune cells.

Unless I am misunderstanding, I am unsure why the authors limited their approach to 2 datasets, albeit 2 very large datasets. If the goal is to eliminate running control samples, this approach would need to be widened and seems limited for now as a general database. It is well-known and described by several sc studies performed on human subjects that the donor-donor variability is huge. So, the limitation here seems to reduce the catalog of 'control samples' to a limited number of donors, 28 which may definitely not be enough to cover all ages for both sexes. Is the goal to extend to all available control datasets deposited in the HCA? This may need to be discussed and added as limitations.

At different parts in the paper, the authors mention using the studies from Dominguez Conde 2022 and the Tabula Sapiens from 2022, as well as the Allen Brain atlas data. However, in the text, they mention using HCA data. Could it be clarified for readers?

A minimum of 1,000 cells for a given cell type for CGN identification may remove from analysis some important cell type that may be related to diseases. This is a major limitation of the study.

On a more philosophical point, there are different factors that could alter gene expression and thus CGNs in human cell types that would not be disease-related but may affect disease-related genes. Or they may be specific conditions that may later CGNs without causing a specific disease. Every human is unique, and may have unique sets of CGNs at distinct point in time, that may vary with aging, although not all of these people will present with a specific disease. How is this taken into account in HCNetlas? What are considered controls from non-controls? Do you plan on applying specific threshold for inclusion in your database, like age, BMI, smoker or not, etc…. Healthy for one cell type may not be healthy for another, and the way control tissues are handled at biobank is limited in terms of metadata. For example, obtaining a healthy tissue, may just mean that this tissue did not present with a tumor, but not that the person in which this tissue was derived was healthy, and thus the CGN may be affected. With such variability how can a reference atlas be handled?

Reviewer #3: The authors created and tested co-expression based gene-gene networks for human immune and brain cells. They used these networks to identify disruption of specific cell types based on patient samples from Alzheimer's disease and systemic lupus erythematosus.

Given the word 'atlas' in the title, I assumed this approach would highlight specific cell types body-wide for a given set of disease-associated genes. Instead, this is limited to immune cells from several organs and the brain cell types (which didn't cover the resident immune cells). In this regard, readers might need clarification on how comprehensive this network atlas is.

GCNs created from the control samples from the SLE and Alzheimer's disease datasets were not compared to results from the HCNetlas GCNs. This needs to be tested since the authors claim their approach can 'circumvent the need for matched control samples.' This may highlight that the AD study took samples from the prefrontal cortex while the HCNetlas GCN is derived from the motor cortex, and was also enriched for neurons. This was done for the lung cancer study, but the conclusion that 'lung cancer genes exert their roles in immune cells' doesn't seem supported, given that the HCNetlas GCN was only from immune cells.

Figure 2 suggests that the GCNs are partially driven by expression level, given the high connectivity of the B and T cell marker genes. That leaves the question of whether the 'mere changes in expression levels' would highlight the same cell types.

For Alzheimer's disease, the GWAS-associated genes are expressed highly in microglia cells in the brain, which were not included in HCNetlas. The authors did not find this because they lacked the microglial cells to generate an GCN. In addition, the Alzheimer's GWAS hits are enriched for expression in the spleen and macrophages. This links the peripheral immune system to the disease. These associations were found from body-wide atlases like GTEx, but it doesn't seem HCNetlas would highlight macrophages. So, given its current setup, it would have missed this important characterization of the Alzheimer's disease genes.

Methods

-were the author-provided cell type annotations used? It seems that they were not and CellTypist was used instead. This should be clearly stated and justified, as the author-provided annotations should be trusted over CellTypist predictions. In addition, the CellTypist parameters should be specified.

Minor

-for Figure 2C, please mark which genes are within the top15 of the specific four cell types. It seems there is some overlap since only 50 genes are shown.

-Figure 2 uses GO BP, but KEGG is used later on. GO is more frequently updated and comprehensive and should be used over KEGG.

-the source of the disease-associated gene sets is highly variable. DisGeNET and KEGG collects genes from many sources with different degrees of evidence. This is clear with the AD genes as 17% are NADH:Ubiquinone Oxidoreductases, which lack genetic evidence and probably end up in the list because KEGG links them together.

Revision 2

Attachments
Attachment
Submitted filename: PLoS_response_letter_final.docx
Decision Letter - Richard Hodge, Editor

Dear Dr Lee,

Thank you for your continued patience while we considered your revised manuscript "HCNetlas: Human cell network atlas enabling cell type-resolved disease genetics" for publication as a Methods and Resources Article at PLOS Biology. Please accept my sincere apologies for the delays that you have experienced during this round of the peer review process. This revised version of your manuscript has been evaluated by the PLOS Biology editors, the Academic Editor and two of the original reviewers.

Based on the reviews, I am pleased to say that we are likely to accept this manuscript for publication, provided you satisfactorily address the following data and other policy-related requests that I have provided below (A-E):

(A) We routinely suggest changes to titles to ensure maximum accessibility for a broad, non-specialist readership. In this case, we would suggest a minor edit to the title, as follows. Please ensure you change both the manuscript file and the online submission system, as they need to match for final acceptance:

“HCNetlas: a reference database of human cell-type-specific gene networks to aid disease genetic analyses”

(B) You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797

Note that we do not require all raw data. Rather, we ask that all individual quantitative observations that underlie the data summarized in the figures and results of your paper be made available in one of the following forms:

-Supplementary files (e.g., excel). Please ensure that all data files are uploaded as 'Supporting Information' and are invariably referred to (in the manuscript, figure legends, and the Description field when uploading your files) using the following format verbatim: S1 Data, S2 Data, etc. Multiple panels of a single or even several figures can be included as multiple sheets in one excel file that is saved using exactly the following convention: S1_Data.xlsx (using an underscore).

-Deposition in a publicly available repository. Please also provide the accession code or a reviewer link so that we may view your data before publication.

Regardless of the method selected, please ensure that you provide the individual numerical values that underlie the summary data displayed in the following figure panels as they are essential for readers to assess your analysis and to reproduce it:

Figure 1B-C, 2A-E, 3A-B, 4A-F, 5A-F, 6A-G, S1A-C, S2A-B, S3, S4A-C, S5

NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values).

(C) Please also ensure that each of the relevant figure legends in your manuscript include information on *WHERE THE UNDERLYING DATA CAN BE FOUND*, and ensure your supplemental data file/s has a legend.

(D) Please ensure that your Data Statement in the submission system accurately describes where your data can be found and is in final format, as it will be published as written there.

(E) Please note that we cannot accept sole deposition of code in GitHub, as this could be changed after publication. However, you can archive this version of your publicly available GitHub code to Zenodo. Once you do this, it will generate a DOI number, which you will need to provide in the Data Accessibility Statement (you are welcome to also provide the GitHub access information). See the process for doing this here: https://docs.github.com/en/repositories/archiving-a-github-repository/referencing-and-citing-content

------------------------------------------------------------------------

As you address these items, please take this last chance to review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the cover letter that accompanies your revised manuscript.

We expect to receive your revised manuscript within two weeks.

To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include the following:

- a cover letter that should detail your responses to any editorial requests, if applicable, and whether changes have been made to the reference list

- a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable, if not applicable please do not delete your existing 'Response to Reviewers' file.)

- a track-changes file indicating any changes that you have made to the manuscript.

NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines:

https://journals.plos.org/plosbiology/s/supporting-information

*Published Peer Review History*

Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://plos.org/published-peer-review-history/

*Press*

Should you, your institution's press office or the journal office choose to press release your paper, please ensure you have opted out of Early Article Posting on the submission form. We ask that you notify us as soon as possible if you or your institution is planning to press release the article.

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please do not hesitate to contact me should you have any questions.

Best regards,

Richard

Richard Hodge, PhD

Senior Editor, PLOS Biology

rhodge@plos.org

------------------------------------------------------------------------

Reviewer remarks:

Reviewer #2: The authors wrote a very thorough rebuttal and answered all my comments. They modified their text and figures accordingly. I do not have additional comments or questions.

Reviewer #3: I believe the authors have reasonably addressed the issues I raised.

Revision 3

Attachments
Attachment
Submitted filename: PLoS_response_letter_final.docx
Decision Letter - Richard Hodge, Editor

Dear Dr Lee,

On behalf of my colleagues and the Academic Editor, Sui Huang, I am pleased to say that we can accept your manuscript for publication, provided you address any remaining formatting and reporting issues. These will be detailed in an email you should receive within 2-3 business days from our colleagues in the journal operations team; no action is required from you until then. Please note that we will not be able to formally accept your manuscript and schedule it for publication until you have completed any requested changes.

Please take a minute to log into Editorial Manager at http://www.editorialmanager.com/pbiology/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process.

PRESS

We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have previously opted in to the early version process, we ask that you notify us immediately of any press plans so that we may opt out on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

Thank you again for choosing PLOS Biology for publication and supporting Open Access publishing. We look forward to publishing your study. 

Best wishes, 

Richard

Richard Hodge, PhD

Senior Editor, PLOS Biology

rhodge@plos.org

PLOS

Empowering researchers to transform science

Carlyle House, Carlyle Road, Cambridge, CB4 3DN, United Kingdom

California (U.S.) corporation #C2354500, based in San Francisco

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .