Peer Review History

Original SubmissionJune 5, 2020
Decision Letter - Roland G Roberts, Editor

Dear Dr Cheng,

Thank you for submitting your manuscript entitled "A Network Medicine Approach to Investigation and Population-based Validation of Disease Manifestations and Therapeutics for COVID-19" for consideration as a Preregistered Research Article by PLOS Biology.

IMPORTANT: You have submitted this as a Preregistered Research Article, but actually you have already performed the study, and this is actually a regular Research Article. Please could you change the article type to Research Article when you upload your metadata (see next paragraph).

However, before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire.

Please re-submit your manuscript within two working days, i.e. by Jun 23 2020 11:59PM.

Login to Editorial Manager here: https://www.editorialmanager.com/pbiology

During resubmission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF when you re-submit.

Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. Once your manuscript has passed all checks it will be sent out for review.

Given the disruptions resulting from the ongoing COVID-19 pandemic, please expect delays in the editorial process. We apologise in advance for any inconvenience caused and will do our best to minimize impact as far as possible.

Feel free to email us at plosbiology@plos.org if you have any queries relating to your submission.

Kind regards,

Roli Roberts

Roland G Roberts, PhD,

Senior Editor

PLOS Biology

Revision 1

Attachments
Attachment
Submitted filename: renamed_9304e.pdf
Decision Letter - Roland G Roberts, Editor

Dear Dr Cheng,

Thank you very much for submitting your manuscript "A Network Medicine Approach to Investigation and Population-based Validation of Disease Manifestations and Therapeutics for COVID-19" for consideration as a Research Article at PLOS Biology. Your manuscript has been evaluated by the PLOS Biology editors, an Academic Editor with relevant expertise, and by several independent reviewers.

As you will see in the reviews below, the referees raise a substantial number of concerns with the analyses performed, the reporting and the ultimate relevance of the drug screening results. All of the issues raised are pertinent and should be thoroughly addressed. Given the interest of the topic and approach, we would like to invite you to submit a thoroughly revised manuscript that takes into account all of the reviewers' comments. Please note that we will not be able to make a decision about publication until we have seen the revised manuscript, your response to the reviewers' comments and how they received the revised manuscript.

We expect to receive your revised manuscript within 2 months.

Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension. At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may end consideration of the manuscript at PLOS Biology.

**IMPORTANT - SUBMITTING YOUR REVISION**

Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript:

1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript.

*NOTE: In your point by point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point.

You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response.

2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Related" file type.

*Re-submission Checklist*

When you are ready to resubmit your revised manuscript, please refer to this re-submission checklist: https://plos.io/Biology_Checklist

To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record.

Please make sure to read the following important policies and guidelines while preparing your revision:

*Published Peer Review*

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*PLOS Data Policy*

Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5

*Blot and Gel Data Policy*

We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosbiology/s/submission-guidelines#loc-materials-and-methods

Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Roli Roberts

Roland G Roberts, PhD,

Senior Editor,

rroberts@plos.org,

PLOS Biology

*****************************************************

REVIEWS:

Reviewer's Responses to Questions

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Reviewer #1: Overall Summary:

Zhou et al developed network-based methodologies to identify SARS-CoV-2 pathogenesis, disease manifestations, and COVID-19 therapies. They incorporated SARSCoV-2 virus-host protein-protein interactions, transcriptomics, and proteomics into the human interactome. Network proximity measure was used to identify the underlying pathogenesis for broad COVID-19-associated manifestations. Multi-modal analyses of single-cell RNA-seq data identified the co-expression pattern of ACE2 and TMPRSS2 in absorptive enterocytes from the inflamed ileal tissues of Crohn's disease patients compared to uninflamed tissues, revealing shared pathobiology by COVID-19 and inflammatory bowel disease. Integrative analyses of metabolomics and transcriptomics (bulk and single-cell) data from asthma patients indicated that COVID-19 shared intermediate inflammatory endophenotypes with asthma. By combing network-based prediction and propensity score matching observation study of 18118 patients from a COVID-19 registry, the authors identified that melatonin was associated with 64% reduced likelihood of a positive lab test for SARS-CoV-2 and can have better efficacy than angiotensin II receptor blockers or angiotensin converting enzyme inhibitors for treating SARS-CoV-2. However, details about how to identify the differential expressed genes, PPI-networks and drug-target network construction, and single-cell RNA-seq analysis are missing; the rationale about the construction of PanCoV-PPI and the network proximity measure need to be clarified. Most of the results are computational discoveries. It may provide valuable insight if the following concerns are addressed.

Major Comments:

1. Details about the RNA-seq data analysis are missing. I am not sure how the authors generated SARS2-DEP. For example, which samples in GSE147507 are used to identify differentially expressed genes? Which samples are used as controls? What kind of software is used to analyze those samples? How to identify differentially expressed genes? The same problem arises elsewhere in the manuscript.

2. In line 194 of page 10, the authors mentioned that "For cancer, the driver 195 genes for pan-cancer and individual cancer types were retrieved from the Cancer Gene 196 Census [34] and a previous study [35]. For autoimmune, pulmonary, neurological, 197 cardiovascular, and metabolic categories, we extracted their associated genes/proteins 198 from the Human Gene Mutation Database" I am not sure how those genes were retrieved from corresponding databases? Did the authors use any keywords for searching?

3. In line 675 of page 31, the authors mentioned that "The final human protein-protein interactome used in this study included 351,444 unique PPIs connecting 17,706 proteins". However, would the authors put those results in their GitHub page for the reader's reference? Several methods adopted are described as "as described in our previous study". Some statistics/overview of the network features/methods, although published somewhere else, should be provided for a better understanding of the proposed methods. For example, how did the authors build human protein-protein interactome? What are active comparator design and PS adjustment? How did the authors obtain the experimentally validated drug-target network?

4. Source code and supporting data cannot be found from the Github link provided by the authors.

5. In line 138-142, the authors mentioned that "PanCoV-PPI (Fig. 2B) and four other data sets (SARS2-139 DEG, SARS2-DEP, HCoV-PPI, and SARS2-PPI) (S6 Fig) were more likely to be highly connected (high degree or connectivity) in the human PPI network, including several hubs, such as JUN, XPO1, MOV10, NPM1, VCP, and HNRNPA1". According to Fig. 2B and S6, I cannot see anything supporting data to explain that JUN, XPO1, MOV10, NPM1, VCP, and HNRNPA1 are the hubs.

6. The authors performed functional enrichment analyses for five different PPIs, i.e., SARS2-DEG, SARS2-DEP, HCoV-PPI, SARS2-PPI, and PanCoV-PPI, generated from transcriptomic and proteomic data of SARS-CoV-2 as well as literature-based virus-host protein-protein interactions. They found different PPIs differ considerably in terms of enriched pathways, and then claimed "These observations suggest that these different SARS-CoV-2 data sets capture complementary aspects of the biological and cellular states of the viral life cycle and host immunity". However, many factors can cause the complementary effects. For example, 1. These data are derived from different cells or tissues, and not representative; 2. How did the authors perform functional enrichment analysis for SARS2-PPI and HCoV-PPI? 3. The differential expressed genes identified from transcriptome profiles can be very different from proteomic data.

7. In Figs 2C, 2D, and 2G, what is the physical meaning of the ratio of nonsynonymous to synonymous substitutions (dN/dS) and evolutionary rate ratio?

8. In line 130 of page 7, the authors mentioned that "further compiled four additional virus-host gene/protein networks identified by different methods for comparisons". However, I am not sure how those virus-host gene/protein networks are identified, details regarding how to generate those networks and relevant references are missing.

9. The network proximity measure was used to measure the distance of two genes/proteins in a protein-protein interaction network, however, the importance of the centrality of the nodes/proteins in the PPI network was ignored.

10. The authors mentioned that "We next performed network-based drug repurposing using the existing knowledge of the drug-target network." and "Using our state-of-the-art network proximity framework, we measured the "closest" proximities of nearly 3,000 drugs". However, details about the drug-target network, the network proximity framework, and the nearly 3,000 drugs are missing. The authors then computationally found 34 drugs that are associated with SARS-CoV-2 data sets, how did the authors rank those 3,000 drugs? how many of those 34 drugs are being tested, or have been tested in clinical trials and have positive effects for COVID-19 patients? The authors validated the efficacy of melatonin, one of those 34 drugs, on COVID-19 patients using their medical records. I am not sure if melatonin was ranked as the top-one among those 34 drugs? Not sure what are the differences between the drug-repositioning method proposed in this paper and in the authors' previous publication (ref 27), which used a similar network-based approach.

11. I am not sure how cell types in Figs 5C, 5F, 6C, and 6D are annotated? Are there any control samples in the single-cell RNA-seq analysis?

12. In Fig7.B there are four different bars with different colors, what does the PS-matched model 1 and PS-matched model 2 mean? There are two different OR model1s indicated by different colors, however, are these OR model1s independent? Besides, are four different models (variables) in Fig7.B independent?

Reviewer #2: Review report on Zhou et al PLoS Biology paper: PBIOLOGY-D-20-01653R1

Comments for the authors:

This paper addresses the network interpretation of higher risk of morbidity and mortality of COVID-19 patients with one or more other common diseases utilizing integrative network analysis of transcriptomics, proteomics, and human interactome. Utilizing bulk and single cell RNA-seq data together with differential metabolite information (only for asthma patients), the authors provided insights on shared pathobiology of COVID-19 patients with asthma and inflammatory bowel diseases. The authors of this paper utilized their earlier developed in-silico drug repurposing approaches on COVID-19 clinical registry database and prioritized existing FDA-approved drugs as potential therapeutic candidates.

Overall authors utilized all possible data sources and network-inference state of the art methods in their integrative analysis. The question remains however, with regard to whether these methods are good enough to yield substantial predictions. Below are the major concerns that need to be carefully further addressed before this paper can be considered for publication:

Major Comments:

1. One intrinsic limitation of the authors' method is that directionalities are in general not being taken into account in the various networks they built and/or used. For example, it seems that whether a viral protein activates or inhibits a host protein, or whether a gene is upregulated or downregulated in a disease is not being considered, and this can make the interpretation of results difficult or give rise to ambiguities. As an example of this issue, although the authors have identified proximity between the SARS-CoV-2 network and the asthma network with several shared nodes (Fig. 5A), when comparing the differential expression (DE) profiles in asthma to that in SARS-CoV-2 infection (Fig. 5B), there does not seem to be significant concordance in terms of the direction of DE. Notably, IL6 increased in SARS-CoV-2 infection but decreased in asthma. Will the same findings still hold if the directionality is taken into account properly? We think that this is an important issue that should be addressed appropriately.

2. The different coronavirus datasets were from different cell types, for example the SARS2-DEG was based on data in primary bronchial epithelial cells, while the SARS2-DEP was based on Caco-2 colorectal cancer cell line, and the SARS2-PPI was based on HEK273T cell line. Further, the genes associated with each disease should all act in a context-specific manner in the corresponding tissue types where each of the diseases manifest. It's questionable whether the network within one context can still largely hold if transferred to a different context (e.g. tissue type). The authors have implicitly studied the related issue of cell/tissue type-specific expression. If the tissue type-specific expression information is explicitly used to refine the various SARS2 network separately and specifically for each tissue type, will the findings on proximity with disease networks still hold? This potentially serious confounding factor cannot be ignored and needs to be carefully addressed.

3. In the 'Validating drug-outcome relationships on COVID-19 using patient data' under Results section, authors utilized their earlier developed approaches of network proximity, GSEA analysis and PS-score matching methods to prioritize the drug candidates. The basis for selection of the final drug melatonin seems weak. The authors didn't mention anywhere how many drugs they finally considered in their analysis and with what frequency each of them was used by the patients. It is not clear whether the drugs (including melatonin) are being taken by the patients before or at the time of being tested positive, if before then what is the time interval between the drug consumption and testing, for how long have the patient been taking the drug, and how such information are being used in terms of selecting the samples to include in the analysis, as well as in the analytical model during the analysis. Such details are critical for the interpretation of the results, and an overview should be provided in the main text (Results or Methods) with comprehensive additional details in the Supplementary Materials. Whatever drug list authors provided in Figure 7A, it is very obvious that all drugs were not used by all the patients in similar frequency. Melatonin is a very common drug compared to other drugs listed in that figure. So all these statistical tests are not at all applicable for all these drugs. In other words, this validation is not very useful in the COVID-19 patients' context, where intake of medicines is not homogeneous among patients.

4. Can the authors prioritize drugs in a more context-specific manner rather than one drug for all? Likewise, can they prioritize drugs in a more similar group of patients, like for asthma patients or for IBD patients or for hypertension patients?

5. In many places, the authors just provided some numbers without any biological implication. There is no explanation of such variability in numbers or what are the biological consequences of those. For example, in Figure 2, the authors presented the data for PanCoV-PPI which is a combination of HCoV-PPI and SARS2-PPI. It is well known that SARS2-PPI and HCoV-PPI networks are not very similar. In that case, all these numbers are not very useful towards the overall theme of the paper.

6. The authors utilized the network proximity measure to evaluate the connectivity and closeness of other diseases with COVID-19. It is now well known a variety of underlying health conditions are risk factors for covid-19 patients, including children with rare diseases, like cerebral palsy or mental conditions. In such scenarios, network proximity is not a very useful measure for identifying this high morbidity and mortality risk. What is the authors take on that?

Minor Comments:

We feel that many parts of the writing are inaccurate or confusing and can be improved (examples given below). We recommend the authors to further refine the writing so that it is easier for the readers to understand:

(a) Line 214, the authors write "these diseases can be targeted directly or interact with the 215 targets of SARS-CoV-2 or other HCoVs." We think it is actually meant that "the disease genes can interact with the viral proteins either directly or indirectly via another host protein".

(b) Line 228, authors can explicitly mention which 8 comorbidities they are referring to here.

(c) Line 293, the term "endophenotype", which has a strict definition, may not be appropriate here, the authors may intend to write "molecular profile" or perhaps "molecular program".

(d) Line 297, in multi-modal analysis, there is no clear explanation what authors exactly did here? There is no clear methodology for their multi-modal analysis in the Methods section.

(e) Line 308, "matching the enzymes of the differential metabolites and the proteins in the PPI network", it's not clear whether enzymes for the synthesis, or degradation, or any transformation, or transportation, etc. of the metabolites were considered, and it seems that this is not explained elsewhere either.

(f) Line 318 "Utilizing two bulk RNA-Seq data sets from asthma patients and healthy controls, we identified elevated expression of IRAK3 and ADRB2 in SARS-CoV-2 infected human bronchial epithelial cells." -- this is confusing.

(g) Line 428 "Validating drug-outcome relationships on COVID-19 using patient data", it seems to mean "evidence from the COVID-19 registry data that supports the predicted drug repurposing strategies".

Reviewer #3: The manuscript by Zhou and colleagues is submitted for consideration for publication to PLOS Biology. In this manuscript the authors tried to investigate pathogenesis, clinical manifestations and therapies COVID-19 using network medicine approach on clinical and multi-omics data. The reason for this study is to understand molecular mechanisms of SARS-CoV-2 infection, to compare with other not-infectious diseases and to identify FDA-approved drugs as potential COVID-19 drug candidates through network-medicine findings and clinical data from a large COVID-19 clinical registry database. The paper is well structured and exhaustive in every parts, while the integrated approach, the authors have used, is original and very interesting. Importantly, this theoretical findings might have practical significance via guiding both pharmaceutical and diagnostic research with the prospect to identify potential new biological targets. It can be recommended for publication upon addressing several concerns into some not clear parts.

The main concerns are:

1. in introduction they report numbers of pandemics, but it needs to insert one or more references (e.g. John Hopkins University), while at row 54 they should add other references about network based-approach model based on comparative PPI interactomes with other HCoV and concept of Disease Map applied to COVID19 .

2. in Results they talk about S2 Table, containing "additional virus-host gene / protein networks identified by different methods for comparisons", but it is not clear how they selected these genes, where these data is from and what purpose it would serve .

3. at pag 19 they wrote: "These observations reveal common network relationship between COVID-19 and human diseases". In my opinion, this phrase could result obvious, due to wide previous literature about COVID19 produced up this moment and the pathogenic and molecular similarity with SARS-CoV.

4. at pag 20 they talk about the network-based relationships of the 64 diseases across the 6 categories to COVID-19, shown in Fig 4A, taking into account the proximity, Z scores and P values, as significance test. However this part results hard to understand: firstly, they should report the absolute numbers of how many genes they used for each Z-score test and p-value, because different sample size of genes provide the streghtness of associations. Secondly, they must explain to biological function of genes tested and the pathway involved, because it is very difficult to figure out why they found strong significant network proximity with attention-deficit / hyperactivity disorder and not with other cardiovascular diseases, since vascular damage is one of most featured manifestations in severe COVID19 cases, or asthma. Moreover, for the networks in this figure, it is not clear why they chose sepsis and respiratory distress syndroms as example and it could result misleading.

5. at pag 28 they analyzed data by a large-scale patient data from the Cleveland Clinic COVID-19 patient registry, evaluating the charateristics of melatonin and carvedilol. I noted that in SARS-CoV-2 positive patients there was a wide diversity in sample size between cases and controls (cases are ~ 2% of controls). I understand it was due to availability of COVID19 cases tested, but it should much more report in the limitation section.

Revision 2

Attachments
Attachment
Submitted filename: Response_to_Reviewers_Sep2.pdf
Decision Letter - Roland G Roberts, Editor

Dear Dr Cheng,

Thank you for submitting your revised Research Article entitled "A network medicine approach to investigation and population-based validation of disease manifestations and drug repurposing for COVID-19" for publication in PLOS Biology. I have now obtained advice from the original reviewers and have discussed their comments with the Academic Editor.

Based on the reviews, we will probably accept this manuscript for publication, assuming that you will modify the manuscript to address the following points:

a) Please address my Data Policy requests (see further down).

b) Please clarify the funding information (it currently appears that you had no specific funding for this project, but this is unclear).

c) Please go very carefully through the text and especially the Abstract, ensuring that you do *not* overstate your claims regarding melatonin or any other treatments herein. This is particularly important given the high-profile disinformation that has occurred during recent months. You must only claim what is supported by the evidence presented. The final clause ("and identifying melatonin for potential prevention and treatment of COVID-19.") seems excessively strong. Also, given that melatonin is an endogenous chemical, the mentions of melatonin (e.g. "melatonin is associated with a reduced likelihood....") presumably should read "melatonin usage" or similar, to avoid the impression that endogenous levels were measured.

d) Please update any assessment of the current state of COVID-19 treatments to ensure that any fast-moving developments are captured.

We expect to receive your revised manuscript within two weeks. Your revisions should address the specific points made by each reviewer. In addition to the remaining revisions and before we will be able to formally accept your manuscript and consider it "in press", we also need to ensure that your article conforms to our guidelines. A member of our team will be in touch shortly with a set of requests. As we can't proceed until these requirements are met, your swift response will help prevent delays to publication.

To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include the following:

- a cover letter that should detail your responses to any editorial requests, if applicable

- a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable)

- a track-changes file indicating any changes that you have made to the manuscript.

*Copyediting*

Upon acceptance of your article, your final files will be copyedited and typeset into the final PDF. While you will have an opportunity to review these files as proofs, PLOS will only permit corrections to spelling or significant scientific errors. Therefore, please take this final revision time to assess and make any remaining major changes to your manuscript.

NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines:

https://journals.plos.org/plosbiology/s/supporting-information

*Published Peer Review History*

Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*Early Version*

Please note that an uncorrected proof of your manuscript will be published online ahead of the final version, unless you opted out when submitting your manuscript. If, for any reason, you do not want an earlier version of your manuscript published online, uncheck the box. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us as soon as possible if you or your institution is planning to press release the article.

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosbiology/s/submission-guidelines#loc-materials-and-methods

Please do not hesitate to contact me should you have any questions.

Sincerely,

Roli Roberts

Roland G Roberts, PhD,

Senior Editor,

rroberts@plos.org,

PLOS Biology

------------------------------------------------------------------------

DATA POLICY:

You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797

Note that we do not require all raw data. Rather, we ask that all individual quantitative observations that underlie the data summarized in the figures and results of your paper be made available in one of the following forms:

1) Supplementary files (e.g., excel). Please ensure that all data files are uploaded as 'Supporting Information' and are invariably referred to (in the manuscript, figure legends, and the Description field when uploading your files) using the following format verbatim: S1 Data, S2 Data, etc. Multiple panels of a single or even several figures can be included as multiple sheets in one excel file that is saved using exactly the following convention: S1_Data.xlsx (using an underscore).

2) Deposition in a publicly available repository. Please also provide the accession code or a reviewer link so that we may view your data before publication.

Regardless of the method selected, please ensure that you provide the individual numerical values that underlie the summary data displayed in the following figure panels as they are essential for readers to assess your analysis and to reproduce it: All main and supplementary figure panels except Fig 1. NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values).

IMPORTANT: I note that your supplementary files (Tables S1-S7) and Github deposition contain a significant amount of data; however their relationship to the specific Figure panels is unclear; please either supply the data that immediately underlies the Figures, or specify where it can be found; each main and supplementary Figure should have a clear citation to the underlying data (e.g. "The data underlying this Figure can be found in https://github.com/ChengF-Lab/COVID-19_Map"). It may be more appropriate to re-name Tables S2-S7 as Supplementary Data files.

Please also ensure that figure legends in your manuscript include information on where the underlying data can be found, and ensure your supplemental data file/s has a legend.

Please ensure that your Data Statement in the submission system accurately describes where your data can be found.

------------------------------------------------------------------------

REVIEWERS' COMMENTS:

Reviewer #2:

We thank the authors for addressing our comments. We think the manuscript has now been largely improved and is suitable for publication.

Reviewer #3:

No comments submitted.

Revision 3

Attachments
Attachment
Submitted filename: Response_to Editors comments.docx
Decision Letter - Roland G Roberts, Editor

Dear Dr Cheng,

On behalf of my colleagues and the Academic Editor, Nicole Soranzo, I am pleased to inform you that we will be delighted to publish your Research Article in PLOS Biology.

PRODUCTION PROCESS

Before publication you will see the copyedited word document (within 5 business days) and a PDF proof shortly after that. The copyeditor will be in touch shortly before sending you the copyedited Word document. We will make some revisions at copyediting stage to conform to our general style, and for clarification. When you receive this version you should check and revise it very carefully, including figures, tables, references, and supporting information, because corrections at the next stage (proofs) will be strictly limited to (1) errors in author names or affiliations, (2) errors of scientific fact that would cause misunderstandings to readers, and (3) printer's (introduced) errors. Please return the copyedited file within 2 business days in order to ensure timely delivery of the PDF proof.

If you are likely to be away when either this document or the proof is sent, please ensure we have contact information of a second person, as we will need you to respond quickly at each point. Given the disruptions resulting from the ongoing COVID-19 pandemic, there may be delays in the production process. We apologise in advance for any inconvenience caused and will do our best to minimize impact as far as possible.

EARLY VERSION

The version of your manuscript submitted at the copyedit stage will be posted online ahead of the final proof version, unless you have already opted out of the process. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

PRESS

We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have not yet opted out of the early version process, we ask that you notify us immediately of any press plans so that we may do so on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

Thank you again for submitting your manuscript to PLOS Biology and for your support of Open Access publishing. Please do not hesitate to contact me if I can provide any assistance during the production process.

Kind regards,

Alice Musson

Publishing Editor,

PLOS Biology

on behalf of

Roland Roberts,

Senior Editor

PLOS Biology

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .