Convergent perturbation of the human domain-resolved interactome by viruses and mutations inducing similar disease phenotypes

Yangchun Frank Chen; Yu Xia

doi:10.1371/journal.pcbi.1006762

Abstract

An important goal of systems medicine is to study disease in the context of genetic and environmental perturbations to the human interactome network. For diseases with both genetic and infectious contributors, a key postulate is that similar perturbations of the human interactome by either disease mutations or pathogens can have similar disease consequences. This postulate has so far only been tested for a few viral species at the level of whole proteins. Here, we expand the scope of viral species examined, and test this postulate more rigorously at the higher resolution of protein domains. Focusing on diseases with both genetic and viral contributors, we found significant convergent perturbation of the human domain-resolved interactome by endogenous genetic mutations and exogenous viral proteins inducing similar disease phenotypes. Pan-cancer, pan-oncovirus analysis further revealed that domains of human oncoproteins either physically targeted or structurally mimicked by oncoviruses are enriched for cancer driver rather than passenger mutations, suggesting convergent targeting of cancer driver pathways by diverse oncoviruses. Our study provides a framework for high-resolution, network-based comparison of various disease factors, both genetic and environmental, in terms of their impacts on the human interactome.

Author summary

Cellular function and behaviour are driven by highly coordinated biomolecular interaction networks. A prime example is the protein-protein interaction network, often simply referred to as the “interactome”. Recent advances in systems biology have spawned the view of human disease as a manifestation of genetic and environmental perturbations to the human interactome, a key postulate being that similar perturbation patterns lead to similar disease phenotypes. Here, we took a structural systems biology approach to compare mutation-induced and virus-induced perturbations of the human interactome in diseases with both genetic and viral contributors. Specifically, we constructed a domain-resolved human-virus protein interactome and characterized the distribution of genetic disease mutations with respect to human domains either physically targeted or structurally mimicked by virus. Overall, we found significant convergent perturbation of the human domain-resolved interactome by viruses and mutations inducing similar disease phenotypes. Structure-guided, integrated analysis of host genetic variation and host-pathogen protein interaction data may help elucidate the molecular mechanisms of infection and reveal its connections to genetic diseases such as cancer, autoimmunity, and neurodegeneration. On a broader note, our finding implies that similar perturbations of the human interactome at the domain level can have similar phenotypic consequences, regardless of the source of perturbation.

Citation: Chen YF, Xia Y (2019) Convergent perturbation of the human domain-resolved interactome by viruses and mutations inducing similar disease phenotypes. PLoS Comput Biol 15(2): e1006762. https://doi.org/10.1371/journal.pcbi.1006762

Editor: Andrey Rzhetsky, University of Chicago, UNITED STATES

Received: September 17, 2018; Accepted: January 7, 2019; Published: February 13, 2019

Copyright: © 2019 Chen, Xia. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All protein-protein interaction data are available from IntAct (https://www.ebi.ac.uk/intact/), HPIDB (http://hpidb.igbb.msstate.edu/downloads/hpidb2.mitab.zip) and HIV-1 Human Interaction Database (https://www.ncbi.nlm.nih.gov/genome/viruses/retroviruses/hiv-1/interactions/). All disease variant data are available from ClinVar (ftp://ftp.ncbi.nlm.nih.gov/pub/clinvar/) and UniProtKB (ftp://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/variants/humsavar.txt). All mutation data from cancer genome sequences are available from COSMIC (https://cancer.sanger.ac.uk/cosmic).

Funding: This work was supported by Natural Sciences and Engineering Research Council of Canada grant RGPIN-2014-03892, Canada Foundation for Innovation grant JELF-33732, Canada Foundation for Innovation grant IF-33122, and Canada Research Chairs program to Y.X., and McGill Engineering Doctoral Awards program to Y.F.C. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Cellular function and behaviour are driven by highly coordinated biomolecular interaction networks. A prime example is the protein-protein interaction (PPI) network, also known as the protein “interactome” or interactome for short. A central focus of disease systems biology is to use interactome networks to study genotype-phenotype relationships in complex diseases [1]. The idea of using interactome networks to infer gene function and gene-disease association comes from the well-validated principle of “guilt by association”, which states that physically interacting proteins tend to share similar functions and, by extension, tend to be involved in similar disease processes [1–4]. Recent advances in systems biology have spawned the view of human disease as a manifestation of genetic and environmental perturbations to the human interactome, a key postulate being that similar perturbation patterns lead to similar disease phenotypes [5–8]. A corollary is that, for diseases with both genetic and infectious contributors, similar perturbations of the human interactome by either disease mutations or pathogens can have similar disease consequences. This corollary has been tested for several viral species at the level of whole proteins [9, 10]. For example, Gulbahce et al. used yeast two-hybrid screens to map binary interactions between Epstein-Barr virus (EBV) and human papillomavirus (HPV) proteins and human proteins, and transcriptionally profiled human cell lines exogenously expressing HPV oncoproteins E6 and E7 [9]. They found that human genes associated with EBV- and HPV-implicated genetic diseases were often either directly targeted by the virus or transcriptionally regulated by viral targets. This finding led to the idea that oncoviral proteins may preferentially target host proto-oncogenes and tumour suppressors, which was experimentally validated in four families of DNA oncoviruses [10].

Despite insights from these studies on the etiology of virally-implicated genetic diseases, there has yet to be a systematic, structure-based comparison of mutation-induced and pathogen-induced perturbations of the human interactome. A high-resolution, structurally-resolved network biology approach is important for unravelling complex genotype-phenotype relationships, because mutations occurring in different PPI-mediating interfaces on the same protein often have distinct functional impacts and phenotypic consequences [5–8]. In this regard, structural systems biology has proved useful in uncovering evolutionary properties of single- and multi-interface PPI network hubs, systems-level principles governing human-virus interactions, and systems properties of disease variants [6, 11, 12]. For instance, by constructing atomic-resolution human-virus and within-human protein interactomes, Franzosa and Xia discovered that viral proteins tend to target existing endogenous PPI interfaces in the human interactome, rather than creating exogenous interfaces de novo, thereby efficiently perturbing multiple endogenous PPIs involved in cell regulation [12]. In a follow-up study, Garamszegi et al. expanded the coverage of the human-virus interactome using domain-resolved models of PPIs, and found that viral proteins tend to deploy short linear motifs to bind a variety of human protein domains [13]. The economical and pleiotropic nature of “host domain-viral motif” interactions reflects the efficiency with which viruses rewire the human interactome given limited genomic resources at their disposal. Meanwhile, Wang et al. constructed a domain-resolution within-human interactome where protein domains are annotated with disease variant information [6]. They found that mutations occurring in different PPI-mediating domains within the same protein tend to be associated with different disorders (“gene pleiotropy”). By contrast, mutations occurring in the domains of two different but interacting proteins, where the interaction is mediated by said domains, tend to be associated with the same disorder (“locus heterogeneity”). These studies attest to the utility of structural systems biology in the study of infectious and genetic diseases.

Here, we apply structural systems biology to the study of virally-implicated genetic diseases (VIDs), and rigorously test the postulate that endogenous genetic mutations and exogenous viral proteins give rise to similar disease phenotypes by inducing similar perturbations of the human interactome at the level of protein domains. Specifically, we constructed a domain-resolved human-virus protein interactome and characterized the distribution of genetic disease mutations with respect to human domains targeted by virus. Overall, we found that viral proteins and VID mutations induce similar perturbations of the human domain-resolved interactome, for individual viruses with clearly defined VIDs and sufficient numbers of host-virus PPIs (including EBV, HPV and HIV), for oncoviruses, as well as for all viruses combined. We first analyzed the disease associations of host proteins targeted by viral proteins and confirmed that virus-targeted proteins tend to be causally associated with VIDs rather than non-VIDs. We then analyzed the domain-level distribution of disease mutations in virus-targeted proteins and found that virus-targeted domains are significantly enriched for mutations causing VIDs rather than non-VIDs. Using a pooled analysis of all oncoviruses and all oncomutations, we found oncovirus-targeted domains to be significantly enriched for mutations causing cancer rather than other diseases. Furthermore, domains of oncoproteins either physically targeted or structurally mimicked by oncoviruses are significantly enriched for cancer driver mutations rather than passenger mutations, which implies convergent perturbation of cancer driver pathways by diverse oncoviruses. Finally, we also assessed the extent to which viral proteins and VID mutations perturb the same domain-domain interactions (DDIs) in the human interactome. We found that viruses preferentially target DDI partners of domains harbouring VID mutations, regardless of whether the DDI partners themselves are susceptible to known disease mutations. By correlating the equivalent pathogenicity of viral proteins and VID mutations with their convergent perturbation of the human domain-resolved interactome, we provide a framework for high-resolution, network-based comparison of the functional impacts of both genetic and environmental disease factors. On a broader note, our finding implies that similar perturbations of the human interactome at the domain level can have similar phenotypic consequences, regardless of the source of perturbation.

Results

Disease-annotated, domain-resolved human-virus protein interaction network

We first acquired human-endogenous and human-virus binary PPI data from IntAct, HPIDB 3.0, and the HIV-1 Human Interaction Database [14–18]. Only PPIs supported by at least one PubMed ID were included in the whole-protein resolution human-virus interactome, which consists of 173830 PPIs between 15995 human proteins, and 28531 PPIs between 7761 human proteins and 624 viral proteins. 7211 human proteins participate in both endogenous and exogenous PPIs. To build homology models of PPIs, we collected high-confidence domain-domain interaction (DDI) and domain-motif interaction (DMI) templates derived from 3D structures of protein complexes in the Protein Data Bank, and scanned protein sequences for the occurrence of Pfam domains and domain-binding linear motifs [19–23]. Structural models were assigned to each PPI by extracting all DDIs and DMIs possibly mediating the PPI. The resulting domain-resolved human-virus structural interaction network (hvSIN) consists of 61041 PPIs between 11596 human proteins, and 4654 PPIs between 1590 human proteins and 405 viral proteins. 1517 human proteins participate in both endogenous and exogenous portions of hvSIN.

We then obtained manually-curated disease variant data from UniProtKB and ClinVar [24, 25], selecting missense variants located inside Pfam domains for our analyses. Overall, 19047 mutations associated with 5383 diseases were mapped to 3585 domains of 2622 proteins. 14720 mutations associated with 4185 diseases were mapped to 2642 domains of 1864 human proteins in hvSIN. Table 1 lists the number of mutations by the type of domain in which they occur. Incidentally, 1272 domains of 957 human proteins in hvSIN are susceptible to disease mutations, but lack interacting domains or motifs. 850 of these 1272 domains harbour a total of 4154 mutations associated with 1381 diseases that are not accounted for by mutations occurring in PPI-mediating domains in hvSIN. Because the completeness of a domain’s PPI profile depends largely on the interactome search space and availability of 3D structures of protein complexes, and domains often have important biological functions besides mediating PPIs (e.g. enzymatic or nucleotide-binding activity), we included all domains of virus-targeted host proteins in a comprehensive analysis of the domain-level distribution of disease mutations.

Download:

Table 1. Number of disease mutations mapped to human protein domains in the human-virus structural interaction network (hvSIN).

https://doi.org/10.1371/journal.pcbi.1006762.t001

Virus-targeted host domains are enriched for virally-implicated disease mutations

To relate the equivalent pathogenicity of viral proteins and VID mutations to their equivalent perturbation of the host interactome, we first characterized the mutational landscape of human proteins targeted by EBV, HPV and HIV, three viruses with clearly defined VIDs and sufficient numbers of host-virus PPIs. Since most oncoviruses are causally implicated in only a few site-specific malignancies (e.g. HBV/HCV in hepatocellular carcinoma, KSHV in Kaposi’s sarcoma, and HTLV in adult T-cell lymphoma), and various types of cancer share common molecular hallmarks [26, 27], to increase the statistical power of our analysis and establish whether a general equivalence exists between endogenous and exogenous perturbagens of oncogenic pathways, we also performed a pooled analysis of host proteins targeted by diverse oncoviruses, by considering all types of cancer as interchangeable diseases, all oncomutations as interchangeable endogenous perturbagens, and all oncoviral proteins as interchangeable exogenous perturbagens. We found that for EBV, HIV, HPV and a broad spectrum of oncoviruses, virus-targeted host proteins tend to be causally associated with VIDs (Fig 1), and virus-targeted host domains tend to harbour mutations causally associated with VIDs (Fig 2). We discuss our findings for each type of virus below. A full list of VIDs and disease-associated proteins for EBV, HPV and HIV can be found in S1 Table.

Download:

Fig 1. Virus-targeted host proteins tend to be causally associated with virally-implicated diseases (VIDs).

“VID proteins” have at least one missense variant that is causally associated with a VID, whereas all missense variants of “non-VID proteins” are exclusively associated with non-VIDs. Literature-curated, virus-specific diseases for EBV, HPV and HIV are listed in S1 Table. For pooled analysis of oncoviruses, VIDs include all types of cancer (Methods). For pooled analysis of all viruses, VIDs include all proliferative and immunological diseases (Methods). Error bars represent 95% confidence intervals.

https://doi.org/10.1371/journal.pcbi.1006762.g001

Download:

Fig 2. Virus-targeted host domains tend to harbour mutations causally associated with virally-implicated diseases (VIDs).

“VID mutations” are causally associated with at least one VID, whereas “non-VID mutations” are exclusively associated with non-VIDs. Error bars represent 95% confidence intervals.

https://doi.org/10.1371/journal.pcbi.1006762.g002

EBV.

EBV is involved in lymphomas of the B, T, and NK-cell lineages as well as in adenocarcinomas of epithelial cells [28–32]. EBV hijacks cellular signaling processes by encoding viral homologues of cellular proteins that play key roles in apoptosis and proliferation. Examples include EBNA2 (mimics Notch signaling), LMP1 (mimics CD40 receptor signaling), LMP2 (mimics IgG receptor signaling), BALF1 and BHRF1 (homologues of cellular Bcl-2), and BCRF1 (homologue of cellular IL-10) [27]. All EBV homologues share at least one PPI partner with their cellular counterparts. Overall, EBV targets 11/99 (11.1%) host proteins associated with EBV diseases, and 51/2523 (2%) host proteins associated with non-EBV diseases, i.e. EBV tends to directly target host proteins causally associated with EBV-implicated diseases (Fisher’s exact test, two-tailed P = 1 × 10⁻⁵) (Fig 1). Analysis of the domain-level distribution of disease mutations found that 35/43 (81.4%) EBV-disease mutations and 62/856 (7.2%) non-EBV disease mutations occur in EBV-targeted domains, suggesting that EBV-targeted domains are significantly enriched for EBV-disease mutations (Fisher’s exact test, two-tailed P < 2.2 × 10⁻¹⁶) (Fig 2). Fig 3A shows the exclusive localization of mutations causing lung cancer, an EBV-implicated disease, in EBV-targeted tyrosine kinase domain (PF07714) of EGFR protein, while mutations causing other diseases such as brain cancer are evenly distributed among all domains of EGFR.

Download:

Fig 3. Exclusive localization or enrichment of VID mutations in virus-targeted domains.

(A) Exclusive localization of mutations causing lung cancer, an EBV-implicated disease, in EBV-targeted tyrosine kinase domain of EGFR protein. (B) Exclusive localization of mutations causing vulvar cancer and lung cancer, both HPV-implicated diseases, in HPV-targeted B domain of RB protein. (C) Exclusive localization of mutations causing cervical cancer, an HIV-implicated disease, in HIV-targeted PI3-kinase domain of MTOR protein, while mutations causing other diseases such as focal cortical dysplasia and Smith-Kingsmore syndrome are evenly distributed among all domains of MTOR. (D) Moderate enrichment of oncomutations in KSHV-targeted SH2 domain of PTPN11 protein, compared to mutations causing Noonan syndrome. Most of the oncomutations cause juvenile myelomonocytic leukemia, a disease although not caused by KSHV, is mimicked clinically and morphologically by other human herpesvirus infections, including EBV, CMV and HHV-6. VID mutations are shown as dark green diamonds. Non-VID mutations are shown as orange diamonds. Amino acid residues in virus-targeted domains are shown as light green squares. Residues in domains not targeted by virus are shown as yellow squares.

https://doi.org/10.1371/journal.pcbi.1006762.g003

HPV.

High-risk human papillomaviruses (HPV16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 66, 68), as defined by the Centers for Disease Control and Prevention (CDC) and International Agency for Research on Cancer (IARC), are established etiological agents for cervical, oropharyngeal and anogenital cancers [33–35]. Several studies have also reported an association between HPV and cancers of the bladder [36], breast [37], lung [38], and prostate [39]. Overall, HPV targets 5/79 (6.3%) host proteins associated with HPV diseases, and 17/2543 (0.7%) host proteins associated with non-HPV diseases, i.e. HPV tends to directly target host proteins causally associated with HPV-implicated diseases (Fisher’s exact test, two-tailed P = 3 × 10⁻⁴) (Fig 1). Analysis of the domain-level distribution of disease mutations found that 117/119 (98.3%) HPV-disease mutations and 94/150 (62.7%) non-HPV disease mutations occur in HPV-targeted domains, suggesting that HPV-targeted domains are significantly enriched for HPV-disease mutations (Fisher’s exact test, two-tailed P = 2 × 10⁻¹⁴) (Fig 2). Fig 3B shows the exclusive localization of mutations causing vulvar cancer and lung cancer, both HPV-implicated diseases, in HPV-targeted B domain (PF01857) of RB protein, while mutations causing other diseases such as retinoblastoma are evenly distributed among all domains of RB.

HIV.

HIV substantially raises the risk of Kaposi’s sarcoma, non-Hodgkin’s lymphoma and cervical cancer [40], as well as cancers of the anus, liver, lung, oropharynx and testes [41]. Although HIV-encoded accessory proteins such as Tat and Nef have demonstrated oncogenic properties on their own [42–44], HIV-associated cancers are mostly attributed to opportunistic infections with oncoviruses such as KSHV, EBV, HPV, and Hepatitis B/C virus. In addition, other HIV-associated complications such as cardiomyopathy and neurocognitive disorders have become increasingly common in the post-HAART era [45–50]. Overall, HIV targets 23/132 (17.4%) host proteins associated with HIV diseases, and 120/2490 (4.8%) host proteins associated with non-HIV diseases, i.e. HIV tends to directly target host proteins causally associated with HIV-implicated diseases (Fisher’s exact test, two-tailed P = 3 × 10⁻⁷) (Fig 1). Analysis of the domain-level distribution of disease mutations found that 103/158 (65.2%) HIV-disease mutations and 479/898 (53.3%) non-HIV disease mutations occur in HIV-targeted domains, suggesting that HIV-targeted domains are significantly enriched for HIV-disease mutations (Fisher’s exact test, two-tailed P = 7 × 10⁻³) (Fig 2). Fig 3C shows the exclusive localization of mutations causing cervical cancer, an HIV-implicated disease, in HIV-targeted PI3-kinase domain (PF00454) of MTOR protein, while mutations causing other diseases such as focal cortical dysplasia and Smith-Kingsmore syndrome are evenly distributed among all domains of MTOR. In addition to offering general insights on human-HIV interaction, our domain-resolved PPI models also provide useful information about specific HIV proteins. For instance, our model for the interaction between human Akt1 and HIV Nef involves the protein kinase domain (PF00069) of Akt1 and a region of Nef matching three overlapping motifs: MOD_NEK2_1 (residues 100–105), DOC_MAPK_gen_1 (residues 105–112) and DOC_MAPK_MEF2A_6 (residues 105–114). Notably, our predicted Akt1-binding region of Nef (residues 100–114) is consistent with the experimentally determined Akt1-binding region of Nef (residues 55–210) [51]. hvSIN also reveals a previously unreported similarity between the host interaction profiles of HIV Nef and the EBV oncoprotein LMP2, in that both can bind the SH2 domain (PF00017) of Src family kinases (Lck, Lyn, Src) and Syk family kinases (Syk, ZAP70), as well as the WW domain (PF00397) of the Nedd4 family of E3 ubiquitin ligases (Itch, Nedd4), possibly revealing disease modules perturbed in common by HIV and EBV in AIDS-related lymphoma [52, 53].

Oncoviruses.

Oncoviruses contribute to 12% of human cancers worldwide and can activate in a cancer cell the same molecular hallmarks shared among cancers of non-viral origin [27, 54]. In fact, some of the most potent oncogenes were first discovered in retroviruses [55]. Oncoviruses in hvSIN include human herpesviruses (HHV-4/EBV, HHV-5/CMV, HHV-8/KSHV), high-risk HPVs, human polyomaviruses (BKV, JCV, MCV), hepatitis B and C viruses, human T cell lymphotropic virus (HTLV) and oncogenic retroviruses. Some oncoviruses, although not directly infectious to human, are tumorigenic in other species, can transform human cells in vitro, and serve as models for studying viral oncogenesis in human (e.g. murid herpesvirus 4) [56, 57]. Despite HIV being classified by IARC as a Group 1 carcinogen and the in vitro oncogenicity of HIV-encoded accessory proteins, we excluded it from the pooled analysis of oncoviruses, because there is insufficient data on HIV prevalence and cancer incidence among HIV-infected individuals to accurately assess the independent contribution of HIV to infection-attributable cancers [58]. Pooled analysis of all oncovirus-targeted host proteins found that oncoviruses target 34/194 (17.5%) oncoproteins and 119/2428 (4.9%) proteins associated with non-cancer diseases, i.e. oncoviruses tend to directly target oncoproteins (Fisher’s exact test, two-tailed P = 1 × 10⁻⁹) (Fig 1). Analysis of the domain-level distribution of disease mutations found that 314/413 (76%) oncomutations and 371/1322 (28.1%) other disease mutations occur in oncovirus-targeted domains (OVTDs), i.e. the odds of finding cancer-causing over other disease-causing mutations in OVTDs is 8 times as high as that in non-OVTDs (Fisher’s exact test, two-tailed P < 2.2 × 10⁻¹⁶) (Fig 2). Fig 3D shows a moderate enrichment of oncomutations in KSHV-targeted SH2 domain (PF00017) of PTPN11 protein, compared to mutations causing Noonan syndrome. Most of the oncomutations cause juvenile myelomonocytic leukemia, a disease although not caused by KSHV, is mimicked clinically by other human herpesvirus infections, including EBV, CMV and HHV-6 [59, 60]. Finally, we also assessed the mutational landscape of 107 oncovirus-targeted pleiotropic proteins that are susceptible to both oncomutations and other disease mutations. Overall, 88/113 (77.9%) oncomutations and 110/179 (61.5%) other disease mutations were mapped to the OVTDs of these pleiotropic proteins, suggesting that enrichment of oncomutations in OVTDs holds even at the level of individual proteins involved in both cancer and other diseases (Fisher’s exact test, two-tailed P = 4 × 10⁻³).

Viruses in proliferative and immunological diseases.

All viruses have evolved sophisticated mechanisms to subvert host transcriptional and signaling machineries for replication and persistence. Viruses are known to encode homologues of cellular proteins to mimic mutant oncoproteins (Fig 4A) or antagonize mutant cytokine receptors (Fig 4B). Viruses have also been shown to abuse peptide motifs to modulate host signaling pathways, potentially mimicking the effects of disease-causing mutations (Fig 4C). We suspect that viruses and mutations causing proliferative and immunological diseases (PIDs) target similar human domains involved in cell cycle progression, apoptosis, DNA repair and immune homeostasis. Proliferative diseases include various neoplasms, both benign and malignant. Examples include lung cancer (Fig 3A), vulvar and lung cancer (Fig 3B), cervical cancer (Fig 3C), juvenile myelomonocytic leukemia (Fig 3D), glioblastoma multiforme and non-small-cell lung cancer (Fig 4A), lung cancer, breast cancer and lymphoma (Fig 4C). Immunological diseases include autoimmune diseases, hypersensitivity, and immunodeficiency disorders. One example of an immunological disease, inflammatory bowel disease (IBD), is given in Fig 4B, where we show convergent perturbation of the IL10-binding domain of IL-10R1 by both viral homologues of IL-10 and IBD mutations.

Download:

Fig 4. Viral and mutational perturbations of host domains are mechanistically similar.

(A) Viruses encode homologues of human proteins to mimic mutations in oncoproteins that cause uncontrolled cell proliferation. Top: EGFRvIII deletion mutation, frequently detected in glioblastoma multiforme (GBM) patients, and v-ErbB, encoded by avian leukosis virus, both lack the EGFR ligand-binding domain. Meanwhile, an L858R missense mutation in the EGFR kinase domain is frequently found in non-small-cell lung cancer (NSCLC). These alterations lead to conformational changes that result in ligand-independent, constitutive kinase activity [61, 62]. (B) Viruses encode homologues of human proteins to antagonize mutations in cytokine receptors that cause hypersensitivity. Human IL-10 functions both as an immunosuppressant in the inhibition of proinflammatory cytokines, and as an immunostimulant in the induction of MHC II expression on B cells. Mutations in the IL10-binding domain of IL-10R1 abrogate hIL10-induced phosphorylation, leading to loss of immunosuppression and inflammatory bowel disease [63]. In contrast, viral IL-10 homologues encoded by Epstein-Barr virus (EBV) and human cytomegalovirus (HCMV) retain and amplify the immunosuppressive properties of hIL-10, thus facilitating viral persistence after lytic infection [64]. ebvIL-10 selectively retains only the immunosuppressive properties of hIL-10. cmvIL-10 binds with greater affinity to IL-10R1 than hIL-10, while co-opting other IL10-associated pathways to amplify the immunosuppressive properties of hIL-10. Interestingly, transgenic expression of vIL-10 has been tested in animal models as an immunosuppressant option for transplant recipients [65]. In addition, abnormal expression levels of IL-10, IL10-R1 and IL10-R2 has been suggested as a mechanism for diffuse large B-cell lymphoma, a disease with clear EBV involvement [66]. (C) Viruses abuse peptide motifs to modulate host signaling pathways, potentially mimicking the effects of disease-causing mutations. Left: Kaposi’s sarcoma-associated herpesvirus (KSHV) protein K15-M uses a “PPLP” motif to bind the SH3 domain (PF00018) of Src [67], which possibly induces conformational opening of the Src kinase domain, thereby mimicking activating mutations such as Y527F [68]. Interestingly, a W121C mutation in the KSHV-targeted SH3 domain of Src has been identified in lung cancer [69]. Middle: Murine polyomavirus (MPyV) Middle T antigen (MT) uses a tyrosine-phosphorylated motif to recruit host Shc1, thereby promoting cell cycle progression [70]. Interestingly, a R175Q mutation in the MPyV-targeted PTB domain (PF00640) of Shc1 has been found to regulate tumorigenesis in mouse models of breast cancer [71]. Right: HIV protein gag uses the late-budding domain to sequester host PTPN23 and facilitate viral budding [72]. The phosphatase domain (PF00102) of PTPN23 regulates cell migration via dephosphorylation of FAK and is often mutated in cancer and developmental disorders [73, 74].

https://doi.org/10.1371/journal.pcbi.1006762.g004

To establish whether a general equivalence exists between endogenous and exogenous perturbagens of pathways associated with PIDs, we performed a pooled analysis of all virus-targeted host proteins by considering all PIDs as a unique category of diseases with both genetic and viral contributors, all PID mutations as interchangeable endogenous perturbagens, and all viral proteins as interchangeable exogenous perturbagens. We found that overall, viruses tend to target host proteins associated with PIDs (85/338, 25.1%) rather than non-PIDs (213/2284, 9.3%) (Fisher’s exact test odds ratio = 3.3, two-tailed P = 1 × 10⁻¹⁴) (Fig 1), and virus-targeted domains are enriched for mutations causing PIDs (525/737, 71.2%) rather than non-PIDs (803/2003, 40%) (Fisher’s exact test odds ratio = 3.7, two-tailed P < 2.2 × 10⁻¹⁶) (Fig 2). Since the equivalence between oncoviruses and oncomutations has already been established in the previous section, we excluded proliferative diseases from consideration and further tested the equivalence between viral proteins and mutations in causing immunological diseases. Again, we found that viruses tend to target host proteins associated with immunological diseases (31/151, 20.5%) rather than other diseases (267/2471, 10.8%) (Fisher’s exact test odds ratio = 2.1, two-tailed P = 8 × 10⁻⁴), and virus-targeted domains are enriched for mutations causing immunological diseases (101/179, 56.4%) rather than other diseases (1227/2561, 47.9%) (Fisher’s exact test odds ratio = 1.4, two-tailed P = 0.03). Finally, we tested the equivalence between viral proteins and mutations in causing proliferative, but not immunological diseases. Overall, viruses tend to target host proteins associated with proliferative diseases (56/199, 28.1%) rather than other diseases (242/2423, 10%) (Fisher’s exact test odds ratio = 3.5, two-tailed P = 8 × 10⁻¹²), and virus-targeted domains are enriched for mutations causing proliferative diseases (431/571, 75.5%) rather than other diseases (897/2169, 41.4%) (Fisher’s exact test odds ratio = 4.4, two-tailed P < 2.2 × 10⁻¹⁶).

Oncovirus-targeted host domains are enriched for cancer driver mutations

A main challenge in cancer research is to distinguish mutations which confer clonal growth advantage (i.e. drivers), from mutations that do not cause clonal expansion (i.e. passengers) [75]. Large-scale cancer genome sequencing projects have enabled systematic identification of cancer driver proteins and mutations [76]. Rozenblatt-Rosen et al. previously constructed an oncovirus-human interactome and demonstrated, at the whole-protein level, comparability between oncoviral perturbation and conventional functional genomics approaches to cancer gene discovery [10]. However, by representing proteins and PPIs as generic nodes and edges, their approach is not sensitive enough to distinguish driver mutations from passenger mutations occurring in the same oncoprotein. As we demonstrated earlier in the case of pleiotropic oncoproteins, the oncogenicity or “driver-ness” of a mutation is often correlated with its occurrence in oncovirus-targeted domains (OVTDs).

To confirm that oncoviruses can help identify driver proteins, we first cross-classified human proteins in hvSIN by whether they are oncoviral targets, and whether they are curated by the Cancer Gene Census (CGC) as being causally implicated in cancer, i.e. driver proteins [76]. Out of 727 oncoviral targets, 93 (12.8%) are in CGC, whereas out of 10897 remaining human proteins in hvSIN, 514 (4.7%) are in CGC. In other words, there is a 3-fold enrichment of driver proteins among oncoviral targets (Fisher’s exact test, two-tailed P = 3 × 10⁻¹⁶) (Fig 5A). Next, to find out if oncoviruses can also help identify driver mutations, we cross-classified mutations in oncoproteins by whether they are drivers or passengers, and by whether they map to OVTDs. Oncogenic and resistance mutations with a ClinVar clinical significance value of “pathogenic” or “likely pathogenic” are considered drivers, while passengers include all other missense mutations in oncoproteins that are catalogued by ClinVar and COSMIC. Out of 194 oncoproteins with annotated driver mutations, we identified 30 oncoproteins as having at least one OVTD. Pooled analysis of all 30 oncoproteins mapped 340/398 (85.4%) driver mutations and 3673/7177 (51.2%) passenger mutations to OVTDs. In other words, the odds of finding a driver mutation in OVTDs is 5 times as high as that in non-OVTDs (Fisher’s exact test, two-tailed P < 2.2 × 10⁻¹⁶) (Fig 5B). Closer inspection identified 19 candidates for focused investigations into the common basis of viral and mutational oncogenesis (Table 2): (I) 7 oncoproteins where all domains are OVTDs, and the driver:passenger ratio is higher than the average ratio across all oncoproteins; (II) 8 oncoproteins where some domains are OVTDs, and driver mutations are exclusively found in OVTDs; and (III) 4 oncoproteins where some domains are OVTDs, and driver mutations are significantly enriched in OVTDs (Fisher’s exact test, two-tailed P < 0.05). An example of each type of candidate is given in Fig 6.

Download:

Fig 5. Oncovirus-targeted proteins are enriched for driver proteins, and oncovirus-targeted or mimicked domains are enriched for driver mutations.

(A) There is a 3-fold enrichment of Cancer Gene Census proteins in oncovirus-targeted proteins. (B) There are 5-fold and 3-fold enrichments of driver mutations in oncovirus-targeted domains (OVTDs) and oncoviral homology domains (OVHDs), respectively.

https://doi.org/10.1371/journal.pcbi.1006762.g005

Download:

Fig 6. Oncoproteins having at least one oncovirus-targeted domain (OVTD), where driver mutations are either exclusively found or enriched.

(A) driver:passenger ratio in oncovirus-targeted PF00605 domain of IRF1 is higher than the mean driver:passenger ratio for all oncoproteins; (B) driver mutations are exclusively found in oncovirus-targeted PF07714 domain of PDGFRA; (C) driver mutations are enriched in oncovirus-targeted PF00104 domain of AR.

https://doi.org/10.1371/journal.pcbi.1006762.g006

Download:

Table 2. Oncoproteins having at least one oncovirus-targeted domain (OVTD), where driver mutations are either exclusively found or enriched.

https://doi.org/10.1371/journal.pcbi.1006762.t002

Oncovirus-mimicked host domains are enriched for cancer driver mutations

Viruses are known to encode structural homologues that mimic host domains in order to modulate the biological activities of host targets. Such viral homology domains (VHDs) play key roles in mediating immune response (e.g. PF00048 in CMV and KSHV), apoptosis (e.g. PF00452 in EBV and KSHV), cell differentiation (e.g. PF07684 in feline leukemia virus), and protein phosphorylation (e.g. PF06734 in CMV), among other cellular processes involved in virally-implicated diseases. VHDs often compete with cellular counterparts for interaction partners, thereby rewiring host signaling networks to the virus’s advantage. Table 3 lists instances of human proteins convergently targeted by human domains and oncoviral homology domains in hvSIN.

Download:

Table 3. Human proteins convergently targeted by human domains and oncoviral homology domains (OVHDs) in hvSIN.

https://doi.org/10.1371/journal.pcbi.1006762.t003

The preceding section established that oncovirus-targeted host domains are enriched for cancer driver mutations. Here, we test the hypothesis that oncovirus-mimicked host domains are also enriched for cancer driver mutations, independent of whether they are physically targeted by the virus. To this end, we identified 21 oncoproteins having at least one oncovirus-targeted domain (OVTD) and at least one viral homology domain (VHD). We further classified viral homology domains (VHDs) into those enriched in oncogenic viruses (oncoviral homology domains, or OVHDs), versus those enriched in non-oncogenic, i.e. “generic” viruses (generic viral homology domains, or GVHDs) (Methods, S2 Table). We found that domains structurally mimicked by oncoviruses (OVHDs) are more likely to harbour driver mutations, compared to domains structurally mimicked by generic viruses (GVHDs), independent of whether the domain is physically targeted by oncoviruses (OVTD) (CMH test, common odds ratio = 2.2, P = 5 × 10⁻⁵).

We then analyzed the mutational landscape of 44 oncoproteins having at least one oncoviral homology domain (OVHD) but not physically targeted by the virus, i.e. having no OVTDs. Pooled analysis of all 44 oncoproteins mapped 245/298 (82.2%) driver mutations and 5422/9554 (56.8%) passenger mutations to OVHDs. In other words, the odds of finding a driver mutation in OVHDs is 3 times as high as that in non-OVHDs (Fisher’s exact test, two-tailed P < 2.2 × 10⁻¹⁶) (Fig 5B). Closer inspection identified 23 candidates for focused investigations into the common basis of viral and mutational oncogenesis (Table 4): (I) 4 oncoproteins where all domains are OVHDs, and the driver:passenger ratio is higher than the average ratio across all oncoproteins; (II) 16 oncoproteins where some domains are OVHDs, and driver mutations are exclusively found in OVHDs; and (III) 3 oncoproteins where some domains are OVHDs, and driver mutations are significantly enriched in OVHDs (Fisher’s exact test, two-tailed P < 0.05). An example of each type of candidate is given in Fig 7. In summary, oncovirus-mimicked host domains are enriched for cancer driver mutations, regardless of whether these domains are physically targeted by the virus.

Download:

Fig 7. Oncoproteins having no oncovirus-targeted domain (OVTD) but at least one oncoviral homology domain (OVHD), where driver mutations are either exclusively found or enriched.

(A) driver:passenger ratio in oncovirus-mimicked PF00178 and PF02198 domains of ETV6 is higher than the mean driver:passenger ratio for all oncoproteins; (B) driver mutations are exclusively found in oncovirus-mimicked PF07714 domain of MET; (C) driver mutations are enriched in oncovirus-mimicked PF07714 domain of FGFR2.

https://doi.org/10.1371/journal.pcbi.1006762.g007

Download:

Table 4. Oncoproteins having no oncovirus-targeted domain (OVTD) but at least one oncoviral homology domain (OVHD), where driver mutations are either exclusively found or enriched.

https://doi.org/10.1371/journal.pcbi.1006762.t004

Viral proteins and virally-implicated disease mutations tend to perturb the same domain-domain interactions in the human interactome

Gulbahce et al. previously hypothesized, and established at the whole-protein level, that viruses and VID mutations induce similar perturbations of the human interactome [9]. Here, we test the same hypothesis at the higher resolution of protein domains, by examining whether viruses and VID mutations perturb the same domain-domain interactions (DDIs) in the human interactome. In other words, do viruses tend to target DDI partners of domains harbouring VID mutations (viral disease domain-interacting domains, or VDDiDs), rather than DDI partners of domains harbouring non-VID mutations (non-viral disease domain-interacting domains, or nVDDiDs) (Fig 8A)? As some domains can interact with both VID domains and non-VID domains, we define VDDiDs as domains that interact with at least one VID domain, and nVDDiDs as domains that exclusively interact with non-VID domains. We found that EBV and HPV exhibit a slight preference for targeting VDDiDs, although the effect sizes are not statistically significant (42/62 VDDiDs vs. 58/104 nVDDiDs for EBV, and 20/29 VDDiDs vs. 41/69 nVDDiDs for HPV). HIV targets 218/309 (70.6%) VDDiDs and 193/346 (55.8%) nVDDiDs, representing a 1.9-fold enrichment of VDDiDs among HIV-targeted domains (Fisher’s exact test, two-tailed P = 1 × 10⁻⁴). Similarly, oncoviruses target 204/285 (71.6%) VDDiDs and 164/291 (56.4%) nVDDiDs, i.e. a 1.9-fold enrichment of VDDiDs among oncovirus-targeted domains (Fisher’s exact test, two-tailed P = 1 × 10⁻⁴). Finally, a meta-analysis on the common effect of all viral proteins and all mutations causing proliferative and immunological diseases found that viruses target 424/599 (70.8%) VDDiDs and 350/551 (63.5%) nVDDiDs, i.e. a 1.4-fold enrichment of VDDiDs among virus-targeted domains (Fisher’s exact test, two-tailed P = 0.01) (Fig 8B).

Download:

Fig 8. Viral proteins and VID mutations perturb the same domain-domain interactions in the human interactome.

(A) From left to right, domains are cross-classified as: interacting with a domain harbouring at least one VID mutation (VDDiD) and targeted by virus, VDDiD not targeted by virus, interacting with a domain harbouring only non-VID mutations (nVDDiD) and targeted by virus, and nVDDiD not targeted by virus. (B) Viruses tend to target VDDiDs rather than nVDDiDs, regardless of whether the VDDiDs and nVDDiDs are susceptible to known disease mutations. The results for EBV and HPV are not statistically significant, possibly due to small sample sizes.

https://doi.org/10.1371/journal.pcbi.1006762.g008

Virus’s preferential targeting of VDDiDs may be confounded by the tendency for viruses to target VID domains (Fig 2), and the tendency for VID domains to interact among themselves. We therefore excluded domains susceptible to known disease mutations and examined the extent to which virus targets “non-disease” domains that interact with VID domains. We found that HIV targets 179/250 (71.6%) VDDiDs and 164/285 (57.5%) nVDDiDs that do not harbour any known disease mutation (Fisher’s exact test odds ratio = 1.9, two-tailed P = 8 × 10⁻⁴). Similarly, oncoviruses target 165/230 (71.7%) VDDiDs and 137/237 (57.8%) nVDDiDs that do not harbour any known disease mutation (Fisher’s exact test odds ratio = 1.8, two-tailed P = 2 × 10⁻³). Pooled analysis of all viruses found that overall, viruses target 345/481 (71.7%) VDDiDs and 295/456 (64.7%) nVDDiDs that do not harbour any known disease mutation (Fisher’s exact test odds ratio = 1.4, two-tailed P = 0.02). Virus’s preferential targeting of VDDiDs supports our hypothesis that viruses and VID mutations inducing similar disease phenotypes convergently perturb the host domain interactome, possibly unveiling core disease modules underlying clinically heterogeneous virally-implicated diseases (Fig 9).

Download:

Fig 9. Viral proteins and VID mutations convergently perturb dense regions of the human domain interactome.

Examples are given for EBV (left) and HIV (right).

https://doi.org/10.1371/journal.pcbi.1006762.g009

Discussion

Structural interaction networks serve as a valuable tool for understanding the molecular mechanisms of genetic diseases, as well as the fundamental differences between endogenous and exogenous PPI networks. As experimental determination of protein structure remains an arduous task, homology modelling offers an efficient alternative for the structural annotation of protein complexes. This is based on the observation that PPIs are often mediated by evolutionarily conserved structural modules, such as domains and short linear motifs [77]. Here, we reassess the role of viral proteins as surrogates for human disease variants in relating interactome network perturbation to disease phenotypes, using a domain-resolved human-virus protein interactome where human domains are annotated with disease variant information. Compared to previous work demonstrating general proximity between viral targets and VID proteins in the human interactome, our results provide a structural explanation for the equivalent pathogenicity of viral proteins and VID mutations. Whereas previous studies merely recognized the existence of viral homologues of cellular domains, we delve deeper into the functional implications of oncoviral domain homology. Our approach can readily identify domains convergently targeted or mimicked by diverse oncoviruses for focused screening of driver mutations across various types of cancer. Further characterization of cellular domains and motifs interacting with domains targeted or mimicked by viruses may uncover immune evasion strategies exploited in common by cancer cells and pathogens, and shed light on pathways dysregulated in other virally-implicated disorders.

Although most of our findings are statistically significant, there are notable differences in the enrichment of VID mutations in virus-targeted domains, both among individual viruses (EBV, HPV and HIV), as well as between single-virus analysis and pooled analysis on multiple viruses. For single-virus analysis, enrichment effect size and significance are impacted by the number of virus-host protein-protein interactions and virus-specific diseases, which ultimately determine the statistical power. Pooled analysis on all oncoviruses detected trends in the same direction as analysis on single oncoviruses (EBV and HPV), but with higher statistical power. In addition to investigator bias resulting in some viruses having a higher number of mapped virus-host PPIs, it is also possible that certain viruses prefer to perturb host regulatory network, rather than host PPI network, which is beyond the scope of this work. Compared to direct targeting of VID domains (a “first-degree” effect), viral targeting of the interaction partners of VID domains is expected to have a weaker, “second-degree” effect on the VID domains. This partly explains why results of the “first-degree” analysis on EBV and HPV (Fig 2) are stronger than those of the “second-degree” analysis (Fig 8B).

Our pooled analysis of all oncoviral targets and all oncomutations is motivated by the assumption of convergent evolution and mimicry of endogenous oncogenic mechanisms by diverse oncoviruses. There is compelling evidence of different oncoviruses complementing each other’s replication and persistence strategies, thus eliciting multiple cellular responses associated with the hallmarks of cancer. One example is primary effusion lymphoma, a disease causally linked to KSHV but also having an EBV component. While expression of KSHV lytic genes such as vIL-6 and K1 promote VEGF secretion and angiogenesis, concomitant expression of EBV latent genes confers additional anti-apoptotic properties to infected cells in the initial phase of lymphomagenesis [78, 79]. Given the paucity of context-dependent (i.e. tissue- and disease-specific) host-endogenous and host-pathogen PPI data, here we focus on establishing viral proteins and genetic mutations that induce similar disease phenotypes as generally equivalent perturbagens of the human interactome. Future work will also consider the diversity of host range and tissue tropism among different viruses, and the potentially distinct functional impacts of the same mutation in different cell types and diseases.

One potential caveat of our interactome perturbation model is its incompleteness, due to the following reasons. Firstly, current mapping of the host-virus protein interactome is far from exhaustive. Secondly, some bona fide host-virus PPIs cannot be modelled by existing domain-based interaction templates. Thirdly, virus may not interact with a host protein via PPI, but rather regulate its expression via transcriptional or epigenetic mechanisms. Lastly, our study only considers missense mutations, because domain-based analysis of interactome perturbation requires precise positioning of mutations with respect to protein domains. Missense mutations can be unambiguously mapped to individual domains, whereas other types of mutations (e.g. nonsense or frameshift) may cause more drastic changes in the protein structure and are more difficult to map to individual domains. We are aware, however, of literature suggesting that nonsense and frameshift mutations tend to occur more frequently in tumour suppressor genes than in oncogenes [80]. Effects of these mutations on the integrity of the human interactome warrant further investigation. Still, despite the incompleteness of our model, we observed significant convergent perturbation of the human domain-resolved interactome by viruses and mutations inducing similar disease phenotypes.

The advent of high-throughput biotechnology has made it possible to comprehensively characterize genomic variations in and interspecies interactions between human and microbes, which play important roles in health and disease. As more data on pathogen-implicated diseases and host-pathogen interactions emerge, our approach may be extended to the study of bacterial diseases and co-infections involving multiple pathogenic species, such as the co-pathogenesis of HIV and Mycobacterium tuberculosis. By combining these data within the framework of structural systems biology, our work sets the stage for multi-scale, integrative investigations into endogenous and exogenous perturbagens of the human interactome, thus helping to elucidate the molecular mechanisms of infection and its possible connections to genetic diseases such as cancer, autoimmunity, and neurodegeneration.

Methods

Construction of disease-annotated human-virus structural interaction network

Human-endogenous and human-virus binary PPI data were obtained from IntAct [14], HPIDB [15], and the HIV-1 Human Interaction Database [16–18]. Structural templates for domain-domain and domain-motif interactions were obtained from 3did [19], iPfam [21] and ELM [20]. Protein sequences were scanned for Pfam domains using InterProScan under default settings (version 5.30–69.0) [23, 81], and for the occurrence of domain-binding motifs as defined by 3did and ELM. Domain-based interaction models were assigned to each PPI by extracting all DDIs and DMIs possibly mediating the PPI. Disease association and clinical significance of variants were obtained from UniProtKB, ClinVar, and COSMIC [24, 25, 76]. Ensembl Variant Effect Predictor (VEP v93.0) was used for extracting variant genomic location, variation class, reference allele, HGVS notations, amino acid position, overlapping Pfam domains, among other features [82]. To facilitate counting of mutational events, variants are annotated with RefSNP IDs using VEP’s check_existing flag. Variants not co-located with any known variant are merged based on identical genomic location, variation class, and shared alleles, as per NCBI guidelines for merging submitted SNPs into RefSNP clusters (https://www.ncbi.nlm.nih.gov/books/NBK44417/). Only missense mutations located inside Pfam domains were retained for analyses. Assignment of each virally-implicated disease (VID) to EBV, HPV and HIV was based on at least two literature sources (S1 Text). To minimize redundancy in disease annotation, UMLS and OMIM IDs given to subtypes of the same disease were merged into the more general Disease Ontology [83], Orphanet [84] and MeSH IDs.

Pooled analysis of viral proteins and disease mutations

Oncoviruses are as classified by CDC, IARC, and MeSH (https://www.ncbi.nlm.nih.gov/mesh/68009858). Cancer is defined as any disease whose parent terms include “DOID:162”, “ORPHA:250908”, or MeSH IDs beginning with “C04.557|C04.588|C04.619|C04.626|C04.651|C04.666|C04.682|C04.692|C04.697|C04.700|C04.730|C04.834|C04.850”. Diseases without Disease Ontology, Orphanet or MeSH IDs are manually labelled as “cancer” if their names match the following regular expression: “blastoma|cancer|carcino*|glioma|leukemia|leukaemia|lymphoma|melanoma|neoplas*|sarcoma|tumour|tumor”. Proliferative diseases have parent terms “DOID:14566”, “ORPHA:250908”, or MeSH IDs beginning with “C04”. Immunological diseases have parent terms “DOID:2914”, “ORPHA:98004”, or MeSH IDs beginning with “C20”. All statistical analyses were conducted in R [85]. Plots of domain-level distribution of disease mutations were created with Protter [86].

Classification of viral homology domains

Pfam domain annotation for all human and viral proteins in UniProt was retrieved from InterPro (Release 69.0) [87]. We define viral homology domains (VHDs) as Pfam domains conserved between human and viral proteins. For each VHD, the likelihood of it occurring in oncoviruses was calculated as the number of oncoviruses encoding the VHD, divided by the total number of unique oncoviral species in UniProt. Similarly, the likelihood of a VHD occurring in “generic” (i.e. non-oncogenic) viruses was calculated as the number of generic viruses encoding the VHD divided by the total number of unique generic viral species in UniProt. The observed likelihood ratio (LR) of an oncovirus vs. a generic virus encoding the VHD is then the ratio of the two likelihoods. We then permuted the label “oncovirus” and “generic virus” 10000 times among viruses encoding the VHD, thereby obtaining a null distribution for the LR. An empirical p-value for the enrichment or depletion of a VHD in oncoviral proteomes was calculated according to [88]. VHDs whose observed LR > 1 and Benjamini-Hochberg adjusted p-values (q-values) < 0.1 are considered enriched in oncoviral proteomes. These VHDs and other VHDs exclusively occurring in oncoviruses are called oncoviral homology domains (OVHDs). Likewise, VHDs whose observed LR < 1 and q-values < 0.1 are considered enriched in generic viral proteomes. These VHDs and other VHDs exclusively occurring in generic viruses are called generic viral homology domains (GVHDs).

Supporting information

S1 Text. References for virally implicated diseases.

https://doi.org/10.1371/journal.pcbi.1006762.s001

(DOCX)

S1 Table. Virally implicated diseases and disease-associated proteins for EBV, HPV and HIV.

Disease proteins are shown in brackets if they are targeted by virus, but the human-virus PPI does not have a domain-based structural model. Disease proteins are in bold if the domain harbouring disease mutation is targeted by virus.

https://doi.org/10.1371/journal.pcbi.1006762.s002

(XLSX)

S2 Table. Human domains exclusively occurring or enriched in oncoviruses vs. generic viruses.

For each Pfam domain, a human instance (UniProt ID) and a viral instance (Taxonomy ID|Taxonomy Name|UniProt ID) are selected, with preference given to proteins in hvSIN. Mappings of Pfam domains to Gene Ontology terms are obtained from http://geneontology.org/external2go/pfam2go.

https://doi.org/10.1371/journal.pcbi.1006762.s003

(CSV)

References

1. Vidal M., Cusick M.E., and Barabasi A.L., Interactome networks and human disease. Cell, 2011. 144(6): p. 986–98. pmid:21414488
- View Article
- PubMed/NCBI
- Google Scholar
2. Oliver S., Guilt-by-association goes global. Nature, 2000. 403(6770): p. 601–3. pmid:10688178
- View Article
- PubMed/NCBI
- Google Scholar
3. Barabasi A.L., Network medicine—from obesity to the "diseasome". N Engl J Med, 2007. 357(4): p. 404–7. pmid:17652657
- View Article
- PubMed/NCBI
- Google Scholar
4. Goh K.I., et al., The human disease network. Proc Natl Acad Sci U S A, 2007. 104(21): p. 8685–90. pmid:17502601
- View Article
- PubMed/NCBI
- Google Scholar
5. Zhong Q., et al., Edgetic perturbation models of human inherited disorders. Mol Syst Biol, 2009. 5: p. 321. pmid:19888216
- View Article
- PubMed/NCBI
- Google Scholar
6. Wang X., et al., Three-dimensional reconstruction of protein networks provides insight into human genetic disease. Nat Biotechnol, 2012. 30(2): p. 159–64. pmid:22252508
- View Article
- PubMed/NCBI
- Google Scholar
7. Sahni N., et al., Edgotype: a fundamental link between genotype and phenotype. Curr Opin Genet Dev, 2013. 23(6): p. 649–57. pmid:24287335
- View Article
- PubMed/NCBI
- Google Scholar
8. Sahni N., et al., Widespread macromolecular interaction perturbations in human genetic disorders. Cell, 2015. 161(3): p. 647–60. pmid:25910212
- View Article
- PubMed/NCBI
- Google Scholar
9. Gulbahce N., et al., Viral perturbations of host networks reflect disease etiology. PLoS Comput Biol, 2012. 8(6): p. e1002531. pmid:22761553
- View Article
- PubMed/NCBI
- Google Scholar
10. Rozenblatt-Rosen O., et al., Interpreting cancer genomes using systematic host network perturbations by tumour virus proteins. Nature, 2012. 487(7408): p. 491–5. pmid:22810586
- View Article
- PubMed/NCBI
- Google Scholar
11. Kim P.M., et al., Relating three-dimensional structures to protein networks provides evolutionary insights. Science, 2006. 314(5807): p. 1938–41. pmid:17185604
- View Article
- PubMed/NCBI
- Google Scholar
12. Franzosa E.A. and Xia Y., Structural principles within the human-virus protein-protein interaction network. Proc Natl Acad Sci U S A, 2011. 108(26): p. 10538–43. pmid:21680884
- View Article
- PubMed/NCBI
- Google Scholar
13. Garamszegi S., Franzosa E.A., and Xia Y., Signatures of pleiotropy, economy and convergent evolution in a domain-resolved map of human-virus protein-protein interaction networks. PLoS Pathog, 2013. 9(12): p. e1003778. pmid:24339775
- View Article
- PubMed/NCBI
- Google Scholar
14. Orchard S., et al., The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res, 2014. 42(Database issue): p. D358–63. pmid:24234451
- View Article
- PubMed/NCBI
- Google Scholar
15. Ammari M.G., et al., HPIDB 2.0: a curated database for host-pathogen interactions. Database (Oxford), 2016. 2016.
- View Article
- Google Scholar
16. Fu W., et al., Human immunodeficiency virus type 1, human protein interaction database at NCBI. Nucleic Acids Res, 2009. 37(Database issue): p. D417–22. pmid:18927109
- View Article
- PubMed/NCBI
- Google Scholar
17. Ptak R.G., et al., Cataloguing the HIV type 1 human protein interaction network. AIDS Res Hum Retroviruses, 2008. 24(12): p. 1497–502. pmid:19025396
- View Article
- PubMed/NCBI
- Google Scholar
18. Pinney J.W., et al., HIV-host interactions: a map of viral perturbation of the host system. AIDS, 2009. 23(5): p. 549–54. pmid:19262354
- View Article
- PubMed/NCBI
- Google Scholar
19. Mosca R., et al., 3did: a catalog of domain-based interactions of known three-dimensional structure. Nucleic Acids Res, 2014. 42(Database issue): p. D374–9. pmid:24081580
- View Article
- PubMed/NCBI
- Google Scholar
20. Dinkel H., et al., ELM 2016—data update and new functionality of the eukaryotic linear motif resource. Nucleic Acids Res, 2016. 44(D1): p. D294–300. pmid:26615199
- View Article
- PubMed/NCBI
- Google Scholar
21. Finn R.D., et al., iPfam: a database of protein family and domain interactions found in the Protein Data Bank. Nucleic Acids Res, 2014. 42(Database issue): p. D364–73. pmid:24297255
- View Article
- PubMed/NCBI
- Google Scholar
22. Berman H.M., et al., The Protein Data Bank. Nucleic Acids Res, 2000. 28(1): p. 235–42. pmid:10592235
- View Article
- PubMed/NCBI
- Google Scholar
23. Finn R.D., et al., The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res, 2016. 44(D1): p. D279–85. pmid:26673716
- View Article
- PubMed/NCBI
- Google Scholar
24. Famiglietti M.L., et al., Genetic variations and diseases in UniProtKB/Swiss-Prot: the ins and outs of expert manual curation. Hum Mutat, 2014. 35(8): p. 927–35. pmid:24848695
- View Article
- PubMed/NCBI
- Google Scholar
25. Landrum M.J., et al., ClinVar: public archive of interpretations of clinically relevant variants. Nucleic Acids Res, 2016. 44(D1): p. D862–8. pmid:26582918
- View Article
- PubMed/NCBI
- Google Scholar
26. Hanahan D. and Weinberg R.A., The hallmarks of cancer. Cell, 2000. 100(1): p. 57–70. pmid:10647931
- View Article
- PubMed/NCBI
- Google Scholar
27. Mesri E.A., Feitelson M.A., and Munger K., Human viral oncogenesis: a cancer hallmarks analysis. Cell Host Microbe, 2014. 15(3): p. 266–82. pmid:24629334
- View Article
- PubMed/NCBI
- Google Scholar
28. Epstein M.A., Achong B.G., and Barr Y.M., Virus Particles in Cultured Lymphoblasts from Burkitt's Lymphoma. Lancet, 1964. 1(7335): p. 702–3. pmid:14107961
- View Article
- PubMed/NCBI
- Google Scholar
29. Wang S., et al., Identification and Characterization of Epstein-Barr Virus Genomes in Lung Carcinoma Biopsy Samples by Next-Generation Sequencing Technology. Sci Rep, 2016. 6: p. 26156. pmid:27189712
- View Article
- PubMed/NCBI
- Google Scholar
30. Kimura H., et al., EBV-associated T/NK-cell lymphoproliferative diseases in nonimmunocompromised hosts: prospective analysis of 108 cases. Blood, 2012. 119(3): p. 673–86. pmid:22096243
- View Article
- PubMed/NCBI
- Google Scholar
31. Lung M.L., et al., The interplay of host genetic factors and Epstein-Barr virus in the development of nasopharyngeal carcinoma. Chin J Cancer, 2014. 33(11): p. 556–68. pmid:25367335
- View Article
- PubMed/NCBI
- Google Scholar
32. Fukayama M., Hino R., and Uozaki H., Epstein-Barr virus and gastric carcinoma: virus-host interactions leading to carcinoma. Cancer Sci, 2008. 99(9): p. 1726–33. pmid:18616681
- View Article
- PubMed/NCBI
- Google Scholar
33. Schiffman M., et al., Human papillomavirus and cervical cancer. Lancet, 2007. 370(9590): p. 890–907. pmid:17826171
- View Article
- PubMed/NCBI
- Google Scholar
34. Nulton T.J., et al., Analysis of The Cancer Genome Atlas sequencing data reveals novel properties of the human papillomavirus 16 genome in head and neck squamous cell carcinoma. Oncotarget, 2017. 8(11): p. 17684–17699. pmid:28187443
- View Article
- PubMed/NCBI
- Google Scholar
35. Moscicki A.B., et al., Updating the natural history of human papillomavirus and anogenital cancers. Vaccine, 2012. 30 Suppl 5: p. F24–33.
- View Article
- Google Scholar
36. Li N., et al., Human papillomavirus infection and bladder cancer risk: a meta-analysis. J Infect Dis, 2011. 204(2): p. 217–23. pmid:21673031
- View Article
- PubMed/NCBI
- Google Scholar
37. Li N., et al., Human papillomavirus infection and sporadic breast carcinoma risk: a meta-analysis. Breast Cancer Res Treat, 2011. 126(2): p. 515–20. pmid:20740311
- View Article
- PubMed/NCBI
- Google Scholar
38. Klein F., Amin Kotb W.F., and Petersen I., Incidence of human papilloma virus in lung cancer. Lung Cancer, 2009. 65(1): p. 13–8. pmid:19019488
- View Article
- PubMed/NCBI
- Google Scholar
39. Singh N., et al., Implication of high risk human papillomavirus HR-HPV infection in prostate cancer in Indian population—a pioneering case-control analysis. Sci Rep, 2015. 5: p. 7822. pmid:25592643
- View Article
- PubMed/NCBI
- Google Scholar
40. Monforte A., et al., HIV-induced immunodeficiency and mortality from AIDS-defining and non-AIDS-defining malignancies. AIDS, 2008. 22(16): p. 2143–53. pmid:18832878
- View Article
- PubMed/NCBI
- Google Scholar
41. Grulich A.E., et al., Incidence of cancers in people with HIV/AIDS compared with immunosuppressed transplant recipients: a meta-analysis. Lancet, 2007. 370(9581): p. 59–67. pmid:17617273
- View Article
- PubMed/NCBI
- Google Scholar
42. De Falco G., et al., Interaction between HIV-1 Tat and pRb2/p130: a possible mechanism in the pathogenesis of AIDS-related neoplasms. Oncogene, 2003. 22(40): p. 6214–9. pmid:13679860
- View Article
- PubMed/NCBI
- Google Scholar
43. Nunnari G., Smith J.A., and Daniel R., HIV-1 Tat and AIDS-associated cancer: targeting the cellular anti-cancer barrier? J Exp Clin Cancer Res, 2008. 27: p. 3. pmid:18577246
- View Article
- PubMed/NCBI
- Google Scholar
44. Briggs S.D., et al., SH3-mediated Hck tyrosine kinase activation and fibroblast transformation by the Nef protein of HIV-1. J Biol Chem, 1997. 272(29): p. 17899–902. pmid:9218412
- View Article
- PubMed/NCBI
- Google Scholar
45. Barbaro G., Cardiovascular manifestations of HIV infection. Circulation, 2002. 106(11): p. 1420–5. pmid:12221062
- View Article
- PubMed/NCBI
- Google Scholar
46. Pugliese A., et al., Impact of highly active antiretroviral therapy in HIV-positive patients with cardiac involvement. J Infect, 2000. 40(3): p. 282–4. pmid:10908024
- View Article
- PubMed/NCBI
- Google Scholar
47. Yunis N.A. and Stone V.E., Cardiac manifestations of HIV/AIDS: a review of disease spectrum and clinical management. J Acquir Immune Defic Syndr Hum Retrovirol, 1998. 18(2): p. 145–54. pmid:9637579
- View Article
- PubMed/NCBI
- Google Scholar
48. Hersh B.P., Rajendran P.R., and Battinelli D., Parkinsonism as the presenting manifestation of HIV infection: improvement on HAART. Neurology, 2001. 56(2): p. 278–9. pmid:11160977
- View Article
- PubMed/NCBI
- Google Scholar
49. Koutsilieri E., et al., Parkinsonism in HIV dementia. J Neural Transm (Vienna), 2002. 109(5–6): p. 767–75.
- View Article
- Google Scholar
50. Mirsattari S.M., Power C., and Nath A., Parkinsonism with HIV infection. Mov Disord, 1998. 13(4): p. 684–9. pmid:9686775
- View Article
- PubMed/NCBI
- Google Scholar
51. Kumar A., et al., Tuning of AKT-pathway by Nef and its blockade by protease inhibitors results in limited recovery in latently HIV infected T-cell line. Sci Rep, 2016. 6: p. 24090. pmid:27076174
- View Article
- PubMed/NCBI
- Google Scholar
52. Portis T., Dyck P., and Longnecker R., Epstein-Barr Virus (EBV) LMP2A induces alterations in gene transcription similar to those observed in Reed-Sternberg cells of Hodgkin lymphoma. Blood, 2003. 102(12): p. 4166–78. pmid:12907455
- View Article
- PubMed/NCBI
- Google Scholar
53. Lamers S.L., et al., HIV-1 Nef in macrophage-mediated disease pathogenesis. Int Rev Immunol, 2012. 31(6): p. 432–50. pmid:23215766
- View Article
- PubMed/NCBI
- Google Scholar
54. Plummer M., et al., Global burden of cancers attributable to infections in 2012: a synthetic analysis. Lancet Glob Health, 2016. 4(9): p. e609–16. pmid:27470177
- View Article
- PubMed/NCBI
- Google Scholar
55. Vogt P.K., Retroviral oncogenes: a historical primer. Nat Rev Cancer, 2012. 12(9): p. 639–48. pmid:22898541
- View Article
- PubMed/NCBI
- Google Scholar
56. Stevenson P.G., Simas J.P., and Efstathiou S., Immune control of mammalian gamma-herpesviruses: lessons from murid herpesvirus-4. J Gen Virol, 2009. 90(Pt 10): p. 2317–30. pmid:19605591
- View Article
- PubMed/NCBI
- Google Scholar
57. Parada L.F., et al., Human EJ bladder carcinoma oncogene is homologue of Harvey sarcoma virus ras gene. Nature, 1982. 297(5866): p. 474–8. pmid:6283357
- View Article
- PubMed/NCBI
- Google Scholar
58. de Martel C., et al., Cancers attributable to infections among adults with HIV in the United States. AIDS, 2015. 29(16): p. 2173–81. pmid:26182198
- View Article
- PubMed/NCBI
- Google Scholar
59. Manabe A., et al., Viral Infections in Juvenile Myelomonocytic Leukemia: Prevalence and Clinical Implications. J Pediatr Hematol Oncol, 2004. 26(10): p. 636–641.
- View Article
- Google Scholar
60. Koike K. and Matsuda K., Recent advances in the pathogenesis and management of juvenile myelomonocytic leukaemia. Br J Haematol, 2008. 141(5): p. 567–75. pmid:18422786
- View Article
- PubMed/NCBI
- Google Scholar
61. Kaplan M., et al., EGFR Dynamics Change during Activation in Native Membranes as Revealed by NMR. Cell, 2016. 167(5): p. 1241–1251 e11. pmid:27839865
- View Article
- PubMed/NCBI
- Google Scholar
62. Purba E.R., Saita E.I., and Maruyama I.N., Activation of the EGF Receptor by Ligand Binding and Oncogenic Mutations: The "Rotation Model". Cells, 2017. 6(2).
- View Article
- Google Scholar
63. Glocker E.O., et al., Inflammatory bowel disease and mutations affecting the interleukin-10 receptor. N Engl J Med, 2009. 361(21): p. 2033–45. pmid:19890111
- View Article
- PubMed/NCBI
- Google Scholar
64. Slobedman B., et al., Virus-encoded homologs of cellular interleukin-10 and their control of host immune function. J Virol, 2009. 83(19): p. 9618–29. pmid:19640997
- View Article
- PubMed/NCBI
- Google Scholar
65. DeBruyne L.A., et al., Lipid-mediated gene transfer of viral IL-10 prolongs vascularized cardiac allograft survival by inhibiting donor-specific cellular and humoral immune responses. Gene Ther, 1998. 5(8): p. 1079–87. pmid:10326031
- View Article
- PubMed/NCBI
- Google Scholar
66. Beguelin W., et al., IL10 receptor is a novel therapeutic target in DLBCLs. Leukemia, 2015. 29(8): p. 1684–94. pmid:25733167
- View Article
- PubMed/NCBI
- Google Scholar
67. Pietrek M., et al., Role of the Kaposi's sarcoma-associated herpesvirus K15 SH3 binding site in inflammatory signaling and B-cell activation. J Virol, 2010. 84(16): p. 8231–40. pmid:20534855
- View Article
- PubMed/NCBI
- Google Scholar
68. Myoui A., et al., C-SRC tyrosine kinase activity is associated with tumor colonization in bone and lung in an animal model of human breast cancer metastasis. Cancer Res, 2003. 63(16): p. 5028–33. pmid:12941830
- View Article
- PubMed/NCBI
- Google Scholar
69. Imielinski M., et al., Mapping the hallmarks of lung adenocarcinoma with massively parallel sequencing. Cell, 2012. 150(6): p. 1107–20. pmid:22980975
- View Article
- PubMed/NCBI
- Google Scholar
70. Campbell K.S., et al., Polyoma middle tumor antigen interacts with SHC protein via the NPTY (Asn-Pro-Thr-Tyr) motif in middle tumor antigen. Proc Natl Acad Sci U S A, 1994. 91(14): p. 6344–8. pmid:8022784
- View Article
- PubMed/NCBI
- Google Scholar
71. Ahn R., et al., The ShcA PTB domain functions as a biological sensor of phosphotyrosine signaling during breast cancer progression. Cancer Res, 2013. 73(14): p. 4521–32. pmid:23695548
- View Article
- PubMed/NCBI
- Google Scholar
72. Dussupt V., et al., The nucleocapsid region of HIV-1 Gag cooperates with the PTAP and LYPXnL late domains to recruit the cellular machinery necessary for viral budding. PLoS Pathog, 2009. 5(3): p. e1000339. pmid:19282983
- View Article
- PubMed/NCBI
- Google Scholar
73. Castiglioni S., Maier J.A., and Mariotti M., The tyrosine phosphatase HD-PTP: A novel player in endothelial migration. Biochem Biophys Res Commun, 2007. 364(3): p. 534–9. pmid:17959146
- View Article
- PubMed/NCBI
- Google Scholar
74. Manteghi S., et al., Haploinsufficiency of the ESCRT Component HD-PTP Predisposes to Cancer. Cell Rep, 2016. 15(9): p. 1893–900. pmid:27210750
- View Article
- PubMed/NCBI
- Google Scholar
75. Stratton M.R., Campbell P.J., and Futreal P.A., The cancer genome. Nature, 2009. 458(7239): p. 719–24. pmid:19360079
- View Article
- PubMed/NCBI
- Google Scholar
76. Forbes S.A., et al., COSMIC: somatic cancer genetics at high-resolution. Nucleic Acids Res, 2017. 45(D1): p. D777–D783. pmid:27899578
- View Article
- PubMed/NCBI
- Google Scholar
77. Aloy P. and Russell R.B., Structural systems biology: modelling protein interactions. Nat Rev Mol Cell Biol, 2006. 7(3): p. 188–97. pmid:16496021
- View Article
- PubMed/NCBI
- Google Scholar
78. Haddad L., et al., KSHV-transformed primary effusion lymphoma cells induce a VEGF-dependent angiogenesis and establish functional gap junctions with endothelial cells. Leukemia, 2008. 22(4): p. 826–34. pmid:18094712
- View Article
- PubMed/NCBI
- Google Scholar
79. Keller S.A., et al., NF-kappaB is essential for the progression of KSHV- and EBV-infected lymphomas in vivo. Blood, 2006. 107(8): p. 3295–302. pmid:16380446
- View Article
- PubMed/NCBI
- Google Scholar
80. Mort M., et al., A meta-analysis of nonsense mutations causing human genetic disease. Hum Mutat, 2008. 29(8): p. 1037–47. pmid:18454449
- View Article
- PubMed/NCBI
- Google Scholar
81. Jones P., et al., InterProScan 5: genome-scale protein function classification. Bioinformatics, 2014. 30(9): p. 1236–40. pmid:24451626
- View Article
- PubMed/NCBI
- Google Scholar
82. McLaren W., et al., The Ensembl Variant Effect Predictor. Genome Biol, 2016. 17(1): p. 122. pmid:27268795
- View Article
- PubMed/NCBI
- Google Scholar
83. Kibbe W.A., et al., Disease Ontology 2015 update: an expanded and updated database of human diseases for linking biomedical knowledge through disease data. Nucleic Acids Res, 2015. 43(Database issue): p. D1071–8. pmid:25348409
- View Article
- PubMed/NCBI
- Google Scholar
84. Rath A., et al., Representation of rare diseases in health information systems: the Orphanet approach to serve a wide range of end users. Hum Mutat, 2012. 33(5): p. 803–8. pmid:22422702
- View Article
- PubMed/NCBI
- Google Scholar
85. Team, R.C., R: A language and environment for statistical computing. 2018, R Foundation for Statistical Computing: Vienna, Austria.
86. Omasits U., et al., Protter: interactive protein feature visualization and integration with experimental proteomic data. Bioinformatics, 2014. 30(6): p. 884–6. pmid:24162465
- View Article
- PubMed/NCBI
- Google Scholar
87. Finn R.D., et al., InterPro in 2017-beyond protein family and domain annotations. Nucleic Acids Res, 2017. 45(D1): p. D190–D199. pmid:27899635
- View Article
- PubMed/NCBI
- Google Scholar
88. Phipson B. and Smyth G.K., Permutation P-values should never be zero: calculating exact P-values when permutations are randomly drawn. Stat Appl Genet Mol Biol, 2010. 9: p. Article39.
- View Article
- Google Scholar

[ref1] 1. Vidal M., Cusick M.E., and Barabasi A.L., Interactome networks and human disease. Cell, 2011. 144(6): p. 986–98. pmid:21414488
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Oliver S., Guilt-by-association goes global. Nature, 2000. 403(6770): p. 601–3. pmid:10688178
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Barabasi A.L., Network medicine—from obesity to the "diseasome". N Engl J Med, 2007. 357(4): p. 404–7. pmid:17652657
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Goh K.I., et al., The human disease network. Proc Natl Acad Sci U S A, 2007. 104(21): p. 8685–90. pmid:17502601
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Zhong Q., et al., Edgetic perturbation models of human inherited disorders. Mol Syst Biol, 2009. 5: p. 321. pmid:19888216
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Wang X., et al., Three-dimensional reconstruction of protein networks provides insight into human genetic disease. Nat Biotechnol, 2012. 30(2): p. 159–64. pmid:22252508
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. Sahni N., et al., Edgotype: a fundamental link between genotype and phenotype. Curr Opin Genet Dev, 2013. 23(6): p. 649–57. pmid:24287335
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref8] 8. Sahni N., et al., Widespread macromolecular interaction perturbations in human genetic disorders. Cell, 2015. 161(3): p. 647–60. pmid:25910212
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref9] 9. Gulbahce N., et al., Viral perturbations of host networks reflect disease etiology. PLoS Comput Biol, 2012. 8(6): p. e1002531. pmid:22761553
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref10] 10. Rozenblatt-Rosen O., et al., Interpreting cancer genomes using systematic host network perturbations by tumour virus proteins. Nature, 2012. 487(7408): p. 491–5. pmid:22810586
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref11] 11. Kim P.M., et al., Relating three-dimensional structures to protein networks provides evolutionary insights. Science, 2006. 314(5807): p. 1938–41. pmid:17185604
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref12] 12. Franzosa E.A. and Xia Y., Structural principles within the human-virus protein-protein interaction network. Proc Natl Acad Sci U S A, 2011. 108(26): p. 10538–43. pmid:21680884
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref13] 13. Garamszegi S., Franzosa E.A., and Xia Y., Signatures of pleiotropy, economy and convergent evolution in a domain-resolved map of human-virus protein-protein interaction networks. PLoS Pathog, 2013. 9(12): p. e1003778. pmid:24339775
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref14] 14. Orchard S., et al., The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res, 2014. 42(Database issue): p. D358–63. pmid:24234451
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref15] 15. Ammari M.G., et al., HPIDB 2.0: a curated database for host-pathogen interactions. Database (Oxford), 2016. 2016.
View Article
Google Scholar

[58] View Article

[59] Google Scholar

[ref16] 16. Fu W., et al., Human immunodeficiency virus type 1, human protein interaction database at NCBI. Nucleic Acids Res, 2009. 37(Database issue): p. D417–22. pmid:18927109
View Article
PubMed/NCBI
Google Scholar

[61] View Article

[62] PubMed/NCBI

[63] Google Scholar

[ref17] 17. Ptak R.G., et al., Cataloguing the HIV type 1 human protein interaction network. AIDS Res Hum Retroviruses, 2008. 24(12): p. 1497–502. pmid:19025396
View Article
PubMed/NCBI
Google Scholar

[65] View Article

[66] PubMed/NCBI

[67] Google Scholar

[ref18] 18. Pinney J.W., et al., HIV-host interactions: a map of viral perturbation of the host system. AIDS, 2009. 23(5): p. 549–54. pmid:19262354
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref19] 19. Mosca R., et al., 3did: a catalog of domain-based interactions of known three-dimensional structure. Nucleic Acids Res, 2014. 42(Database issue): p. D374–9. pmid:24081580
View Article
PubMed/NCBI
Google Scholar

[73] View Article

[74] PubMed/NCBI

[75] Google Scholar

[ref20] 20. Dinkel H., et al., ELM 2016—data update and new functionality of the eukaryotic linear motif resource. Nucleic Acids Res, 2016. 44(D1): p. D294–300. pmid:26615199
View Article
PubMed/NCBI
Google Scholar

[77] View Article

[78] PubMed/NCBI

[79] Google Scholar

[ref21] 21. Finn R.D., et al., iPfam: a database of protein family and domain interactions found in the Protein Data Bank. Nucleic Acids Res, 2014. 42(Database issue): p. D364–73. pmid:24297255
View Article
PubMed/NCBI
Google Scholar

[81] View Article

[82] PubMed/NCBI

[83] Google Scholar

[ref22] 22. Berman H.M., et al., The Protein Data Bank. Nucleic Acids Res, 2000. 28(1): p. 235–42. pmid:10592235
View Article
PubMed/NCBI
Google Scholar

[85] View Article

[86] PubMed/NCBI

[87] Google Scholar

[ref23] 23. Finn R.D., et al., The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res, 2016. 44(D1): p. D279–85. pmid:26673716
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref24] 24. Famiglietti M.L., et al., Genetic variations and diseases in UniProtKB/Swiss-Prot: the ins and outs of expert manual curation. Hum Mutat, 2014. 35(8): p. 927–35. pmid:24848695
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref25] 25. Landrum M.J., et al., ClinVar: public archive of interpretations of clinically relevant variants. Nucleic Acids Res, 2016. 44(D1): p. D862–8. pmid:26582918
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref26] 26. Hanahan D. and Weinberg R.A., The hallmarks of cancer. Cell, 2000. 100(1): p. 57–70. pmid:10647931
View Article
PubMed/NCBI
Google Scholar

[101] View Article

[102] PubMed/NCBI

[103] Google Scholar

[ref27] 27. Mesri E.A., Feitelson M.A., and Munger K., Human viral oncogenesis: a cancer hallmarks analysis. Cell Host Microbe, 2014. 15(3): p. 266–82. pmid:24629334
View Article
PubMed/NCBI
Google Scholar

[105] View Article

[106] PubMed/NCBI

[107] Google Scholar

[ref28] 28. Epstein M.A., Achong B.G., and Barr Y.M., Virus Particles in Cultured Lymphoblasts from Burkitt's Lymphoma. Lancet, 1964. 1(7335): p. 702–3. pmid:14107961
View Article
PubMed/NCBI
Google Scholar

[109] View Article

[110] PubMed/NCBI

[111] Google Scholar

[ref29] 29. Wang S., et al., Identification and Characterization of Epstein-Barr Virus Genomes in Lung Carcinoma Biopsy Samples by Next-Generation Sequencing Technology. Sci Rep, 2016. 6: p. 26156. pmid:27189712
View Article
PubMed/NCBI
Google Scholar

[113] View Article

[114] PubMed/NCBI

[115] Google Scholar

[ref30] 30. Kimura H., et al., EBV-associated T/NK-cell lymphoproliferative diseases in nonimmunocompromised hosts: prospective analysis of 108 cases. Blood, 2012. 119(3): p. 673–86. pmid:22096243
View Article
PubMed/NCBI
Google Scholar

[117] View Article

[118] PubMed/NCBI

[119] Google Scholar

[ref31] 31. Lung M.L., et al., The interplay of host genetic factors and Epstein-Barr virus in the development of nasopharyngeal carcinoma. Chin J Cancer, 2014. 33(11): p. 556–68. pmid:25367335
View Article
PubMed/NCBI
Google Scholar

[121] View Article

[122] PubMed/NCBI

[123] Google Scholar

[ref32] 32. Fukayama M., Hino R., and Uozaki H., Epstein-Barr virus and gastric carcinoma: virus-host interactions leading to carcinoma. Cancer Sci, 2008. 99(9): p. 1726–33. pmid:18616681
View Article
PubMed/NCBI
Google Scholar

[125] View Article

[126] PubMed/NCBI

[127] Google Scholar

[ref33] 33. Schiffman M., et al., Human papillomavirus and cervical cancer. Lancet, 2007. 370(9590): p. 890–907. pmid:17826171
View Article
PubMed/NCBI
Google Scholar

[129] View Article

[130] PubMed/NCBI

[131] Google Scholar

[ref34] 34. Nulton T.J., et al., Analysis of The Cancer Genome Atlas sequencing data reveals novel properties of the human papillomavirus 16 genome in head and neck squamous cell carcinoma. Oncotarget, 2017. 8(11): p. 17684–17699. pmid:28187443
View Article
PubMed/NCBI
Google Scholar

[133] View Article

[134] PubMed/NCBI

[135] Google Scholar

[ref35] 35. Moscicki A.B., et al., Updating the natural history of human papillomavirus and anogenital cancers. Vaccine, 2012. 30 Suppl 5: p. F24–33.
View Article
Google Scholar

[137] View Article

[138] Google Scholar

[ref36] 36. Li N., et al., Human papillomavirus infection and bladder cancer risk: a meta-analysis. J Infect Dis, 2011. 204(2): p. 217–23. pmid:21673031
View Article
PubMed/NCBI
Google Scholar

[140] View Article

[141] PubMed/NCBI

[142] Google Scholar

[ref37] 37. Li N., et al., Human papillomavirus infection and sporadic breast carcinoma risk: a meta-analysis. Breast Cancer Res Treat, 2011. 126(2): p. 515–20. pmid:20740311
View Article
PubMed/NCBI
Google Scholar

[144] View Article

[145] PubMed/NCBI

[146] Google Scholar

[ref38] 38. Klein F., Amin Kotb W.F., and Petersen I., Incidence of human papilloma virus in lung cancer. Lung Cancer, 2009. 65(1): p. 13–8. pmid:19019488
View Article
PubMed/NCBI
Google Scholar

[148] View Article

[149] PubMed/NCBI

[150] Google Scholar

[ref39] 39. Singh N., et al., Implication of high risk human papillomavirus HR-HPV infection in prostate cancer in Indian population—a pioneering case-control analysis. Sci Rep, 2015. 5: p. 7822. pmid:25592643
View Article
PubMed/NCBI
Google Scholar

[152] View Article

[153] PubMed/NCBI

[154] Google Scholar

[ref40] 40. Monforte A., et al., HIV-induced immunodeficiency and mortality from AIDS-defining and non-AIDS-defining malignancies. AIDS, 2008. 22(16): p. 2143–53. pmid:18832878
View Article
PubMed/NCBI
Google Scholar

[156] View Article

[157] PubMed/NCBI

[158] Google Scholar

[ref41] 41. Grulich A.E., et al., Incidence of cancers in people with HIV/AIDS compared with immunosuppressed transplant recipients: a meta-analysis. Lancet, 2007. 370(9581): p. 59–67. pmid:17617273
View Article
PubMed/NCBI
Google Scholar

[160] View Article

[161] PubMed/NCBI

[162] Google Scholar

[ref42] 42. De Falco G., et al., Interaction between HIV-1 Tat and pRb2/p130: a possible mechanism in the pathogenesis of AIDS-related neoplasms. Oncogene, 2003. 22(40): p. 6214–9. pmid:13679860
View Article
PubMed/NCBI
Google Scholar

[164] View Article

[165] PubMed/NCBI

[166] Google Scholar

[ref43] 43. Nunnari G., Smith J.A., and Daniel R., HIV-1 Tat and AIDS-associated cancer: targeting the cellular anti-cancer barrier? J Exp Clin Cancer Res, 2008. 27: p. 3. pmid:18577246
View Article
PubMed/NCBI
Google Scholar

[168] View Article

[169] PubMed/NCBI

[170] Google Scholar

[ref44] 44. Briggs S.D., et al., SH3-mediated Hck tyrosine kinase activation and fibroblast transformation by the Nef protein of HIV-1. J Biol Chem, 1997. 272(29): p. 17899–902. pmid:9218412
View Article
PubMed/NCBI
Google Scholar

[172] View Article

[173] PubMed/NCBI

[174] Google Scholar

[ref45] 45. Barbaro G., Cardiovascular manifestations of HIV infection. Circulation, 2002. 106(11): p. 1420–5. pmid:12221062
View Article
PubMed/NCBI
Google Scholar

[176] View Article

[177] PubMed/NCBI

[178] Google Scholar

[ref46] 46. Pugliese A., et al., Impact of highly active antiretroviral therapy in HIV-positive patients with cardiac involvement. J Infect, 2000. 40(3): p. 282–4. pmid:10908024
View Article
PubMed/NCBI
Google Scholar

[180] View Article

[181] PubMed/NCBI

[182] Google Scholar

[ref47] 47. Yunis N.A. and Stone V.E., Cardiac manifestations of HIV/AIDS: a review of disease spectrum and clinical management. J Acquir Immune Defic Syndr Hum Retrovirol, 1998. 18(2): p. 145–54. pmid:9637579
View Article
PubMed/NCBI
Google Scholar

[184] View Article

[185] PubMed/NCBI

[186] Google Scholar

[ref48] 48. Hersh B.P., Rajendran P.R., and Battinelli D., Parkinsonism as the presenting manifestation of HIV infection: improvement on HAART. Neurology, 2001. 56(2): p. 278–9. pmid:11160977
View Article
PubMed/NCBI
Google Scholar

[188] View Article

[189] PubMed/NCBI

[190] Google Scholar

[ref49] 49. Koutsilieri E., et al., Parkinsonism in HIV dementia. J Neural Transm (Vienna), 2002. 109(5–6): p. 767–75.
View Article
Google Scholar

[192] View Article

[193] Google Scholar

[ref50] 50. Mirsattari S.M., Power C., and Nath A., Parkinsonism with HIV infection. Mov Disord, 1998. 13(4): p. 684–9. pmid:9686775
View Article
PubMed/NCBI
Google Scholar

[195] View Article

[196] PubMed/NCBI

[197] Google Scholar

[ref51] 51. Kumar A., et al., Tuning of AKT-pathway by Nef and its blockade by protease inhibitors results in limited recovery in latently HIV infected T-cell line. Sci Rep, 2016. 6: p. 24090. pmid:27076174
View Article
PubMed/NCBI
Google Scholar

[199] View Article

[200] PubMed/NCBI

[201] Google Scholar

[ref52] 52. Portis T., Dyck P., and Longnecker R., Epstein-Barr Virus (EBV) LMP2A induces alterations in gene transcription similar to those observed in Reed-Sternberg cells of Hodgkin lymphoma. Blood, 2003. 102(12): p. 4166–78. pmid:12907455
View Article
PubMed/NCBI
Google Scholar

[203] View Article

[204] PubMed/NCBI

[205] Google Scholar

[ref53] 53. Lamers S.L., et al., HIV-1 Nef in macrophage-mediated disease pathogenesis. Int Rev Immunol, 2012. 31(6): p. 432–50. pmid:23215766
View Article
PubMed/NCBI
Google Scholar

[207] View Article

[208] PubMed/NCBI

[209] Google Scholar

[ref54] 54. Plummer M., et al., Global burden of cancers attributable to infections in 2012: a synthetic analysis. Lancet Glob Health, 2016. 4(9): p. e609–16. pmid:27470177
View Article
PubMed/NCBI
Google Scholar

[211] View Article

[212] PubMed/NCBI

[213] Google Scholar

[ref55] 55. Vogt P.K., Retroviral oncogenes: a historical primer. Nat Rev Cancer, 2012. 12(9): p. 639–48. pmid:22898541
View Article
PubMed/NCBI
Google Scholar

[215] View Article

[216] PubMed/NCBI

[217] Google Scholar

[ref56] 56. Stevenson P.G., Simas J.P., and Efstathiou S., Immune control of mammalian gamma-herpesviruses: lessons from murid herpesvirus-4. J Gen Virol, 2009. 90(Pt 10): p. 2317–30. pmid:19605591
View Article
PubMed/NCBI
Google Scholar

[219] View Article

[220] PubMed/NCBI

[221] Google Scholar

[ref57] 57. Parada L.F., et al., Human EJ bladder carcinoma oncogene is homologue of Harvey sarcoma virus ras gene. Nature, 1982. 297(5866): p. 474–8. pmid:6283357
View Article
PubMed/NCBI
Google Scholar

[223] View Article

[224] PubMed/NCBI

[225] Google Scholar

[ref58] 58. de Martel C., et al., Cancers attributable to infections among adults with HIV in the United States. AIDS, 2015. 29(16): p. 2173–81. pmid:26182198
View Article
PubMed/NCBI
Google Scholar

[227] View Article

[228] PubMed/NCBI

[229] Google Scholar

[ref59] 59. Manabe A., et al., Viral Infections in Juvenile Myelomonocytic Leukemia: Prevalence and Clinical Implications. J Pediatr Hematol Oncol, 2004. 26(10): p. 636–641.
View Article
Google Scholar

[231] View Article

[232] Google Scholar

[ref60] 60. Koike K. and Matsuda K., Recent advances in the pathogenesis and management of juvenile myelomonocytic leukaemia. Br J Haematol, 2008. 141(5): p. 567–75. pmid:18422786
View Article
PubMed/NCBI
Google Scholar

[234] View Article

[235] PubMed/NCBI

[236] Google Scholar

[ref61] 61. Kaplan M., et al., EGFR Dynamics Change during Activation in Native Membranes as Revealed by NMR. Cell, 2016. 167(5): p. 1241–1251 e11. pmid:27839865
View Article
PubMed/NCBI
Google Scholar

[238] View Article

[239] PubMed/NCBI

[240] Google Scholar

[ref62] 62. Purba E.R., Saita E.I., and Maruyama I.N., Activation of the EGF Receptor by Ligand Binding and Oncogenic Mutations: The "Rotation Model". Cells, 2017. 6(2).
View Article
Google Scholar

[242] View Article

[243] Google Scholar

[ref63] 63. Glocker E.O., et al., Inflammatory bowel disease and mutations affecting the interleukin-10 receptor. N Engl J Med, 2009. 361(21): p. 2033–45. pmid:19890111
View Article
PubMed/NCBI
Google Scholar

[245] View Article

[246] PubMed/NCBI

[247] Google Scholar

[ref64] 64. Slobedman B., et al., Virus-encoded homologs of cellular interleukin-10 and their control of host immune function. J Virol, 2009. 83(19): p. 9618–29. pmid:19640997
View Article
PubMed/NCBI
Google Scholar

[249] View Article

[250] PubMed/NCBI

[251] Google Scholar

[ref65] 65. DeBruyne L.A., et al., Lipid-mediated gene transfer of viral IL-10 prolongs vascularized cardiac allograft survival by inhibiting donor-specific cellular and humoral immune responses. Gene Ther, 1998. 5(8): p. 1079–87. pmid:10326031
View Article
PubMed/NCBI
Google Scholar

[253] View Article

[254] PubMed/NCBI

[255] Google Scholar

[ref66] 66. Beguelin W., et al., IL10 receptor is a novel therapeutic target in DLBCLs. Leukemia, 2015. 29(8): p. 1684–94. pmid:25733167
View Article
PubMed/NCBI
Google Scholar

[257] View Article

[258] PubMed/NCBI

[259] Google Scholar

[ref67] 67. Pietrek M., et al., Role of the Kaposi's sarcoma-associated herpesvirus K15 SH3 binding site in inflammatory signaling and B-cell activation. J Virol, 2010. 84(16): p. 8231–40. pmid:20534855
View Article
PubMed/NCBI
Google Scholar

[261] View Article

[262] PubMed/NCBI

[263] Google Scholar

[ref68] 68. Myoui A., et al., C-SRC tyrosine kinase activity is associated with tumor colonization in bone and lung in an animal model of human breast cancer metastasis. Cancer Res, 2003. 63(16): p. 5028–33. pmid:12941830
View Article
PubMed/NCBI
Google Scholar

[265] View Article

[266] PubMed/NCBI

[267] Google Scholar

[ref69] 69. Imielinski M., et al., Mapping the hallmarks of lung adenocarcinoma with massively parallel sequencing. Cell, 2012. 150(6): p. 1107–20. pmid:22980975
View Article
PubMed/NCBI
Google Scholar

[269] View Article

[270] PubMed/NCBI

[271] Google Scholar

[ref70] 70. Campbell K.S., et al., Polyoma middle tumor antigen interacts with SHC protein via the NPTY (Asn-Pro-Thr-Tyr) motif in middle tumor antigen. Proc Natl Acad Sci U S A, 1994. 91(14): p. 6344–8. pmid:8022784
View Article
PubMed/NCBI
Google Scholar

[273] View Article

[274] PubMed/NCBI

[275] Google Scholar

[ref71] 71. Ahn R., et al., The ShcA PTB domain functions as a biological sensor of phosphotyrosine signaling during breast cancer progression. Cancer Res, 2013. 73(14): p. 4521–32. pmid:23695548
View Article
PubMed/NCBI
Google Scholar

[277] View Article

[278] PubMed/NCBI

[279] Google Scholar

[ref72] 72. Dussupt V., et al., The nucleocapsid region of HIV-1 Gag cooperates with the PTAP and LYPXnL late domains to recruit the cellular machinery necessary for viral budding. PLoS Pathog, 2009. 5(3): p. e1000339. pmid:19282983
View Article
PubMed/NCBI
Google Scholar

[281] View Article

[282] PubMed/NCBI

[283] Google Scholar

[ref73] 73. Castiglioni S., Maier J.A., and Mariotti M., The tyrosine phosphatase HD-PTP: A novel player in endothelial migration. Biochem Biophys Res Commun, 2007. 364(3): p. 534–9. pmid:17959146
View Article
PubMed/NCBI
Google Scholar

[285] View Article

[286] PubMed/NCBI

[287] Google Scholar

[ref74] 74. Manteghi S., et al., Haploinsufficiency of the ESCRT Component HD-PTP Predisposes to Cancer. Cell Rep, 2016. 15(9): p. 1893–900. pmid:27210750
View Article
PubMed/NCBI
Google Scholar

[289] View Article

[290] PubMed/NCBI

[291] Google Scholar

[ref75] 75. Stratton M.R., Campbell P.J., and Futreal P.A., The cancer genome. Nature, 2009. 458(7239): p. 719–24. pmid:19360079
View Article
PubMed/NCBI
Google Scholar

[293] View Article

[294] PubMed/NCBI

[295] Google Scholar

[ref76] 76. Forbes S.A., et al., COSMIC: somatic cancer genetics at high-resolution. Nucleic Acids Res, 2017. 45(D1): p. D777–D783. pmid:27899578
View Article
PubMed/NCBI
Google Scholar

[297] View Article

[298] PubMed/NCBI

[299] Google Scholar

[ref77] 77. Aloy P. and Russell R.B., Structural systems biology: modelling protein interactions. Nat Rev Mol Cell Biol, 2006. 7(3): p. 188–97. pmid:16496021
View Article
PubMed/NCBI
Google Scholar

[301] View Article

[302] PubMed/NCBI

[303] Google Scholar

[ref78] 78. Haddad L., et al., KSHV-transformed primary effusion lymphoma cells induce a VEGF-dependent angiogenesis and establish functional gap junctions with endothelial cells. Leukemia, 2008. 22(4): p. 826–34. pmid:18094712
View Article
PubMed/NCBI
Google Scholar

[305] View Article

[306] PubMed/NCBI

[307] Google Scholar

[ref79] 79. Keller S.A., et al., NF-kappaB is essential for the progression of KSHV- and EBV-infected lymphomas in vivo. Blood, 2006. 107(8): p. 3295–302. pmid:16380446
View Article
PubMed/NCBI
Google Scholar

[309] View Article

[310] PubMed/NCBI

[311] Google Scholar

[ref80] 80. Mort M., et al., A meta-analysis of nonsense mutations causing human genetic disease. Hum Mutat, 2008. 29(8): p. 1037–47. pmid:18454449
View Article
PubMed/NCBI
Google Scholar

[313] View Article

[314] PubMed/NCBI

[315] Google Scholar

[ref81] 81. Jones P., et al., InterProScan 5: genome-scale protein function classification. Bioinformatics, 2014. 30(9): p. 1236–40. pmid:24451626
View Article
PubMed/NCBI
Google Scholar

[317] View Article

[318] PubMed/NCBI

[319] Google Scholar

[ref82] 82. McLaren W., et al., The Ensembl Variant Effect Predictor. Genome Biol, 2016. 17(1): p. 122. pmid:27268795
View Article
PubMed/NCBI
Google Scholar

[321] View Article

[322] PubMed/NCBI

[323] Google Scholar

[ref83] 83. Kibbe W.A., et al., Disease Ontology 2015 update: an expanded and updated database of human diseases for linking biomedical knowledge through disease data. Nucleic Acids Res, 2015. 43(Database issue): p. D1071–8. pmid:25348409
View Article
PubMed/NCBI
Google Scholar

[325] View Article

[326] PubMed/NCBI

[327] Google Scholar

[ref84] 84. Rath A., et al., Representation of rare diseases in health information systems: the Orphanet approach to serve a wide range of end users. Hum Mutat, 2012. 33(5): p. 803–8. pmid:22422702
View Article
PubMed/NCBI
Google Scholar

[329] View Article

[330] PubMed/NCBI

[331] Google Scholar

[ref85] 85. Team, R.C., R: A language and environment for statistical computing. 2018, R Foundation for Statistical Computing: Vienna, Austria.

[ref86] 86. Omasits U., et al., Protter: interactive protein feature visualization and integration with experimental proteomic data. Bioinformatics, 2014. 30(6): p. 884–6. pmid:24162465
View Article
PubMed/NCBI
Google Scholar

[334] View Article

[335] PubMed/NCBI

[336] Google Scholar

[ref87] 87. Finn R.D., et al., InterPro in 2017-beyond protein family and domain annotations. Nucleic Acids Res, 2017. 45(D1): p. D190–D199. pmid:27899635
View Article
PubMed/NCBI
Google Scholar

[338] View Article

[339] PubMed/NCBI

[340] Google Scholar

[ref88] 88. Phipson B. and Smyth G.K., Permutation P-values should never be zero: calculating exact P-values when permutations are randomly drawn. Stat Appl Genet Mol Biol, 2010. 9: p. Article39.
View Article
Google Scholar

[342] View Article

[343] Google Scholar

Figures

Abstract

Author summary

Introduction

Results

Disease-annotated, domain-resolved human-virus protein interaction network

Virus-targeted host domains are enriched for virally-implicated disease mutations

EBV.

HPV.

HIV.

Oncoviruses.

Viruses in proliferative and immunological diseases.

Oncovirus-targeted host domains are enriched for cancer driver mutations

Oncovirus-mimicked host domains are enriched for cancer driver mutations

Viral proteins and virally-implicated disease mutations tend to perturb the same domain-domain interactions in the human interactome

Discussion

Methods

Construction of disease-annotated human-virus structural interaction network

Pooled analysis of viral proteins and disease mutations

Classification of viral homology domains

Supporting information

S1 Text. References for virally implicated diseases.

S1 Table. Virally implicated diseases and disease-associated proteins for EBV, HPV and HIV.

S2 Table. Human domains exclusively occurring or enriched in oncoviruses vs. generic viruses.

References