Endemic Burkitt lymphoma (eBL) is primarily found in children in equatorial regions and represents the first historical example of a virus-associated human malignancy. Although Epstein-Barr virus (EBV) infection and MYC translocations are hallmarks of the disease, it is unclear whether other factors may contribute to its development. We performed RNA-Seq on 20 eBL cases from Uganda and showed that the mutational and viral landscape of eBL is more complex than previously reported. First, we found the presence of other herpesviridae family members in 8 cases (40%), in particular human herpesvirus 5 and human herpesvirus 8 and confirmed their presence by immunohistochemistry in the adjacent non-neoplastic tissue. Second, we identified a distinct latency program in EBV involving lytic genes in association with TCF3 activity. Third, by comparing the eBL mutational landscape with published data on sporadic Burkitt lymphoma (sBL), we detected lower frequencies of mutations in MYC, ID3, TCF3 and TP53, and a higher frequency of mutation in ARID1A in eBL samples. Recurrent mutations in two genes not previously associated with eBL were identified in 20% of tumors: RHOA and cyclin F (CCNF). We also observed that polyviral samples showed lower numbers of somatic mutations in common altered genes in comparison to sBL specimens, suggesting dual mechanisms of transformation, mutation versus virus driven in sBL and eBL respectively.
Burkitt lymphoma is endemic in sub-Saharan Africa and affects primarily children of age 4–7 years. Historically, it was one of the first tumors associated with a virus (EBV) and bearing a translocation involving an oncogene, i.e. MYC. There are three distinct clinical variants of Burkitt lymphoma according to the World Health Organization: sporadic, endemic and immunodeficiency-related. Although there has been some recent work on the molecular characterization of sporadic Burkitt lymphomas, little is known about the pathogenesis of endemic cases. In this work, we analyzed 20 samples of RNASeq from Burkitt lymphoma collected in Lacor Hospital (Uganda, Africa) and validated in an extension panel of 73 samples from Uganda and Kenya. We identify the presence in the adjacent non-neoplastic tissue of other herpesviridae family members in 53% of the cases, namely cytomegalovirus (CMV) and Kaposi sarcoma herpesvirus (KSHV). We also demonstrate expression of EBV lytic genes in primary tumor samples and find an inverse association between EBV lytic expression and TCF3 activity. When studying the mutational profile of endemic Burkitt tumors, we find recurrent alterations in genes rarely mutated in sporadic Burkitt lymphomas, i.e. ARID1A, CCNF and RHOA, and lower numbers of mutations in genes previously reported to be commonly mutated in sporadic cases, i.e. MYC, ID3, TCF3, TP53. Together, these results illustrate a distinct genetic and viral profile of endemic Burkitt lymphoma, suggesting a dual mechanism of transformation (mutation versus virus driven in sBL and eBL respectively).
Citation: Abate F, Ambrosio MR, Mundo L, Laginestra MA, Fuligni F, Rossi M, et al. (2015) Distinct Viral and Mutational Spectrum of Endemic Burkitt Lymphoma. PLoS Pathog 11(10): e1005158. https://doi.org/10.1371/journal.ppat.1005158
Editor: Paul M. Lieberman, Wistar Institute, UNITED STATES
Received: March 30, 2015; Accepted: August 19, 2015; Published: October 15, 2015
Copyright: © 2015 Abate et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: RNA-Seq data have been deposited at the NCBI SRA service (accession number PRJNA292327).
Funding: This work was supported by grants from the Italian Association for Cancer Research (AIRC 5x1000, n. 10007) and Programma Strategico “Innovative approaches to the diagnosis and pharmacogenetic based therapies of primary hepatic tumours, peripheral B and T – cell lymphomas and lymphoblastic leukaemias”, Programma di Ricerca Regione-Università 2010–2012: Area 1 “Ricerca Innovativa” (SP); MIUR 2012 (LL); Steward Foundation and NIH 1 U54 CA121852-05 (RR). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Burkitt lymphoma (BL) is the first human cancer to be associated with the Epstein-Barr virus (EBV), the first tumor to exhibit a chromosomal translocation activating an oncogene (MYC), and the first lymphoma to be associated with human immunodeficiency virus (HIV) infection. The World Health Organization classification describes three clinical variants of BL: endemic, sporadic, and immunodeficiency-related. These variants are similar in morphology, immunophenotype, and genetics. While the sporadic variant (sBL) occurs outside of Africa and is rarely associated with EBV infection, the endemic variant (eBL) arises mainly in Africa and is associated with malaria endemicity and EBV infection in almost all cases. Epidemiological studies have shown that malaria and EBV combined do not fully explain the distribution of eBL in high risk regions. Malaria and EBV are in fact ubiquitous within the lymphoma belt of Africa, suggesting that other etiologic agents may be involved. However, it is unclear what other epidemiological factors could play a role in the genesis of eBLs.
Three types of EBV latency have been described in EBV-related lymphomas according to the pattern of EBV nuclear antigen (EBNA) and the latent membrane protein (LMP) expression, namely latency I, II, and III. Specifically, latency I is usually associated with eBL and it denotes a transcriptional program in which an EBV infection does not produce virions and expresses a single protein, EBNA-1. While the latency I program has been extensively characterized in vitro, a different form of latency has been recently reported in 15% of eBL that uses a different set of promoters. Termed Wp-restricted latency, this program shows a homogeneous host expression signature characterized by down-regulation of BCL-6 and up-regulation of IRF-4 and BLIMP-1. Other reports have described latency program heterogeneity at single cell level and low expression of LMP genes in a fraction of cases[8,9]. Heterogeneous EBV transcription profiles with LMP expression have been recently reported in some cases of AIDS-related and sporadic BL, but extensive data on endemic cases are not available yet. These studies indicate that the transcriptional EBV programs of primary eBL could be more complex than expected across cases and within individuals. Therefore, the exact role of EBV has remained elusive and further investigation is required.
The genetic hallmark of all three clinical variants of BL is the t(8;14) translocation involving the juxtaposition of the immunoglobulin heavy chain locus (IGH) with the MYC oncogene. However, although transgenic mice expressing MYC under the control of the intronic IGH enhancer (Eμ) develop B cell lymphomas, successive molecular characterization demonstrated that this model does not fully recapitulate the human disease. The comparison between the gene expression profile (GEP) of BL and diffuse large B-cell lymphoma (DLBCL) highlighted a distinct signature of BL characterized by the expression of both MYC targets and germinal-center B-cell genes. Furthermore, hypermutation and different breakpoint patterns of IGH/MYC translocation[14,15] suggests that the origin of human BL derives from aberrant class switching in the germinal center (GC), while transgenic IGH/MYC mice typically arise from precursor/naive B-cells. The more accurate PI3K/MYC transgenic mouse model by Sander et al better recapitulates the human phenotype of BL and highlights the importance of the PI3K pathway in the disease. Moreover, GEP analysis has demonstrated that the transcriptional profile of eBL is different from that observed in sBL. Recent studies have unveiled the genetic landscape of sBL characterized by mutations affecting the B-cell receptor (BCR) pathway and in particular the transcription factor TCF3, its negative regulator ID3, the cell-cycle G1/S regulator CCND3[18,19], and the chromatin-remodeling gene ARID1A. On the contrary, very little is known about the spectrum of alterations in eBL, how it might differ from that of sBL, the correlations between host mutation and viral infection, and the specific viral/host transcriptional programs.
In this study, we aim to characterize the presence of other potential agents, to define the EBV transcriptional profile and to link these profiles to the mutational status of new and previously reported genes. We provide a characterization of the mutational and viral landscape of eBL using 20 cases from Uganda. RNA-Seq, in combination with targeted sequencing technology on a larger cohort of cases, allows the identification, validation and assessment of the recurrence of new somatic mutations. In addition, in contrast with earlier microarray-based expression studies, RNA-Seq provides the opportunity to identify and associate microbial and tumor mutational and expression profiles.
Endemic BL is associated with multiple viral infections
To identify new pathogens in eBL, we applied Pandora, a new pipeline for the characterization of tumor microbiomes, to a discovery cohort of 20 RNA-Seq samples. We established a read cutoff on the basis of those samples that tested positive for RNA in situ hybridization (ISH) of the EBER transcript. Since ISH validated all the RNA-Seq samples as positive, we established the threshold to call a virus present in a particular sample as the minimal number of reads detecting EBV (S1 Fig). Next, we established the EBV subtype by aligning RNA-Seq reads to the genomes of both EBV type I and type II and deduced type I as the closest genotype. In addition to EBV, RNA-Seq revealed the presence of other viruses. In particular, 5/20 cases contained human herpesvirus 5 (HHV5, cytomegalovirus, CMV), 4/20 human herpesvirus 8 (HHV8, Kaposi sarcoma herpes virus, KSHV), and 1/20 human T-lymphotropic virus 1 (HTLV-1) (Fig 1A, S2 Fig and S3 Fig). Human immunodeficiency virus (HIV) was not detected in any case, confirming that pediatric eBL is rarely associated with the immunodeficiency syndrome. Nested PCR and immunohistochemical (IHC) analysis performed on all 20 original samples confirmed the presence of all the viruses in the discovery cohort (S1A Table). To assess whether RNA-Seq findings generalize for EBV, CMV, KSHV, and HTLV-1, we assayed for the presence of these four viruses in 20 additional cases from western Kenya by IHC (S1B Table). In this Kenyan cohort, EBV was detected in 20/20 samples, CMV in 8/20 samples (Fig 1B and S4 Fig), KSHV in 7/20 samples (Fig 1C and 1D, and S5 Fig), and HTLV-1 in 0/20 samples. Therefore, over the 40 cases, we report the overall viral infection frequencies of 40/40 (100%) for EBV, 13/40 (32.5%) for CMV, 11/40 (27.5%) for KSHV, and 1/40 (2.5%) for HTLV-1. IHC analysis demonstrated the presence of CMV in the stromal cells and macrophages localized within the tumors and in the adjacent reactive lymphoid tissue (Fig 1B, S2 Fig and S4 Fig). KSHV was identified not only in normal B-lymphocytes and endothelial cells from the adjacent reactive lymphoid tissue (S3 Fig and S5 Fig), but also in one case in about 5–10% of neoplastic cells (Fig 1C and 1D). HTLV-1 was detected in reactive T-lymphocytes in the only positive case of the discovery cohort. Sections of the samples incubated with the secondary antibody alone and sections of reactive lymphoid tissue were used as negative controls. Sections of lymph nodes with infectious mononucleosis were used as positive control for EBV. Next, we compared the viral landscape of endemic and sporadic cases by analyzing 27 RNA-Seq sBL samples from Schmitz et al. with Pandora. The analysis showed the presence of EBV and HIV respectively in 4/27 (15%) and in 1/27 (4%) cases, consistent with several literature sources.
Epstein-Barr virus latent and lytic infection in BL
Beyond identification of EBV presence, RNA-Seq enabled us to quantitatively analyze the viral transcriptional program. In addition to EBER-1 and EBER-2 transcripts, expression analysis of the viral genes showed the expression of EBNA-1, a gene associated to latency I type, in 18/20 cases (Fig 2A). We also detected either LMP-1 or LMP-2A, characterizing the latency II type, in 13/20 samples (65%), and also EBNA-2 in 1/20 cases (5%). Interestingly, 2/20 cases (10%) were characterized by the expression of EBNA-3A/B/C/LP, together with the lytic gene BHRF-1, suggesting a Wp-restricted program. However, the specific analysis of EBV isoforms showed the presence of H2-HF splicing event, which is hallmark of lytic BHRF-1 expression[24–26](S6 Fig). Unsupervised hierarchical clustering of expressed EBV genes demonstrated two main clusters distinguished largely by gene products involved in EBV replication (BALF-2, BCRF-1, BHRF-1, BILF-1, BMRF-1, BNLF-2a, BZLF-1). The expression of these genes suggests a non-canonical latency program of the virus with a subset of viral episomes initiating lytic reactivation.
Due to the heterogeneity of the viral transcriptional programs, we aimed to validate the latency type by performing RT-qPCR for the EBNA-1, LMP-1, LMP-2A, EBNA-2, EBNA-3C, and BHRF-1 transcripts across an additional series of 26 cases from an extended cohort of samples from Kenya. EBNA-1 was detected in 26/26 (100%), LMP-1 and LMP-2 in respectively 5/26 (20%) and 20/26 (75%) cases (S3 Table and S7A Fig), EBNA-2 in 0/26 (0%), and the combination of EBNA-3C and BHRF-1 in 4/26 (15%). These results are largely consistent with the RNA-Seq data with the exception of LMP-1 that has been detected at higher frequency in RNA-Seq (S2 Table). Next, we evaluated the lytic cycle activation and found BILF-1, BALF-4, and LF-2 in all 26 cases, whereas we observed the expression of BALF-2 in 23/26 (90%), BHRF-1 in 20/26 (80%), BZLF-1 and BMRF-1 in 15/26 (60%), BNLF-2a in 13/26 (50%), and BCRF-1 in 11/26 (45%) of the cases (S3 Table and S7B–S7D Fig).
We then validated the expression of all the available encoded-proteins by IHC using stringent positive and negative controls as reported in Materials and Methods. Overall, IHC evaluation confirmed a non-canonical latency associated program with the expression of some proteins characterizing latency II (i.e. LMP-1 in 2/26 and LMP-2A in 17/26 of the cases); however, there was heterogeneity in the intensity of protein staining and in the proportion of positive tumor cells. LMP-1 was detected in few cells, whereas LMP-2A was identified in a proportion of cells ranging from 25% to 50% (Fig 2B and 2C). EBV replication was assessed by nuclear expression of the immediate-early BZLF-1/ZEBRA and early BMRF-1/Ea-D, BHRF-1/Ea-R lytic proteins (Fig 2D–2I). There was positive staining in the neoplastic cells for BZLF1, BHRF-1/Ea-R and BMRF-1/Ea-D, respectively in 11/26 (40%), 16/26 (60%), and 13/26 (50%) of the cases (S3 Table).
Finally, we compared the patterns of latent and lytic gene expression between endemic and sporadic BLs using the 4 EBV-positive sBLs of the 27 RNA-Seq samples from Schmitz et al.16 We observed the expression of BHRF-1 and BMRF-1 in 1 case; BZLF-1 was present in 2 cases and LMP-2A in 4 cases.
Mutational landscape of eBL and correlation with EBV presence
To identify the genes that are somatically mutated in eBL, we applied the SAVI algorithm to the cohort of 20 RNA-Seq samples (see Material and Methods for gene selection criteria). Our analysis identified 13 genes recurrently mutated in more than 4 samples. We confirmed the presence of mutations in genes previously reported in BL literature[18,19,28] (Fig 3A–3C and S4 Table), including MYC in 10/20 (50%), DDX3X in 7/20 (35%), ID3 in 6/20 (30%), ARID1A in 5/20 (25%), RHOA in 4/20 (20%), TCF3 and TP53 in 3/20 (15%), CCND3 in 1/20 (5%) of the cases. In addition, we found recurrent mutations in one gene not reported so far: CCNF, detected in 4 out of the 20 cases (20%). Since RHOA mutations have not been previously detected in eBL and CCNF mutation was a new discovery, their prevalence as specific mutations was further assessed using Sequenom technology on an extended panel of 66 neoplastic samples plus 7 cases with matched normal controls (S8A and S8B Fig). Recurrent mutations in RHOA were found in 6/73 eBL cases (8%), and in 0/7 normal samples. Two of the 6 RHOA mutations occurred in paired eBL/normal cases, confirming that the alterations are somatic (S5 Table). Recurrent mutations in codon 451 of CCNF were found in 14/73 eBL cases (19%), and in 0/7 normal samples. One of the 14 CCNF mutations occurred in a paired eBL/normal case, showing that also CCNF alteration is somatic. Direct sequencing of genomic DNA confirmed all the mutations identified by Sequenom tecnnology and RNAseq (S9 Fig and S10 Fig).
The distribution of somatic mutations and viral presence across both eBL and sBL samples exhibit two interesting features (Fig 3B). First, in eBL samples we observed lower mutational frequencies in the genes MYC, ID3, TCF3, DDX3X, CCND3 and TP53, as compared to their reported recurrence in sBL, and higher mutational frequencies in ARID1A, RHOA, and CCNF[18,19]. Second, in sBL cases an almost mutual exclusivity can be seen between EBV presence and mutations in TCF3/ID3 both known to be driver genes in sBL (p-value < 0.02, Fisher exact test). To explore this hypothesis, we performed a hierarchical clustering of both endemic and sporadic cases on TCF3 target genes (previously reported in Schmitz et al.) and we demonstrated that the first bifurcation of the dendrogram classifies the samples into EBV-positive and EBV-negative BL independently on the specific subtype with an accuracy of 96% (45/47). (Fig 4A). The results show that the TCF3 pathway is more activated in EBV-negative cases, as indicated by the significant negative enrichment of TCF3 target genes in EBV-positive samples. Furthermore, we observe that when considering the overall panel of both endemic and sporadic BL samples, the mutually exclusivity between TCF3/ID3 mutations and EBV infection yields a more significant effect (p-value < 0.0008, Fisher exact test). To further investigate the host transcriptional programming related to EBV presence, we performed GSEA C2 analysis on genes differentially expressed between EBV-positive and EBV-negative cases. Interestingly, we detected a significant enrichment for the LMP-1 gene set signature, reported by Sengupta et al.  in nasopharyngeal carcinoma, which is consistent with the detected LMP-1 expression in RNA-Seq data (Fig 4B). Moreover, since 13/20 RNA-Seq cases were positive for LMP2A, we investigated the role of this viral gene in the context of eBL and GSEA C2 analysis has been performed on gene differentially expressed between LMP-2A positive and LMP-2A negative samples. Interestingly, the E2F, E2F3 and cell cycle G1/S gene sets presented the highest significant enrichment score (see S11 Fig), together with the down-regulation of retinoblastoma pathway. These results can be explained as an effect of the interaction between MYC and LMP2A. In fact, previous studies showed that LMP-2A promotes MYC-induced lymphomagenesis, and E2F is a know target of MYC during cell division and proliferation. Moreover, several works associate LMP-2A expression to the PI3K/Akt pathway activation[32–35] and the study from Brennan et al. (Oncogene, 2002 ) shows that the activation of PI3K pathway in lymphoblastoid cell lines can promote E2F transcription activity to affect cell cycle and cellular proliferation.
Over the past few years, the concept that many diseases can be etiologically linked to infection by more than one pathogen has drawn increased attention[37–40]. Whether endemic Burkitt lymphoma should also be considered a polymicrobial disease and what role genetic alterations play in the tumor are still open questions. In this paper, we analyzed the presence of pathogens other than EBV in 40 eBL primary tumors by RNA sequencing, PCR, and immunohistochemistry, and found the presence of CMV and KSHV. We detected these viruses, which are frequently reported in the African population, primarily in the surrounding non-neoplastic tissue. Their prevalence in areas endemic for EBV, along with their absence in the sporadic cases, suggests that CMV or KSHV could contribute to the chronic antigenic stimulation in which eBL occurs.
The presence of these additional cofactors may also induce EBV lytic cycle through B-cell reactivation and spreading EBV infection out of its natural niche of memory B-cells, characterized by a latency 0/I program[42,43]. In fact, in our samples we showed a non-canonical latency program of the virus characterized by a large number of cases expressing LMP-1/-2A/-2B in a significant proportion of cells along with lytic reactivation. Our results are in agreement with recent studies showing more complex EBV protein expression in Akata and Mutu cell lines, commonly used to study the role of EBV in Burkitt lymphoma. By using an alternative approach based on RT-QPCR array platform, Tierney et al. report a quantitative characterization of EBV transcripts in different experimental infection models that were validated in endemic Burkitt lymphoma samples. Interestingly, in this study a significant expression of LMP-2 gene was revealed. Moreover, our results are in accordance with a previously published study in primary AIDS-related lymphomas (ARL) by Arvey and colleagues, although a rigorous comparison is limited by the small number of ARL BLs. All together, our findings confirm recent evidence that LMP-2A cooperates in reprogramming the function of normal B-lymphocytes and enhance MYC driven lymphomagenesis through the activation of PI3K-pathway[45,46]. This pathway is a crucial to MYC mediated transformation as shown by PI3K/MYC transgenic mouse that produces a model that represent a phenocopy of human tumors in terms of histology, gene and protein marker expression, and somatic hypermutations. This scenario suggests that LMP-2A activation of PI3K is an alternative/convergent mechanism to the one driven by TCF3/ID3 mutations.
The expression of genes characterizing the lytic phase of EBV found by RNA-Seq was confirmed by IHC staining for the three main genes involved in the initiation of the lytic phase, BZLF-1/ZEBRA, BMRF-1/Ea-D and BHRF-1/Ea-R. In early latent infection, EBV can be induced to enter the lytic cycle by a variety of causes including B-cell receptor stimulation, Toll-like receptor-9 activation, hypoxia, and growth factors[48,49]. Although lytic infection kills the host cell, it also allows horizontal spread of EBV from cell to cell and may increase the pool of latently infected B-lymphocytes from which transformed cells arise. Additionally, lytically infected B-cells secrete factors that may promote tumorigenesis, including growth and angiogenesis factors and immunosuppressive cytokines. Recent evidence has challenged the view that only the latency phase of EBV infection is significant for the development of EBV-associated malignancies, proposing that lytic EBV replication may be of pathogenic relevance. Humanized mice infected with lytic active viral strains develop more lymphomas than animals infected with replication-defective strains, suggesting that lytic EBV infection may be of importance also in the context of an active immune response. In the present study we gave evidence for the first time that this occurs in vivo in the neoplastic cells of the primary tumors. Physiologically, lytic gene products are expressed in three consecutive stages: immediate-early, early, and late. Immediate-early lytic gene products initiate the process by inducing the activation of transcription of the other genes. Early genes control replication and metabolism of neoplastic cells. Fatty acid synthase expression is induced by the BRLF-1 immediate–early protein, and interestingly BL tumors are characterized by altered lipid metabolism. Late gene products code for viral capsid antigens and proteins involved in immune evasion. BNLF-2A, detected in a significant number of our cases, may protect infected B-cells from immune recognition and elimination. Finally, the EBV transcriptome during the reactivation may involve the contribution of a wide array of other virus-encoded RNAs, such as BARF-0, BARF-1, BcLF-1, and RPMSI-1, that are not translated and may function as non-coding RNA molecules which could participate in regulating gene expression. Heterogeneity in lytic/latent expression programs can be observed not only between patients but also within individual tumors, on a cell-to-cell basis. Intra-patient heterogeneity might be related to the activation of the immune response following the expression of the viral genes. Therefore, the tumor is under selective pressure and needs alternative mechanisms to survive and proliferate.
Our data on the mutational landscape of eBL seems to support this hypothesis. In fact, eBL samples were characterized by a lower number of point mutations in genes previously found altered in sBL, including MYC, ID3, TCF3, DDX3X, CCND3, and TP53. These results are consistent with previous studies by Schmitz et al. in which TCF3/ID3 mutations were more common in sBL (70% of the cases) than eBL (40%). In particular, we observed a near mutual exclusivity between TCF3/ID3 mutations and the presence of EBV, indicating that TCF3 pathway is more significantly activated in EBV-negative cases.
The inverse correlation we observed between the presence and expression of EBV and the number of cellular mutations in the different BL cases, may represent an in vivo picture of the dynamic process by which a neoplastic cell, initially dependent upon EBV, switches-off viral genes and switches-on cellular mutated genes to survive and proliferate. These results are consistent with previous analysis of pediatric BL. Based on our findings, one should infer that eBL may arise from pathogenic pathways that are partially distinct from those driving sBL, suggesting dual mechanisms of transformation in BL, mutationally versus virally driven. On the other hand, ARID1A and RHOA were more often mutated in eBL than in sBL. ARID1A is one of the subunits of the Switch/Sucrose Non-Fermentable (SWI/SNF) chromatin remodeling complex and is currently thought to behave like a tumor suppressor gene. Consistently, ARID1A mutations frequently occur as insertion/deletion, and in most of our cases involved the amino acid G1630. This gene has been reported as frequently mutated in the context of pediatric BL, with a significant association to EBV negative cases, suggesting that the high prevalence in eBL compared to the sBL may be due to the pediatric nature of the endemic case. However, other EBV-associated cancer types show frequent deregulation of ARID1A[57–61]. In particular, in EBV-associated gastric cancer a strong correlation between ARID1A deactivation and EBV presence has been reported[62–64]. RHOA, which belongs to the Ras homolog family, is a small GTP-ase protein recently found to be mutated in three tumors associated with EBV infection, namely peripheral T-cell lymphoma (where it relates to follicular helper T-cells[65–67]), diffuse gastric carcinoma and paediatric sBL. The distribution of RHOA mutations in our cohort overlaps with the already reported mutations (codons 5, 17, 42 and 69) suggesting a similar functional role.
Finally we identified recurrent mutations involving the amino acid R451C in one gene not previously detected in endemic or sporadic BLs, CCNF, altered in 20% of our cases. CCNF encodes a member of the cyclin family belonging to the F-box protein family; it acts as an inhibitor of centrosome reduplication during G2 phase and protects the cell from genome instability. Therefore, it is reasonable that CCNF mutations may cooperate in inducing lymphomagenesis by promoting chromosome instability and a hypermutator phenotype[70,71].
Understanding the mechanisms regulating EBV lymphomagenesis will hopefully lead to the development of highly specific therapies. To avoid the tumor evasion from the already available therapies, we need to identify and target the multiplicity of pathways that are deregulated in the neoplastic cells and decrease tumor survival and proliferation.
Materials and Methods
A total of 20 BL samples preserved in RNAlater (RNA stabilization Reagent-QIAGEN, Valencia, CA) were collected from the Department of Human Pathology of the Lacor Hospital (Uganda, Africa), in endemic areas. For all of them, formalin-fixed and paraffin-embedded (FFPE) samples have been available. All diagnoses were reviewed by 2 expert hematopathologists and were formulated according to the 2008 WHO classification. The clinical and histopathologic characteristics of the 20 BL cases are summarized in S6 Table. Briefly, all cases were t(8;14)-positive, and the immunophenotype was consistent with the diagnosis of BL (CD20 positive, CD10 positive, BCL-6 positive, Ki67> 98%, BCL-2 negative). Epstein-Barr virus was detected by using in situ hybridization with EBER probes (INFORM EBER, Roche Diagnostics, Basel, Switzerland). EBV infection in tumor cells was observed in 100% of the samples, assessed by strong nuclear expression of small EBV-encoded RNA genes, EBER-1 and -2. These cases have been previously studied for gene expression profile analysis and showed a molecular profile consistent with molecular BL.
We used two distinct series of cases for validation of RNA-Seq results. The first included 26 primary tumors collected at the Moi University, Eldoret (Western Kenya). Of these, 20 were used for virus data validation and 26 for EBV latency validation. The second was comprised of 66 neoplastic samples plus 7 cases for which matched normal controls were available (1 liver, 6 lymph nodes) collected from endemic area in Africa, and was used for Sequenom validation.
Total RNA extraction was perfomed by RNeasy Plus Mini Kit(QIAGEN, Valencia, CA) according to the manufacture instructions. The amount and quality of RNA were evaluated by measuring the optical density (OD) at 260 nm, the 260/230 and the 260/280 ratios using a Nanodrop spectrophotometer (ND-100, Nanodrop, Thermo Scientific, Celbio, Italy).
Paired-end libraries (2x75 base pair) were prepared according to the TruSeq RNA sample preparation v2 protocol (Illumina, San Diego, USA). Briefly, 2 μg of Poly(A)+ RNA was purified from total RNA using poly-T oligo attached magnetic beads and then used for fragmentation into 130–290 bp fragments. First, single stranded cDNA was synthesized using reverse transcriptase (SuperScript II, Invitrogen, Life Technologies,USA) and random hexamer priming, followed by generation of double-stranded cDNA. AmpureXP beads (Beckman Coulter, Brea CA) were used to purify the ds cDNA and end repair step was performed to convert the overhangs, resulting from fragmentation, into blunt ends by 3’ to 5’ exonuclease activity. A single “A” nucleotide was added to the 3’ ends of the blunt fragments to prevent them from ligating to one another during the adapter ligation reaction. This approach was adopted to ensure a low rate of chimera (concatenated template) formation. Subsequently, sequencing adapters were added to the ends of the ds cDNA fragment and a PCR reaction was used to selectively enrich those ds cDNA fragments that had adapter molecules on both ends, amplifying the amount of ds cDNA in the final libraries. Lastly, PCR library products were purified by AmpureXP beads and quality control analysis was assessed using a DNA-1000 (Agilent, USA). The quantification was performed by Quant-it PicoGreendsDNA Assay Kit according to manufacturer’s protocol (Invitrogen, Life Technologies,USA). The resulting libraries were sequenced on an Illumina HiScan SQ (Illumina, San Diego, USA) following the manufacturer's instructions.
Point mutation identification using RNASeq
Sequence variants were obtained using the SAVI (Statistical Algorithm for Variant Identification)[72,73] algorithm independently for each sample. Candidate somatic mutations were obtained by eliminating common germline variants (dbSNP 132 and variants from 10 reactive lymph nodes). Genes recurrently mutated in more than 4 samples and expressing the corresponding transcript with RPKM>3 were selected. Mutations occurring in the exact same position in more than 4 samples have been discarded. Conversely, genes previously reported in BL[19,74,75] were selected, even at low recurrence, to allow the comparison between endemic and sporadic subtypes (S3 Table). Sanger sequencing was used for technical validation.
Pandora: A pipeline for pathogen discovery using RNASeq
Characterization of the tumor microbiome is accomplished with Pandora, a new RNA-Seq pipeline for pathogen identification and discovery (S12 Fig). The algorithm takes raw RNA-Seq data as input and outputs annotated microbial spectra present in the tumor sample. Pandora implements a subtractive algorithm consisting of discrete modules. First, the Host Removal phase sequentially aligns the input reads to the host reference using bowtie2, blastN and Megablast, and filters out the data originating from the host. Second, the unaligned (non-host) reads are passed as input to the Microbe Identification phase where the reads are aligned to curated sets of NCBI microbial sequences representing viruses/viroids, bacteria, fungi, and select taxa of eukaryotic parasites. Third, the NCBI records matching each non-host read are input to the Reporting phase where microbial load, gene expression, and relevant clinical parameters are computed as the final output. The microbial load is computed as the number of reads mapping to the organism or virus normalized by the genome length. Gene expression quantification is computed as transcript per million (TPM), which provides a more accurate relative quantification of mRNA abundance compared to other normalization methods such as RPKM.
EBV genome analysis and expression
RNA-Seq reads were aligned to the GRCh37/hg19 reference genome using Bowtie2, Blastn and Megablast. Reads not aligning to homo sapiens (non-host reads) were mapped to human herpesvirus 4, type I (NCBI accession number NC_007605.1) using TopHat, a splicing aware alignment program  (S9 Table and S10 Table). EBV viral gene expression was normalized as transcripts per million (TPM). For each viral product the TPM expression was normalized by the expression of A73 genes, which is consistently expressed in all the HHV4 positive BL samples. Hierarchical clustering was computed with Pearson distance and Ward’s linkage method.
Gene expression analysis was performed on both endemic and sporadic RNA-Seq samples of Burkitt Lymphoma. All the reads were aligned to human reference genome (GRCh37/hg19) by means of TopHat version 1.3.3. Transcript abundance quantification was computed as FPKM using Cufflinks, Cuffquant and Cuffnorm version 2.2.1. Hierarchical clustering was performed with Pearson distance and Ward’s linkage method. Gene set enrichment analysis was obtained by running GSEA software on pre-ranked list of log2 ratio of the FPKM mean fold change between two conditions.
The DNA was extracted from formalin-fixed paraffin embedded (FFPE) of the original neoplastic samples using NucleoSpin Tissue (Machery-Nagel, Italy) following manufacture’s instructions. The amount and quality of DNA were evaluated by measuring the optic density (OD) at 260 nm, the 260/230 and the 260/280 ratios using a Nanodrop spectrophotometer (ND-100, Nanodrop, Thermo Scientific, Celbio, Italy).
To detect the presence of HTLV-1, CMV and KSHV, a nested PCR assay was performed on DNA of original tumor samples as previously reported, (S7 Table and S8 Table). DNA from HTLV-1-positive cells, CMV-positive cells, and KSHV-positive cells were used as positive controls, whereas DNA from HeLa293 cells was used as negative control. Several precautions have been taken to prevent false-positive PCR results: (a) rooms for pre- and post-PCR procedures were physically separated; (b) reagents were prepared in large batches and stored in small aliquots; (c) equipment such as the microcentrifuge, water baths, pipettes, tube racks, and other small equipment was designated for PCR work only; (d) gloves were changed frequently; and (e) aerosol-barrier pipette tips, PCR tubes, and autoclaved, diethylpyrocarbonate-treated water were sterilized by UV irradiation prior to PCR. Finally, 15 μl aliquots of the PCR mixture were electrophoresed on a 2% agarose gel and directly visualized by ethidium bromide staining under ultraviolet light.
Validation in an extended panel
The MassARRAY Assay Design Suite software was used to design 8 different multiplex reactions for investigating 115 SNPs. Genotyping was performed using iPLEX Gold technology 57 MassARRAY high-throughput DNA analysis with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (Sequenom), according to the manufacturer’s protocol. 66 neoplastic cases plus 7 samples with matched normal controls (1 liver, 6 lymph nodes) were analysed.
Real-time quantitative reverse transcription PCR (RT-qPCR)
The expression of EBV-encoded genes (EBNA-1, EBNA-2, EBNA-3c, BALF-2, BALF-4, BCRF-1, BHRF-1, BILF-1, BNLF-2a, BMRF-1, BZLF-1, LMP-1, LMP-2A LF-2), which characterize the different latency programs, has been investigated on an additional series of 26 samples by RT-qPCR using the QuantiTect SYBR Green PCR Kit (Qiagen, CA) as previously reported (S7 Table). All samples were run in triplicate. The stably expressed housekeeping gene hypoxanthine-guanine phosphoribosyltransferase (HPRT) was used as an endogenous control and reference gene for relative quantification of each target gene. The relative expression is expressed as 2ΔCt, where ΔCt is defined as the difference in mean cycle thresholds of the gene of interest and HPRT. The samples were defined as “not expressed” if the ΔCt value exceeded 50 cycles.
To further validate the presence of HTLV-1 and HHV-8, immunohistochemistry for viral products (HTLV-1-TAX 1: 70, Abcam, Cambridge, United Kingdom; HHV8-LANA 1: 50, Leica Biosystems, Newcastle Lid, United Kingdom; HHV8 clone AT4C11 1:50, Abnova, Taipey City, Taiwan) was performed on formalin-fixed paraffed-embedded (FFPE) sections of the original samples and in an additional series of 20 cases. CMV was detected using in situ hybridization (ISH) with Bond ISH Probe.
The protein expression of EBNA-1 (1:150, AbCam, Italy), EBNA-2 (1:100, AbCam, Italy), LMP-1 (1:100, Novus Biologicals, Italy), LMP-2A (1:100, AbCam, Italy), BZLF-1/ZEBRA (1:100, Novus Biologicals, Italy), BMRF-1/Ea-D (1:150, AbCam, Italy), BHRF-1/Ea-R (1:150, Novus Biologicals, Italy), was assessed by immunohistochemistry on FFPE sections of the original samples and on an additional series of 26 primary tumors. Sections of the samples incubated with the secondary antibody alone and sections of reactive lymphoid tissue were used as negative controls. Sections of lymph nodes with infectious mononucleosis were used as positive control. Immunoreactivity was performed on Bond Max automated immunostainer (Leica Microsystem, Bannockburn, IL, USA), with controls in parallel. No epitope retrieval was used. Ultravision Detection System using anti-Polyvalent HRP (LabVision, Fremont, CA, USA) and diaminobenzidine (DAB, Dako, Milan-Italy) as chromogen was used. Two independent investigators assessed immunoreactivity. Case were considered positive when more than 20% of the cells were stained for latent gene products and when more than 5% of the cells were stained for lytic gene products.
Ethics approval for this study was obtained from the Institutional Review Board at the University of Siena (Italy), from the Ethics and Research Committee at the Lacor Hospital (Uganda) and from the Ethics and Research Committee at Moi University, Eldoret (Kenya). Written permission and informed consent have been obtained before sample collection in accordance with the Declaration of Helsinki.
S1 Fig. Number of viral reads per million of human reads across the 20 eBL of the discovery cohort.
The red line indicates the minimal number of viral reads to detect any of the viruses in the corresponding sample.
S2 Fig. CMV detection on discovery cohort samples by IHC.
S3 Fig. KSHV detection on discovery cohort samples by IHC.
S4 Fig. CMV detection on Kenyan cohort samples by IHC.
S5 Fig. KSHV detection on Kenyan cohort samples by IHC.
S6 Fig. The specific analysis of EBV isoforms showed the presence of H2-HF splicing event.
S7 Fig. Validation of latency and lytic genes expression by RT-qPCR for LMP-2A (A), BZLF-1 (B), BMRF-1 (C), BHRF-1 (D) in additional series of 26 cases is shown.
S8 Fig. A-B Results of Sequenom analysis on an extended panel of 66 neoplastic samples plus 7 cases with matched normal controls are demonstrated.
S9 Fig. A-C Examples of mutations in RHOA (A-B) and in CCNF (C) detected by Sequenom technology and validated by Sanger sequencing are shown.
S10 Fig. A-F Examples of mutations in ARID1A (A), DDX3X (B), CCNF (C), RHOA (D), ID3 (E), TCF3 (F) detected by RNA-Seq technique and validated by Sanger sequencing are shown.
S11 Fig. A-B The E2F, E2F3 and cell cycle G1/S gene sets presented the highest significant enrichment score together with the down-regulation of retinoblastoma pathway.
S12 Fig. Characterization of the tumor microbiome by Pandora.
S1 Table. (A) PCR Nested and IHC validation results on original samples. (B) IHC validation results on extended cohort from Kenya.
S2 Table. Validation of latency type by RT-qPCR in comparison with RNA-sequencing results.
S3 Table. Immunohistochemical results in single case.
S4 Table. List of selected mutations detected in RNA-Seq.
S5 Table. List of variants detected through Sequenom technology.
S6 Table. Clinical and histopathologic features of discovery cohort BL patients.
S7 Table. Primers sequences for detection and typing of human lymphotropic herpesviruses.
S8 Table. Primers sequences for detection of EBV-encoded genes.
S9 Table. Complete list of mutations detected in RNA-Seq.
The authors would like to acknowledge Laura Pasqualucci, Riccardo Dalla Favera, and Adolfo Ferrando for helpful discussions.
Conceived and designed the experiments: FA RR SZ LL SP PPP. Performed the experiments: LM MRA SG GDF CB TA SL BJR ME. Analyzed the data: FA SZ RR. Contributed reagents/materials/analysis tools: SP LL. Wrote the paper: FA RR LL SP SZ. Contributed clinical samples: MO VC IN KP DC. Analyzed Sequenom data: PPP MAL MR FF EM.
- 1. World Health O (2008) Report of the ninth meeting of the WHO Technical Advisory Group on Leprosy Control: Cairo, Egypt, 6–7 March 2008. Lepr Rev 79: 452–470. pmid:19274996
- 2. Ogwang MD, Bhatia K, Biggar RJ, Mbulaiteye SM (2008) Incidence and geographic distribution of endemic Burkitt lymphoma in northern Uganda revisited. Int J Cancer 123: 2658–2663. pmid:18767045
- 3. van den Bosch C (2012) A Role for RNA Viruses in the Pathogenesis of Burkitt's Lymphoma: The Need for Reappraisal. Advances in hematology 2012: 494758. pmid:22550493
- 4. Thorley-Lawson DA, Hawkins JB, Tracy SI, Shapiro M (2013) The pathogenesis of Epstein-Barr virus persistent infection. Curr Opin Virol 3: 227–232. pmid:23683686
- 5. Kelly G, Bell A, Rickinson A (2002) Epstein-Barr virus-associated Burkitt lymphomagenesis selects for downregulation of the nuclear antigen EBNA2. Nature medicine 8: 1098–1104. pmid:12219084
- 6. Kelly GL, Stylianou J, Rasaiyaah J, Wei W, Thomas W, et al. (2013) Different patterns of Epstein-Barr virus latency in endemic Burkitt lymphoma (BL) lead to distinct variants within the BL-associated gene expression signature. Journal of virology 87: 2882–2894. pmid:23269792
- 7. Kelly GL, Milner AE, Baldwin GS, Bell AI, Rickinson AB (2006) Three restricted forms of Epstein-Barr virus latency counteracting apoptosis in c-myc-expressing Burkitt lymphoma cells. Proceedings of the National Academy of Sciences of the United States of America 103: 14935–14940. pmid:17001014
- 8. Bell AI, Groves K, Kelly GL, Croom-Carter D, Hui E, et al. (2006) Analysis of Epstein-Barr virus latent gene expression in endemic Burkitt's lymphoma and nasopharyngeal carcinoma tumour cells by using quantitative real-time PCR assays. The Journal of general virology 87: 2885–2890. pmid:16963746
- 9. Niedobitek G, Agathanggelou A, Rowe M, Jones EL, Jones DB, et al. (1995) Heterogeneous expression of Epstein-Barr virus latent proteins in endemic Burkitt's lymphoma. Blood 86: 659–665. pmid:7605996
- 10. Arvey A, Ojesina AI, Pedamallu CS, Ballon G, Jung J, et al. (2015) The tumor virus landscape of AIDS-related lymphomas. Blood 125: e14–e22. pmid:25827832
- 11. Dalla-Favera R, Bregni M, Erikson J, Patterson D, Gallo RC, et al. (1982) Human c-myc onc gene is located on the region of chromosome 8 that is translocated in Burkitt lymphoma cells. Proc Natl Acad Sci U S A 79: 7824–7827. pmid:6961453
- 12. Adams JM, Harris AW, Pinkert CA, Corcoran LM, Alexander WS, et al. (1985) The c-myc oncogene driven by immunoglobulin enhancers induces lymphoid malignancy in transgenic mice. Nature 318: 533–538. pmid:3906410
- 13. Dave SS, Fu K, Wright GW, Lam LT, Kluin P, et al. (2006) Molecular diagnosis of Burkitt's lymphoma. N Engl J Med 354: 2431–2442. pmid:16760443
- 14. Neri A, Barriga F, Knowles DM, Magrath IT, Dalla-Favera R (1988) Different regions of the immunoglobulin heavy-chain locus are involved in chromosomal translocations in distinct pathogenetic forms of Burkitt lymphoma. Proc Natl Acad Sci U S A 85: 2748–2752. pmid:2833750
- 15. Cario G, Stadt UZ, Reiter A, Welte K, Sykora KW (2000) Variant translocations in sporadic Burkitt's lymphoma detected in fresh tumour material: analysis of three cases. Br J Haematol 110: 537–546. pmid:10997962
- 16. Sander S, Calado DP, Srinivasan L, Kochert K, Zhang B, et al. (2012) Synergy between PI3K signaling and MYC in Burkitt lymphomagenesis. Cancer Cell 22: 167–179. pmid:22897848
- 17. Piccaluga PP, De Falco G, Kustagi M, Gazzola A, Agostinelli C, et al. (2011) Gene expression analysis uncovers similarity and differences among Burkitt lymphoma subtypes. Blood 117: 3596–3608. pmid:21245480
- 18. Schmitz R, Ceribelli M, Pittaluga S, Wright G, Staudt LM (2014) Oncogenic mechanisms in Burkitt lymphoma. Cold Spring Harb Perspect Med 4. pmid:24492847
- 19. Schmitz R, Young RM, Ceribelli M, Jhavar S, Xiao W, et al. (2012) Burkitt lymphoma pathogenesis and therapeutic targets from structural and functional genomics. Nature 490: 116–120. pmid:22885699
- 20. Giulino-Roth L, Wang K, MacDonald TY, Mathew S, Tam Y, et al. (2012) Targeted genomic sequencing of pediatric Burkitt lymphoma identifies recurrent alterations in antiapoptotic and chromatin-remodeling genes. Blood 120: 5181–5184. pmid:23091298
- 21. Naresh KN, Raphael M, Ayers L, Hurwitz N, Calbi V, et al. (2011) Lymphomas in sub-Saharan Africa—what can we learn and how can we help in improving diagnosis, managing patients and fostering translational research? Br J Haematol 154: 696–703. pmid:21707579
- 22. Gutierrez MI, Bhatia K, Barriga F, Diez B, Muriel FS, et al. (1992) Molecular epidemiology of Burkitt's lymphoma from South America: differences in breakpoint location and Epstein-Barr virus association from tumors in other world regions. Blood 79: 3261–3266. pmid:1317726
- 23. Kelly GL, Long HM, Stylianou J, Thomas WA, Leese A, et al. (2009) An Epstein-Barr virus anti-apoptotic protein constitutively expressed in transformed cells and implicated in burkitt lymphomagenesis: the Wp/BHRF1 link. PLoS pathogens 5: e1000341. pmid:19283066
- 24. Tierney RJ, Shannon-Lowe CD, Fitzsimmons L, Bell AI, Rowe M (2015) Unexpected patterns of Epstein-Barr virus transcription revealed by a high throughput PCR array for absolute quantification of viral mRNA. Virology 474: 117–130. pmid:25463610
- 25. Oudejans JJ, van den Brule AJ, Jiwa NM, de Bruin PC, Ossenkoppele GJ, et al. (1995) BHRF1, the Epstein-Barr virus (EBV) homologue of the BCL-2 protooncogene, is transcribed in EBV-associated B-cell lymphomas and in reactive lymphocytes. Blood 86: 1893–1902. pmid:7655018
- 26. Lear AL, Rowe M, Kurilla MG, Lee S, Henderson S, et al. (1992) The Epstein-Barr virus (EBV) nuclear antigen 1 BamHI F promoter is activated on entry of EBV-transformed B cells into the lytic cycle. J Virol 66: 7461–7468. pmid:1331531
- 27. Vladimir Trifonov LP, Enrico Tiacci, Brunangelo Falini, Raul Rabadan (2013) Statistical Algorithm for Variant Frequency Identification. BMC Systems Biology In Press.
- 28. Rohde M, Richter J, Schlesner M, Betts MJ, Claviez A, et al. (2014) Recurrent RHOA mutations in pediatric Burkitt lymphoma treated according to the NHL-BFM protocols. Genes Chromosomes Cancer 53: 911–916. pmid:25044415
- 29. Sengupta S, den Boon JA, Chen IH, Newton MA, Dahl DB, et al. (2006) Genome-wide expression profiling reveals EBV-associated inhibition of MHC class I expression in nasopharyngeal carcinoma. Cancer Res 66: 7999–8006. pmid:16912175
- 30. Bultema R, Longnecker R, Swanson-Mungerson M (2009) Epstein-Barr virus LMP2A accelerates MYC-induced lymphomagenesis. Oncogene 28: 1471–1476. pmid:19182823
- 31. Bouchard C, Staller P, Eilers M (1998) Control of cell proliferation by Myc. Trends in Cell Biology 8: 202–206. pmid:9695840
- 32. Portis T, Longnecker R (2004) Epstein-Barr virus (EBV) LMP2A mediates B-lymphocyte survival through constitutive activation of the Ras/PI3K/Akt pathway. Oncogene 23: 8619–8628. pmid:15361852
- 33. Scholle F, Bendt KM, Raab-Traub N (2000) Epstein-Barr virus LMP2A transforms epithelial cells, inhibits cell differentiation, and activates Akt. J Virol 74: 10681–10689. pmid:11044112
- 34. Swart R, Ruf IK, Sample J, Longnecker R (2000) Latent membrane protein 2A-mediated effects on the phosphatidylinositol 3-Kinase/Akt pathway. J Virol 74: 10838–10845. pmid:11044134
- 35. Fukuda M, Longnecker R (2004) Latent membrane protein 2A inhibits transforming growth factor-beta 1-induced apoptosis through the phosphatidylinositol 3-kinase/Akt pathway. J Virol 78: 1697–1705. pmid:14747535
- 36. Brennan P, Mehl AM, Jones M, Rowe M (2002) Phosphatidylinositol 3-kinase is essential for the proliferation of lymphoblastoid cells. Oncogene 21: 1263–1271. pmid:11850846
- 37. Hatton OL, Harris-Arnold A, Schaffert S, Krams SM, Martinez OM (2014) The interplay between Epstein-Barr virus and B lymphocytes: implications for infection, immunity, and disease. Immunologic research 58: 268–276. pmid:24619311
- 38. Chene A, Donati D, Orem J, Mbidde ER, Kironde F, et al. (2009) Endemic Burkitt's lymphoma as a polymicrobial disease: new insights on the interaction between Plasmodium falciparum and Epstein-Barr virus. Seminars in cancer biology 19: 411–420. pmid:19897039
- 39. Moormann AM, Snider CJ, Chelimo K (2011) The company malaria keeps: how co-infection with Epstein-Barr virus leads to endemic Burkitt lymphoma. Current opinion in infectious diseases 24: 435–441. pmid:21885920
- 40. Dolcetti R, Dal Col J, Martorelli D, Carbone A, Klein E (2013) Interplay among viral antigens, cellular pathways and tumor microenvironment in the pathogenesis of EBV-driven lymphomas. Seminars in cancer biology 23: 441–456. pmid:23917255
- 41. Biryahwaho B, Dollard SC, Pfeiffer RM, Shebl FM, Munuo S, et al. (2010) Sex and geographic patterns of human herpesvirus 8 infection in a nationally representative population-based sample in Uganda. J Infect Dis 202: 1347–1353. pmid:20863232
- 42. Kenney SC, Mertz JE (2014) Regulation of the latent-lytic switch in Epstein-Barr virus. Semin Cancer Biol 26: 60–68. pmid:24457012
- 43. Chene A, Donati D, Guerreiro-Cacais AO, Levitsky V, Chen Q, et al. (2007) A molecular link between malaria and Epstein-Barr virus reactivation. PLoS Pathog 3: e80. pmid:17559303
- 44. Lin Z, Wang X, Strong MJ, Concha M, Baddoo M, et al. (2013) Whole-genome sequencing of the Akata and Mutu Epstein-Barr virus strains. J Virol 87: 1172–1182. pmid:23152513
- 45. Fish K, Chen J, Longnecker R (2014) Epstein-Barr virus latent membrane protein 2A enhances MYC-driven cell cycle progression in a mouse model of B lymphoma. Blood 123: 530–540. pmid:24174629
- 46. Dittmer DP (2014) Not like a wrecking ball: EBV fine-tunes MYC lymphomagenesis. Blood 123: 460–461. pmid:24458272
- 47. Sander S, Calado DP, Srinivasan L, Kochert K, Zhang BC, et al. (2012) Synergy between PI3K Signaling and MYC in Burkitt Lymphomagenesis. Cancer Cell 22: 167–179. pmid:22897848
- 48. Kenney SC, Mertz JE (2014) Regulation of the latent-lytic switch in Epstein-Barr virus. Seminars in cancer biology 26: 60–68. pmid:24457012
- 49. Sivachandran N, Wang X, Frappier L (2012) Functions of the Epstein-Barr virus EBNA1 protein in viral reactivation and lytic infection. Journal of virology 86: 6146–6158. pmid:22491455
- 50. Xue SA, Labrecque LG, Lu QL, Ong SK, Lampert IA, et al. (2002) Promiscuous expression of Epstein-Barr virus genes in Burkitt's lymphoma from the central African country Malawi. International journal of cancer Journal international du cancer 99: 635–643. pmid:12115495
- 51. Li Y, Webster-Cyriaque J, Tomlinson CC, Yohe M, Kenney S (2004) Fatty acid synthase expression is induced by the Epstein-Barr virus immediate-early protein BRLF1 and is required for lytic viral gene expression. Journal of virology 78: 4197–4206. pmid:15047835
- 52. Ambrosio MR, Piccaluga PP, Ponzoni M, Rocca BJ, Malagnino V, et al. (2012) The alteration of lipid metabolism in Burkitt lymphoma identifies a novel marker: adipophilin. PloS one 7: e44315. pmid:22952953
- 53. Jochum S, Moosmann A, Lang S, Hammerschmidt W, Zeidler R (2012) The EBV immunoevasins vIL-10 and BNLF2a protect newly infected B cells from immune recognition and elimination. PLoS pathogens 8: e1002704. pmid:22615564
- 54. Jang BG, Jung EJ, Kim WH (2011) Expression of BamHI-A Rightward Transcripts in Epstein-Barr Virus-Associated Gastric Cancers. Cancer research and treatment: official journal of Korean Cancer Association 43: 250–254.
- 55. O'Grady T, Cao S, Strong MJ, Concha M, Wang X, et al. (2014) Global bidirectional transcription of the Epstein-Barr virus genome during reactivation. Journal of virology 88: 1604–1616. pmid:24257595
- 56. Vereide DT, Sugden B (2011) Lymphomas differ in their dependence on Epstein-Barr virus. Blood 117: 1977–1985. pmid:21088132
- 57. Shain AH, Giacomini CP, Matsukuma K, Karikari CA, Bashyam MD, et al. (2012) Convergent structural alterations define SWItch/Sucrose NonFermentable (SWI/SNF) chromatin remodeler as a central tumor suppressive complex in pancreatic cancer. Proc Natl Acad Sci U S A 109: E252–259. pmid:22233809
- 58. Trifonov V, Pasqualucci L, Dalla Favera R, Rabadan R (2013) MutComFocal: an integrative approach to identifying recurrent and focal genomic alterations in tumor samples. BMC systems biology 7: 25. pmid:23531283
- 59. Wang K, Kan J, Yuen ST, Shi ST, Chu KM, et al. (2011) Exome sequencing identifies frequent mutation of ARID1A in molecular subtypes of gastric cancer. Nature genetics 43: 1219–1223. pmid:22037554
- 60. Wiegand KC, Shah SP, Al-Agha OM, Zhao Y, Tse K, et al. (2010) ARID1A mutations in endometriosis-associated ovarian carcinomas. The New England journal of medicine 363: 1532–1543. pmid:20942669
- 61. Abe H, Maeda D, Hino R, Otake Y, Isogai M, et al. (2012) ARID1A expression loss in gastric cancer: pathway-dependent roles with and without Epstein-Barr virus infection and microsatellite instability. Virchows Arch 461: 367–377. pmid:22915242
- 62. Wang K, Kan J, Yuen ST, Shi ST, Chu KM, et al. (2011) Exome sequencing identifies frequent mutation of ARID1A in molecular subtypes of gastric cancer. Nat Genet 43: 1219–1223. pmid:22037554
- 63. Gulley ML (2015) Genomic assays for Epstein-Barr virus-positive gastric adenocarcinoma. Exp Mol Med 47: e134. pmid:25613731
- 64. Wang K, Yuen ST, Xu J, Lee SP, Yan HH, et al. (2014) Whole-genome sequencing and comprehensive molecular profiling identify new driver mutations in gastric cancer. Nat Genet 46: 573–582. pmid:24816253
- 65. Palomero T, Couronne L, Khiabanian H, Kim MY, Ambesi-Impiombato A, et al. (2014) Recurrent mutations in epigenetic regulators, RHOA and FYN kinase in peripheral T cell lymphomas. Nat Genet 46: 166–170. pmid:24413734
- 66. Sakata-Yanagimoto M, Enami T, Yoshida K, Shiraishi Y, Ishii R, et al. (2014) Somatic RHOA mutation in angioimmunoblastic T cell lymphoma. Nat Genet 46: 171–175. pmid:24413737
- 67. Yoo HY, Sung MK, Lee SH, Kim S, Lee H, et al. (2014) A recurrent inactivating mutation in RHOA GTPase in angioimmunoblastic T cell lymphoma. Nat Genet 46: 371–375. pmid:24584070
- 68. Kakiuchi M, Nishizawa T, Ueda H, Gotoh K, Tanaka A, et al. (2014) Recurrent gain-of-function mutations of RHOA in diffuse-type gastric carcinoma. Nat Genet 46: 583–587. pmid:24816255
- 69. D'Angiolella V, Esencay M, Pagano M (2013) A cyclin without cyclin-dependent kinases: cyclin F controls genome stability through ubiquitin-mediated proteolysis. Trends Cell Biol 23: 135–140. pmid:23182110
- 70. D'Angiolella V, Donato V, Vijayakumar S, Saraf A, Florens L, et al. (2010) SCF(Cyclin F) controls centrosome homeostasis and mitotic fidelity through CP110 degradation. Nature 466: 138–142. pmid:20596027
- 71. D'Angiolella V, Donato V, Forrester FM, Jeong YT, Pellacani C, et al. (2012) Cyclin F-mediated degradation of ribonucleotide reductase M2 controls genome integrity and DNA repair. Cell 149: 1023–1034. pmid:22632967
- 72. Tiacci E, Trifonov V, Schiavoni G, Holmes A, Kern W, et al. (2011) BRAF mutations in hairy-cell leukemia. The New England journal of medicine 364: 2305–2315. pmid:21663470
- 73. Trifonov V, Pasqualucci L, Tiacci E, Falini B, Rabadan R (2013) SAVI: a statistical algorithm for variant frequency identification. BMC systems biology 7 Suppl 2: S2. pmid:24564980
- 74. Love C, Sun Z, Jima D, Li G, Zhang J, et al. (2012) The genetic landscape of mutations in Burkitt lymphoma. Nat Genet 44: 1321–1325. pmid:23143597
- 75. Richter J, Schlesner M, Hoffmann S, Kreuz M, Leich E, et al. (2012) Recurrent mutation of the ID3 gene in Burkitt lymphoma identified by integrated genome, exome and transcriptome sequencing. Nat Genet 44: 1316–1320. pmid:23143595
- 76. Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9: 357–359. pmid:22388286
- 77. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215: 403–410. pmid:2231712
- 78. Morgulis A, Coulouris G, Raytselis Y, Madden TL, Agarwala R, et al. (2008) Database indexing for production MegaBLAST searches. Bioinformatics 24: 1757–1764. pmid:18567917
- 79. Wagner GP, Kin K, Lynch VJ (2012) Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory in Biosciences 131: 281–285. pmid:22872506
- 80. Trapnell C, Pachter L, Salzberg SL (2009) TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25: 1105–1111. pmid:19289445
- 81. Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, et al. (2010) Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol 28: 511–515. pmid:20436464
- 82. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, et al. (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102: 15545–15550. pmid:16199517
- 83. Pozo F, Tenorio A (1999) Detection and typing of lymphotropic herpesviruses by multiplex polymerase chain reaction. J Virol Methods 79: 9–19. pmid:10328531
- 84. Laurentino RV, Lopes IG, Azevedo VN, Machado LF, Moreira MR, et al. (2005) Molecular characterization of human T-cell lymphotropic virus coinfecting human immunodeficiency virus 1 infected patients in the Amazon region of Brazil. Mem Inst Oswaldo Cruz 100: 371–376. pmid:16113884
- 85. Pellett PE, Spira TJ, Bagasra O, Boshoff C, Corey L, et al. (1999) Multicenter comparison of PCR assays for detection of human herpesvirus 8 DNA in semen. J Clin Microbiol 37: 1298–1301. pmid:10203474
- 86. Qiu J, Smith P, Leahy L, Thorley-Lawson DA (2015) The Epstein-Barr virus encoded BART miRNAs potentiate tumor growth in vivo. PLoS Pathog 11: e1004561. pmid:25590614
- 87. Bell AI, Groves K, Kelly GL, Croom-Carter D, Hui E, et al. (2006) Analysis of Epstein-Barr virus latent gene expression in endemic Burkitt's lymphoma and nasopharyngeal carcinoma tumour cells by using quantitative real-time PCR assays. J Gen Virol 87: 2885–2890. pmid:16963746