Extranodal marginal zone lymphomas (EMZL) are the most common lymphomas in the ocular adnexa. The etiology and potential role for antigenic stimulation in these lymphomas are still controversial. We have examined IGHV gene usage and mutations in 67 Chlamydophila psittaci-negative ocular adnexal EMZL. Clonal IGHV gene sequences were identified in 43 tumors originating from the orbit (19), conjunctivae (18) and lacrimal gland (6). Forty four potentially functional clonal IGHV gene sequences were detected with overrepresentation of the IGHV4 family and IGHV4-34 gene. All but 3 sequences were mutated with the average percent homology to the germ line of 93.5±6.1. Multinomial model and Focused binomial test demonstrated evidence for positive and/or negative antigen selection in 59% of the potentially functional IGHV genes. Intraclonal variation was detected in 8 of 11 tumor specimens. Overall our findings demonstrate that C. psittaci-negative ocular adnexal EMZL exhibit biased usage of IGHV families and genes with evidence for intraclonal heterogeneity and antigen selection in multiple tumors, implicating B-cell receptor-mediated antigen stimulation in the pathogenesis of these lymphomas.
Citation: Zhu D, Lossos C, Chapman-Fredricks JR, Matthews JM, Ikpatt OF, Ruiz P, et al. (2011) Biased Use of the IGHV4 Family and Evidence for Antigen Selection in Chlamydophila psittaci-Negative Ocular Adnexal Extranodal Marginal Zone Lymphomas. PLoS ONE 6(12): e29114. https://doi.org/10.1371/journal.pone.0029114
Editor: Jose Angel Martinez Climent, University of Navarra, Center for Applied Medical Research, Spain
Received: September 21, 2011; Accepted: November 21, 2011; Published: December 27, 2011
Copyright: © 2011 Zhu et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: ISL is supported by National Institutes of Health (NIH) grants NIH CA109335 and NIH CA122105, and the Dwoskin Foundation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Extranodal marginal zone lymphomas (EMZLs) are a distinct subtype of Non-Hodgkin's lymphoma (NHL) typically arising in extranodal sites devoid of significant lymphoid tissue, such as the gastrointestinal tract, thyroid and salivary glands, lungs, skin and the ocular adnexa (lacrimal gland, orbit, conjunctiva and eyelid) , . EMZLs are frequently characterized by an indolent clinical course and often remain localized at their sites of origin for many years. In some locations, these lymphomas are preceded by the acquisition of reactive localized inflammatory infiltrates caused by long-standing chronic infection (e. g. Helicobacter pylori in the stomach) or autoimmune diseases (e.g. Hashimoto's thyroiditis in the thyroid and Sjogren's syndrome in the salivary glands) suggesting pathogenetic dependence on antigen stimulation and/or help provided by the local immune reactions , , , , . Remission of early gastric EMZLs following Helicobacter pylori-eradicating antibiotics supports this mechanism of lymphoma pathogenesis , .
In contrast, the etiology and pathogenesis of ocular adnexal extranodal marginal zone lymphomas (OAEMZLs) are still controversial . In a prospective case-controlled study from Italy, a significant association was demonstrated between exposure to household animals, rural residence, and history of chronic conjunctivitis in patients with OAEMZLs . Furthermore, the Italian group and investigators from several other geographic regions demonstrated Chlamydophila psittaci (C. psittaci) DNA and bacteria present in OAEMZLs , , , , , , suggesting that this pathogen may be implicated in the development of these lymphomas analogously to Helicobacter pylori in gastric EMZLs. However, similar studies performed by us and other investigators on US and non-US-based patients have failed to corroborate the finding of C. psittaci in OAEMZLs, potentially suggesting geographic differences , , , , . Furthermore, DNA from other bacteria was also not detected, which supports a non-bacterial etiology in Florida US patients .
The majority of B-cell lymphomas express a unique clonal surface B-cell receptor containing an immunoglobulin (Ig) heavy chain variable region (IGHV) that may have important implications for tumor diagnosis and pathogenesis. The following approaches can be useful to trace the developmental stage at which neoplastic B cell transformation occurs and provide support for B-cell receptor-mediated direct antigen stimulation of lymphoma cells without prior antigen identification: (1) examining possible bias in IGHV-gene usage; (2) analyzing the distribution of silent (S) and replacement (R) mutations in framework regions (FR) and complementary determining regions (CDR); (3) evaluating the CDR3 sequences; and (4) investigating the presence of intraclonal heterogeneity as a marker of ongoing somatic mutations. Several groups have analyzed the IGHV gene in small cohorts of patients (range 8–26) with OAEMZLs , , , , ; however, these studies have not yielded consistent results. To further elucidate the potential bias in IGHV gene usage and more robustly examine evidence for antigen selection, we analyzed IGHV gene sequences derived from fresh tumor samples in a large cohort of C. psittaci-negative, untreated OAEMZLs.
Usage of IGHV genes in OAEMZLs
DNA extracted from fresh OAEMZL tumors from 67 patients (median age 63, range 24–92; 39 females and 28 males) was used for PCR amplification of the IGHV gene. The tumors originated in the orbit (29), conjunctiva (26), lacrimal gland (11) and eyelid (1). All specimens were negative for C. psittaci. Tumors from 34 of these 67 patients were used in our previous studies evaluating the presence of C. psittaci and other bacteria in OAEMZL , . Agarose gel examination detected a PCR amplicon of the appropriate size for amplified IGHV in all 67 tumors. However, in 24 (36%) tumors, a clonal IGHV-IGHJ PCR sequence could not be identified, despite successful actin amplification, which served as a control for DNA integrity.
A total of 45 clonal IGHV sequences were detected in 43 patients, 23 females and 20 males, with a median age of 63 (range 39–92), representing 64% of the analyzed cohort. In two tumors two clonal sequences were detected in each tumor. Tumors with detected clonal sequences originated from the orbit (19), conjunctiva (18) and lacrimal gland (6). No differences in clinical features and tumor localization between patients with or without clonal IGHV sequences were detected. In 14 (32.6%) cases the PCR product could be sequenced directly, whereas in 29 (67.4%) cases, PCR amplicons had to be subcloned to identify the IGHV-IGHJ gene sequence.
A total of 44 identified clonal IGHV sequences were potentially functional (Table 1), while one sequence encoded by IGHV2-26*1 harbored an out-of-frame junction with a stop codon. This nonfunctional sequence was identified in a tumor that did not harbor another potentially functional clonal IGHV sequence. The 44 potentially functional IGHV were derived from 4 of the 7 human IGHV gene families with the following distribution: IGHV1, 11.4%; IGHV2, 2.2%; IGHV3, 40.9% and IGHV4, 45.5%. In comparison to the relative complexity of functional germline IGHV genes within each family and to the use of IGHV families in peripheral and lymph node lymphocytes in healthy donors , , ,  (Table 2), the IGHV gene family usage by the OAEMZLs was biased with overrepresentation of the IGHV4 gene family (p = 0.001). The IGHJ segments in the potentially functional amplicons were derived from 5 of the 6 families with the following distribution: IGHJ2, 11.4%; IGHJ3, 4.5%; IGHJ4, 43.2%; IGHJ5, 22.7% and IGHJ6, 18.2%. Their usage was also significantly different from the reported repertoire in peripheral blood lymphocytes (p<0.001), with an over-representation of IGHJ2 and IGHJ5 .
The most frequently encountered genes were IGHV4-34 (n = 8), IGHV3-30 (n = 6) and IGHV3-23 (n = 5) (Table 1) which are known to frequently encode autoantibodies , , . IGHV4-34 and IGHV3-30 were used at higher frequency than expected in normal individuals . Specifically, the most commonly used gene, IGHV4-34, represented 18.2% of all the potentially functional IGHV genes identified in this study in contrast to its 3–9% prevalence in adult peripheral B lymphocytes , . Similarly, IGHV4-34 was also detected once (3.2%) among 31 IGHV genes sequenced from normal adult marginal zone lymphocytes .
Mutation Pattern and CDR3 Analyses of Potentially Functional IGHV genes
A total of 41 potentially functional clonal IGHV gene sequences harbored mutations, with 32 exhibiting more than a 2% difference from the most similar germline gene sequences and 9 sequences harboring a 2% or less difference. A total of 3 clonal IGHV gene sequences exhibited a germline sequence without mutations. The average percent homology to the germ line sequence in the 41 potentially functional mutated sequences was 93.5% (range 71.5–99.6). Eight of the 44 (18.2%) potentially functional IGHV gene isolates differed by more than 10% from the most similar germline counterpart. In the two tumors which each had two potentially functional IGHV gene isolates there was a 5 and 14% difference in the mutation load between the two sequences from the same tumor, respectively. Two of the three IGHV3-30 sequences harbored a 2% or less difference from the most similar germline genes, while all the IGHV3-23 and most IGHV4-34 sequences had more than a 2% difference from the most similar germline genes. OAEMZLs did not exhibit excessive accumulation of acceptor sequence motifs for N-glycan addition.
We next examined if the observed mutations in the OAEMZL IGHV regions tend to occur in the RGYW/WRCY sequences usually targeted by AID  and WA/TW sequences usually arising as a polymerase error during repair of the AID-generated lesions . Overall, 51.7% of mutations observed in the OAEMZL cases occurred in AID target/error prone repair regions (30% and 21.7%, respectively), while only 23.4% and 17.7% of the germline genes consisted of areas subjected to AID action and error-prone repair, respectively. Overall, 7.9% of polymerase error-prone repair targets and 8.3% of AID targets were mutated as compared to 5.3% for the rest of the germline sequences, indicating a bias in mutation patterns toward mutational hotspots, as previously reported , .
To analyze for potential antigen selection pressure on the IGHV genes, we applied two algorithms: the Multinomial Model and Focused binomial test (Table 1). Both algorithms were used since presently there is no biologically confirmed method that precisely estimates antigen selection and each of these two most commonly used methods is associated with potential limitations: interaction between positive and negative selections in the multinomial model and underestimation of positive selection in the CDR regions by the Focused binomial test . The Multinomial algorithm for antigen selection revealed evidence for positive selection manifested by an excess of replacement (R) mutations in the CDR exceeding that expected to occur by chance in 15 sequences. Negative selection manifested by a scarcity of R mutations in the FR was detected in 17 sequences. In 7 OAEMZL tumors a concomitant scarcity of R mutations in FRs and excess of R mutations in CDRs were observed. Selection analysis using the Focused binomial test demonstrated evidence of selection in the FR in 13 sequences and in the CDR in 4 sequences. The markedly smaller number of cases with selection in the CDR by the Focused binomial test as compared to the Multinomial model could be predicted, since the former test is known to underestimate positive selection in the CDR . Selection was detected in all 5 IGHV3-23 and 6 of the 8 functional IGHV4-34 sequences by either of the two models.
We performed a search for similar R mutations within the CDRs of isolates belonging to the same IGHV genes (Table 3). Recurrent amino acid changes were observed in the CDR1 and CDR2 of the IGHV4-34, IGHV3-30 and IGHV3-23 sequences. Repeated mutations in the CDR2 region resulting in a loss of serine from the germline region were observed in all three genes, and were replaced with aspartic acid, glycine, alanine, threonine, and asparagine, respectively. Additional observed mutations included changes of isoleucine to valine, asparagine to serine, and threonine to alanine. The prevalence of recurrent amino acid changes was higher in the CDR2 region than the CDR1 region, suggesting that the CDR2 region may markedly contribute to the paratope of the tumor-derived Ig. Next we compared mutation patterns present in the CDR2 of the IGHV4-34 from our 8 OAEMZL tumors to 70 sequences originating from normal peripheral blood, tonsils and marginal zone B cells identified in GenBank (accession numbers are shown in the Table S1). Except for recurrent mutations of serine in the second residue that were observed in both normal B cell and OAEMZL sequences, recurrent mutations of isoleucine in the first residue to valine, serine in the forth and sixth residues to aspartic acid and glycine, respectively, and threonine in the seventh residue to alanine were not observed in normal B cells (data not shown), suggesting that the mutations observed in the OALMZL may be tumor antigen specific. However, future analysis of additional sequences will be required to confirm this observation.
Analysis of the tumor-derived CDR3 sequences revealed low similarity and an absence of stereotyped sequences with no homology to antibacterial and other previously published antibodies. The average CDR3 isoelectric point was 5.95±1.92 (SD). The average CDR3 length was 15.75±3.67 (SD) amino acids, with 19 sequences harboring 10–14 amino acids, 18 sequences harboring 15–19 amino acids and 7 sequences of 20 or more amino acids.
In 22 patients, consisting of 6 cases with 2% or less difference from the most similar germline gene sequences and 16 cases with more than 2% difference from the most similar germline gene sequences, information on clinical presentation and outcome following localized radiation to the eyes was available with a median follow up of 31 months (range 2–121). There was no difference in overall survival and progression free survival between these two groups of patients (not shown); however, the small number of cases and clinical events prevents firm conclusions.
Analysis of intraclonal heterogeneity
To assess the presence of intraclonal heterogeneity, extensive molecular cloning of 11 randomly selected potentially functional IGHV gene isolates with percent homology to the germ line sequence ranging from 81.0 to 94.7 was performed (Table 4 and Figure 1). In three of the tested samples, the extensively mutated clonal IGHV gene isolates did not show intraclonal heterogeneity. In 8 samples intraclonal heterogeneity was detected as manifested by the presence of molecular clones harboring confirmed mutations not observed in the most abundant clonal IGHV gene sequence. While single nucleotide mutations accounted for most of the observed ongoing mutations, deletions were also observed. The extent of intraclonal heterogeneity was limited in 5 of these cases with 1 or 2 additional confirmed mutations, similar to our previous observation in a subset of diffuse large B-cell lymphoma cases . In two tumors (4694 and 13299), 16 and 56 additional confirmed mutations were observed, thus leading to extensive variation between the subclones harboring identical CDR3 regions, similar to intraclonal heterogeneity which is typically observed in follicular lymphoma .
HC- hypothetical molecular clone.
OAEMZLs account for up to 55% of all orbital tumors and 8% of extranodal NHLs . While antigen stimulation, which may be directly mediated by surface B-cell receptor and/or indirectly mediated by T cells, is implicated in the pathogenesis of EMZLs in the thyroid, salivary gland and stomach, the etiology of OAEMZL is still controversial. C. psittaci may function as the antigenic stimulus of OAEMZLs in several geographic regions , , , ; however, it is not detected in the majority of US patient tumors, in which the antigen is unknown , , . Overall, there is only scarce published data on the potential role of antigenic stimulation in this lymphoma.
The present study was undertaken to clarify if there is potential antigen stimulation in C. psittaci-negative OAEMZLs. To this end, molecular analysis of the IGHV region was performed in 67 untreated primary OAEMZL patients. The study demonstrates a biased usage of IGHV families and genes with evidence for intraclonal heterogeneity and antigen selection in multiple tumors, implicating B-cell receptor-mediated antigen stimulation in the pathogenesis of these lymphomas.
In our cohort of C. psittaci-negative OAEMZL patients there was overrepresentation of the IGVH4 family and specifically the IGVH4-34 gene segment. Similar biased usage of the IGVH4-34 gene segment was previously suggested by Bahler et al. in a small cohort of C. psittaci-negative OAEMZL patients from the US . In this study the IGVH4-34 gene segment was detected in 3 of the 10 analyzed cases. In contrast, 4 small studies from Germany and Japan, analyzing a total of 56 tumors (range 8–26) did not demonstrate biased usage of the IGVH4 family and IGVH4-34 gene segment , , , . In these 4 studies the IGVH3 family (64%) was most commonly used. Overrepresentation of IGHV3-30, seen in our cohort, was not observed in these studies. IGHV3-23, commonly used in our cohort, was detected in 7 of these 56 cases but was overrepresented only in a study by Adam et al., accounting for 3 out of the 8 cases . While all the analyzed US cases in both studies were negative for C. psittaci, its presence was not examined in any of the 4 non-US studies , , , . In a recently accepted, but still not published manuscript that mostly analyzed patients from European countries, Dagklis et al. also did not observe biased usage of the IGVH4 family and IGVH4-34 gene segment . Careful re-evaluation of the data in this manuscript discloses that IGVH4-34 gene segment was not detected in any of the 15 C. psittaci-positive OAEMZL tumors, but was observed in 3 of the 21 of the tested C. psittaci-negative OAEMZL. The observed differences in the use of the IGVH families and genes between the US and non-US studies most probably stem from different antigens driving the lymphomagenesis of OAEMZL in distinct geographic locations, but the small number of analyzed cases, usage of archived material as a DNA source and differences in PCR methodologies might also contribute to the observed discrepancies. Further studies of IGHV in OAEMZL patients with and without C. psittaci from different geographic regions using DNA extracted from fresh or frozen tumors will help to clarify the observed discrepancy.
The gene most commonly used by the C. psittaci-negative OAEMZL in our patients as well as patients reported by Bahler et al  was IGVH4-34, accounting for 18.2% of the functional IGVH genes in our cohort. In healthy individuals, IGVH4-34 expressing cells are predominantly identified in naïve lymphocytes and are underrepresented in the germinal center and memory compartments , being used by 3–9% of adult peripheral B lymphocytes , ,  and in one of 31 IGHV genes sequenced from normal adult marginal zone lymphocytes . In lymphomas, the IGVH4-34 gene is overrepresented in chronic lymphocytic leukemia cases with mutated IGVH genes (20%)  and in primary central nervous system diffuse large B-cell lymphomas (DLBCL) (60%) . It is not overrepresented in patients with non- OAEMZLs. In its germline configuration, without contribution of somatic mutation and to a large extent independently of the CDR3 region and of associated light chains, IGVH4-34 frequently encodes intrinsically autoreactive antibodies that recognize the I/i erythrocyte determinants constituting the antigenic target of pathogenic autoantibodies in cold agglutinin disease . The IGVH4-34 gene has also been reported to encode antibodies recognizing auto- allo- and exogenous antigens, such as DNA, Rh, cardiolipin and lipid-A and is overrepresented in patients with systemic lupus erythematosus , . Two other genes (IGVH3-30 and IGVH3-23) frequently used by OAEMZLs analyzed in the present study also frequently encode autoantibodies .
The erythrocyte I/i antigens bind to the FR1 region of the IGVH4-34 , a characteristic attributed to B-cell superantigens, which are supposed to directly activate B cells , , . Among the IGVH4-34 genes cloned from the OAEMZLs analyzed in the present study, one sequence was not mutated, 3 acquired similar mutations in the CDR2 and 3 preserved FR sequences, suggesting a possible role of a yet unknown B cell superantigen that might drive proliferation and lymphoma development.
The majority (93%) of the potentially functional IGVH genes isolated from the OAEMZLs analyzed in this study were mutated as compared to their germline counterparts, suggesting their origin from a cell that had experienced antigen selection during the germinal center reaction. This finding is concordant with previous reports, which also demonstrated somatic mutations in the majority of EMZLs irrespective of the anatomical site of origin , . The observed level of mutations (mean percent homology to germline of 93.5±6.1) is also similar to the mutational load previously reported in both ocular adnexal and other EMZLs , , , , , . However, 15% of the potentially functional IGHV gene isolates in the current study differed by more than 12% from the most similar germline counterparts- an uncommon observation in EMZLs irrespective of the anatomical site of origin. This extensive mutational load together with the evidence for positive and/or negative antigen selection (Table 1) in 59% of the potentially functional IGHV genes by either Multinomial Model and/or Focused binomial test further suggests that antigens play a role in the pathogenesis of these lymphomas. The prevalence of IGHV genes with negative selection may suggest that Ig preservation may be required for preventing the disruption of antigen-mediated signaling that could be harmful to lymphoma cell survival. Observation of intraclonal heterogeneity in the majority of the analyzed OAEMZLs using a very conservative definition in this study that required the presence of confirmed ongoing mutations, may further point to the existence of continuous antigen stimulation in their pathogenesis. Since EMZLs are in most cases considered to be of post germinal center origin, the presence of intraclonal heterogeneity in these tumors may reflect aberrant activation of the somatic mutational machinery or re-entry into germinal centers for further mutations. Indeed, colonization of germinal centers is commonly observed in EMZL biopsies  and may reflect re-entry of malignant cells into germinal centers for additional rounds of antigen selection.
Our methodological approach failed to identify clonal IGHV-IGHJ PCR product in 36% of tumors that were clinically and pathologically similar to the tumors in which clonality was determined. To prevent misidentification of the lymphoma IGHV genes, we used very strict criteria to define clonality which may account for some of the observed failures. However, previous studies reported PCR detection of a monoclonal population in only 25–70% of EMZL cases . The inability to detect monoclonal IGHV-IGHJ gene sequences in these cases may result from the absence of Ig rearrangement, as is rarely observed in lymphomas . We could not amplify the IGHV-IGHJ gene sequences of some of these tumors using the BIOMED-2 primers, while tumor clonality was confirmed by light chain amplification using the same protocol. Alternatively, somatic mutations in the region to which the PCR primers are designed to hybridize may lower amplification efficiency and lead to possible false-negative results.
Overall our findings demonstrate that C. psittaci-negative OAEMZLs exhibit biased usage of IGHV families and genes with evidence for intraclonal heterogeneity and antigen selection in multiple tumors, implicating B-cell receptor-mediated antigen stimulation in the pathogenesis of these lymphomas. The nature of the antigens that potentially play a role in these processes is currently unknown and requires further studies.
Materials and Methods
DNA was extracted from a total of 67 patient fresh biopsy samples at the time of diagnosis (between 1991 and 2011) of OAEMZLs using a commercially available kit (QIAamp; Qiagen, Valencia, CA, USA), as described by the manufacturer. All pathologic specimens were classified according to the WHO 2008 classification on the basis of the morphologic features observed on routinely prepared hematoxylin and eosin–stained slides of formalin-fixed, paraffin-embedded tissues along with immunophenotypic and genotypic results . Flow cytometry immunophenotyping was performed in most cases and clonality was confirmed by either Southern blot analysis or polymerase chain reaction (PCR) for immunoglobulin heavy or light chains in accordance with the BIOMED-2 recommendations . This study was approved by the University of Miami Institutional Review Board (IRB) and written informed consent was obtained according to the approved protocol.
Detection of C. psittaci DNA
Touchdown enzyme time-release polymerase chain reaction (PCR) for detection of C. psittaci DNA was performed as previously reported by us . Blank reactions were always run concomitantly with the DNA from patients' specimens to monitor for possible contamination of PCR reagents and to rule out false-positive results. All reactions were repeated twice.
PCR amplification, cloning and sequencing of IGHV genes
To amplify the IGHV-IGHJ gene sequences, 50–200 ng of DNA were amplified by GoTaq Green Master Mix (Promega, Madison, WI) in a final volume of 50 µl containing 10 pmol of a specific 5′ primer corresponding to one of the 6 human variable immunoglobulin heavy chain family leaders (IGHV1 through IGHV6) and 10 pmol of 3′ antisense JH consensus primer , . IGHV1 leader primer also amplifies sequences from the closely related IGHV7 family. The PCR conditions were: 96°C for 5 minutes, 55°C for 1 minute, 72°C for 3 minutes, 1 cycle; 94°C for 30 seconds, 55°C for 30 seconds, 72°C for 30 seconds, 30 to 35 cycles; and 72°C for 7 minutes. A control with no added template was used in each PCR reaction to exclude the possibility of contamination. DNA integrity from patient samples was verified by amplification of ß-actin using specific PCR primers yielding a 597 base pair (bp) amplicon, as described previously . PCR products were analyzed by 2% agarose gel electrophoresis and stained with ethidium bromide. All bands of the appropriate size were excised from the gels and purified by adsorption to a silica matrix (QIAquick Gel Extraction Kit, Qiagen). Direct DNA sequencing of PCR amplicons was performed on a 373 automatic DNA sequencer (Applied Biosystems, Foster City, CA) using the ABI Prism Big Dye Terminator Kit (Perkin Elmer, Foster City, CA), as recommended by the manufacturer. The same primers used for the PCR were used for sequencing. If a direct DNA sequencing attempt of the PCR amplicon failed to recover an unambiguous sequence, the PCR amplicons were cloned using a TOPO TA cloning kit and OneShot TOP10' chemically competent E. coli cells (Invitrogen, Carlsbad, CA) according to the manufacturer instructions. The colony direct PCR assay was used to determine whether colonies included the correct PCR insert. At least 4 colonies from two independent PCRs were sequenced. Sequences were analyzed with the ImMunoGeneTics V-Quest (IMGT V-QUEST) software (http://www.imgt.org). The sequences were defined as clonal if identical CDR3 sequences were obtained from 2 independent PCR reactions by direct sequencing. If cloning was performed, at least two sequences with identical CDR3 from each of the two independent PCRs (a total of 4 identical sequences) were required for definition of clonal sequence. All the clonal sequences were deposited in GenBank (JN646046-JN646089).
Analysis of the IGHV sequences
Sequences were compared with known germline genes and assigned IGHV and IGHJ germline sequence based on the highest percentage of sequence homology . The length of the CDR3 region and number of somatic mutations were determined. Mutations at the last nucleotide position of the sequenced fragment were excluded from the mutational analysis because they might result from nucleotide deletion at the joining sites . The percent of sequence identity was calculated from the aligned sequences from the beginning of FR1 to the end of FR3. The total number of mutations per variable region (FR1-FR3) were added and normalized to the respective length of each region according to the IMGT numbering scheme to analyze the relative frequency of mutations in the different variable regions. The number of coding (replacement-R) and non-coding (silent-S) mutations were obtained from IMGT V-QUEST tabulated data. Antigen selection analysis was performed using the Multinomial Model  and the Z-Test version of the Focused binomial test . For analysis of recurrent amino acid mutations in the CDR1 and CDR2 regions, sequences derived from the same germline IGVH gene in 4 or more specimens were aligned to their corresponding IMGT assigned germline CDR1 and CDR2 regions by ClustalW (http://www.ebi.ac.uk/Tools/msa/clustalw2/). To determine the CDR3 isoelectric point, the pI/MW application of the Swiss-Prot/TrEMBL software was used (http://web.expasy.org/compute_pi/).
Somatic hypermutations tend to occur at hotspots, such as RGYW/WRCY sequences usually targeted by the mutagenic enzyme AID  and WA/TW sequences usually associated with mutations in the A:T pairs considered to arise as copying errors introduced by a DNA polymerase during repair of AID-generated lesions . To analyze for potential mutation enrichment in these sequences, we have 1) calculated the percentage of mutations at these hotspots relative to the total number of mutations observed in all the analyzed sequences, and 2) examined the prevalence of mutations in these motifs by normalizing to the relative length of these hypermutational hotspots across all the clonal germline sequences detected in the analyzed cohort.
To determine whether homology existed between patients' CDR3 to the CDR3 of previously reported Ig genes, the amino acid sequence of patients' CDR3 region was run through IgBLAST (http://www.ncbi.nlm.nih.gov/igblast/) and TEIRESIAS algorithm (http://cbcsrv.watson.ibm.com/download.phtml.html). The previously reported parameters, enabling the connection of all pairs of sequences that shared at least 50% amino-acid identity and 70% similarity, were used for this analysis .
Analysis of intraclonal heterogeneity
Intraclonal heterogeneity was examined in 11 ocular adnexa MALT lymphoma specimens by repeated cloning and sequencing of at least 9 tumor-derived molecular clones from each specimen. For evaluation of intraclonal heterogeneity, we used previously proposed definitions , : confirmed mutation- a mutation observed more than once in the IGHV gene molecular clones from the same tumor specimen; unconfirmed mutation- a substitution mutation observed in only 1 of the IGHV gene molecular clones from the same tumor specimen. Only the confirmed mutations were considered as evidence of intraclonal heterogeneity while the unconfirmed mutations, which may result from Taq polymerase errors were disregarded.
The usage of the IGHV and IGHJ gene segments in the ocular adnexa MALT lymphoma specimens was compared to their utilization in normal peripheral blood lymphocytes by the X2 test, with p<0.05 defined as statistically significant. The Multinomial Model and the Focused binomial test were applied as previously reported , .
Conceived and designed the experiments: ISL. Performed the experiments: DZ JMM. Analyzed the data: ISL CL DZ JRC-F OFI. Contributed reagents/materials/analysis tools: ISL PR. Wrote the paper: ISL CL JRC-F.
- 1. Swerdlow SH, Campo E, Harris NL, Jaffe ES, Pileri SA, et al. (2008) WHO Classification of Tumours of Haematopoietic and Lymphoid Tissues. Geneva, Switzerland: World Health Organization.
- 2. Isaacson PG, Du MQ (2004) MALT lymphoma: from morphology to molecules. Nat Rev Cancer 4: 644–653.
- 3. Parsonnet J, Isaacson PG (2004) Bacterial infection and MALT lymphoma. N Engl J Med 350: 213–215.
- 4. Wotherspoon AC, Ortiz-Hidalgo C, Falzon MR, Isaacson PG (1991) Helicobacter pylori-associated gastritis and primary B-cell gastric lymphoma. Lancet 338: 1175–1176.
- 5. Schmid U, Helbron D, Lennert K (1982) Development of malignant lymphoma in myoepithelial sialadenitis (Sjogren's syndrome). Virchows Arch A Pathol Anat Histol 395: 11–43.
- 6. Hyjek E, Isaacson PG (1988) Primary B cell lymphoma of the thyroid and its relationship to Hashimoto's thyroiditis. Hum Pathol 19: 1315–1326.
- 7. Stefanovic A, Lossos IS (2009) Extranodal marginal zone lymphoma of the ocular adnexa. Blood 114: 501–510.
- 8. Ferreri AJ, Guidoboni M, Ponzoni M, De Conciliis C, Dell'Oro S, et al. (2004) Evidence for an association between Chlamydia psittaci and ocular adnexal lymphomas. J Natl Cancer Inst 96: 586–594.
- 9. Ferreri AJ, Dolcetti R, Dognini GP, Malabarba L, Vicari N, et al. (2008) Chlamydophila psittaci is viable and infectious in the conjunctiva and peripheral blood of patients with ocular adnexal lymphoma: results of a single-center prospective case-control study. Int J Cancer 123: 1089–1093.
- 10. Decaudin D, Dolcetti R, de Cremoux P, Ponzoni M, Vincent-Salomon A, et al. (2008) Variable association between Chlamydophila psittaci infection and ocular adnexal lymphomas: methodological biases or true geographical variations? Anticancer Drugs 19: 761–765.
- 11. Chanudet E, Zhou Y, Bacon CM, Wotherspoon AC, Muller-Hermelink HK, et al. (2006) Chlamydia psittaci is variably associated with ocular adnexal MALT lymphoma in different geographical regions. J Pathol 209: 344–351.
- 12. Yoo C, Ryu MH, Huh J, Park JH, Kang HJ, et al. (2007) Chlamydia psittaci infection and clinicopathologic analysis of ocular adnexal lymphomas in Korea. Am J Hematol 82: 821–823.
- 13. Rosado MF, Byrne GE Jr, Ding F, Fields KA, Ruiz P, et al. (2006) Ocular adnexal lymphoma: a clinicopathologic study of a large cohort of patients with no evidence for an association with Chlamydia psittaci. Blood 107: 467–472.
- 14. Daibata M, Nemoto Y, Togitani K, Fukushima A, Ueno H, et al. (2006) Absence of Chlamydia psittaci in ocular adnexal lymphoma from Japanese patients. Br J Haematol 132: 651–652.
- 15. Mulder MM, Heddema ER, Pannekoek Y, Faridpooya K, Oud ME, et al. (2006) No evidence for an association of ocular adnexal lymphoma with Chlamydia psittaci in a cohort of patients from the Netherlands. Leuk Res 30: 1305–1307.
- 16. Ruiz A, Reischl U, Swerdlow SH, Hartke M, Streubel B, et al. (2007) Extranodal marginal zone B-cell lymphomas of the ocular adnexa: multiparameter analysis of 34 cases including interphase molecular cytogenetics and PCR for Chlamydia psittaci. Am J Surg Pathol 31: 792–802.
- 17. Matthews JM, Moreno LI, Dennis J, Byrne GE Jr, Ruiz P, et al. (2008) Ocular Adnexal Lymphoma: no evidence for bacterial DNA associated with lymphoma pathogenesis. Br J Haematol 142: 246–249.
- 18. Bahler DW, Szankasi P, Kulkarni S, Tubbs RR, Cook JR, et al. (2009) Use of similar immunoglobulin VH gene segments by MALT lymphomas of the ocular adnexa. Mod Pathol 22: 833–838.
- 19. Coupland SE, Foss HD, Anagnostopoulos I, Hummel M, Stein H (1999) Immunoglobulin VH gene expression among extranodal marginal zone B-cell lymphomas of the ocular adnexa. Invest Ophthalmol Vis Sci 40: 555–562.
- 20. Hara Y, Nakamura N, Kuze T, Hashimoto Y, Sasaki Y, et al. (2001) Immunoglobulin heavy chain gene analysis of ocular adnexal extranodal marginal zone B-cell lymphoma. Invest Ophthalmol Vis Sci 42: 2450–2457.
- 21. Mannami T, Yoshino T, Oshima K, Takase S, Kondo E, et al. (2001) Clinical, histopathological, and immunogenetic analysis of ocular adnexal lymphoproliferative disorders: characterization of malt lymphoma and reactive lymphoid hyperplasia. Mod Pathol 14: 641–649.
- 22. Adam P, Haralambieva E, Hartmann M, Mao Z, Ott G, et al. (2008) Rare occurrence of IgVH gene translocations and restricted IgVH gene repertoire in ocular MALT-type lymphoma. Haematologica 93: 319–320.
- 23. Cook GP, Tomlinson IM (1995) The human immunoglobulin VH repertoire. Immunol Today 16: 237–242.
- 24. Guigou V, Cuisinier AM, Tonnelle C, Moinier D, Fougereau M, et al. (1990) Human immunoglobulin VH and VK repertoire revealed by in situ hybridization. Mol Immunol 27: 935–940.
- 25. Brezinschek HP, Brezinschek RI, Lipsky PE (1995) Analysis of the heavy chain repertoire of human peripheral B cells using single-cell polymerase chain reaction. J Immunol 155: 190–202.
- 26. Bessudo A, Rassenti L, Havlir D, Richman D, Feigal E, et al. (1998) Aberrant and unstable expression of immunoglobulin genes in persons infected with human immunodeficiency virus. Blood 92: 1317–1323.
- 27. Pugh-Bernard AE, Silverman GJ, Cappione AJ, Villano ME, Ryan DH, et al. (2001) Regulation of inherently autoreactive VH4-34 B cells in the maintenance of human B cell tolerance. J Clin Invest 108: 1061–1070.
- 28. Pascual V, Capra JD (1992) VH4-21, a human VH gene segment overrepresented in the autoimmune repertoire. Arthritis Rheum 35: 11–18.
- 29. Kraj P, Friedman DF, Stevenson F, Silberstein LE (1995) Evidence for the overexpression of the VH4-34 (VH4.21) Ig gene segment in the normal adult human peripheral blood B cell repertoire. J Immunol 154: 6406–6420.
- 30. Tierens A, Delabie J, Michiels L, Vandenberghe P, De Wolf-Peeters C (1999) Marginal-zone B cells in the human lymph node and spleen show somatic hypermutations and display clonal expansion. Blood 93: 226–234.
- 31. Martin A, Bardwell PD, Woo CJ, Fan M, Shulman MJ, et al. (2002) Activation-induced cytidine deaminase turns on somatic hypermutation in hybridomas. Nature 415: 802–806.
- 32. Neuberger MS, Di Noia JM, Beale RC, Williams GT, Yang Z, et al. (2005) Somatic hypermutation at A.T pairs: polymerase error versus dUTP incorporation. Nat Rev Immunol 5: 171–178.
- 33. MacDonald CM, Boursier L, D'Cruz DP, Dunn-Walters DK, Spencer J (2010) Mathematical analysis of antigen selection in somatically mutated immunoglobulin genes associated with autoimmunity. Lupus 19: 1161–1170.
- 34. Lossos IS, Alizadeh AA, Eisen MB, Chan WC, Brown PO, et al. (2000) Ongoing immunoglobulin somatic mutation in germinal center B cell-like but not in activated B cell-like diffuse large cell lymphomas. Proc Natl Acad Sci U S A 97: 10209–10213.
- 35. Cleary ML, Meeker TC, Levy S, Lee E, Trela M, et al. (1986) Clustering of extensive somatic mutations in the variable region of an immunoglobulin heavy chain gene from a human B cell lymphoma. Cell 44: 97–106.
- 36. Zhang GS, Winter JN, Variakojis D, Reich S, Lissner GS, et al. (2007) Lack of an association between Chlamydia psittaci and ocular adnexal lymphoma. Leuk Lymphoma 48: 577–583.
- 37. Dagklis A, Ponzoni M, Govi S, Giulia Cangi M, Pasini E, et al. (2011) Immunoglobulin gene repertoire in ocular adnexal lymphomas: hints on the nature of the antigen stimulation. Leukemia. In press.
- 38. Hamblin TJ, Davis Z, Gardiner A, Oscier DG, Stevenson FK (1999) Unmutated Ig V(H) genes are associated with a more aggressive form of chronic lymphocytic leukemia. Blood 94: 1848–1854.
- 39. Thompsett AR, Ellison DW, Stevenson FK, Zhu D (1999) V(H) gene sequences from primary central nervous system lymphomas indicate derivation from highly mutated germinal center B cells with ongoing mutational activity. Blood 94: 1738–1746.
- 40. Pascual V, Victor K, Lelsz D, Spellerberg MB, Hamblin TJ, et al. (1991) Nucleotide sequence analysis of the V regions of two IgM cold agglutinins. Evidence that the VH4-21 gene segment is responsible for the major cross-reactive idiotype. J Immunol 146: 4385–4391.
- 41. Bhat NM, Bieber MM, Hsu FJ, Chapman CJ, Spellerberg M, et al. (1997) Rapid cytotoxicity of human B lymphocytes induced by VH4-34 (VH4.21) gene-encoded monoclonal antibodies, II. Clin Exp Immunol 108: 151–159.
- 42. Levinson AI, Kozlowski L, Zheng Y, Wheatley L (1995) B-cell superantigens: definition and potential impact on the immune response. J Clin Immunol 15: 26S–36S.
- 43. Domiati-Saad R, Lipsky PE (1998) Staphylococcal enterotoxin A induces survival of VH3-expressing human B cells by binding to the VH region with low affinity. J Immunol 161: 1257–1266.
- 44. Domiati-Saad R, Attrep JF, Brezinschek HP, Cherrie AH, Karp DR, et al. (1996) Staphylococcal enterotoxin D functions as a human B cell superantigen by rescuing VH4-expressing B cells from apoptosis. J Immunol 156: 3608–3620.
- 45. Kurosu K, Yumoto N, Furukawa M, Kuriyama T, Mikata A (1998) Low-grade pulmonary mucosa-associated lymphoid tissue lymphoma with or without intraclonal variation. Am J Respir Crit Care Med 158: 1613–1619.
- 46. Thiede C, Alpen B, Morgner A, Schmidt M, Ritter M, et al. (1998) Ongoing somatic mutations and clonal expansions after cure of Helicobacter pylori infection in gastric mucosa-associated lymphoid tissue B-cell lymphoma. J Clin Oncol 16: 3822–3831.
- 47. Bertoni F, Cotter FE, Zucca E (1999) Molecular genetics of extranodal marginal zone (MALT-type) B-cell lymphoma. Leuk Lymphoma 35: 57–68.
- 48. Lossos IS, Okada CY, Tibshirani R, Warnke R, Vose JM, et al. (2000) Molecular analysis of immunoglobulin genes in diffuse large B-cell lymphomas. Blood 95: 1797–1803.
- 49. van Dongen JJ, Langerak AW, Bruggemann M, Evans PA, Hummel M, et al. (2003) Design and standardization of PCR primers and protocols for detection of clonal immunoglobulin and T-cell receptor gene recombinations in suspect lymphoproliferations: report of the BIOMED-2 Concerted Action BMH4-CT98-3936. Leukemia 17: 2257–2317.
- 50. Campbell MJ, Zelenetz AD, Levy S, Levy R (1992) Use of family specific leader region primers for PCR amplification of the human heavy chain variable region gene repertoire. Mol Immunol 29: 193–203.
- 51. Lossos IS, Tibshirani R, Narasimhan B, Levy R (2000) The inference of antigen selection on Ig genes. J Immunol 165: 5122–5126.
- 52. Uduman M, Yaari G, Hershberg U, Stern JA, Shlomchik MJ, et al. (2011) Detecting selection in immunoglobulin sequences. Nucleic Acids Res 39: W499–504.
- 53. Darzentas N, Hadzidimitriou A, Murray F, Hatzi K, Josefsson P, et al. (2010) A different ontogenesis for chronic lymphocytic leukemia cases carrying stereotyped antigen receptors: molecular and computational evidence. Leukemia 24: 125–132.