Antibodies (Abs) produced during HIV-1 infection rarely neutralize a broad range of viral isolates; only eight broadly-neutralizing (bNt) monoclonal (M)Abs have been isolated. Yet, to be effective, an HIV-1 vaccine may have to elicit the essential features of these MAbs. The V genes of all of these bNt MAbs are highly somatically mutated, and the VH genes of five of them encode a long (≥20 aa) third complementarity-determining region (CDR-H3). This led us to question whether long CDR-H3s and high levels of somatic mutation (SM) are a preferred feature of anti-HIV bNt MAbs, or if other adaptive immune responses elicit them in general.
Methodology and Principal Findings
We assembled a VH-gene sequence database from over 700 human MAbs of known antigen specificity isolated from chronic (viral) infections (ChI), acute (bacterial and viral) infections (AcI), and systemic autoimmune diseases (SAD), and compared their CDR-H3 length, number of SMs and germline VH-gene usage. We found that anti-HIV Abs, regardless of their neutralization breadth, tended to have long CDR-H3s and high numbers of SMs. However, these features were also common among Abs associated with other chronic viral infections. In contrast, Abs from acute viral infections (but not bacterial infections) tended to have relatively short CDR-H3s and a low number of SMs, whereas SAD Abs were generally intermediate in CDR-H3 length and number of SMs. Analysis of VH gene usage showed that ChI Abs also tended to favor distal germline VH-genes (particularly VH1-69), especially in Abs bearing long CDR-H3s.
Citation: Breden F, Lepik C, Longo NS, Montero M, Lipsky PE, Scott JK (2011) Comparison of Antibody Repertoires Produced by HIV-1 Infection, Other Chronic and Acute Infections, and Systemic Autoimmune Disease. PLoS ONE 6(3): e16857. https://doi.org/10.1371/journal.pone.0016857
Editor: Pere-Joan Cardona, Fundació Institut Germans Trias i Pujol; Universitat Autònoma de Barcelona CibeRES, Spain
Received: September 1, 2010; Accepted: January 16, 2011; Published: March 30, 2011
This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Funding: This work was supported by National Institutes of Health grant AI49111 (J.K.S.; http://www.nih.gov/), Canada Research Chairs (J.K.S.; http://www.chairs-chaires.gc.ca/), the Michael Smith Foundation for Health Research (M.M.; http://www.msfhr.org/), the Natural Sciences and Engineering Research Council of Canada (C.L., M.M., and F.B.; http://www.nserc-crsng.gc.ca/index_eng.asp), and the IRMACS Centre at Simon Fraser University (http://www.irmacs.ca/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
A highly diverse repertoire of antibodies (Abs) is a prerequisite for the adaptive immune system to recognize a vast array of antigens (Ags) and distinguish self from non-self. Three processes contribute to the production of this diverse repertoire: (i) somatic recombination of germline V, D and J genes, (ii) addition and deletion of nucleotides at the V-D, D-J, and V-J junctions, and (iii) somatic hypermutation after Ag stimulation , . The third complementarity-determining region of the Ab heavy chain (CDR-H3) is encoded by the DH gene, parts of the VH and JH genes, and nucleotides added at the junctions between these; it is the most variable region in the Ab, and typically is central to contact with cognate Ag .
A major goal for an HIV vaccine is to elicit Abs that neutralize a broad range of HIV-1 primary isolates. To this end, efforts have been made to identify and use broadly (b) neutralizing (Nt) monoclonal (M) Abs with this activity for epitope-targeted vaccine design . The bNt MAbs identified so far are rare and most of them bear unusually long CDR-H3s. Despite intensive effort, only eight bNt MAbs have been discovered (b12, 2F5, 4E10, 2G12, 447-52D, PG9/PG16, VRC01/02, and HJ16 , , , , ; 5 of which bear CDR-H3s of 20 aa or more, based on the IMGT numbering system). Consistent with this, most HIV-1-infected individuals produce strong strain-specific Nt Ab responses against HIV-1 envelope (Env) soon after initial infection; yet rarely do they develop broad neutralization , , and then only after a year or more .
While high levels of SM have been noted for all bNt MAbs, starting with Kunert et al. , a number of authors have proposed a connection between the length of the CDR-H3 region, and the broad neutralization of these MAbs , , , , , , , , , . Mutagenesis experiments and/or X-ray crystal structures of Fab bound to protein or peptide Ag have implicated their CDR-H3s as being required for neutralization: b12 , , 2F5 , , , 447-52D , and 4E10 , . This has been observed even in cases in which CDR-H3 appears to make minimal or no contact with envelope protein Ag , . It has been speculated that in these cases, CDR-H3 may contact other sites on HIV-1, such as the viral membrane , , , , , , . Nevertheless, it is not clear whether long CDR-H3s are required, in general, for broad neutralization; certainly the exceptions, MAbs 2G12 and VRC01/02, disprove this as an absolute rule for broad Nt activity.
While it is generally acknowledged that high levels of SM are produced by T-cell driven processes in germinal centers, the conditions under which long CDR-H3 Abs appear in adaptive immune responses are less well understood, and this could help explain the origin of bNt Abs during HIV infection. Importantly, the long CDR-H3s have been associated with anti-protein Abs  and anti-viral Abs . Long CDR-H3s found among polyreactive natural Abs ,  and autoreactive Abs produced by naïve B cells in SLE patients  do not carry SMs; yet, those found among the memory B cells in healthy people do . However, analyses to directly associate different types of adaptive immune response with CDR-H3 length have not been reported. Our purpose was to identify the circumstances under which Abs bearing some of the features of bNt MAbs (viz., long CDR-H3s and high levels of SM) appear during adaptive immune responses in humans.
As an initial approach, we examined heavy chain variable (VH) genes expressed by individuals undergoing Ag-specific immune responses, by compiling a database of expressed human VH genes of MAbs for which the Ag specificity of the MAb was known. The MAbs were taken from individuals with chronic infections (ChI), acute infections (AcI), or following immunization (included with AcI MAbs), and systemic autoimmune diseases (SAD). CDR-H3 length, level of SM, and VH-gene usage were compared among MAbs with specificity against self vs. non-self Ag, with specificity against protein vs. non-protein Ag, and/or from different conditions (ChI MAbs, SAD MAbs and AcI MAbs).
Both long CDR-H3s and SMs were strongly associated with protein Ag. Long CDR-H3s were at their highest frequency among ChI MAbs, and less so among SAD and AcI MAbs, whereas SMs were more prevalent in ChI and SAD anti-protein Abs and greatly reduced among anti-protein AcI Abs. Both ChI and AcI Abs tended to use distal VH genes; the use of VH1-69 was especially high among anti-HIV Abs, and was associated with high levels of SM and long CDR-H3s. The picture emerging from this analysis is that Abs bearing high numbers of SMs, long CDR-H3s and the distal gene VH1-69 appear to be selected in chronic vs. acute viral infections. Thus different biological processes, and perhaps different B-cell subsets, such as marginal zone vs. conventional B2 B cells , , (see Baumgarth  for review) could be involved in the earlier vs. later stages of viral infection, respectively.
CDR-H3 length of expressed VH genes in adaptive Ab responses
Table 1 summarizes the analysis of expressed VH genes for several MAb categories with regard to CDR-H3 length, number of SMs relative to the predicted germline VH gene, and the distance in the IgH locus between an Ab's germline VH gene and the VH6-1 gene, the VH gene closest to the DH region. As expected, the bNt HIV MAbs had the longest CDR-H3s with an average length of 20.9 aa. Fig. 1 compares the CDR-H3 length distributions for the bNt HIV MAbs, the non-bNt HIV MAbs (i.e., excluding the bNt HIV MAbs), and the remaining ChI MAbs (excluding all HIV MAbs). It shows that both the bNt and non-bNt HIV MAbs have long CDR-H3s; the difference between these two MAb groups was only marginally significant (Table 1A; unadjusted p = 0.0501), and not significant compared to the Bonferroni corrected value of 0.0018. Thus, CDR-H3 length does not appear to be restricted to broad neutralization. In addition, the CDR-H3s of all of the anti-protein HIV MAbs (mean 17.8 aa) were not significantly longer than those of non-HIV ChI MAbs (16.5 aa). Although our data set has only 34 non-HIV ChI MAbs, 12 of these have CDR-H3s of 19 aa or longer (Fig. 1), placing them in the upper quartile of the 427 Ag-specific MAbs. Thus, long CDR-H3s were associated with all types of ChI MAb, including the anti-HIV MAbs.
HIV Mabs are divided into three categories: bNt HIV MAbs, all HIV MAbs with the bNt HIV MAbs removed (i.e., non-bNt HIV MAbs), and ChI MAbs with the HIV MAbs removed (i.e., non-HIV ChI MAbs). Arrows indicate the mean for each category. Distribution of CDR-H3 length for more than 4000 Abs compiled from IMGT and Kabat databases  is included in Figs. 1–3, as a control comparison (blue line). The average CDR-H3 length of the 425 Ag-specific MAbs was 16.3 aa, which is higher than the 15.2-aa mean reported for 4751 expressed VH sequences compiled from the Kabat and IMGT databases by Zemlin et al. ; this may reflect that our data set included a higher proportion of ChI Abs, both HIV and non-HIV.
Fig. 2 shows the distribution CDR-H3 length for anti-HIV MAbs, partitioned according the region of Env bound; CDR-H3 length was longest for MAbs against the CD4i site (19.6 aa), intermediate for those against the V3 loop (18.5 aa) and CD4bs (18.3 aa), and shortest for the anti-gp41 MAbs (15.9 aa). These distributions were significantly different (χ2 test, p<0.05, PROC FREQ, SAS), indicating that, while the anti-HIV MAbs as a whole bear long CDR-H3s, epitope specificity also shapes CDR-H3 length.
Env sites are categorized as: CD4bs, CD4i, V3 loop, gp41, and control MAbs (blue line) . Arrows indicate the mean for each category.
The observation that long CDR-H3s are associated with chronic viral infections in general led us to compare MAbs from such infections to those from other immune responses, namely SAD and AcI. Categorization of MAbs by these types of immune response showed that the average length of the ChI MAbs (17.6 aa, Table 1B and Fig. 3A) was most different from the AcI MAbs (14.7 aa), with the SAD MAbs being intermediate (15.1 aa). This trend, of the AcI MAbs being the most different from the ChI MAbs, persisted when the categories were further divided into Abs against protein vs. non-protein Ags (Fig. 3B and Table 1B). For example, among anti-protein MAbs, those from the ChI group had significantly longer CDR-H3s than did those from the AcI or SAD groups (p<0.0001). As virtually all of the ChI Abs (except bNt MAb 2G12) and all of the anti-protein AcI Abs are anti-viral, it is striking that a large difference in CDR-H3 length exists between the Abs elicited by the two types of viral infection. In addition, anti-protein MAbs had significantly longer CDR-H3s than MAbs against non-protein Ags (p<0.0001, Table 1C), even with the Bonferroni correction; while the difference between self and non-self Abs was not as great (p<0.005, not significant with the Bonferroni correction). Thus autoimmune status (self vs. non-self) had a much lower effect on CDR-H3 length than did protein vs. non-protein Ag.
A. Distribution of CDR-H3 length for ChI, SAD, and AcI MAbs. B. Distribution of CDR-H3 length for ChI anti-protein, SAD anti-protein, AcI anti-protein, SAD anti-non-protein, and AcI anti-non-protein MAbs. Arrows indicate the means for each category. Control MAbs (blue line) are included for comparison .
SM of expressed VH genes in adaptive Ab responses
That long CDR-H3s were found mainly among anti-protein Abs suggests that Abs with long CDR-H3s are selected by protein Ag, and hence, by T-cell-driven processes; such responses typically occur in germinal centers and involve SMs introduced by activation-induced cytidine deaminase (AID) . Predicting that SMs should increase among Abs bearing long CDR-H3s, we compared the patterns of SM for the same categories of MAb as for CDR-H3 length in Table 1. For many comparisons, the patterns observed for SMs paralleled those observed for CDR-H3 length. For example, the eight bNt HIV MAbs had both the highest average CDRH3 length and the highest average level of SM in VH (mean 53.3). Given the small number of bNt MAbs, these were significantly longer than the non-bNt HIV MAbs (mean 27.3, p = 0.0024), but not when the Bonferroni correction for multiple tests was used. In comparing MAbs from the different types of immune response, the ChI MAbs (SM mean 27.3) were, again, most different from the AcI MAbs (mean 10.9), with the SAD MAbs being intermediate (mean 17.9). The SAD and AcI MAbs showed opposite patterns when they were partitioned according to protein vs. non-protein Ag; for SAD, the SM level among anti-protein MAbs was higher than that of MAbs against non-protein Ags, whereas for AcI MAbs, the level of SM was lower for protein vs. non-protein Ags. The average number of SMs for AcI MAbs against non-protein Ags, which were mainly against streptococcal capsular polysaccharide, was 15.5, intermediate between ChI and SAD non-protein MAbs. However, the level of SM was extremely low for AcI MAbs against protein Ags (mean 7.9), many of which were against rotavirus (more than 40%). This reduction in SMs among the anti-protein AcI MAbs was not restricted to anti-rotavirus MAbs, as when they were excluded from the AcI MAb category, the anti-protein AcI MAbs still had a low number of SMs (mean 7.3). In summary, there was a significant difference between Abs from chronic and acute viral infections, with the latter consistently having much shorter CDR-H3s and far fewer SMs. Little difference in SMs was observed between anti-protein Abs produced by chronic processes (ChI and SAD); both had long CDR-H3s and high levels of SM, and both involve persistent exposure to Ag.
To further analyze the relationship between CDR-H3 length and levels of SM, the MAb dataset was divided into quartiles according to CDR-H3 length, with the shortest quartile (S) having lengths of 13 aa or less, the longest quartile (L) 19 aa or more, and the two middle quartiles comprising the M class (between 14 and 18 aa inclusive, Table 2). While the differences in SM levels observed in Table 1B among the disease conditions or types of Ag specificities generally held across the corresponding quartiles in Table 2, some differences were observed. Table 2B shows little difference in the number of SMs between the anti-protein Abs for all disease categories and little difference in SM levels between the quartiles for medium-length and the longest CDR-H3s; however, the short CDR-H3 quartile tended to have lower levels of SM than did the longer two groups. Furthermore, the trend for MAbs against non-protein Ags was reversed: MAbs in the shortest CDR-H3 quartile tended to have the highest SM levels for both the SAD and AcI categories. Although long CDR-H3 MAbs against non-protein Ags are uncommon, their SM level was lower than that of their short CDR-H3 counterparts. This is consistent with the hypothesis that MAbs against non-protein Ags (even those with long CDR-H3s) may derive from different B-cell subsets and/or different immune processes than the MAbs against protein Ags.
Germline VH-gene usage in adaptive Ab responses
Early reports of VH gene family usage in anti-HIV Abs reported that VH3 was over-utilized and VH4 was under-utilized compared to the naïve repertoire , . Given that our dataset includes Abs of known Ag specificity for several disease conditions, we compared the use of V-gene families and of the specific gene VH1-69 among these conditions (Table 3 and Figure S1). Table 3 shows that there were significant differences when the proportions were tested across each gene family. The proportion of anti-HIV MAbs that use family VH3 genes (29%) was lower than that for SAD MAbs (52%) or AcI MAbs (53%). Concomitantly, use of family VH1 increased for HIV MAbs (38% for HIV compared to 23% for SAD MAbs and 22% for AcI MAbs), whereas the proportion of MAbs using family VH4 was not significantly different among the major categories, ranging from 18 to 25%. Thus, HIV infection was related to an increase in VH1 and decrease in VH3 gene usage that was not apparent for ChI Abs (perhaps because this sample size is small) or other conditions.
We were particularly interested in usage patterns for the VH1-69 germline gene, as it is not commonly used in the naïve repertoire (e.g., Wardemann et al. 2003 ), but is characteristic of Ab repertoires in several disease states (see Discussion). As shown in Table 3, analysis of the 193 HIV MAbs having verifiable germline VH genes revealed that 43 (22%) used VH1-69, whereas only five of 87 (6%) SAD MAbs and three of 113 (3%) AcI MAbs used this VH gene; VH1-69 usage by (non-HIV) ChI MAbs was intermediate, being four of 34 (12%). Thus, HIV MAbs, and, perhaps, ChI MAbs use VH1-69 at a higher frequency than do SAD or AcI MAbs.
Given this distinct difference, we directly compared the features of MAbs that use VH1-69 to those that do not (Table 4). Among ChI MAbs, those that used VH1-69 had significantly longer CDR-H3s (means of 20.1 vs. 17.0, P<0.001), whereas their SMs were not significantly different. Very few SAD or AcI MAbs used VH1-69, so no statistical comparisons could be made within those groups. Among MAbs that use VH genes other than VH1-69, the patterns among the disease categories of ChI, SAD and AcI were similar to those observed in the full data set; the CDR-H3 length of ChI MAbs remained significantly longer than those of SAD and AcI MAbs, and the SMs of all three groups were different. Thus, ChI Abs encoded by VH1-69 appear to have longer CDR-H3s but not more SMs than their counterparts that do not use this germline gene.
VH1-69 is a fairly distal gene, being approximately 764 Kb from VH6-1 ; only five of the approximately 40 functional VH-genes are more distal. Thus we wondered if the use of VH1-69 among Abs bearing long CDR-H3s in HIV infection could be part of a larger trend toward using distal genes in ChI. Table 1A shows that ChI MAbs use the most distal VH genes, consistent with this class having the longest CDR-H3s and the highest frequency of SMs. However, inconsistent with CDR-H3 length and SMs, anti-protein (anti-viral) AcI Abs had intermediate VH gene distances whereas SAD Abs had the most proximal ones. Importantly, the AcI MAbs, only three of which use VH1-69, used distal genes overall; the average distance for 69 anti-protein (antiviral) AcI MAbs is 450 kb, suggesting that distal genes besides VH1-69 may be selected in viral infections of all types.
Table 2 further analyzes the relationship between CDR-H3 length and VH gene distance, dividing the MAbs into short, medium and long. Table 2 shows that the pattern is also not consistent within disease condition. Analyzing CDR-H3 length by quartile, the trend between CDR-H3 length and VH gene distance held for ChI and SAD but not for AcI Abs. The medium and long quartiles of ChI and SAD MAbs used the most distal VH genes, whereas they were used by the short and medium quartiles of anti-protein AcI MAbs. Thus, within the viral infection groups, and following similar trends for SM, distal gene usage was related to CDR-H3 length for the ChI, but not the AcI, MAbs.
The primary motivation for this analysis was to determine if other types of Ab share features with the bNt HIV MAbs, which might help explain the rarity of bNt MAbs in HIV-1 infection. We found that the bNt MAbs most closely resemble the other anti-HIV MAbs, and ChI MAbs as a group, in being enriched for long CDR-H3s and high numbers of SMs; this indicates that these features are not limited to broad neutralization, but appear to be common characteristics of the Abs involved in chronic viral infections. That all anti-HIV MAbs, including the bNt MAbs, share similar features indicates that unusual immunological processes, such as breaking of tolerance , are probably not responsible for the rarity of the bNt Abs during chronic HIV infection. Instead, processes involved in chronic viral infections in general may be at play in shaping the repertoire of Abs available for selection after viral persistence and/or multiple rounds of viral escape; such processes are probably linked to broadening of the Ab response beyond those involved in the response to initial infection . Our results are consistent with the view that Abs having the features of the bNt MAbs are not rare, but arise as a result of chronic viral infection.
Strikingly, the Abs from acute viral infections (anti-protein AcI Abs) had significantly shorter CDR-H3s and lower numbers of SMs than did the ChI Abs. In addition, while both types of Ab tended to use distal VH genes, Abs from acute viral infections bearing long CDR-H3s tended not to use the most distal VH genes, nor did they use VH1-69 to the same extent as the ChI (and HIV) Abs. We speculate that, if the Abs involved in acute viral infections reflect those produced during the early phases of chronic viral infection, a shift in expressed VH gene composition (i.e., CDR-H3 length and VH gene usage) must occur over time, along with an increase in SMs.
High SMs, but not long CDR-H3s nor use of distal VH genes, were also found among SAD-related anti-protein MAbs. This lack of shared features between the SAD and bNt MAbs (or ChI MAbs in general) suggests that the bNt MAbs against HIV are probably not drawn from an initial pool of autoimmune B cells bearing long CDR-H3s, as previously hypothesized , , ; were that the case, then the bNt MAbs would be expected to be similar to the SAD MAbs but not the ChI ones. Clearly, Abs having the features of the bNt MAbs are not rare, and are routinely produced during ChIs. Thus, it seems more likely that the rarity of the bNt HIV Abs results from the cryptic, flexible and/or transient nature of conserved epitopes on the neutralization-competent structure of HIV Env; such epitopes are not immunodominant on the virus, nor on the envelope “debris” shed by infected cells, and as such, multiple rounds of viral escape are likely required before the immune system can mount an effective Ab response against them.
This study is to our knowledge the first to explicitly compare gene family usage in MAbs from HIV with those from other types of immune response. We observed a bias toward family VH1 and against family VH3 genes in the HIV and other ChI MAbs. The increased usage of family VH1 agrees with Scheid et al. ; and removal of the large number of Abs from Scheid et al. did not affect this conclusion (analysis not shown). A deficit in family VH3 usage associated with HIV infection has been reported in several studies , , , , . This deficit is consistent with the suggestion that HIV-1 gp120 acts as a superAg that specifically deletes B cells bearing Abs encoded by genes from family VH3 , . In addition, and in contrast to some previous findings , we did not observe over-utilization of the VH4 family, which remained mostly constant across all MAb categories.
We observed an overabundance of VH1-69 gene usage in the HIV MAbs and among ChI MAbs in general, compared to the naïve repertoire reported from other studies , and to the other MAb categories in our database. Our results extend the observations of Huang et al. , who noted that nine of twelve MAbs against the CD4i site of gp120, used VH1-69, and those of Gorny et al. , who showed that MAbs against all HIV Env epitopes, except the V3 loop, are enriched for VH1-69 usage. In addition, three studies have noted almost exclusive use of VH1-69 among cross-protective MAbs against influenza virus , , .
Both AcI and ChI MAbs tended to use distal VH genes, but only in the latter group were long CDR-H3s present in Abs encoded by distal VH genes, which tended to be VH1-69. Three mechanisms can produce long CDR-H3s: (i) longer VH, DH, or JH genes can be used preferentially, (ii) CDR-H3 can be lengthened by insertions induced by activation-induced cytidine deaminase , , and (iii) secondary rearrangement (or receptor editing or revision , ) can result in N and P additions at the N1 junction. Secondary VH-gene rearrangement necessarily involves the use of distal VH and JH genes, because once a VH gene is somatically recombined with a DH gene (i.e., after primary V-D-J rearrangement), only genes more distal to the DH region are available for further joining. The features of the ChI Abs alone are consistent with secondary rearrangement model, in both using distal VH genes and having long CDR-H3s for the same Ab population. Thus, viral infection appears to select for distal VH genes, but if secondary rearrangement is playing a role in lengthening CDR-H3, it appears to be doing so only for the ChI Abs.
Many of our conclusions should be interpreted with caution, given that they are based on a limited dataset that may be biased in several ways. For example, there was significant bias related to Ag specificity for several categories of MAb; many of the anti-HIV MAbs were against the gp120 CD4bs, and were obtained via phage-displayed Ab libraries; most of the SAD MAbs were against the non-protein Ags DNA and phospholipid/cardiolipin; whereas most of the AcI MAbs were against streptococcal capsular polysaccharide and rotavirus. Another potential bias is related to the limited number of SADs we studied, with the preponderance being SLE and anti-phospholipid syndrome. Since we are studying MAbs, it is important to realize that particular antibodies are often selected for further study based on characteristics such as strength of binding, isotype or epitope, and thus the data set is not random with respect to these parameters. This is one reason why we adopted a conservative approach, and emphasize those results that satisfy a Bonferroni-adjusted p value based on the total number of tests conducted in Table 1. A larger dataset, indexed by disease and clinical condition, would overcome many of these potential biases. One roadblock to such a compilation is that many researchers do not routinely submit expressed sequences to public databases; this will become especially critical as high-throughput methods are employed to survey large sets of disease-specific MAbs.
This analysis of 427 Ag-specific MAbs should directly inform vaccine research. For example, the result that long CDR-H3s are associated with chronic and persistent Ag and anti-protein Abs motivates several questions. Are Abs bearing long CDR-H3s present at the beginning of an immune response, or do they “evolve” over time? If they accumulate over time, then are they directly selected from a pre-existing minor compartment within the naïve B-cell populations, do they comprise a specially recruited B-cell subset, and/or do they evolve by secondary processes (e.g., VH-gene replacement, DNA insertion, or gene conversion)? All of the bNt MAbs against HIV are heavily mutated and five of the eight have long CDR-H3s. This line of reasoning raises the possibility that long CDR-H3s are not required to bind conserved epitopes on the HIV-1 envelope, but arise instead through processes that come into play during long-term persistence of protein Ag and viral escape. If so, an effective HIV vaccine may produce bNt Abs via "normal" immunization processes, by virtue of enhancing the immunogenicity of Nt sites on Env. Given this scenario, it remains unknown if Abs bearing the features of acute antiviral Abs (which we expect to be similar to the features of Abs elicited by a traditional vaccine) can act as bNt Abs. While bNt Abs have yet to be elicited by vaccines meant to mimic the epitopes on Env that mediate neutralization by the bNt MAbs, that should not be taken as evidence that they cannot be so produced. Our results indicate that HIV vaccine research should continue to follow “reverse vaccinology” approaches  that attempt to make the sites recognized by the bNt MAbs immunodominant , . Progress in this approach has recently been observed with an influenza vaccine that elicts broadly protective Abs . Conversely, it is also possible, but not proven, that “chronic” type Abs bearing the features of the bNt MAbs will be required for broad neutralization. If this is the case, then research into the cellular and genetic origins of such Abs is required. Thus our second recommendation is for research efforts to be expanded in this area, with the goal of developing vaccination strategies that stimulate key features of these chronic processes, and in so doing, elicit bNt Abs.
Materials and Methods
Heavy chain sequences of expressed MAbs were retrieved from the IMGT/LIGM-DB on-line database (http://imgt.cines.fr/), from the literature, and from direct contacts with researchers (see Table S1). Our goal was to collect VH sequences for all of the available human HIV MAbs. The Ag targets of the HIV MAbs included gp120, its CD4 binding site (CD4bs) and CD4 inducible site (CD4i), the gp120 V3 loop, gp41, Rev, Tat, p24, and p25. This MAb dataset was expanded to include human MAbs associated with other chronic infections (the ChI MAbs), including those against Epstein Barr virus, hepatitis B and C virus, herpes simplex virus and human cytomegalovirus. (Note that all of these MAbs are from viral infections.) For comparison, a similar group of MAbs from Systemic Autoimmune Disease (SAD) was assembled, including from systemic lupus erythematosus (SLE), anti-phospholipid syndrome, mixed connective tissue disease, rheumatoid arthritis, Sjögren's disease, and cold agglutin disease, and against the Ags, cardiolipin (serum dependent and serum independent), phospholipids, DNA, beta-2-glycoprotein, Sm ribonucleoprotiens, myelin basic protein, myelin-associated glycoprotein, achetylcholine receptor, Ro/SSA and La/SSB. We concentrated on SAD MAbs based on the hypothesis that the bNt anti-HIV MAbs were derived from autoAb/autoreactive precursors , , . In addition, VH sequences were collected for MAbs associated with acute infections (Pseudomonas aeruginosa, rotavirus, Pneumococcus pneumoniae, Ebola virus, Neisseria meningitidis, hepatitis A virus), and from vaccinated individuals (Haemophilus influenzae Type b conjugate vaccine, 23-valent pneumococcal polysaccharide vaccine, Streptococcus pneumonia, tetanus toxoid, hepatitis B surface Ag), reported as the AcI MAbs. For each MAb, we attempted to obtain information on its isotype, Ag specificity, the methods used to obtain it (e.g., phage display, B-cell sorting, etc.), clinical data on the source-subject, and the bibliographic reference and GenBank accession number for the original MAb sequence. This information was entered into an Excel database by hand.
Nucleotide sequences were analyzed using a recent version of JoinSolver (http://joinsolver.niams.nih.gov/index.htm; ), which provides the closest-matched VH, DH and JH genes, determines the limits of the CDR-H3 region, the length (in amino acids) of CDR-H3 region, the contributions of P and N nucleotides at both the V-D and D-J junctions, and the number of SMs in the MAbs relative to the predicted germline genes (this number is defined as of the number of base pair substitutions relative to a predicted germline gene). These results, including nucleotide sequence for CDR-H3 region, were also entered into the Excel database. For a few HIV MAbs, only CDR-H3 length, and not the expressed VH sequence, was available (e.g., Ditzel et al. ; see Table S1). Results from JoinSolver were compared to those produced by the V-QUEST and JunctionAnalysis algorithms of the IMGT system , which also analyzes VH sequences for gene usage and somatic mutations. In addition, assignments of each MAb to predicted germline VH, DH and JH genes were confirmed visually. Results from IMGT and JoinSolver differed systematically. For example, the size of the region of CDR-H3 contributed by the germline DH gene was consistently estimated to be greater using IMGT V-QUEST. This result can be explained by the fact that standard parameters for V-QUEST allow more mutations in the DH-gene core. However, differences between classes of MAbs were similar whether the comparisons were calculated using V-QUEST or JoinSolver, and these relative differences (e.g., the average CDR-H3 length of self vs. non-self MAbs) are the important parameters in our study.
JoinSolver results were used to screen for clonal expansions, which were identified as those Abs that used the same sets of VH, DH and JH regions with similar patterns of N and P nucleotides. A single, randomly chosen MAb was retained to represent each clonally-expanded set. Two recently reported bNt MAbs  are expansions of the same B-cell lineage, so we randomly selected one, PG16, for analysis; taking the same approach we selected VRC01 from the set of two bNt MAbs reported by Wu et al. . Thus, the final set that was statistically analyzed does not include all reported HIV MAbs, but only those representing independent clonal lineages (see below, and Tables S1 and S2).
In summary, the entire database consists of over 700 MAbs (Table S1), which underwent two screens to produce the final dataset (Table S2) for analysis of CDR-H3 length, SM and gene usage. In the first screen, each MAb had to have a specified Ag, and to be associated with a particular immune response. In the second screen, clones from the same clonal expansion were deleted from the dataset, resulting in a 427-MAb dataset comprising 227 ChI MAbs (including 193 HIV MAbs), 87 SAD MAbs and 113 AcI MAbs, which was exported to SAS (Rel. 8.2, 2001; SAS Institute Inc., Cary, NC) for statistical analysis. Of these 427 MAbs, 318 were identified as to IgM or IgG; 90% of these were IgG (Table S2).
For all MAb categories, PROC UNIVARIATE (SAS) was used to test the distributions of CDR-H3 length, total VH-gene mutations, distance of predicted VH gene used in the MAb relative to VH6-1, the V-gene most proximal to the DH region (VH-distance), and distance of predicted JH gene from JH6, the JH gene most distal to the DH region (JH-distance), against the normal distribution. Most of the distributions were non-normal, even after log transformation, so a non-parametric Kruskall-Wallis Test was used to test for differences among sets of MAbs (PROC NPAR1WAY, SAS). To avoid zero values, the natural log of (3*CDR-H3 length in aa + 0.1) was used in tests for differences in CDR-H3 length. All results from the non-parametric tests were compared to one-way ANOVA (PROC GLM, SAS), and in all cases the results were similar in terms of levels of significance. When more than two categories were compared, (i.e., comparisons among ChI, AcI and SAD MAbs), Tukey a posteriori tests were used to determine what groups were statistically different, and these different groups were denoted by different letters (PROC GLM, SAS). In Table 1 we present the p values for the main statistical tests of this study. This Table reports 9 hypothesis tests for each of CDR-H3 length, number of SMs and VH-distance, for a total of 27 tests; therefore, to be conservative, all tests that passed a Bonferroni-corrected P value of 0.05/27 = 0.0018 were highlighted in bold. Given the many confounding factors in this data base, these probability values should be interpreted as indicators of strong differences among categories rather than strictly interpreted statistical tests (see Discussion). Distributions of CDR-H3 length for CD4bs, CD4i, V3 loop, and anti-gp41 MAbs presented in Figure 2 were tested for heterogeneity by χ2. JH-distance did not vary among MAb categories and is not reported. The difficulty of assigning germline DH genes to expressed Ab sequences, especially for highly mutated HIV MAbs, precluded a comprehensive analysis of DH gene usage or the number of P and N nucleotides.
VH gene family usage in anti-protein and non anti-protein MAbs for 3 disease conditions. See Table 3 for sample sizes; there is only 1 ChI Mab that is not anti-protein. MAbs utilizing VH1 family were separated into those using VH1-69 and others.
Database of antigen-specific expressed human MAbs.
R. Kunert and H. Katinger (University of Natural Resources and Life Sciences, Vienna) kindly provided the sequences of MAbs 2F5, 2G12 and 4E10 for this analysis, and J. Scheid (Charité Universitätsmedizin, Berlin) and M. Nussenzwieg (Rockefeller University) provided a large number of anti-HIV-1 expressed Ab sequences . We are grateful to Ralph Pantophlet (Simon Fraser University) for comments on the manuscript and Kevin Henry (Simon Fraser University) for help with graphics.
Conceived and designed the experiments: FB JKS. Performed the experiments: FB CL MM NSL PEL JKS. Analyzed the data: FB CL MM NSL PEL JKS. Contributed reagents/materials/analysis tools: PEL NSL. Wrote the paper: FB JKS NSL PEL.
- 1. Tonegawa S (1983) Somatic generation of antibody diversity. Nature 302: 575–581.
- 2. Market E, Papavasiliou FN (2003) V(D)J recombination and the evolution of the adaptive immune system. PLoS Biol 1: E16.
- 3. Wu TT, Johnson G, Kabat EA (1993) Length distribution of CDRH3 in antibodies. Proteins 16: 1–7.
- 4. Schief WR, Ban YE, Stamatatos L (2009) Challenges for structure-based HIV vaccine design. Curr Opin HIV AIDS 4: 431–440.
- 5. Burton DR, Desrosiers RC, Doms RW, Koff WC, Kwong PD, et al. (2004) HIV vaccine design and the neutralizing antibody problem. Nat Immunol 5: 233–236.
- 6. Binley JA, Wrin T, Korber B, Zwick MB, Wang M, et al. (2004) Comprehensive cross-clade neutralization analysis of a panel of anti-human immunodeficiency virus type 1 monoclonal antibodies. Journal of Virology 78: 13232–13252.
- 7. Walker LM, Phogat SK, Chan-Hui PY, Wagner D, Phung P, et al. (2009) Broad and potent neutralizing antibodies from an African donor reveal a new HIV-1 vaccine target. Science 326: 285–289.
- 8. Corti D, Langedijk JP, Hinz A, Seaman MS, Vanzetta F, et al. (2010) Analysis of memory B cell responses and isolation of novel monoclonal antibodies with neutralizing breadth from HIV-1-infected individuals. PLoS One 5: e8805.
- 9. Wu X, Yang ZY, Li Y, Hogerkorp CM, Schief WR, et al. (2010) Rational design of envelope identifies broadly neutralizing human monoclonal antibodies to HIV-1. Science 329: 856–861.
- 10. Simek MD, Rida W, Priddy FH, Pung P, Carrow E, et al. (2009) Human immunodeficiency virus type 1 elite neutralizers: individuals with broad and potent neutralizing activity identified by using a high-throughput neutralization assay together with an analytical selection algorithm. J Virol 83: 7337–7348.
- 11. Doria-Rose NA, Klein RM, Daniels MG, O'Dell S, Nason M, et al. (2010) Breadth of human immunodeficiency virus-specific neutralizing activity in sera: clustering analysis and association with clinical variables. J Virol 84: 1631–1636.
- 12. Richman DD, Wrin T, Little SJ, Petropoulos CJ (2003) Rapid evolution of the neutralizing antibody response to HIV type 1 infection. Proc Natl Acad Sci U S A 100: 4144–4149.
- 13. Kunert R, Ruker F, Katinger H (1998) Molecular characterization of five neutralizing anti-HIV type 1 antibodies: Identification of nonconventional D segments in the human monoclonal antibodies 2G12 and 2F5. AIDS Research and Human Retroviruses 14: 1115–1128.
- 14. Saphire EO, Parren PW, Pantophlet R, Zwick MB, Morris GM, et al. (2001) Crystal structure of a neutralizing human IGG against HIV-1: a template for vaccine design. Science 293: 1155–1159.
- 15. Zwick MB, Parren PWHI, Saphire EO, Church S, Wang M, et al. (2003) Molecular features of the broadly neutralizing immunoglobulin G1 b12 required for recognition of Human Immunodeficiency Virus Type 1 gp120 10.1128/JVI.77.10.5863-5876.2003. J Virol 77: 5863–5876.
- 16. Ofek G, Tang M, Sambor A, Katinger H, Mascola JR, et al. (2004) Structure and mechanistic analysis of the anti-human immunodeficiency virus type 1 antibody 2F5 in complex with its gp41 epitope. J Virol 78: 10724–10737.
- 17. Stanfield RL, Gorny MK, Williams C, Zolla-Pazner S, Wilson IA (2004) Structural rationale for the broad neutralization of HIV-1 by human monoclonal antibody 447-52D. Structure 12: 193–204.
- 18. Cardoso RM, Zwick MB, Stanfield RL, Kunert R, Binley JM, et al. (2005) Broadly neutralizing anti-HIV antibody 4E10 recognizes a helical conformation of a highly conserved fusion-associated motif in gp41. Immunity 22: 163–173.
- 19. Haynes BF, Fleming J, St Clair EW, Katinger H, Stiegler G, et al. (2005) Cardiolipin polyspecific autoreactivity in two broadly neutralizing HIV-1 antibodies. Science 308: 1906–1908.
- 20. Nabel GJ (2005) Immunology. Close to the edge: neutralizing the HIV-1 envelope. Science 308: 1878–1879.
- 21. Hioe CE, Wrin T, Seaman MS, Yu X, Wood B, et al. (2010) Anti-V3 monoclonal antibodies display broad neutralizing activities against multiple HIV-1 subtypes. PLoS One 5: e10254.
- 22. Zhou T, Xu L, Dey B, Hessell AJ, Van Ryk D, et al. (2007) Structural definition of a conserved neutralization epitope on HIV-1 gp120. Nature 445: 732–737.
- 23. Zwick MB, Komori HK, Stanfield RL, Church S, Wang M, et al. (2004) The long third complementarity-determining region of the heavy chain is important in the activity of the broadly neutralizing anti-Human Immunodeficiency Virus Type 1 antibody 2F5 10.1128/JVI.78.6.3155-3161.2004. J Virol 78: 3155–3161.
- 24. Julien JP, Huarte N, Maeso R, Taneva SG, Cunningham A, et al. (2010) Ablation of the complementarity-determining region H3 apex of the anti-HIV-1 broadly neutralizing antibody 2F5 abrogates neutralizing capacity without affecting core epitope binding. J Virol 84: 4136–4147.
- 25. Scherer EM, Leaman DP, Zwick MB, McMichael AJ, Burton DR (2010) Aromatic residues at the edge of the antibody combining site facilitate viral glycoprotein recognition through membrane interactions. Proc Natl Acad Sci U S A 107: 1529–1534.
- 26. Alam SM, McAdams M, Boren D, Rak M, Scearce RM, et al. (2007) The role of antibody polyspecificity and lipid reactivity in binding of broadly neutralizing anti-HIV-1 envelope human monoclonal antibodies 2F5 and 4E10 to glycoprotein 41 membrane proximal envelope epitopes. Journal of Immunology 178: 4424–4435.
- 27. Johnson G, Wu TT (1998) Preferred CDRH3 lengths for antibodies with defined specificities. Int Immunol 10: 1801–1805.
- 28. Collis AVJ, Brouwer AP, Martin ACR (2003) Analysis of the antigen combining site: Correlations between length and sequence composition of the hypervariable loops and the nature of the antigen. Journal of Molecular Biology 325: 337–354.
- 29. Baxendale HE, Johnson M, Stephens RC, Yuste J, Klein N, et al. (2008) Natural human antibodies to pneumococcus have distinctive molecular characteristics and protect against pneumococcal disease. Clin Exp Immunol 151: 51–60.
- 30. Notkins AL (2004) Polyreactivity of antibody molecules. Trends in Immunology 25: 174–179.
- 31. Yurasov S, Tiller T, Tsuiji M, Velinzon K, Pascual V, et al. (2006) Persistent expression of autoantibodies in SLE patients in remission. J Exp Med 203: 2255–2261.
- 32. Tiller T, Tsuiji M, Yurasov S, Velinzon K, Nussenzweig MC, et al. (2007) Autoreactivity in human IgG+ memory B cells. Immunity 26: 205–213.
- 33. Song H, Cerny J (2003) Functional heterogeneity of marginal zone B cells revealed by their ability to generate both early antibody-forming cells and germinal centers with hypermutation and memory in response to a T-dependent antigen. J Exp Med 198: 1923–1935.
- 34. Phan TG, Gardam S, Basten A, Brink R (2005) Altered migration, recruitment, and somatic hypermutation in the early response of marginal zone B cells to T cell-dependent antigen. J Immunol 174: 4567–4578.
- 35. Baumgarth N (2011) The double life of a B-1 cell: self-reactivity selects for protective effector functions. Nat Rev Immunol 11: 34–46.
- 36. Tangye SG, Good KL (2007) Human IgM+CD27+ B cells: memory B cells or "memory" B cells? J Immunol 179: 13–19.
- 37. Berberian L, Goodglick L, Kipps TJ, Braun J (1993) Immunoglobulin VH3 gene products: natural ligands for HIV gp120. Science 261: 1588–1591.
- 38. David D, Demaison C, Bani L, Zouali M, Theze J (1995) Selective variations in vivo of VH3 and VH1 gene family expression in peripheral B cell IgM, IgD and IgG during HIV infection. Eur J Immunol 25: 1524–1528.
- 39. Wardemann H, Yurasov S, Schaefer A, Young JW, Meffre E, et al. (2003) Predominant autoantibody production by early human B cell precursors. Science 301: 1374–1377.
- 40. Matsuda F, Ishii K, Bourvagnet P, Kuma K, Hayashida H, et al. (1998) The complete nucleotide sequence of the human immunoglobulin heavy chain variable region locus. J Exp Med 188: 2151–2162.
- 41. Mahalanabis M, Jayaraman P, Miura T, Pereyra F, Chester EM, et al. (2009) Continuous viral escape and selection by autologous neutralizing antibodies in drug-naive human immunodeficiency virus controllers. J Virol 83: 662–672.
- 42. CHAVI CfH-AVI (2006) https://chavi.org/modules/chavi_mem_pols/index.php?id=1.
- 43. Scheid JF, Mouquet H, Feldhahn N, Seaman MS, Velinzon K, et al. (2009) Broad diversity of neutralizing antibodies isolated from memory B cells in HIV-infected individuals. Nature 458: 636–640.
- 44. Wisnewski A, Cavacini L, Posner M (1996) Human antibody variable region gene usage in HIV-1 infection. J Acquir Immune Defic Syndr Hum Retrovirol 11: 31–38.
- 45. Scamurra RW, Miller DJ, Dahl L, Abrahamsen M, Kapur V, et al. (2000) Impact of HIV-1 infection on VH3 gene repertoire of naive human B cells. J Immunol 164: 5482–5491.
- 46. Gorny MK, Wang X, Jiang X, Williams C, Volsky B, et al. (2008) Immunoglobulin gene usage by neutralizing human anti-V3 HIV-1 monoclonal antibodies derived from clade B and non-B HIV-1 infected individuals. AIDS Research and Human Retroviruses 24: 46–46.
- 47. Muller S, Kohler H (1997) B cell superantigens in HIV-1 infection. Int Rev Immunol 14: 339–349.
- 48. Viau M, Veas F, Zouali M (2007) Direct impact of inactivated HIV-1 virions on B lymphocyte subsets. Molecular Immunology 44: 2124–2134.
- 49. Huang CC, Venturi M, Majeed S, Moore MJ, Phogat S, et al. (2004) Structural basis of tyrosine sulfation and V-H-gene usage in antibodies that recognize the HIV type 1 coreceptor-binding site on gp120. Proceedings of the National Academy of Sciences of the United States of America 101: 2706–2711.
- 50. Throsby M, van den Brink E, Jongeneelen M, Poon LLM, Alard P, et al. (2008) Heterosubtypic neutralizing monoclonal antibodies cross-protective against H5N1 and H1N1 recovered from human IgM+ memory B cells. PLoS ONE 3: e3942.
- 51. Sui J, Hwang WC, Perez S, Wei G, Aird D, et al. (2009) Structural and functional bases for broad-spectrum neutralization of avian and human influenza A viruses. Nat Struct Mol Biol 16: 265–273.
- 52. Kashyap AK, Steel J, Oner AF, Dillon MA, Swale RE, et al. (2008) Combinatorial antibody libraries from survivors of the Turkish H5N1 avian influenza outbreak reveal virus neutralization strategies. Proc Natl Acad Sci U S A 105: 5986–5991.
- 53. de Wildt RM, van Venrooij WJ, Winter G, Hoet RM, Tomlinson IM (1999) Somatic insertions and deletions shape the human antibody repertoire. J Mol Biol 294: 701–710.
- 54. Zouali M (2008) Receptor editing and receptor revision in rheumatic autoimmune diseases. Trends in Immunology 29: 103–109.
- 55. Klonowski KD, Monestier M (2001) Ig heavy-chain gene revision: leaping towards autoimmunity. Trends in Immunology 22: 400–405.
- 56. Walker LM, Burton DR (2010) Rational antibody-based HIV-1 vaccine design: current approaches and future directions. Curr Opin Immunol 22: 358–366.
- 57. Ofek G, Guenaga FJ, Schief WR, Skinner J, Baker D, et al. (2010) Elicitation of structure-specific antibodies by epitope scaffolds. Proc Natl Acad Sci U S A 107: 17880–17887.
- 58. Correia BE, Ban YE, Holmes MA, Xu H, Ellingson K, et al. (2010) Computational design of epitope-scaffolds allows induction of antibodies specific for a poorly immunogenic HIV vaccine epitope. Structure 18: 1116–1126.
- 59. Wang TT, Tan GS, Hai R, Pica N, Petersen E, et al. (2010) Broadly protective monoclonal antibodies against H3 influenza viruses following sequential immunization with different hemagglutinins. PLoS Pathog 6: e1000796.
- 60. Souto-Carneiro MM, Longo NS, Russ DE, Sun HW, Lipsky PE (2004) Characterization of the human Ig heavy chain antigen binding complementarity determining region 3 using a newly developed software algorithm, JOINSOLVER. J Immunol 172: 6790–6802.
- 61. Ditzel HJ, Parren PW, Binley JM, Sodroski J, Moore JP, et al. (1997) Mapping the protein surface of human immunodeficiency virus type 1 gp120 using human monoclonal antibodies from phage display libraries. J Mol Biol 267: 684–695.
- 62. Lefranc MP (2005) IMGT, the international ImMunoGeneTics information system: a standardized approach for immunogenetics and immunoinformatics. Immunome Res 1: 3.
- 63. Zemlin M, Klinger M, Link J, Zemlin C, Bauer K, et al. (2003) Expressed murine and human CDR-H3 intervals of equal length exhibit distinct repertoires that differ in their amino acid composition and predicted range of structures. J Mol Biol 334: 733–749.