Understanding the relationship between protein sequence and molecular recognition selectivity remains a major challenge. The antibody fragment scFv1F4 recognizes with sub nM affinity a decapeptide (sequence 6TAMFQDPQER15) derived from the N-terminal end of human papilloma virus E6 oncoprotein. Using this decapeptide as antigen, we had previously shown that only the wild type amino-acid or conservative replacements were allowed at positions 9 to 12 and 15 of the peptide, indicating a strong binding selectivity. Nevertheless phenylalanine (F) was equally well tolerated as the wild type glutamine (Q) at position 13, while all other amino acids led to weaker scFv binding. The interfaces of complexes involving either Q or F are expected to diverge, due to the different physico-chemistry of these residues. This would imply that high-affinity binding can be achieved through distinct interfacial geometries. In order to investigate this point, we disrupted the scFv–peptide interface by modifying one or several peptide positions. We then analyzed the effect on binding of amino acid changes at the remaining positions, an altered susceptibility being indicative of an altered role in complex formation. The 23 starting variants analyzed contained replacements whose effects on scFv1F4 binding ranged from minor to drastic. A permutation analysis (effect of replacing each peptide position by all other amino acids except cysteine) was carried out on the 23 variants using the PEPperCHIP® Platform technology. A comparison of their permutation patterns with that of the wild type peptide indicated that starting replacements at position 11, 12 or 13 modified the tolerance to amino-acid changes at the other two positions. The interdependence between the three positions was confirmed by SPR (Biacore® technology). Our data demonstrate that binding selectivity does not preclude the existence of alternative high-affinity recognition modes.
Citation: Nominé Y, Choulier L, Travé G, Vernet T, Altschuh D (2015) Antibody Binding Selectivity: Alternative Sets of Antigen Residues Entail High-Affinity Recognition. PLoS ONE 10(12): e0143374. https://doi.org/10.1371/journal.pone.0143374
Editor: Chandra Verma, Bioinformatics Institute, SINGAPORE
Received: July 15, 2015; Accepted: November 4, 2015; Published: December 2, 2015
Copyright: © 2015 Nominé et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This work was supported by the Centre National de la Recherche Scientifique (CNRS) and the University of Strasbourg (UdS). This work used the platforms of the Grenoble Instruct centre (ISBG; UMS 3518 CNRS-CEA-UJF-EMBL) with support from FRISBI (ANR-10-INSB-05-02) and GRAL (ANR-10-LABX-49-01) within the Grenoble Partnership for Structural Biology (PSB).
Competing interests: The authors have declared that no competing interests exist.
The determinants of binding selectivity in protein-protein interactions remain largely unexplored even though their knowledge is crucial for understanding binding events that underlie biological phenomena or for developing new drugs. Selectivity is the ability of a molecule to discriminate between interaction partners. A selective binder shows little cross-reactivity: it recognizes a given partner with much higher affinity than other partners. A non-selective binder is highly cross-reactive: it recognizes a range of molecules with similar affinities. The description of selectivity is necessarily operational since it depends on the number and nature of molecules analyzed. Ideally it should be based on the quantitative characterization of a very large number of interactions, but this task is seldom expanded beyond a few dozen, typically alanine variants of interfacial residues.
Alanine scanning mutagenesis (Ala-scan) experiments have shown that a small proportion of all residues that compose protein interfaces play a major role in binding [1–2]. They were called hot spots and defined as residues whose Ala replacement decreases the binding free energy (ΔΔG) by more than 2 kcal/mol. Hot spots are generally clustered at the center of the binding site and are surrounded by energetically less important residues, initially proposed to shield hot spots from the solvent (O-ring hypothesis ). Hot spots are enriched in Trp, Tyr and Arg, which was attributed to the capacity of these amino-acids for multiple interaction types (aromatic-π, h-bond, hydrophobic) [2–3]. Amino-acid preferences at binding sites were also investigated using structural data for protein complexes. The conclusions differ somewhat depending on the data set and definition of interface residues, but a preferential contribution to binding sites of hydrophobic, aromatic and Arg residues was observed [4–7]. These amino-acids were proposed to be well suited for making contacts because of stickiness, flexibility and mixed physico-chemical properties, allowing them to interact with different residues via different contact types. It was suggested that hydrophobic residues (generally identified as hot spots and located at the center of the binding site) mainly provide affinity, while surrounding polar residues contribute to specificity [6, 8–9].
Beyond the identification of general rules, the compilation of Ala-scan experiments and the analysis of binding site architectures stress the complex relationship between sequence/structure and binding [10–12]. Furthermore the hypothesis that hot spot residues present some recognition adaptability raises the question of the basis of selectivity. Statistical studies must therefore be complemented with in-depth studies of the relation between structure and binding in individual interactions. Because experimental mutational studies are laborious and time-consuming, much effort was invested in the computational prediction of residue contributions to binding energy from the 3D structure of complexes [13–16]. While reasonable success was achieved in the identification of hot spots, precise prediction of the variation in binding energy from in silico Ala-scan remains problematic. Predicting the effect on binding energy of replacements other than Ala or of multiple replacements, which is crucial for a description of selectivity, is even more elusive. One of the reasons for the moderate success of quantitative predictions is that protein plasticity and dynamics play a significant role in binding [17–20]. Differences between free and bound molecules include structural re-organizations, changes in internal dynamics, ordering of mobile regions and desolvation effects, with complex consequences on enthalpy and entropy variations. The description of these phenomena, of their influence on binding energy, and of the way they change upon interface modifications, is out of reach with current knowledge and methods used for computational approaches.
Comprehensive mutational studies, including not only single variants other than Ala, but also multiple variants, are inescapable to unravel the relationship between structure and affinity . Such studies are scarce [22–23]. They cannot be exhaustive due to the extremely large size of the mutational landscape. Furthermore an increase in library size is generally achieved at the expense of detailed activity quantifications. Illustrations of large-scale studies are provided by the combinatorial mutagenesis of the interface between the human growth hormone and its receptor , the mutational analysis of a PDZ domain in the cellular context  and the high resolution mapping of the interaction between the WW domain and a peptide using protein display combined with high-throughput sequencing . While double mutant behavior could generally be predicted from that of single mutants in the latter study, the two former studies stress the context-dependent effect of some interface mutations.
Residue interdependences (defined here as those cases when a mutation at position A modifies the impact on binding affinity of a mutation at position B) surmise a modified role of the wild-type (WT) and replacing residues in the binding mechanism, such as the existence of different contact types or geometries. Here we search for residue interdependences within the peptidic epitope recognized by the single chain antibody fragment (scFv) 1F4 , previously proposed to be capable of different recognition modes . ScFv1F4 binds with high affinity to peptides corresponding to the N-terminus of Human Papilloma Virus 16 oncoprotein E6 (HPV16-E6) . In a previous work, a full permutation analysis of the WT decapeptide 6TAMFQDPQER15 allowed us to identify a well-defined epitope composed of positions 8 to 15, among which six showed a restricted tolerance to amino-acid changes and were considered as essential epitope residues . Only the WT amino-acid or conservative changes were allowed at five of these essential positions (9–12 and 15). In contrast Phe and Gln, two amino-acids that differ in physicochemical properties, were allowed at position 13 while conservative replacements led to weaker binding signals. The observation that position 13 is essential for binding but can accommodate residues with a different bonding potential led us to propose that scFv1F4 uses alternative peptide binding mechanisms. If several residues are implicated in the alternative mechanisms, residue interdependences should be observed: the susceptibility of the binding affinity to amino-acid replacements at a given position should depend on the nature of the amino-acid present at another position. Such occurrences are readily identified from comparing the permutation patterns (effect on binding of replacing each position by all amino-acids) of variant peptides with that of the WT peptide. Here we present the permutation analysis of the WT peptide and of 23 starting variants performed using the PEPperPRINT GmbH facility (PEPperCHIP® Platform technology, Heidelberg, Germany). A subset of the peptide–scFv interactions was characterized by SPR (Biacore®, GE Healthcare Biacore, Uppsala, Sweden). Our data indicate that replacements at peptide positions 11, 12 or 13 modify the impact on the binding affinity of amino-acid changes at the other two positions, demonstrating a dependence between the three residues for scFv1F4 binding, and supporting the existence of alternative binding modes. Furthermore the alternative modes entailed high-affinity peptide recognition.
Materials and Methods
Materials and methods were described in detail previously  and will be briefly summarized here.
Soluble peptides and recombinant antibody fragments
ScFv1F4  and scFv1F4-QL34S (scFv1F4 with a Q34S replacement in the light chain ) were expressed and purified as described previously . Soluble peptides derived from residues 6–15 (6TAMFQDPQER15) of oncoprotein E6 of human papilloma virus 16 (HPV16-E6) were purchased from ProteoGenix (Oberhausbergen, France). The variants will be denoted either by their sequence, with the modified position in bold, or by the replacement (for example 6TAMFQDPFER15 or Q13F).
Peptide array synthesis and analysis
ScFv1F4-QL34S labeling with DyLight680, peptide array synthesis and binding detection were performed by PEPperPRINT GmbH (PEPperCHIP® Platform Technology, Heidelberg, Germany). The permutation scans were carried out on the WT decapeptide 6TAMFQDPQER15 and on its variants. In a permutation scan, the effect on binding of replacing each of the ten peptide positions by all amino-acids except cysteine is analyzed, which requires the synthesis of 190 spot peptides per starting peptide (19 amino-acids at 10 positions). Two arrays were prepared, each containing the permutation analysis of the WT peptide and of 19 variants. All peptides were synthesized in duplicate (7,600 peptide spots per array). The peptide arrays were stained with the DyLight680-labeled scFv1F4-QL34S. The staining pattern of array B is shown in Fig 1. The permutation scans of the WT peptide and of variant 6TAMFQDPFER15 are highlighted by white frames. The staining pattern of array A (S1 Fig) was published previously .
The permutation scans of the WT peptide (upper left) and of variant 6TAMFQDPFER15 are highlighted by white frames. Replaced positions together with the WT amino-acid are indicated on the left of the upper left frame. Amino-acids introduced at each position are indicated on top of the frame. Duplicated spots are easily identified.
SPR kinetic experiments were carried out using a Biacore T200 instrument (GE-Healthcare Biacore, Uppsala, Sweden) as described previously . Soluble peptides containing a C-terminal cysteine were covalently coupled to the sensor surfaces using the thiol coupling chemistry. The running buffers for kinetic measurements were HBS (HEPES buffered saline: 10 mM HEPES, 150 mM NaCl, 3.4 mM EDTA, pH 7.4) or HBS-200 (10 mM HEPES, 200 mM NaCl, pH 7.4), both supplemented with 0.005% (v/v) surfactant P20. Three-fold dilutions of a 45 or 30 nM scFv1F4 sample were injected over the surfaces in multiple cycle kinetic experiments. Injection and post-injection times were 180 s and 600 s, respectively. The flow rate was 40 μL/min. The double-referenced binding curves were fit to the one:one Langmuir binding model, using the Biacore T200 evaluation software (GE Healthcare Biacore).
Circular dichroism experiments
Circular dichroism (CD) experiments were recorded on a Jasco J-815 spectropolarimeter (Easton, MD) equipped with an automatic 6- position Peltier thermostated cell holder. Samples (65 μL) at 400 μM were prepared in HBS buffer. Far-UV CD data were collected in the 190–270 nm range using a 0.1 mm pathlength cell (Quartz-Suprasil, Hellma UK Ltd) at 25.0°C ± 0.1°C. Spectra were acquired using a continuous scan rate of 100 nm/min and averaged over 10 successive scans. The response time and the bandwidth were 1.0 s and 1 nm, respectively. The solvent spectrum was subtracted from spectra obtained under identical conditions. Far UV data are deconvoluted with the CDPro package  using the CONTINLL algorithm with the SDP48 protein database, and means and standard deviations calculated over the results obtained for the validated structures.
Peptide array design
The objective was to investigate whether a starting amino-acid replacement modifies the effect on binding of amino-acid changes at neighboring positions. For example the effect of a deleterious starting change could be compensated by a replacement elsewhere. Residue interdependences may also be found in the case of non-deleterious starting replacements. They can be identified by comparing the permutation patterns of the WT and variant peptides. Arrays A  (S1 Fig) and B (Fig 1) each contained the permutation scans of the WT decapeptide 6TAMFQDPQER15 and of nineteen variants. Data recorded for the WT and 14 variant peptides of array A were described previously . The data presented here include all permutation scans in array B (WT and 19 variants), together with those of 5 variants in array A. One of these was performed twice in array A (peptide 6TRMFQDVQER15). Therefore the data include the permutations scans of the WT and of 23 variant decapeptides (Table 1), each of them being recorded at least in duplicate.
A search for interdependences between all possible residue pairs is unfeasible. We therefore focused on the consequences on the permutation patterns of replacing essential residues. Thirteen variants presented a single change at one of the six essential positions (9 to 13 and 15), nine presented multiple changes including at least one at an essential position, and one was replaced at non-essential positions (A7G-M8S). The effect on binding of the replacements ranged from minor to drastic, as emphasized when the averaged fluorescence signals are expressed as percentage of that recorded with the WT decapeptide (column 6 in Table 1).
Array data analysis
The repeatability of fluorescence measurements can be assessed because: i/ all peptides were synthesized in duplicate (Fig 1), ii/ the duplicated scan of peptide 6TRMFQDVQER15 was repeated in array A and iii/ several peptides occur repeatedly within a same or in different families. Array B contained 3,222 different peptides, with 105 peptides repeated between 6 and 46 times. The standard deviation on the mean intensity calculated for these repetitions was < 2,100 (Table 1), which is 12% of the WT intensity reading (18,100), demonstrating an excellent signal repeatability, as was also observed for array A .
The comparison of the WT permutation scans, performed in both arrays A and B, further demonstrated data quality. The signal intensity ranges differed in the two arrays: 6,800 and 18,100 for the scFv—WT decapeptide interaction in arrays A and B, respectively (Table 1). However normalization of the intensity values with respect to the mean value of the scFv1F4-WT peptide intensity in each panel, produced well-superimposed patterns (Fig 2). The data confirm the conclusions that positions 9 to 13 and 15 show a restricted tolerance to amino-acid changes (F9, Y9—Q10—D11, N11—P12—Q13, F13 and H15, K15, R15) and represent essential epitope positions . In addition, assuming that T6 is not part of the epitope, the variability observed for this position (standard deviation = 0.12) provides insight into the statistical significance of the normalized intensities.
Intensities were normalized with respect to the mean intensity recorded for the scFv1F4—WT peptide interaction. Black and grey bars correspond to the normalized intensities in arrays A and B, respectively. White bars represent the WT sequence (normalized value = 1). Stars indicate single replacements in starting variants (see Table 1).
Comparison of WT and variant permutation patterns
The averaged fluorescence signals recorded for fourteen starting peptides were below 10% of that recorded with the WT peptide (Table 1). These peptides carried one or several replacements at essential positions 9, 10, 11, 12 and 15. In contrast, four single replacements (P12V, Q13Y, Q13F and R15Q) had a milder effect with signals at 16, 37, 68 and 24% of the WT signal, respectively. Furthermore five double variants showed signals that exceeded 10% of the WT signal. Four of these included the Q13F or Q13Y change (P12V-Q13Y, P12V-Q13F, D11S-Q13Y, A8Q-Q13Y) and one was modified at non-essential positions 7 and 8 (A7G-M8S).
In order to compare the permutation patterns of the variants with that of the WT, the mean fluorescence signals were normalized as % of the signal recorded with each starting peptide. Figs 3, 4 and S2 Fig show the superimpositions of the normalized WT (blue) and variant (red line) patterns. Positions 6 and 7 are not represented because they were tolerant to all replacements in all peptides analyzed. The replacement patterns at positions modified in the starting sequence are not relevant to the modified sequence context. Therefore they are not represented in single variants, and represented in green in multiple variants. For the cluster of the fourteen peptides displaying strongest effect upon mutation (averaged fluorescence signals below 10% of the WT signal), scFv1F4 binding could not be restored by non-WT replacements, as illustrated by the permutation patterns of four of the fourteen peptides (S2 Fig). Among the nine variant binders displaying milder effect, two (A7G-M8S, R15Q) showed WT-like patterns (Fig 3). In contrast the permutation patterns of positions 11 and 12 were modified in variants presenting the Q13Y or Q13F replacements. Indeed, in the context of these starting peptides, S at position 11 was consistently compatible with binding in addition to D11 and N11 (Fig 4A–4F), while I, L and V were allowed at position 12 in addition to the WT residue P12 (Fig 4A–4D). Concomitantly, F13 was preferred over the WT residue Q13 in the context of starting peptides containing the P12V mutation (Fig 4G) or the D11S mutation (green traces in Fig 4D). Data from seven distinct permutation scans in two different arrays were thus fully consistent. They indicate interdependence between positions 11, 12 and 13.
The patterns of variants R15Q (A) and A7G-M8S (B) are represented as red lines and superimposed with the WT pattern (blue). Fluorescence signals were normalized with respect to that recorded with each starting peptide. Patterns at replaced positions are not shown because they are not relevant to the modified sequence context.
Variant patterns are represented as red lines and superimposed with the WT pattern (blue). Fluorescence signals were normalized with respect to that recorded with each starting peptide. Patterns at the replaced positions are in green for double variants and not shown for single variants.
Eight peptides were synthesized for SPR characterization of their interaction with scFv1F4 (Table 2). The variant P12Y was used as a control of weak binding, its fluorescence signal being < 10% that of the WT (Fig 2). The seven other peptides were designed to investigate the interdependence between positions 11, 12 and 13 by comparing the effect on binding of a given replacement in various sequence contexts. Fig 5 shows the normalized kinetic curves (signal expressed as % of the maximal binding capacity of the surface) recorded when injecting a 10 nM purified scFv1F4 sample in HBS over surfaces carrying these seven peptides. The kinetic and affinity constants are listed in Table 2, together with ΔG (cal / mol) calculated as RT x log KD. Association rates were similar for all complexes (ka in the range 1 to 5 x 106 M-1 s-1), but dissociation rate constants differed significantly. The kd values measured in HBS and HBS-200 (HBS containing 200 instead of 150 mM NaCl) were similar (S1 Table) and correlated well with the fluorescence signals measured in the spot approach (Fig 6A and 6B). The outlier is the WT peptide. The correlation between binding free energies (in HBS) and fluorescence signals is shown in Fig 6C and 6D.
The purified scFv1F4 (10 nM in HBS) was injected over peptide surfaces. The WT curve is shown in red, single variants in light grey, double variants in dark grey and the triple variant in black. The curves were superimposed using the software TraceDrawer (Ridgeview Instruments AB, Uppsala, Sweden).
The kd (A, B) and ΔG (C, D) values are plotted against fluorescence signals from arrays A (A,C) and B (B,D). The kd as measured in HBS (grey markers) and HBS-200 (white markers) is shown in log scale. Peptide names are indicated in A and B.
Structural characterization of the peptides
To investigate if differences in dissociation rate constants are related to structural changes in the free peptide, far-UV circular dichroism (CD) spectra were recorded for a subset of peptides (Fig 7A). The CD spectra present profiles that are typical of disordered structures. This observation is confirmed by deconvolution of the CD spectra indicating a very low prevalence of alpha and beta secondary structure content (always below 25%) (Fig 7B and S2 Table). These results imply that the mutations, even involving the proline residue at position 12, did not induce drastic changes in the peptide conformation as the proportion of secondary structure elements stayed rather constant. To better characterize the conformation of the free WT peptide in solution, NMR experiments were also conducted (Fig 7C). Both 1H-NMR TOCSY and NOESY spectra exhibit a rather narrow amide proton chemical shift dispersion limited to 0.7 ppm. Such a range is characteristic of a lack of structural organization of the peptide backbone as well. Note that region 8 to 15 in the NMR structure of the free N-terminal domain of E6 is highly flexible (PDB ID 2LJX ).
A) CD spectra of 6 of the peptides listed in Table 2. The CD signal is expressed as mean residue ellipticity. B) Representative analysis of far UV data deconvoluted with the CDPro package using the CONTINLL algorithm with the SDP48 protein database (See S2 Table for details). The CD signal is expressed as mean residue CD extinction coefficient. C) Superimposition of NMR 1H-1H TOCSY (green) and NOESY (red) spectra recorded at 700 MHz on a 1 mM sample of WT peptide. All spectra were measured at 25°C in HBS buffer. Sequential assignment is reported here according to the peptide numbering. Note that P12 is missing due to the lack of NH in the proline residue.
The analysis of peptide–protein interactions is of general relevance as a number of interactions that mediate key cellular processes involve peptides or short segments present in disordered protein regions or in protein loops . Furthermore bioactive peptides, or peptides able to interfere with protein-protein interactions, are actively developed for biomedical applications [34,35]. In particular peptides are developed as vaccines  and their antibody binding modes have been investigated in structural or functional studies . Multiple binding modes attributed to peptide or paratope plasticity have been observed in the case of weak-affinity complexes formed by peptides with anti-protein antibodies or primary repertoire antibodies [38–41]. Affinity and selectivity have been improved by constraining the peptides [42–45]. In contrast, we describe here short linear peptides, likely to be largely unstructured in solution according to CD and NMR experiments (Fig 7), that display < nM antibody binding affinity.
We demonstrate that scFv1F4 recognizes with strong affinity peptides that possess alternative three-residue motifs at essential positions 11 to 13 (WT motif 11DPQ13, variant motifs 11SVF13 and 11SPF13), within a common motif (F9—Q10 and R15). Moreover the large-scale mutational study also provided information on non-binders, demonstrating that the antibody fragment is selective. Information on non-binders is generally scarce due to the technical challenges of producing large numbers or variants and of quantifying their binding properties. Nevertheless it is essential as the ability of a protein to bind its partner(s) is as important as its inability to interact with non-wanted homologues to ensure correct biological function.
Ala-replacements of the WT peptide identify six hot-warm spots
We define hot-warm spots as residues whose Ala replacement results in an increase in binding free energy > 1.5 kcal/mol. The ΔG of the WT peptide–scFv1F4 interaction, as deduced from SPR kinetic constants, was -12.7 kcal/mol (KD = 0.5 nM, Table 2). Interactions with ΔG > -11.2 kcal/mol (KD > 6 nM) thus display an increase in binding free energy > 1.5 kcal/mol. From the relation between fluorescence intensities and ΔG in HBS (Fig 6C and 6D), we can roughly approximate intensities < 250 in array A or < 3000 in array B with ΔG > -11.2 kcal/mol. Four Ala variants showed fluorescence readings below these limits: F9A, Q10A, P12A and R15A (signals of 100, 54, 84 and 86, respectively, in array A, and of 633, 425, 1644 and 1057, respectively, in array B). These observations are consistent with the ΔΔG > 2 kcal/mol previously measured by SPR for the P12A (2.7 kcal/mol) and Q13A (2.4 kcal/mol) replacements . Variants Q13A and D11A showed intensities slightly above these limits (intensities of 252 and 305, respectively, in array A, and of 2524 and 4298, respectively, in array B). In conclusion the Ala replacement data, together with the restricted tolerance to amino-acid changes at six positions (Fig 2), point to the strong peptide recognition selectivity of scFv1F4.
Substitutional analysis of variants identifies both independent and interdependent effects of replacements
Residue interdependences were detected within the set of positions 11–13, but not elsewhere. For example the replacement R15Q decreased fluorescence intensity (24% of the WT intensity, Table 1) but the permutation pattern of positions 6 to 14 was nearly unaffected (Fig 3B). The conserved tolerance to amino-acid changes in variant R15Q, despite a decreased fluorescence signal compared to the WT (Table 1), indicates independent mutational effects: the starting change does not influence the effect of replacements at other peptide positions. The R15Q replacement is likely to modify the scFv-peptide interface only locally. In contrast a replacement at one of the hot-warm spots 11–13 affected the tolerance to changes at the two other positions (Fig 4), as also confirmed by SPR (Figs 5 and 6). This finding supports the hypothesis that scFv1F4 is capable of distinct peptide binding modes, initially proposed on the basis that Q and F, two amino-acids with different physico-chemical properties, are equally well tolerated at position 13 . Fig 8 compares the effect on kd of a given replacement in peptides with different sequences. The D11S replacement (top vertical arrows) increased kd 11-fold in the WT context, but had only a marginal effect in 6TAMFQDPFER15 and 6TAMFQDVFER15. Similarly the P12V replacement (right-hand horizontal arrows) increased kd 22 to 23-fold in the WT peptide, but had a weak or no effect in 6TAMFQDPFER15 and 6TAMFQSPFER15. These effects can also be visualized in Fig 5 by comparing the dissociation profiles obtained for each peptide.
Positions 11, 12 and 13 in the peptide sequences are shown in red, blue and green, respectively. The filling is white for the WT residue (D11, P12, Q13) and colored for the modified residue (S11, V12, F13). The ratio kd variant / kd WT, as measured by SPR in HBS (black) or HBS200 (grey), is given next to each arrow, together with the nature of the replacement.
A mutational study of the interaction between TEM1-lactamase and lactamase inhibitor protein (BLIP) suggested a modular architecture of protein–protein binding sites, with mutations showing cooperativity within modules and additivity across modules . The existence of other interdependences within the peptide recognized by scFv1F4, involving replacements that were not investigated, or interactions with stability below the detection limit of the spot experiments, cannot be excluded. Nevertheless the data recorded so far suggest that they are restricted to three of the six essential epitope positions.
ScFv1F4 recognizes alternative motifs with strong affinity
The multiple variants D11S-Q13F and D11S-P12V-Q13F displayed stronger scFv binding compared to single variants (D11S or P12V) in SPR experiments, illustrating a compensating effect of amino-acid changes and confirming the conclusions of spot experiments. The latter clearly indicated a greater tolerance to S at position 11 and I, L or V at position 12 in variants containing F13 instead of the WT Q13 (Fig 4). The ability of the scFv to recognize with similar affinity peptides that contain 11SVF13 and 11DPQ13 is corroborated with CD data showing almost no difference in structural contents. Even if the conformational ensembles are similar, the binding interfaces formed by the two bound peptides should differ due to the distinct nature of their side chains. Differences can be accommodated for instance if these residues are partially accessible to the solvent in the complex, or if the antibody binding site presents some plasticity [46–48]. However the recognition of peptides by the scFv is not ‘fuzzy’, as previously suggested  based on the observation that various sequences were compatible with binding activity in spot and phage display experiments. In the present work we demonstrate that the sequence requirements for high-affinity binding are restricted to a limited set of alternative amino-acid combinations: the D11S and P12V replacements required the concomitant Q13F change for optimal binding. ScFv1F4 is not a natural antibody, but a hybrid fragment formed from the light and heavy chain variable regions of two different monoclonal antibodies. Nevertheless the finding that alternative binding modes display sub-nanomolar affinity (Fig 5, Table 2) challenges the view that antibody selectivity is based on the existence of a unique optimal interface.
In a previous study we have shown that the “non-WT” binding motif was the only one selected from a phage-displayed peptide library . The consensus of 20 clones was S11-P12-FY13. Seven clones contained ST-P-FY, 5 contained ST-VIL-FY and one D-P-F, in accordance with the present finding that these motifs lead to strong scFv1F4 binding. None of the phage-display sequences carried Q13. This selection bias cannot be explained by differences in binding properties as measured by SPR. It could originate from a bias in library composition or from differences in dynamics, which could influence preferential recognition in a competition situation.
The alternative motifs illustrate a case of concerted replacements that ensure optimal activity. Concerted (co-ordinated, co-evolving) amino-acid pairs, identified from sequence alignments of homologous proteins, were found to be closer together in space than random pairs [49,50]. Co-evolving information was then exploited to predict amino-acid structural contacts within protein cores, at protein interfaces and across interfaces [51–54]. The particular case described here, although not dealing with protein evolution, illustrates how individual changes (D11S, P12V) can be deleterious or not depending on a silent replacement (Q13F). Silent replacements would not necessarily co-evolve with other positions of a cluster, which might explain inconsistencies in the experimental validation of computational results .
The relationship between affinity and selectivity is a matter of debate , high affinity being considered in some cases as a condition for specific binding and, in others, as favoring cross-reactivity. Numerous protein binders, and in particular antibodies, have been shown to form stable complexes with two or more ligands, but the assessment of selectivity requires the identification of a significant number of non-binders. Quantitative large-scale permutation studies represent a powerful tool to decipher the subtleties of protein-protein interactions, which cannot be inferred from mere Ala-scan experiments. Our results stress the importance and unpredictable outcome of residue dependencies in binding. We describe a high-affinity binder that is simultaneously selective (restricted tolerance to ligand changes) and permissive towards particular combinations of changes. It is not known if this dual behavior, identified for the interaction between an antibody fragment and a peptide, is an exception or not because few interaction systems have been studied at this level of quantitative detail.
S2 Fig. Permutation patterns of four non-binders.
S1 Table. Dissociation rate constants in HBS and HBS-200, as measured by SPR for 8 peptide-scFv1F4 interactions.
The authors would like to thank PEPperPRINT GmbH for peptide array synthesis, scFv labeling and binding detection, Mireille Baltzinger for scFv1F4-QL34S production and purification, Laure Bellard for scFv1F4 production, Bruno Kieffer for providing us with access to NMR spectrometer and Lionel Chiron for support in Python scripting.
Conceived and designed the experiments: TV LC DA. Performed the experiments: YN DA. Analyzed the data: YN DA. Contributed reagents/materials/analysis tools: TV. Wrote the paper: YN GT TV DA. Prepared the figures: YN LC.
- 1. Clackson T, Wells JA. A hot spot of binding energy in a hormone-receptor interface. Science. 1995; 267(5196): 383–6. pmid:7529940
- 2. Bogan AA, Thorn KS. Anatomy of hot spots in protein interfaces. J Mol Biol. 1998; 280(1): 1–9. Review. pmid:9653027
- 3. Moreira IS, Fernandes PA, Ramos MJ. Hot spot occlusion from bulk water: a comprehensive study of the complex between the lysozyme HEL and the antibody FVD1.3. J Phys Chem B. 2007; 111(10): 2697–706. pmid:17315919
- 4. Lo Conte L, Chothia C, Janin J. The atomic structure of protein-protein recognition sites. J Mol Biol. 1999; 285(5): 2177–98. pmid:9925793
- 5. Bahadur RP, Zacharias M. The interface of protein-protein complexes: analysis of contacts and prediction of interactions. Cell Mol Life Sci. 2008; 65(7–8): 1059–72. Review. pmid:18080088
- 6. De S, Krishnadev O, Srinivasan N, Rekha N. Interaction preferences across protein-protein interfaces of obligatory and non-obligatory components are different. BMC Struct Biol. 2005; 5: 15. pmid:16105176
- 7. Luo Q, Hamer R, Reinert G, Deane CM. Local network patterns in protein-protein interfaces. PLoS One. 2013; 8(3): e57031. pmid:23520460
- 8. Ansari S, Helms V. Statistical analysis of predominantly transient protein-protein interfaces. Proteins. 2005; 61(2): 344–55. pmid:16104020
- 9. Aiello D, Caffrey DR. Evolution of specific protein-protein interaction sites following gene duplication. J Mol Biol. 2012; 423(2): 257–72. pmid:22789570
- 10. Kastritis PL, Bonvin AM. On the binding affinity of macromolecular interactions: daring to ask why proteins interact. J R Soc Interface. 2012; 10(79): 20120835. pmid:23235262
- 11. Kastritis PL, Bonvin AM. Molecular origins of binding affinity: seeking the Archimedean point. Curr Opin Struct Biol. 2013; 23(6): 868–77. pmid:23876790
- 12. Reichmann D, Rahat O, Albeck S, Meged R, Dym O, Schreiber G. The modular architecture of protein-protein binding interfaces. Proc Natl Acad Sci USA 2005; 102: 57–62. pmid:15618400
- 13. Audie J, Swanson J. Advances in the prediction of protein-peptide binding affinities: implications for peptide-based drug discovery. Chem Biol Drug Des. 2013; 81(1): 50–60. Review. pmid:23066895
- 14. Foloppe N1, Hubbard R. Towards predictive ligand design with free-energy based computational methods? Curr Med Chem. 2006; 13(29): 3583–608 pmid:17168725
- 15. Morrow JK, Zhang S. Computational prediction of protein hot spot residues. Curr Pharm Des. 2012; 18(9): 1255–65. Review. pmid:22316154
- 16. Sharabi O1, Shirian J, Shifman JM. Predicting affinity- and specificity-enhancing mutations at protein-protein interfaces. Biochem Soc Trans. 2013; 41(5): 1166–9. pmid:24059503
- 17. Lise S, Archambeau C, Pontil M, Jones DT. Prediction of hot spot residues at protein-protein interfaces by combining machine learning and energy-based methods. BMC Bioinformatics. 2009;10: 365. pmid:19878545
- 18. Verkhivker GM, Bouzida D, Gehlhaar DK, Rejto PA, Freer ST, Rose PW. Complexity and simplicity of ligand-macromolecule interactions: the energy landscape perspective. Curr Opin Struct Biol. 2002; 12(2): 197–203. Review. pmid:11959497
- 19. DeLano WL. Unraveling hot spots in binding interfaces: progress and challenges. Curr Opin Struct Biol. 2002; 12(1): 14–20. Review. pmid:11839484
- 20. Frederick KK, Marlow MS, Valentine KG, Wand AJ. Conformational entropy in molecular recognition by proteins. Nature. 2007; 448(7151): 325–9. pmid:17637663
- 21. Hecht M, Bromberg Y, Rost B. News from the protein mutability landscape. J Mol Biol. 2013; 425(21): 3937–48. pmid:23896297
- 22. Bonsor DA1, Sundberg EJ. Dissecting protein-protein interactions using directed evolution. Biochemistry. 2011; 50(13): 2394–402. pmid:21332192
- 23. Sidhu SS, Kossiakoff AA. Exploring and designing protein function with restricted diversity. Curr Opin Chem Biol. 2007; 11(3): 347–54. pmid:17500026
- 24. Pal G, Kouadio JL, Artis DR, Kossiakoff AA, Sidhu SS: Comprehensive and quantitative mapping of energy landscapes for protein–protein interactions by rapid combinatorial scanning. J Biol Chem 2006; 281: 22378–85 pmid:16762925
- 25. McLaughlin RN Jr, Poelwijk FJ, Raman A, Gosal WS, Ranganathan R. The spatial architecture of protein function and adaptation. Nature. 2012; 491(7422): 138–42. pmid:23041932
- 26. Fowler DM, Araya CL, Fleishman SJ, Kellogg EH, Stephany JJ, Baker D, et al. High-resolution mapping of protein sequence-function relationships. Nat Methods. 2010; 7(9):741–6. pmid:20711194
- 27. Giovane C, Trave G, Briones A, Lutz Y, Wasylyk B, Weiss E. Targetting of the N-terminal domain of the human papillomavirus type 16 E6 oncoprotein with monomeric ScFvs blocks the E6-mediated degradation of cellular p53. J Mol Recognit. 1999; 12(2): 141–52. pmid:10398405
- 28. Vernet T, Choulier L, Nominé Y, Bellard L, Baltzinger M, Travé G, et al.Spot peptide arrays and SPR measurements: throughput and quantification in antibody selectivity studies. J Mol Recognit. 2015. pmid:25960426
- 29. Choulier L, Orfanoudakis G, Robinson P, Laune D, Ben Khalifa M, Granier C, et al. Comparative properties of two peptide-antibody interactions as deduced from epitope delineation. J Immunol Methods. 2002; 259(1–2): 77–86. pmid:11730843
- 30. Hugo N, Lafont V, Beukes M, Altschuh D. Functional aspects of co-variant surface charges in an antibody fragment. Protein Sci. 2002; 11(11): 2697–705. pmid:12381851
- 31. Sreerama N & Woody RW. Estimation of protein secondary structure from circular dichroism spectra: comparison of CONTIN, SELCON, and CDSSTR methods with an expanded reference set. Anal Biochem. 2000; 287: 252–60. pmid:11112271
- 32. Zanier K, ould M'hamed ould Sidi A, Boulade-Ladame C, Rybin V, Chappelle A, Atkinson A, Kieffer B, Travé G. Solution structure analysis of the HPV16 E6 oncoprotein reveals a self-association mechanism required for E6-mediated degradation of p53. Structure. 2012; 20(4): 604–17. pmid:22483108
- 33. London N, Movshovitz-Attias D, Schueler-Furman O. The structural basis of peptide-protein binding strategies. Structure. 2010; 18(2): 188–99. pmid:20159464
- 34. Akram ON, DeGraff DJ, Sheehan JH, Tilley WD, Matusik RJ, Ahn JM, Raj GV. Tailoring peptidomimetics for targeting protein-protein interactions. Mol Cancer Res. 2014; 12(7): 967–78. pmid:24642350
- 35. Otvos L Jr, Wade JD. Current challenges in peptide-based drug discovery. Front Chem. 2014; 2: 62. pmid:25152873
- 36. Gori A, Longhi R, Peri C, Colombo G. Peptides for immunological purposes: design, strategies and applications.Amino Acids. 2013; 45(2): 257–68. pmid:23744401
- 37. Olsson N, Wallin S, James P, Borrebaeck CA, Wingren C. Epitope-specificity of recombinant antibodies reveals promiscuous peptide-binding properties. Protein Sci. 2012; 21(12): 1897–910. pmid:23034898
- 38. Keitel T, Kramer A, Wessner H, Scholz C, Schneider-Mergener J, Höhne W. Crystallographic analysis of anti-p24 (HIV-1) monoclonal antibody cross-reactivity and polyspecificity. Cell. 1997; 91(6): 811–20. pmid:9413990
- 39. Scheerer P, Kramer A, Otte L, Seifert M, Wessner H, Scholz C, Krauss N, Schneider-Mergener J, Höhne W. Structure of an anti-cholera toxin antibody Fab in complex with an epitope-derived D-peptide: a case of polyspecific recognition. J Mol Recognit. 2007; 20(4): 263–74. pmid:17712773
- 40. Manivel V, Bayiroglu F, Siddiqui Z, Salunke DM, Rao KV. The primary antibody repertoire represents a linked network of degenerate antigen specificities. J Immunol. 2002; 169(2): 888–97. pmid:12097393
- 41. Sethi DK, Agarwal A, Manivel V, Rao KV, Salunke DM. Differential epitope positioning within the germline antibody paratope enhances promiscuity in the primary immune response. Immunity. 2006; 24(4): 429–38. pmid:16618601
- 42. Cardoso RM1, Brunel FM, Ferguson S, Zwick M, Burton DR, Dawson PE, Wilson IA. Structural basis of enhanced binding of extended and helically constrained peptide epitopes of the broadly neutralizing HIV-1 antibody 4E10. J Mol Biol. 2007; 365(5): 1533–44. pmid:17125793
- 43. Bird GH, Irimia A, Ofek G, Kwong PD, Wilson IA, Walensky LD. Stapled HIV-1 peptides recapitulate antigenic structures and engage broadly neutralizing antibodies. Nat Struct Mol Biol. 2014; 21(12): 1058–67. pmid:25420104
- 44. Smeenk LE, Timmers-Parohi D, Benschop JJ, Puijk WC, Hiemstra H, van Maarseveen JH, Timmerman P. Reconstructing the discontinuous and conformational β1/β3-loop binding site on hFSH/hCG by using highly constrained multicyclic peptides. Chembiochem. 2015; 16(1): 91–9. pmid:25469830
- 45. Moseri A, Tantry S, Sagi Y, Arshava B, Naider F, Anglister J. An optimally constrained V3 peptide is a better immunogen than its linear homolog or HIV-1 gp120. Virology. 2010; 401(2): 293–304. pmid:20347111
- 46. Lescar J, Stouracova R, Riottot MM, Chitarra V, Brynda J, Fabry M, Horejsi M, Sedlacek J, Bentley GA. Three-dimensional structure of an Fab-peptide complex: structural basis of HIV-1 protease inhibition by a monoclonal antibody. J Mol Biol. 1997; 267(5): 1207–22. pmid:9150407
- 47. Stanfield RL, Takimoto-Kamimura M, Rini JM, Profy AT, Wilson IA. Major antigen-induced domain rearrangements in an antibody. Structure. 1993; 1(2): 83–93. pmid:8069628
- 48. Sundberg EJ, Mariuzza RA. Molecular recognition in antibody-antigen complexes. Adv Protein Chem. 2002; 61: 119–60. pmid:12461823
- 49. Altschuh D, Lesk AM, Bloomer AC, Klug A. Correlation of co-ordinated amino acid substitutions with function in viruses related to tobacco mosaic virus. J Mol Biol. 1987; 193(4): 693–707. pmid:3612789
- 50. Altschuh D, Vernet T, Berti P, Moras D, Nagai K. Coordinated amino acid changes in homologous protein families. Protein Eng. 1988; 2(3): 193–9. pmid:3237684
- 51. Horner DS, Pirovano W, Pesole G.DeJuan Correlated substitution analysis and the prediction of amino acid structural contacts. Brief Bioinform. 2008; 9(1): 46–56. pmid:18000015
- 52. Marks DS, Hopf TA, Sander C. Protein structure prediction from sequence variation. Nat Biotechnol. 2012; 30(11): 1072–80. pmid:23138306
- 53. de Juan D, Pazos F, Valencia A. Emerging methods in protein co-evolution. Nat Rev Genet. 2013; 14(4): 249–61. pmid:23458856
- 54. Jana B, Morcos F, Onuchic JN. From structure to function: the convergence of structure based models and co-evolutionary information. Phys Chem Chem Phys. 2014; 16(14): 6496–507. pmid:24603809
- 55. Dietrich S, Borst N, Schlee S, Schneider D, Janda JO, Sterner R, Merkl R. Experimental assessment of the importance of amino acid positions identified by an entropy-based correlation analysis of multiple-sequence alignments. Biochemistry. 2012; 51(28): 5633–41. pmid:22737967
- 56. Greenspan. Cohen’s Conjecture, Howard’s Hypothesis, and Ptashne’s Ptruth: an exploration of the relationship between affinity and specificity. Trends Immunol. 2010; 31(4): 138–43. pmid:20149744