Ankyrin repeat proteins are elastic materials that unfold and refold sequentially, repeat by repeat, under force. Herein we use atomistic molecular dynamics to compare the mechanical properties of the 7-ankyrin-repeat oncoprotein Gankyrin in isolation and in complex with its binding partner S6-C. We show that the bound S6-C greatly increases the resistance of Gankyrin to mechanical stress. The effect is specific to those repeats of Gankyrin directly in contact with S6-C, and the mechanical ‘hot spots’ of the interaction map to the same repeats as the thermodynamic hot spots. A consequence of stepwise nature of unfolding and the localized nature of ligand binding is that it impacts on all aspects of the protein's mechanical behavior, including the order of repeat unfolding, the diversity of unfolding pathways accessed, the nature of partially unfolded intermediates, the forces required and the work transferred to the system to unfold the whole protein and its parts. Stepwise unfolding thus provides the means to buffer repeat proteins and their binding partners from mechanical stress in the cell. Our results illustrate how ligand binding can control the mechanical response of proteins. The data also point to a cellular mechano-switching mechanism whereby binding between two partner macromolecules is regulated by mechanical stress.
Here we use molecular dynamics simulation to compare the mechanical properties of the 7-ankyrin-repeat oncoprotein Gankyrin in isolation and in complex with binding partner S6-C. Tandem repeat proteins like Gankyrin comprise tandem arrays of small structural motifs that pack linearly to produce elongated architectures. They are elastic, mechanically weak molecules and they unfold and refold repeat by repeat under force. We show that S6-C binding greatly increases the resistance of Gankyrin to mechanical stress. The enhanced mechanical stability is specific to those ankyrin repeats in contact with S6-C, and the localized nature of the effect results in fundamental changes in the way the protein responds to force. Thus, the forced unfolding of isolated Gankryin involves a diverse set of pathways with a preference for a C- to N-terminus unfolding mechanism whereas this diversity is reduced upon complex formation with the central repeats, which are those most tightly bound to the ligand, tending to unfold last. Our study shows how stepwise unfolding can buffer repeat proteins and their binding partners from mechanical stress in the cell. It also points to a mechano-switching mechanism whereby binding between two partner macromolecules is regulated by mechanical stress.
Citation: Settanni G, Serquera D, Marszalek PE, Paci E, Itzhaki LS (2013) Effects of Ligand Binding on the Mechanical Properties of Ankyrin Repeat Protein Gankyrin. PLoS Comput Biol 9(1): e1002864. https://doi.org/10.1371/journal.pcbi.1002864
Editor: Shi-Jie Chen, University of Missouri, United States of America
Received: July 10, 2012; Accepted: November 11, 2012; Published: January 17, 2013
Copyright: © 2013 Settanni et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This research was funded by the Center for Computational Science of the University of Mainz and by the Max-Planck Graduate Center with the University of Mainz (GS), NIH grant R01-GM079563 (PEM), Medical Research Council of the UK (DS and LSI, including grant G1002329), Medical Research Foundation (LSI) and the Cambridge Gates Trust (DS). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Tandem repeat proteins, also known as solenoid proteins, are a special class of proteins comprising tandem arrays of small structural motifs (20–40 residues) that pack in a roughly linear fashion to produce elongated, superhelical architectures, thereby presenting extended surfaces that act as scaffolds for molecular recognition. Examples include ankyrin, tetratricopeptide, HEAT and leucine rich repeats. Their structures are characterized by short-range interactions between residues either within a repeat or in adjacent repeats. As such they contrast with globular proteins, which are stabilized by many sequence-distant interactions that frequently result in complex topologies, and with polyproteins like titin, in which independently folded domains are covalently linked in tandem arrays but without significant non-covalent interactions between the individual domains. It is thought that the lack of sequence-distant contacts affords repeat proteins a high degree of flexibility and elasticity , , and atomic force microscopy (AFM) studies have identified certain unique properties that underlie this spring-like mechanical behavior –, . However, the relationship between spring and scaffold functions of repeat proteins is not understood and requires a determination of the mechanics of these proteins upon ligand binding.
Gankyrin is an oncoprotein that is overexpressed in hepatocellular carcinomas . It belongs to the ankyrin repeat family of proteins, which are involved in numerous protein-protein interactions and which have been postulated to be the spring elements in mechanotransduction . Each ankyrin repeat forms a β-turn followed by two antiparallel α-helices and a loop. Gankyrin binds the S6 ATPase subunit of the 19S regulatory particle of the 26S proteasome and it enhances the degradation of the tumour suppressors pRb and p53. The interaction of Gankyrin with S6 C-terminal domain (S6-C) is typical of repeat protein molecular recognition in that the whole length of Gankyrin is used to create an extended surface for binding  (Figure 1). All but the C-terminal ankyrin repeat of Gankyrin (repeat seven) make contacts with S6-C. The interaction involves complementary charged residues on the two proteins that form several positively and negatively charged patches along the elongated interface. The latter comprises residues from the β-turns and the N-terminal helices of repeats 1–6 of Gankyrin.
Gank is shown in grey and S6-C in blue. The residues of Gank with atoms closer than 5 Å to S6-C are shown in red.
An important process in mechanotransduction is mechano-switching. For example, force has the potential to partially unfold proteins, shutting off or triggering biochemical reactions by disrupting binding motifs or exposing cryptic binding sites. Force modulates a protein's free energy surface; a small force does not necessarily abolish the native minimum but may cause the breakage of non-covalent bonds or, conversely, activate catch bonds that bind more tightly with force . Likewise, binding can affect the mechanical response of proteins to external stress, as shown for Dihydrofolate Reductase , Im9  and protein G , . Here we use atomistic molecular dynamics simulations to compare the mechanics of Gankyrin in isolation and in complex with S6-C. The results show how the ligand affects every aspect of Gankyrin's behavior, including the order of repeat unfolding and the nature of partially unfolded intermediate states, the forces required and the work transferred to the system to unfold the whole protein and its parts. Gankyrin shows a unique behavior that may be prototypical of repeat proteins. In the absence of the S6-C ligand, there are many different mechanical unfolding pathways accessible to Gankyrin with a preference for a C- to N-terminus mechanism. In contrast, in the presence of S6-C the central repeats of Gankyrin, which make the most contact with the ligand, are stabilized and consequently the number of accessible unfolding pathways is reduced. Thus, the outermost repeats of Gankyrin unfold at low forces whereas the central repeats preserve their structure and binding interactions and only rupture at higher forces. The different force regimes under which Gank unfolds when bound to S6-C might have relevance in the mechanical control of protein-protein interactions in the cell and this behavior could be exploited in future to design new protein mechano-switches.
Materials and Methods
Molecular dynamics simulations
Molecular dynamics simulations were performed using the CHARMM program ,  with the charmm19 united atom force field. The effect of the aqueous environment was modeled with the fast analytical continuum treatment of solvation (FACTS) algorithm . An implicit solvent model was preferred to explicit ones because an atomistic representation of even a single hydration layer of an extended protein conformation would require an immense computational effort; moreover, implicit solvents relax instantaneously, which reduces artifacts when the protein is pulled fast. The previous application of this methodology to Gank  gave results that agreed with experimental measurements, which validates our choice of implicit solvation model. Initial atomistic models for Gank and Gank-S6-C complex were obtained from pdb structures 1qym  and 2dwz , respectively. The residues of Gank in both systems were renumbered starting from the first resolved residue, which is Cys4 (i.e., Cys 4 in the pdb file is named Cys 1 in the present manuscript, and the following residues are renamed accordingly). The number of resolved residues in 1qym is 223, whereas 226 residues of Gank were resolved in 2dwz. The three extra residues are at the C-terminus of Gank in the complex and they have been retained in the simulations. Langevin dynamics at 300 K with a friction coefficient of 1 ps−1 and an integration timestep of 2 fs was used. After 100 ps of equilibration at 300 K, the Gank constructs were subjected to steered molecular dynamics, whereby a spring with elastic constant set to 20 pN/Å was attached to the N-terminal main-chain nitrogen and to the C-terminal carbonyl carbon and its equilibrium distance was increased at constant velocity to simulate pulling. This resulted in an equal force being applied to both terminal atoms, along the vector joining them and in the direction of increasing distance. Pulling speeds of 0.05 Å/ps and 0.01 Å/ps were used. At pulling speed of 0.01 Å/ps, twelve independent simulations of isolated Gank and twelve of the Gank-S6-C complex were performed with different initial velocities starting from conformations sampled during equilibration. Each simulation was 80 ns long, which allowed for the complete stretching of Gank. Conformations were saved every 4 ps for analysis. At pulling speed of 0.05 Å/ps 24 and 16 simulations were performed for the Gank-S6-C complex and for the isolated Gank, respectively. Each simulation was 16 ns long and a snapshot was saved every 1 ps for analysis.
Identification of force peaks
The distribution of the force measured along the simulations at 0.01 Å/ps resembles a normal distribution with a long tail in the large force regions (Figure 2A, top). The force distribution for the isolated Gank is narrower and peaks at lower values than does the distribution for the complex Gank-S6-C. The main part of the distribution resembles a Gaussian, possibly due to the large number of force-field parameters contributing to it. Thus, this part of the distribution was fitted to a normal distribution and the deviation between the actual distribution and the fit was determined (Figure 2A bottom). Force peaks in the force-extension profiles (Figure 3) reaching values where the deviation is the largest were retained as unambiguous peaks (i.e. values larger than 200 pN and 250 pN for the Gank and the Gank-S6-C simulations, respectively). Lower force peaks are not easily separable from the noise and were therefore discarded. Similar quasi-normal distributions were obtained at pulling speeds of 0.05 Å/ps, however, the number of force peaks above the noise level was very small and the force peak analysis was therefore not carried out.
(A) Upper panel: Distribution of the force for isolated Gank (black solid line) and Gank-S6-C complex (red solid line). The peaks of the distributions were fitted to a Gaussian (dashed lines). Lower panel: Difference between the actual distribution and the fitted Gaussian, normalized using the error on the actual distributions. Peaks in the difference (at 200 pN and 250 pN, for isolated Gank and Gank-S6-C complex, respectively) were taken as thresholds to identify unambiguously peaks in the force-extension profiles. (B) Cartoon representation of the ankyrin repeats of Gank from N- to C-terminus are colored blue, green, red, cyan, magenta, yellow and dark grey, respectively. The end-to-end vector and its repeat components are indicated with black arrows.
Representative profiles are shown for isolated Gank (A)–(B) and for Gank-S6-C complex (C)–(D). The peaks were fitted to a WLC model with a persistence length of 0.38 nm (dashed lines). (E) Probability distribution of the force peaks on the extension-force plane of uncomplexed Gank (black) and Gank-S6-C complex (red). The distribution was obtained by a kernel density estimation analysis of the peak coordinates from all the sampled force-extension profiles, with a Gaussian bandwidth of 20 Å and 20 pN. The figure was prepared using Octave .
Measurement of work
The work transferred ,  by the external force to unfold the protein was measured by integrating along the simulations, where is the external force and is the change in the vector connecting the two atoms where the force is applied (the N and C termini of the protein). This procedure is equivalent to measuring the area under the force-extension profiles (Figure 3). Because the data were saved at discrete time points, operatively we defined the work transferred by the external force up to time as:(1)where the sum runs along the saved time frames of the simulation, is the elapsed simulation time at frame n, is the average force measured in two successive frames and is the change in the end-to-end vector (connecting the backbone C atom at the C-terminus of Gank to the N atom at the N terminus) between two successive frames. Contributions from the individual ankyrin repeats are readily obtained by decomposing the vector into its repeat components (see also Figure 2B):(2)where is the end-to-end vector of repeat r at frame i and is the corresponding change in two successive frames. The individual repeat contributions of the work are then:(3)A few examples of the behavior of in selected runs are shown in Figure S1. We would like to emphasize that, whereas the assignment of force peaks to particular repeat unfolding events may sometimes be problematic because of the broadness of the peaks and noisy nature of the force extension profiles, the repeat component of the work is in contrast a well-defined and robust quantity that we can use instead to report on the forces needed to unfold the repeats via the integration step in Eq. 3.
Monitoring repeat unfolding
The CαCβ-atom root-mean-square deviation (RMSD) of each ankyrin repeat of Gank from the native x-ray conformation was monitored along the simulations. Typically, the RMSD of a single repeat remains smaller than 5 Å for a certain time and then it displays an abrupt rise due to repeat unfolding, quickly reaching values of 20–25 Å when fully extended. Thus, in each simulation and for each repeat, the unfolding midpoint was set as the time point when the RMSD reached 10 Å. Different unfolding RMSD thresholds (from 7 Å to 15 Å) did not significantly change the average repeat unfolding time and the statistics of the repeat unfolding sequences at either unfolding speeds. On the 0.01 Å/ps pulling speed simulations a contact analysis has been performed. Pairs of residues defined a native contact when the distance between their Cα atoms was smaller than 8 Å in the native structure. Only those pairs more than three residues apart in the sequence were considered. The lifetime of a contact was defined as the last time the contact was observed along the simulations. The identification of non-native contacts is based on the concept of recurrence, defined in S.I. The definition of recurrent contacts translates broadly to those contacts that could be observed consistently in at least a continuous 0.4-ns-long stretch of trajectory. Non-native contacts are recurrent contacts that are not present in the native state.
Kernel density estimation
This technique is used to provide approximations to probability density distributions from a discrete sample of data points. Each point of the set is replaced by a Gaussian function with a fixed width. The total approximated probability density distribution for the data set is then the sum of all the Gaussian functions after normalization. The width of the Gaussian function is chosen on the basis of the expected error on the data values. The technique is used in place of a histogram of the data points, because it provides smooth distributions.
Monitoring the Gank-S6-C interaction
The residues of Gank involved in binding to S6-C were defined as those that had at least one atom at a distance of less than 5 Å from any S6-C atom. The native binding residues were defined as those that were involved in binding in more than 80% of the equilibration trajectory. The residue binding lifetime was defined as the last time residues were observed to be involved in binding along the simulations, averaged over the different runs.
Differences in the stepwise mechanical unfolding of uncomplexed and complexed Gankyrin
Gank (pdbid 1qym ) and S6-C-bound Gank (pdbid 2dwz ) were first equilibrated and then subjected to constant speed steered molecular dynamics simulations using the program CHARMM ,  with the united atom force field (param19) and FACTS  as model for implicit treatment of solvent (see “Materials and Methods” for further details about the simulations). During the pulling of Gank to the fully extended state the force exerted on the protein ends was recorded. In AFM experiments on polyproteins the peaks in the force-extension profiles generally correspond to the unfolding of single monomeric units ,  and the increase in contour length between peaks corresponds to the unfolding length of a monomer; for ankyrin-repeat proteins , , , the peaks in the force-extension profiles generally correspond to the unfolding of single repeats and the increase in contour length between peaks corresponds to the unfolding length of a single repeat (105–120 Å), although contour length extensions of 40–70 Å corresponding to the unfolding of half a repeat have also been observed in the cases of Gank , AnkyrinR , and NI6C ). In the present simulations, only the lower pulling speed produced force-extension profiles in which a sufficient number of clear force peaks could be identified above the noise level (see “Materials and Methods” for further details about identification of force peaks). Representative force-extension profiles at the low pulling speed are shown in Figure 3A–D. The regions of the force-extension profile preceding the peaks were fitted separately to a worm-like chain (WLC) model with fixed persistence length of 3.8 Å, as in our earlier work . The contour length and the breaking force for each peak were also measured. Representative snapshots from one of the simulation of the complexed Gank provide a pictorial description of the unfolding process (Figure 4).
Snapshots were taken corresponding to the indicated peaks in the force-extension profile. The starting structure is shown on the left. The first peaks at extensions below 300 Å, show small forces and correspond to the unfolding of ankyrin repeats 7 and 6. Peaks at extensions larger than 300 Å show larger forces and correspond to the unfolding of the repeats with extensive contacts to SC-6 (i.e. R1–5); central repeats 3 and 4 are the last to unfold in this simulation.
Analysis of the histogram of the contour length extensions shows that, for the uncomplexed Gank, the peaks are separated by distances that are multiples of L = 56.8 Å (Figure 5, top), corresponding to the unfolding length of half an ankyrin repeat , , ; a 2 L periodicity is also apparent. In the case of the Gank-S6-C complex, the 2 L periodicity is more evident and the peaks are most frequently separated by distances that are multiples of 2 L (Figure 5, bottom). Thus, the unfolding events in the uncomplexed Gank frequently involve half of a repeat (usually one of the two helices comprising the ankyrin motif), whereas in the Gank-S6-C complex the unfolding events more frequently involve a whole repeat. Our simulations are therefore in agreement with our previous AFM experiments on uncomplexed Gank.
Differences are computed between all possible pairs of peaks in the same simulation run (i.e., not only between adjacent peaks). Data for isolated Gank and for Gank-S6-C complex are shown in the upper and lower panels, respectively. Peaks in the histogram, corresponding to the most common changes in contour length after a force peak, show marked periodicity. Dashed lines are drawn to highlight the length L = 56.8 Å and 2 L of the periods observed in the data, 2 L corresponding to the contour length of one fully extended ankyrin repeat.
Higher force peaks are observed for complexed than for uncomplexed Gankyrin
As well as the differences in the step size of repeat unfolding described above, differences in the unfolding forces of uncomplexed versus complexed Gank, are also observed (Figure 3E). In uncomplexed Gank the force peaks are typically between 100 pN and 250 pN, and the majority of the peaks are observed at total extensions of less than 400 Å. In the case of the Gank-S6-C complex, the force peaks lie in the range of 200 pN and 400 pN and with total extensions frequently larger than 300 Å. The scarcity of force peaks at total extensions below 300 Å in the complex and above 400 Å in the uncomplexed Gank is not due to a lack of unfolding events in these ranges but rather it is due to the forces peaks lying below the noise level. Furthermore, in uncomplexed Gank, the force is roughly independent of contour length. In contrast, in the case of the Gank-S6-C complex those few peaks observed at total extensions below 300 Å, corresponding to the unfolding of repeats 7 and 6 (see below), have forces similar to those found in uncomplexed Gank, whereas peaks at total extensions above 300 Å show larger forces. Figure 3E shows that for both uncomplexed and complexed Gank the force peaks cluster at values of the extension that are separated approximately by the contour length of a single ankyrin repeat (∼110 Å), as discussed above), although the distribution is not sharp and intermediate values of the extension are also observed.
Larger work is transferred to complexed Gankyrin compared with uncomplexed Gankyrin
The work transferred by the external force to unfold Gank was measured along the simulations (see “Materials and Methods” for a detailed description of this calculation), together with the repeat components of this work (Figure 6). These calculations do not require the identification of the force peaks, and consequently they could be performed at both of the pulling speeds used here. At the pulling speed of 0.01 Å/ps, significantly more work was needed to completely stretch (i.e. after 64 ns of simulations) the complexed Gank (2100±100 kBT) than the uncomplexed Gank (1400±100 kBT). Similarly at 0.05 Å/ps pulling speed, the work transferred for complete stretching was 3200±200 kBT and 2300±100 kBT for complexed and uncomplexed Gank, respectively. These values are of the same order of magnitude as values measured experimentally for other biological macromolecules , . The larger work transferred at the higher pulling speed is expected due to the larger fraction of energy irreversibly dissipated in the faster process. At both pulling speeds, all of the repeats, with the exception of repeat 6, required significantly more work to unfold when Gank was bound to S6-C (t-test at 1%-significance level, Figure 6, and Table S1 and Table S2). Repeat 1 showed the largest change upon complexation, with about twice the work required for unfolding in the complex compared with isolated Gank (Table S1 and Table S2). The unexpected increase in the work required to unfold repeat 7, which does not directly contact S6-C, may be due to the larger length of this repeat (by 3 residues) in the complexed Gank construct compared with the uncomplexed Gank construct.
The sequence of unfolding events is different in complexed versus uncomplexed Gankyrin
The ankyrin repeats of Gank unfolded sequentially, one after the other, as judged by the RMSD from the native structure. Successive repeat unfolding events were evenly spaced in time for both uncomplexed and complexed Gank (Figure 7). The presence of S6-C bound to Gank introduced delays in unfolding, which were more pronounced for the later repeat unfolding events.
Distributions of successive repeat unfolding times (e1 corresponding to the first unfolding event) in isolated Gank (continuous lines) and Gank-S6-C complex (dashed line). Distributions are obtained via kernel density estimation analysis with Gaussian bandwidth of 1.6 ns.
The precise sequence of unfolding of the repeats carries important information about the distribution of mechanical stability across the repeat array. For isolated Gank the most common unfolding sequence was from the C-terminus to the N-terminus although several other unfolding pathways are also observed (Table 1 top and Figure S2 top). Such behavior is expected for repeat proteins because the sequence similarity between the repeats means that they all have similar stabilities, which introduces degeneracy into the unfolding mechanism. The presence of bound S6-C had a dramatic impact on the sequence of unfolding (Figure S2 bottom); for example, repeat 1 (r1) is rarely the last repeat to unfold in complexed Gank. Overall, the key feature of the different behavior is the narrower spectrum of unfolding positions available to repeats r1, r2, r5 and r6 in complexed Gank. A quantitative comparison of the unfolding sequences in the complexed versus uncomplexed Gank can be performed using contingency tables built by counting the number of simulation runs where repeat r1 has a certain position in the unfolding sequence in complexed versus uncomplexed Gank (Table 1). An exact Fisher's test on this contingency table shows that complexed Gank has a significantly different unfolding sequence compared with uncomplexed Gank (confidence level 0.006%). Similarly significant differences (maximum confidence level 2.3%) are obtained using repeats r2, r5 or r6. We built these contingency tables by pooling all the simulations at both pulling speeds as the same kind of test revealed that the unfolding sequences were not significantly dependent on the pulling speed. The change in the sequence of repeat unfolding events and the time delays in the later unfolding events results in an increase in the average unfolding time of repeats r1–r4 in the complex compared with isolated Gank (Tables S3 and S4). In summary, the data show that S6-C binding significantly modulates the forced unfolding of Gank by specifically stabilizing the ligand-contacting repeats, which results in prolonged lifetimes of these repeats and less diversity of unfolding pathways accessible to the protein.
A more detailed analysis of the force-induced unfolding process is obtained by monitoring the lifetime of native contacts in Gank (Figure 8). The lifetime of intra-repeat contacts (Figure 8, the α-helical contacts and the β-turn contacts within the black and the green ellipses, respectively) confirms the directional character of the unfolding process, which proceeds roughly from the C- to the N-terminus, and the larger resistance of the complexed Gank compared with the isolated Gank. A prominent difference between the two sets of simulations is the early loss of inter-repeat contacts between β-turn regions (within the blue ellipses in Figure 8) observed for repeat pairs r2–r3, r3–r4 and r4–r5 in isolated Gank and not in the Gank-S6-C complex. In contrast, inter-repeat helix-helix contacts (red ellipses) tend to be lost just before one of the two repeats of the pair unfolds.
The contact map is subdivided in six panels showing data for the pairs of adjacent repeats (repeat numbers are reported on the left and top side of the panels). The color shade of each contact indicates the time the contact was last consistently observed along the simulations, averaged over the runs (see Materials and Methods). Colored ellipses in the top right panel indicate relevant groups of contacts, which can then be located correspondingly in the other panels: intra-repeat helix contacts (black ellipses), inter-repeat helix contacts (red ellipses), intra-repeat β-turn contacts (green ellipses) and inter-repeat β-turn contacts (blue ellipses).
The non-native contacts observed in the simulations are mostly local in nature, i.e., occurring between residues close in sequence, and they are slightly more frequent in uncomplexed Gank than in the complex (Figure S3). A closer inspection of the data reveals that they involve the force-induced deformations of the repeat array, whereby one repeat rotates with respect to the adjacent repeat around the long axis of the protein.
Long-lasting interactions under mechanical stress map out the binding hot spots
The simulations of the Gank-S6-C complex show how specific features of the interaction between the two proteins are affected by the mechanical stress exerted on Gank. The binding surface in the unperturbed complex comprises residues from repeats r1–r6 of Gank. Most of the contacting residues are in the β-turn and the N-terminal α-helix of each repeat. The loss of contacts between the two proteins during the simulations occurs gradually and follows the sequence of unfolding of the repeats themselves (Figure 9A). The contacts that fail last are those involving residues in repeats r2, r3 and r4, more specifically the β-turn of repeat 4, the β-turn and N-terminal α-helix of repeat 3 and the N-terminal α-helix of repeat r2 (Figure 9B and C and Table 2). The large variability in the lifetime of binding contacts in repeats r2 and r4 is related to the variability in the observed unfolding sequence in the different simulation runs (Figure S2), whereby one of the two repeats is most often the last to unfold. Within repeats 4–6, the lifetimes of the binding contacts involving residues in the β-turn are slightly larger than those in the N-terminal α-helix (Figure 9A). Binding contacts involving residues in repeat 3 tend to survive the unfolding of the repeat, specifically residues Ile76 and Trp71 in the N-terminal α-helix of repeat 3. These interactions appear to have a non-specific hydrophobic character that does not require a perfectly structured repeat to be maintained.
A. Lifetime of binding contacts with S6-C for the residues of Gank (red error bars represent 1 standard error around average). Black and blue boxes represent 1 standard error around the average unfolding time of the corresponding repeats in Gank-S6-C complex and isolated Gank simulations, respectively, as measured using RMSD. Grey and green shadowed regions indicate the beta-turn and N-terminal helix regions, respectively. (B)–(C). Front and upper views of Gank (cartoon representation), colored according to the average time each residue took to lose contact with S6-C (glass surface representation, displayed to mark the position of the binding site in the native conformation). The residues that form long-lasting contacts with S6-C are shown in licorice. The figure was prepared using VMD .
Effects of ligand binding on the mechanical stability of repeat proteins
Ligand binding typically increases the thermodynamic stability of proteins, whereas mechanical stabilization upon ligand binding has been observed for some proteins , , , – but not for others , , . This distinction arises because mechanical unfolding as measured by force microscopy using a constant pulling speed is not an equilibrium process and therefore mechanical stability reflects the height of the energy barrier, i.e. the difference in stability between the native state and the mechanical transition state. Moreover, force-induced unfolding may often be very different from chemical-induced unfolding because of the fundamental difference between the two reaction coordinates and the different nature of the unfolded states in the two reactions. A ligand will enhance the mechanical stability of a protein only if it preferentially stabilizes the native state over the mechanical unfolding transition state, an approach that has been used to rationally tune mechanical stability . The situation is more complicated for repeat proteins, however, because mechanical unfolding proceeds in a stepwise manner and so there is a series of transition states rather than a single one. We show here that ligand binding to a repeat protein has a discrete effect on its mechanical stability that is localized to specific repeats and consequently it dramatically alters the protein's mechanical behavior. The bound ligand increases the resistance of Gank to mechanical stress by delaying its complete unfolding and by requiring larger forces and consequently a larger energy to unfold it. We speculate that the ligand offers a supplemental support frame (or “sink”) to the protein that helps it to withstand the external force by distributing the stress over a larger molecular volume.
The high pulling speeds used in simulations have been shown in some cases to mask the differences between two systems under comparison, due to the high noise level . However, the differences we observed between complexed and uncomplexd Gank are insensitive to changes in the pulling speed and they are significant even at the high pulling speeds of the simulations; we predict therefore that significant differences should be observed experimentally in force-extension profiles measured by AFM. Indeed, our simulations predict up to two-fold difference in the forces observed for complexed versus uncomplexed Gank independent of pulling speed. And since the unfolding force peaks of isolated ankyrins are experimentally detectable , , , so should be the detection of higher force peaks for upon complexation.
Fewer unfolding pathways are accessible to repeat proteins upon ligand binding
We found previously, and confirmed in the present study, that when subjected to an external pulling force the predominant unfolding pathway of Gank is from the C-terminus to the N-terminus , although a variety of other unfolding pathways are also accessible. This behavior is profoundly altered in the presence of the ligand. One consequence of ligand binding is that the N-terminal repeat (r1) rarely unfolds last, which also explains the large change in the r1 component of transferred work: indeed, unfolding of a repeat with a folded neighbor (r2 in this case) requires extra energy due to the inter-repeat interactions. More generally, we observe that ligand binding greatly reduces the number of unfolding routes accessible to Gank, and this can be rationalized as follows: The sequence similarity between the repeats means that they all have similar stabilities and therefore unfolding can proceed through many different pathways. Indeed multiple pathways have also been observed in chemical-induced unfolding of repeat proteins (e.g. ); Ligand binding stabilizes repeats r2–r5 (the mechanical hot spots), making the stability distribution across the repeat array more uneven and resulting in energetic bias and less diversity of unfolding pathways.
Cooperativity in repeat-protein mechanical unfolding changes upon ligand binding
The bound ligand induced a more regular and cooperative mechanical unfolding behavior at the repeat level, in the sense that isolated Gank showed a wide variety of stretching patterns involving both partially unfolded repeats, whereby only one of the two helices is unfolded, and the unfolding of more than one repeat at a time (Figure 5, top panel); in contrast, the S6-C-bound Gank tended to unfold more “sharply” repeat by repeat (Figure 5, lower panel), similar to the consensus ankyrin repeat protein NI3C , . S6-C binding has the effect of reducing the importance of the cross talk between repeats by compensating for the loss of inter-repeat stability upon unfolding with the stabilization of each individual repeat involved in binding (particularly the β-turns regions as shown in Figure 8).
Mapping binding hot spots in tandem repeat proteins
The simulations helped to identify those residues of Gank whose contacts with S6-C are the most resilient to pulling (Figure 9B, C and Table 2). These residues are Arg38, Trp43, Ser46, Ala47, Asp67, Asp68, Ala69, Trp71, Ile76, Ser79, Asn100, Gln101, Asn102, Cys104 (according to our numbering scheme). We can compare these mechanical hot spots with the thermodynamic hot spots investigated by pull-down experiments . Pull-downs were used to test mutations at positions Arg38, Lys113, Asp36, Asp68 and Glu159 (according to our numbering scheme), but only mutation of Arg38 was found to disrupt S6-C binding sufficiently to prevent pull down. The other residues did not appear to make as big a contribution to the interaction, which agrees with our finding that they do not have a large contact lifetime with S6C in our simulations. Contrary to the pull-down results, we found that Asp68 had a high contact lifetime. This may be a consequence of the fact that both neighboring residues Ala69 and Asp67 have large contact lifetimes, rather than its role in binding.
Implications of ligand binding for mechanical signal transduction in repeat proteins
The mechanical unfolding of repeat proteins is characterized by relatively small force peaks; therefore, the force at rupture is determined with a much larger relative error than in less compliant systems, making it more difficult to detect a trend in the height of the force peak as a function of the unfolding event as predicted by . Nevertheless, the simulations do clearly show two distinct force regimes of complexed Gank: up to 300 Å total extension, corresponding most frequently to the unfolding of repeats 7 and 6, the forces were similar to those observed in uncomplexed Gank. Above 300 Å total extension, corresponding most frequently to the unfolding of repeats 1–5, the unfolding forces were much higher than those at lower extensions. This property of repeat proteins might have an important functional role in mechanotransduction, by preventing the complete dissociation of a protein complex below a certain force threshold. A mechano-switch can be thought of as a switch between two shapes of the free energy that rules thermodynamics, kinetics and the modulation of the latter with a force. Repeat proteins composed of units with unfolded contour lengths ranging between ∼100–150 Å, such as ankyrin repeat proteins and HEAT repeat proteins, could function as mechano-switches because their repeats are progressively unfolded only with increasing forces , thereby allowing mechanical stress to modulate the protein structure and its binding properties. Moreover, the elastic behavior of the individual repeats  could contribute to the stability of repeat protein complexes by allowing refolding of individual repeats before complete dissociation of the protein from its ligand. The different force regimes under which Gank unfolds when bound to S6-C might have relevance in the mechanical control of protein-protein interactions in the cell and this behavior could be exploited in future to design new protein mechano-switches.
The characteristic stepwise mechanical unfolding of repeat proteins is at the very heart of their compliance and an essential component of their elastic behavior , . These properties implicate repeat proteins in mechanical signal transduction. For example: repeat-protein stretching and contraction motions to regulate the activity of a bound enzyme ; ankyrin nanosprings to operate the gating of ion channels in hair cells ; HEAT-repeat carrier proteins that wrap around their cargoes to transport them between the cytoplasm and the nucleus . The sensitivity of repeat-protein mechanics to ligand binding, as revealed in the present work, suggests the possibility of a further regulatory mechanism whereby elasticity is dictated by ligand binding.
Work transferred to the system by the pulling force. Representative examples of the time dependence of the repeat components (, see Materials and Methods) of the work transferred by the pulling force from simulations of isolated Gank (two top panels) and Gank-S6-C complex (two lower panels) at 0.01 Å/ps pulling speed.
Unfolding time of each ankyrin repeat of Gank in the various simulation runs at 0.01 Å/ps pulling speed. Data for isolated Gank and the Gank-S6-C complex are shown in the upper and lower boxes, respectively. Data from the different repeats are shown in different colors. The runs are ordered on the x-axis so that similar repeat unfolding sequences are adjacent. The lines connecting the points are drawn only as a guide for the eye. Isolated Gank unfolds most frequently in a sequential manner from the C- to the N-terminal repeat (right part of top box). For Gank in the complex, in contrast, the last repeats to unfold are frequently repeats 2, 3 and 4.
Contact map showing all of the observed recurrent contacts between Cα atoms (see Materials and Methods). Data for isolated Gank and the Gank-S6-C complex at 0.01 Å/ps pulling speed are reported in the lower and upper diagonals, respectively. The color of each contact reports the fraction of the simulation runs where it was observed. Thus, native contacts (the most frequent) appear usually in dark blue. A larger fraction of non-native contacts are observed in isolated Gank between R1 and R2 and between R2 and R3, than those observed in the complex.
Comparison of average repeat components of transferred work (in KbT) at 0.01 Å/ps pulling speed for uncomplexed and complexed Gank.
Comparison of average repeat components of transferred work (in KbT) at 0.05 Å/ps pulling speed for uncomplexed and complexed Gank.
Comparison of average repeat unfolding times (in ns) at 0.01 Å/ps pulling speed for uncomplexed and complexed Gank.
Comparison of average repeat unfolding times (in ns) at 0.05 Å/ps pulling speed for uncomplexed and complexed Gank.
Conceived and designed the experiments: GS DS PEM EP LSI. Performed the experiments: GS. Analyzed the data: GS DS EP LSI. Contributed reagents/materials/analysis tools: GS DS EP. Wrote the paper: GS DS EP LSI.
- 1. Kobe B, Kajava AV (2000) When protein folding is simplified to protein coiling: the continuum of solenoid protein structures. Trends Biochem Sci 25: 509–515.
- 2. Kappel C, Zachariae U, Dolker N, Grubmuller H (2010) An unusual hydrophobic core confers extreme flexibility to HEAT repeat proteins. Biophysical Journal 99: 1596–1603.
- 3. Lee G, Abdi K, Jiang Y, Michaely P, Bennett V, et al. (2006) Nanospring behaviour of ankyrin repeats. Nature 440: 246–249.
- 4. Kim M, Abdi K, Lee G, Rabbi M, Lee W, et al. (2010) Fast and forceful refolding of stretched alpha-helical solenoid proteins. Biophys J 98: 3086–3092.
- 5. Lee W, Zeng XC, Zhou HX, Bennett V, Yang WT, et al. (2010) Full Reconstruction of a Vectorial Protein Folding Pathway by Atomic Force Microscopy and Molecular Dynamics Simulations. J Biol Chem 285: 38167–38172.
- 6. Serquera D, Lee W, Settanni G, Marszalek PE, Paci E, et al. (2010) Mechanical Unfolding of an Ankyrin Repeat Protein. Biophys J 98: 1294–1301.
- 7. Higashitsuji H, Itoh K, Nagao T, Dawson S, Nonoguchi K, et al. (2000) Reduced stability of retinoblastoma protein by gankyrin, an oncogenic ankyrin-repeat protein overexpressed in hepatomas. Nat Med 6: 96–99.
- 8. Howard J, Bechstedt S (2004) Hypothesis: a helix of ankyrin repeats of the NOMPC-TRP ion channel is the gating spring of mechanoreceptors. Curr Biol 14: R224–226.
- 9. Nakamura Y, Nakano K, Umehara T, Kimura M, Hayashizaki Y, et al. (2007) Structure of the oncoprotein gankyrin in complex with S6 ATPase of the 26S proteasome. Structure 15: 179–189.
- 10. Vogel V, Sheetz MP (2009) Cell fate regulation by coupling mechanical cycles to biochemical signaling pathways. Curr Opin Cell Biol 21: 38–46.
- 11. Ainavarapu SR, Li L, Badilla CL, Fernandez JM (2005) Ligand binding modulates the mechanical stability of dihydrofolate reductase. Biophys J 89: 3337–3344.
- 12. Hann E, Kirkpatrick N, Kleanthous C, Smith DA, Radford SE, et al. (2007) The effect of protein complexation on the mechanical stability of Im9. Biophys J 92: L79–81.
- 13. Cao Y, Balamurali MM, Sharma D, Li H (2007) A functional single-molecule binding assay via force spectroscopy. Proc Natl Acad Sci U S A 104: 15677–15681.
- 14. Cao Y, Yoo T, Li H (2008) Single molecule force spectroscopy reveals engineered metal chelation is a general approach to enhance mechanical stability of proteins. Proc Natl Acad Sci U S A 105: 11152–11157.
- 15. Brooks BR, Brooks CL 3rd, Mackerell AD Jr, Nilsson L, Petrella RJ, et al. (2009) CHARMM: the biomolecular simulation program. J Comput Chem 30: 1545–1614.
- 16. Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, et al. (1983) Charmm - a Program for Macromolecular Energy, Minimization, and Dynamics Calculations. J Comput Chem 4: 187–217.
- 17. Haberthur U, Caflisch A (2008) FACTS: Fast analytical continuum treatment of solvation. J Comput Chem 29: 701–715.
- 18. Manjasetty BA, Quedenau C, Sievert V, Bussow K, Niesen F, et al. (2004) X-ray structure of human gankyrin, the product of a gene linked to hepatocellular carcinoma. Proteins 55: 214–217.
- 19. Hummer G, Szabo A (2001) Free energy reconstruction from nonequilibrium single-molecule pulling experiments. Proc Natl Acad Sci U S A 98: 3658–3661.
- 20. Schurr JM, Fujimoto BS (2003) Equalities for the nonequilibrium work transferred from an external potential to a molecular system. Analysis of single-molecule extension experiment. J Phys Chem B 107: 14007–14019.
- 21. Rief M, Gautel M, Oesterhelt F, Fernandez JM, Gaub HE (1997) Reversible unfolding of individual titin immunoglobulin domains by AFM. Science 276: 1109–1112.
- 22. Carrion-Vazquez M, Oberhauser AF, Fowler SB, Marszalek PE, Broedel SE, et al. (1999) Mechanical and chemical unfolding of a single protein: A comparison. Proc Natl Acad Sci U S A 96: 3694–3699.
- 23. Li L, Wetzel S, Pluckthun A, Fernandez JM (2006) Stepwise unfolding of ankyrin repeats in a single protein revealed by atomic force microscopy. Biophys J 90: L30–32.
- 24. Mossa A, de Lorenzo S, Huguet JM, Ritort F (2009) Measurement of work in single-molecule pulling experiments. J Chem Phys 130: 234116.
- 25. Gupta AN, Vincent A, Neupane K, Yu H, Wang F, et al. (2011) Experimental validation of free-energy-landscape reconstruction from non-equilibrium single-molecule force spectroscopy measurements. Nat Phys 7: 631–634.
- 26. Bertz M, Rief M (2009) Ligand binding mechanics of maltose binding protein. J Mol Biol 393: 1097–1105.
- 27. Bertz M, Wilmanns M, Rief M (2009) The titin-telethonin complex is a directed, superstable molecular bond in the muscle Z-disk. Proc Natl Acad Sci U S A 106: 13307–133310.
- 28. Wang CC, Tsong TY, Hsu YH, Marszalek PE (2011) Inhibitor Binding Increases the Mechanical Stability of Staphylococcal Nuclease. Biophys J 100: 1094–1099.
- 29. Junker JP, Hell K, Schlierf M, Neupert W, Rief M (2005) Influence of substrate binding on the mechanical stability of mouse dihydrofolate reductase. Biophys J 89: L46–48.
- 30. Bertz M, Rief M (2008) Mechanical unfoldons as building blocks of maltose-binding protein. J Mol Biol 378: 447–458.
- 31. Lee W, Strumpfer J, Bennett V, Schulten K, Marszalek PE (2012) Mutation of conserved histidines alters tertiary structure and nanomechanics of consensus ankyrin repeats. Journal of Biological Chemistry 287: 19115–19121.
- 32. Lowe AR, Itzhaki LS (2007) Rational redesign of the folding pathway of a modular protein. Proceedings of the National Academy of Sciences of the United States of America 104: 2679–2684.
- 33. Zinober RC, Brockwell DJ, Beddard GS, Blake AW, Olmsted PD, et al. (2002) Mechanically unfolding proteins: The effect of unfolding history and the supramolecular scaffold. Protein Sci 11: 2759–2765.
- 34. Grinthal A, Adamovic I, Weiner B, Karplus M, Kleckner N (2010) PR65, the HEAT-repeat scaffold of phosphatase PP2A, is an elastic connector that links force and catalysis. Proceedings of the National Academy of Sciences of the United States of America 107: 2467–2472.
- 35. Sotomayor M, Corey DP, Schulten K (2005) In search of the hair-cell gating spring elastic properties of ankyrin and cadherin repeats. Structure 13: 669–682.
- 36. Stewart M (2007) Molecular mechanism of the nuclear protein import cycle. Nat Rev Mol Cell Biol 8: 195–208.
- 37. Eaton JW et al.. (2009) GNU/Octave, version 3.2.4. Available: http://www.gnu.org/software/octave/. Accessed 12 May 2012.
- 38. Humphrey W, Dalke A, Schulten K (1996) VMD: visual molecular dynamics. J Mol Graph 14: 33–38, 27–38.