Skip to main content
Advertisement
  • Loading metrics

Topological metrics as evolutionary and dynamical descriptors of conformational landscapes within protein families

  • Nikhil Ramesh,

    Roles Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Writing – original draft, Writing – review & editing

    Affiliation Department of Physics and Center for Biological Systems, Arizona State University, Tempe, Arizona, United States of America

  • S. Banu Ozkan ,

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    Banu.Ozkan@asu.edu (SBO); Eleni.Panagiotou@asu.edu (EP)

    Affiliation Department of Physics and Center for Biological Systems, Arizona State University, Tempe, Arizona, United States of America

  • Eleni Panagiotou

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    Banu.Ozkan@asu.edu (SBO); Eleni.Panagiotou@asu.edu (EP)

    Affiliation School of Mathematical and Statistical Sciences, Arizona State University, Tempe, Arizona, United States of America

?

This is an uncorrected proof.

Abstract

Identifying the key order parameters that connect a protein‘s native structure to its dynamical and evolutionary behavior remains a central challenge. We introduce topological and geometrical metrics—specifically, writhe and Local Topological Energy (LTE)—to investigate these connections. Applying these tools to both present-day and ancestral forms of thioredoxin and β-lactamase, we show that LTE strongly correlates with established dynamical measures such as the Dynamical Flexibility Index (DFI). Remarkably, LTE distributions also track the evolutionary trajectories of these proteins, suggesting that the topological geometry of the native state encodes key aspects of both dynamics and evolution. Through molecular dynamics simulations, we further reveal critical shifts in the topological landscape of proteins, providing a molecular mechanism by which functional evolution proceeds via alterations in conformational dynamics. Extending our analysis to over 100 proteins, we provide the first compelling evidence that topological descriptors derived from static structures can reliably predict dynamical behavior. In general, our findings demonstrate that simple geometrical metrics capture essential features of protein conformational landscapes, offering a powerful new approach to bridging protein structure, dynamics, and evolution.

Author summary

Advances in protein science and artificial intelligence have resulted in a wealth of protein structures, yet a fundamental challenge remains: how does a protein‘s native structure encode its dynamical behavior? Addressing this question is crucial for understanding protein function and advancing protein engineering, particularly for designing nanomachines. We introduce a novel approach, rooted in knot theory, to bridge this gap. We demonstrate that topological metrics, even when applied to static protein structures, reveal key aspects of protein dynamics. This discovery provides a rigorous and quantitative framework for predicting protein flexibility and evolutionary adaptation directly from structure. Our approach has broad implications, from improving AI-based protein modeling to anticipating functional outcomes of mutations, with applications in protein engineering, biotechnology and medicine.

Introduction

The complex dynamics of proteins arise from a vast number of intra- and inter-molecular interactions and degrees of freedom, which are encoded, to an extent, by their structure. Molecular simulations show that structure-encoded dynamics indeed govern function [16]. Methods from mathematical topology offer a novel rigorous framework upon which protein structure complexity can be quantified in a way that is amenable to statistical and data analyses methods [716]. However, due to the mathematical nature of these methods, their physical relevance, beyond classification of native states, has been elusive. In this manuscript we show that the mathematical topology and geometry of the native state of proteins reflects aspects of protein evolution of function and dynamics.

Several mathematical approaches aim at characterizing and quantifying native 3-D structure of proteins by examining the relative positions of the constituent atoms [710,17,18]. Knot theory offers a different approach to understanding protein structure that accounts simultaneously for both relative positions of amino acids and their connectivity at all lengthscales [1928]. With recent advances in topology and knot theory, we can now study all proteins beyond knotted proteins [1216,20,22,2932].

It has been challenging to establish a direct connection between the topological or geometrical complexity of a protein‘s static native structure and the broader topological and dynamical landscape of its conformational ensemble. This difficulty arises due to the lack of a well-defined order parameter or mathematical metric that can both compare static structures, and also accurately characterize the three-dimensional structural variability of those within the ensemble. Addressing this gap is particularly important as the traditional structure–function paradigm of protein activity is increasingly replaced by an ensemble-based model [1,3338]. In the ensemble model, the native state of a protein is not a single static structure but rather a collection of interconverting conformations that the protein explores [23,39]. The protein‘s function is governed by the dynamics of sampling these conformations, rather than being dictated by a single dominant structure [40,41]. This framework aligns well with protein evolution, as it explains how new functions can emerge or existing functions can be modulated for adaptation to new environments, all while maintaining the overall structural framework of the protein [4250]. Computational methods provide a powerful means to assess functional changes and allosteric mechanisms that are inherently favored by the native ensemble [13,3336,51]. One such approach, the dynamic flexibility index (DFI), offers a quantitative measure of these evolutionary mechanisms by linking structural dynamics to functional adaptation. This is achieved by applying linear response theory to quantify the response of each residue to external force perturbations. DFI has been successfully applied in various contexts, including establishing a broad relationship between the flexibility of individual amino acids and their evolutionary conservation [1,2,3335,42,5254]. This computational framework complements the ensemble model of protein dynamics by capturing how variations in flexibility profiles contribute to functional diversity while maintaining the overall native fold [55].

Building on the success of metrics like the dynamic flexibility index (DFI) in linking structural dynamics to functional evolution, we aim to develop topology-based metrics to assess functional evolution in proteins. One of the most intuitive mathematical measures of topological and geometrical complexity in proteins is the Gauss linking integral, which can be applied both to static X-ray crystallographic structures and to conformations sampled from molecular dynamics simulations. The Gauss linking integral is a measure of the number of times two curves wind around each other. When applied to one curve, it is called the writhe and it measures the amount of interwinding a curve does around itself. It is also often useful to compute the intercrossing number and average crossing number, which are coarser versions of these metrics. Previous studies have used the Gauss linking integral to quantify backbone entanglement by analyzing the entire protein backbone, examining the writhe of specific regions, or assessing linking between different structural elements [1216,29,30,56,57]. These entanglement-based metrics, including the linking integral and other novel topological descriptors [58], offer significant advantages for studying protein conformational ensembles. As real-valued continuous functions of a protein‘s three-dimensional coordinates, they provide a mathematically rigorous framework for characterizing topological and geometrical features across different conformations. Prior research has demonstrated that nearly all proteins exhibit non-trivial topological and geometrical signatures, which are not only computationally identifiable but also physically meaningful, as they correlate with protein folding rates (at least for some datasets) [12,15,16,29,30,59,60]. By examining local conformations within structures deposited in the protein data bank, a measure known as local topological free energy was introduced [13]. This measure captures the rarity of specific local conformations in the folded state ensemble and has been shown to identify key mutation sites in proteins across diverse contexts [14,56,57]. Indeed, it has been shown that similar approaches integrated into machine learning has been successful for drug discovery [32].Thus, these topology-based metrics complement existing dynamical approaches like DFI, providing a new perspective on how structural complexity influences protein function and evolution. We stress that the LTE is not a physical energy, but a purely mathematical quantity that is normalized as an energy that depicts mathematically the conformational rarity of a local structure in the protein landscape. In this study, we employ mathematical topology and geometry to investigate how the evolution of the protein conformational ensemble reflects its dynamical evolution. Specifically, we apply the Gauss linking integral to both ancestral and extant native X-ray structures and their conformations sampled from molecular dynamics simulations, mapping these properties onto the evolutionary trees of thioredoxin and β-lactamase.

Thioredoxins are ubiquitous oxidoreductase enzymes found across all domains of life, from archaea to humans [61,62]. Over evolutionary time, they have adapted to environmental changes—such as variations in temperature and ocean acidity—by evolving toward reduced stability and decreased activity. In contrast, β–lactamases are bacterial enzymes that confer antibiotic resistance by hydrolyzing β–lactam antibiotics, including penicillin and cefotaxime (CTX), thereby inactivating them. The evolutionary trajectory of β–lactamases shows a transition from a generalist enzyme capable of degrading multiple antibiotics with moderate efficiency to a more specialized enzyme with enhanced activity against penicillins [34]. Our results indicate that, although the tertiary structure of ancestral and extant proteins remains largely conserved throughout evolution, the local topological free energy (LTE) effectively captures subtle local conformational changes. In the case of thioredoxins, LTE consistently decreases over evolutionary time, reflecting gradual structural adaptations. Notably, the LTE of static X-ray structures alone is sufficient to reconstruct the evolutionary tree of thioredoxins, highlighting its potential as a topology-based metric for evolutionary analysis.

By comparing LTE with dynamic flexibility index (DFI) profiles derived from conformational dynamics simulations, we observe a strong correlation between the LTE of static structures and their corresponding DFI profiles for both thioredoxins and β-lactamases. We also found this correlation to persist with a larger dataset of 100 proteins (S1 Data). This finding reinforces the connection between local geometric features of the native state and the broader dynamical landscape of proteins. Additionally, by leveraging topology and geometry, we examine the evolution of the topological landscape in molecular simulations. Our results reveal that, despite the overall conservation of tertiary structure, the topological landscape of ancestral and extant proteins differs, suggesting that topological features from knot theory (even in the absence of knotting) offer a distinct perspective on protein evolution and dynamics.

Materials and methods

In this section we present the methods employed in this work to analyze the structure of proteins and their dynamics. More precisely, first, we present the definition of the Dynamic Flexibility Index, which examines the response of a protein under perturbation. Second, we present the metrics of topological structural complexity employed in this work, which are based on the Gauss linking integral.

DFI

In this work, to study the dynamics of proteins we used a metric known as the dynamic flexibility index (DFI) [1,52]. The DFI captures the dynamical behavior of proteins by calculating the responses of each residue to perturbations. To calculate these responses DFI utilizes linear response theory (LRT) [2,63,64]. The underlying model for the protein used in DFI calculations is either taken to be an elastic network model (ENM) [65] or is obtained from covariance matrices from all-atom molecular dynamics simulations. The response is calculated from LRT using Eq 1. To ensure an isotropic response this response is averaged over multiple directions.

(1)

Here N refers to the number of residues in the protein, H is the Hessian matrix and it can be computed directly from the atomic coordinates. If the DFI is calculated using ENM, then this Hessian matrix is obtained from the second derivatives of the harmonic potential with respect to the cartesian coordinates. The hessian can also be extracted directly from all-atom molecular dynamics simulation as the inverse of the covariance matrix.

Using Eq 1 we can perturb each residue sequentially to obtain a perturbation response matrix A, given by,

(2)

where is the response at residue number i due to a perturbation at residue j. We can then define the DFI at position i to be,

(3)

The DFI quantifies the relative response to perturbations observed at any position. Residues with low DFI do not exhibit large fluctuations to perturbations and hence are more dynamically stable. These positions also tend to be evolutionarily conserved as they typically play a crucial role in the dynamics of these molecules [1].

Topological parameters

A measure of conformational complexity of two curves in 3-space is the Gauss linking integral [66], which is defined as follows: for two disjoint oriented curves and with arc-length parametrizations and respectively, the linking integral, Lk, is the double integral over and :

(4)

where denotes the derivative of γ and where the integral runs over , which denotes all .

The Gauss linking integral is a measure of the number of times two curves wind around each other and can have both positive and negative values depending on orientations of the curves. When the two curves are closed (thus form a topological link), the Gauss linking integral is an integer-valued topological invariant. When the curves are open, it is a real-valued continuous function of the curve coordinates. The linking integral may be non-zero even for curves that do not visibly interwind. In those cases, it captures aspects of their relative positions related to their orientations and vicinity, which can be interpreted as topological interactions or a potential for interwinding. The Gauss linking integral can be expressed in terms of properties of link diagrams. An oriented link diagram is a projection of a pair of oriented curves to a plane, where double points keep the information of over/under and each crossing is labeled as a positive crossing ( or a negative crossing () based on the relative orientations (see S1 Fig). A positive crossing and a negative crossing are also referred to as a right-handed crossing and a left-handed crossing, respectively. The linking integral is then the average of half the algebraic sum of signs of all crossings in a projection of two curves over all possible projection directions. For polygonal curves, the linking integral can be expressed as a finite sum of signed geometric probabilities that two edges cross in any projection direction.

When taking the absolute value in the integrand, we obtain the intercrossing number (ICN) between two curves, which measures the average number of crossings seen between two curves in any projection direction. Namely, for two disjoint oriented curves and with arc-length parametrizations and respectively, the intercrossing number, ICN, is the double integral over and :

(5)

where denotes the derivative of γ and where the integral runs over , which denotes all .

The ICN is not a topological invariant, even for topological links and is more sensitive on the geometry of the conformations. It can be an advantageous metric when interested in capturing subtle geometric entanglement in complex systems.

The Gauss linking integral can also measure the degree at which a curve interwinds around itself. When applied to one curve, the Gauss linking integral is called the writhe of the curve; for an oriented curve l with an arc-length parametrization γ, the writhe, Wr, is the double integral over l:

(6)

where denotes the derivative of γ and where the integral runs over , which denotes all such that .

The writhe is not a topological invariant, even when the curve is closed (thus forms a knot). It is real valued continuous function of the curve coordinates that measures the number of times a curve winds around itself. Even though a high absolute writhe value may indicate topological complexity, the writhe is sensitive to local geometrical entanglement. The writhe can also be expressed as the average algebraic sum of signs of all crossings in a projection of a curve with itself over all possible projection directions.

The writhe, linking number, intercrossing number can be computed exactly, avoiding numerical integration, using the algorithm described in [67,68].

An important property of the Gauss linking integral is that it can be applied to polygonal curves of any length to characterize 3-dimensional conformations at different length scales. In this manuscript we represent proteins by their Cα atoms and compute the writhe of entire proteins, as well as that of parts of the protein at the length scale of 4 residues, which we call the local writhe. We assume an orientation of the resulting polygonal curves implied by their sequence. See Fig 1A for an example of local writhe distribution along a protein. We also apply the ICN to examine the relative conformations of parts of a protein. In particular, we apply this to proteins to examine the relative conformations of two helices with respect to the rest of the protein.

thumbnail
Fig 1. Local writhe and LTE of a protein.

(A) shows the local writhe computed from the positions of four consecutive Cα atoms along the chain of the human thioredixin (PDB ID: 1ERU). The local writhe values can be positive and negative, highlighted in blue and red, respectively. Examples of local conformations of the Cα backbone corresponding to high positive and negative writhe highlighted. The local Writhe can take values between -0.5 and 0.5. High in absolute value writhe indicates tighter local conformations, while a value of zero indicates a straight conformation. The LTE along the chain for the same protein is shown in (B). The variation of LTE with writhe as computed from a culled PDB sample dataset is shown in (C). The LTE captures the rarity of local writhe value in the the PDB, with high LTE values indicative of rare local conformations.

https://doi.org/10.1371/journal.pcbi.1013985.g001

The local topological free energy in writhe.

The local topological free energy, originally defined in [13], can be used to assign a measure of rarity of local conformations along a protein backbone, relative to local conformations in the PDB. To do this in practice, we use a curated subset of the crystal structures provided in the PDB. Namely, we use the dataset of unbiased, high-quality 3-dimensional structures with less than 60% homology identity from [69]. Then for each residue of a given protein we compare its local writhe value to those of the ensemble and a free energy is assigned to the residue based on the population of that value in the ensemble. More precisely, let denote the maximum occurrence value for the local writhe in the folded ensemble. To any value p of local writhe, we associate a purely topological/geometrical free energy, the local topological free energy (LTE):

(7)

where are the number of occurences of each value.

Fig 1B shows the LTE distribution as a function of local writhe values in the PDB sample used in this study.

The LTE has been used to successfully predict sites of interest in proteins in different contexts, such as mutation sites that may impact the conformational landscape of proteins and binding sites [13,14,56,57].

A summary of the topological/geometrical metrics and abbreviations used in this manuscript is given in Table 1 and further examples are given in the SI.

thumbnail
Table 1. The topological/geometrical metrics used in this manuscript with their abbreviation and specific usage herein.

https://doi.org/10.1371/journal.pcbi.1013985.t001

Results

The evolution of the local topological landscape of proteins

Numerous studies on ancestrally resurrected proteins have demonstrated that, while protein function adapts and evolves over approximately three billion years of evolution, the three-dimensional structure remains largely conserved [1,42]. Indeed, the root-mean-square deviation (RMSD) between ancestral proteins and their corresponding extant homologs is typically less than 1 Å, indicating that these structural metrics alone do not provide meaningful insights into functional evolution. Therefore, applying our topological metrics based on knot theory to ancestral protein structures and their modern counterparts presents a promising framework for assessing functional divergence. In this study, we analyzed the functional evolution of two distinct enzyme families: β-lactamases, which evolved from substrate generalists to substrate-specific enzymes, and thioredoxins, which adapted to low-temperature and high-pH environments.

Fig 2A presents the evolutionary tree of thioredoxin, illustrating the divergence of ancestral lineages leading to modern homologs in E. coli and humans. The reconstructed ancestral proteins (LBCA, LPBCA, LGPCA, AECA, LECA, LAFCA) and modern homologs (E. coli and human) retain a conserved fold despite evolutionary divergence. The LTE profiles, depicted in Fig 2B highlight variations in LTE values across ancestral and extant structures, in particular the regions of α-3 and α-4 helices contributing stability and folding of the thioredoxins. Similarly, we also observed a difference in LTE values between ancestral β-lactamase (GNCA) and its modern homolog (panel A of S2 Fig). When we cluster the LTE profiles of both ancestral and extant thioredoxins in both lineages using principal component analysis, ancestral and extant homologs within each evolutionary branch tend to cluster together (Fig 2C). The clustering of the LTE profiles was performed using Ward‘s linkage method. We retained the top seven eigenvalues for this analysis because they account for over 90% of the variance in the data. Clustering of LTE values of ancestral β-lactamases with extant homolog yielded similar results, clustering ancestor gram positive and gram negative common ancestor (PNCA) and gram negative common ancestor (GNCA) β-lactamases together (panel B of S2 Fig). This clustering suggests a conservation of local energetic landscapes along evolutionary trajectories. To further investigate, we plot the average LTE of each homolog against its estimated time of emergence (Fig 2D). A positive correlation (R = 0.69) reveals that despite the high structural similarity between ancestral and extant thioredoxins, topology/geometry-based structural metric, (LTE) correlates with evolutionary time, effectively capturing functional evolution. This pattern is further reinforced in Fig 2E, which shows a strong correlation (R = 0.71) between LTE and the melting temperature (). Overall, these results suggest that protein evolution favors local structural conformations that are, on average, more prevalent in the native protein structure ensemble, potentially enhancing functional robustness.

thumbnail
Fig 2. Topological evolution of thioredoxin.

(A) shows the E. coli and Human branches of the thioredoxin phylogenetic tree (The PDB Codes are: LBCA - 4BA7, AECA - 3ZIV, LACA - 2YNX, LPBCA - 2YJ7, LGPCA - 2YN1, LECA - 2YOI, LAFCA - 2YPM, E. Coli - 2TRX, Human - 1ERU). In (B) we see the LTE profile colored on the crystal structures of the extant and the most ancient along both the branches. We see in (C) that a principal component analysis (PCA) of the LTE profiles shows that the two branches mostly cluster together. In (D) and (E) we see that LTE has a high correlation with both the evolutionary time of emergence of each variant and the melting temperature . The reported R value corresponds to the Pearson correlation coefficient. The p-values for the correlations between the average LTE and (i) the evolutionary time of emergence and (ii) the melting temperature are 0.0566 and 0.0470, respectively. Evolutionary emergence times for each variant were taken from Ingles et al. [43], and melting temperatures were obtained from Risso et al. [70].

https://doi.org/10.1371/journal.pcbi.1013985.g002

Analysis of how flexibility profiles evolved during the functional divergence of these enzymes revealed that modern homologs exhibit increased flexibility in specific regions compared to their ancestral counterparts [1,34,35]. This variation in flexibility is closely linked to protein stability and function [35]. Specifically, DFI analysis revealed that evolutionary substitutions tend to increase the flexibility of previously rigid hinge sites, while compensatory changes introduce new hinge points by reducing the flexibility of other regions. This “hinge-shift” mechanism fine-tunes protein dynamics, allowing adaptation to new functions and environments [1,71].

Given that LTE follows a similar trend, we investigate further if LTE captures local flexibility in a comparable manner. It is important to note that while DFI measures the resistance of an amino acid position to perturbations at other sites (i.e., how much a position is influenced by Brownian kicks applied elsewhere in the protein), LTE is a purely local topological/geometrical metric through application on (static) protein structures. LTE quantifies the rarity of a specific local topological conformation formed by four consecutive residues in the protein chain relative to the PDB folded state ensemble. Despite this fundamental difference between the two metrics, our comparison of the LTE and DFI values of the combined crystal structures of thioredoxin and β-lactamase ancestors (Fig 3A) shows that positions with higher flexibility tend to have higher LTE. To test whether this correlation holds more generally, we examined a larger set of crystal structures (Fig 3B). To quantify the extent of the correlation we used the Spearman’s rank coefficent betwen the DFI bins and the variance of the LTE for that bin. We found the value to be 1 for 5 bins and even on increasing the bins to 20 the correlation was 0.88 with a p-value of . We observed that regions with higher LTE also tend to be more flexible, as reflected by a sharper peak at high LTE values and an extended high-LTE tail in the violin plot with increasing DFI. TEM-1 and GNCA also show a similar positive correlation between LTE and DFI (S3 Fig). One possible explanation for this correlation is that more flexible regions of the protein are better able to access the rarer writhe conformations associated with high LTE. These findings suggest that LTE—though derived solely from static structural data— is closely linked to the protein’s dynamic landscape and offers meaningful insights into its flexibility and motion [13,14,56].

thumbnail
Fig 3. LTE is positively correlated with DFI.

On binning the percentile DFI profiles obtained using the crystal structures we find that the LTE distrubtion broadens with increasing DFI and the mean LTE per bin appears to be positively correlated with increasing DFI. (A) shows violin plots between the LTE of the protein and the binned percentile DFI values of the combined crystal structures of thioredoxin and β-lactamase ancestors and in (B) we see the same for a larger dataset of 100 proteins listed in S1 Data.

https://doi.org/10.1371/journal.pcbi.1013985.g003

The evolution of the global topological landscape of proteins

Building on our observation that LTE, despite being a purely local structural metric, provides meaningful insights into protein dynamics, we now extend our analysis to global topological descriptors in the conformational ensemble. Specifically, we examine global writhe—a measure of structural entanglement computed from the full 3D conformation—using ensembles of structures sampled from molecular dynamics simulations of both ancestral and extant enzymes. The β-lactamase simulations used in our analysis were obtained from previously published work [53]. This approach allows us to assess whether global topological features, like the writhe of the entire protein, reflect evolutionary changes in protein flexibility and function.

To investigate the role of global topological metrics in protein evolution, we analyzed the global writhe distributions of the ancestral (GNCA) and extant (TEM-1) variants of β-lactamase. We note that since the two proteins have the same length, their writhe distributions are directly comparable. Using conformations sampled from molecular dynamics simulations, we computed the global writhe for both variants. The results, shown in Fig 4, reveal a statistically significant shift in the writhe distribution (P-value = on performing a KL Divergence test to show statistically that the two distributions differ.), Fig 4A), with mean values of 14.53 for GNCA and 14.14 for TEM-1. Additionally, the variance of the distribution has slightly decreased (σ = 0.20 for GNCA and σ = 0.16 for TEM-1), indicating that the conformational landscape has become more restricted. This reduction in writhe and landscape variability aligns with our vibrational density of states analysis [72], which suggests that, as β-lactamases evolved from a generalist enzyme to a specialist in penicillin degradation, the native ensemble of TEM-1 shifted toward a more defined conformational states, shaping its structural dynamics for efficient substrate processing. The decrease in the average Writhe value indicates that the β-lactamase has evolved to a more compact conformation. The compactness of the structure, reflected by the higher Writhe can be either due to the better formation of longer helical structures or to the global tightening of a hinge movement. To explore the extremes of the global writhe distributions for TEM-1, we computed the DFI profiles for conformations belonging to the highest writhe bins of TEM-1 and the lowest writhe bins of GNCA. These profiles represent the extreme cases within these two distributions (Fig 4B). While the overall DFI profiles are similar, the most significant differences are observed in the regions between residues Cα 200–250 and around residues 170–179, which correspond to the functionally critical Ω loop of the β-lactamase. Previous studies using both NMR [73] and MD simulations [74] have highlighted the crucial role of the dynamics of the Ω loop in the efficient degradation of various antibiotics.

thumbnail
Fig 4. Writhe and the dynamical behavior of TEM-1.

On calculating the global writhe of TEM-1 and the Gram-Negative Common Ancestor (GNCA) of β-lactamase from simulation data, we find that the writhe of the ancestral variant GNCA is significantly higher on average than that of TEM-1 and has higher variance as shown in (A). This indicates that the extant protein adopts a more narrow conformational landscape and reflects a possible global tightening of a hinge movement. In (B) we see that the DFI profiles obtained from the low and the high writhe structures reveal key differences between residues 200-250 and also between 170-179 which corresponds to the functionally critical Ω loop of β-lactamase. In (C) we see the DFI profile colored on the structure.

https://doi.org/10.1371/journal.pcbi.1013985.g004

Similarly, in the E. coli branch of thioredoxin, we observe a distinct distribution of global writhe between the LBCA and the extant E. coli (P-value = obtained on performing a similar KL-divergence test as the one reported above), see panel A of S4 Fig. In particular, the variance of writhe is higher in the extant enzyme (σ = 0.16 for LBCA, σ = 0.25 for E. coli), suggesting an increase in conformational space for the modern homolog. Additionally, the average writhe of the extant thioredoxin (6.52) is lower than that of the ancestral form (6.88). These findings are consistent with evolutionary analyses of TEM-1, which suggest a shift in conformational ensembles during evolution. Moreover, when we compare the DFI profiles obtained from the conformational ensemble of the LBCA with higher writhe bins to the ensemble of E. coli with lower writhe bins (where the two distributions do not overlap), we find notable differences in the DFI profiles, particularly in the region, as expected. Interestingly, the largest differences are observed in the regions preceding and .

Previous studies have shown that comparing the differences in DFI scores between ancestral and extant thioredoxins in the human and E. coli branches provides insight into their adaptation to lower temperatures. In bacterial thioredoxins, the increased flexibility in the α3 region (resulting from hinge loss), which primarily contributes to stability, is compensated by a decrease in flexibility in α4, a region critical for folding. This trade-off may explain why modern thioredoxins exhibit reduced stability while preserving their canonical 3D fold [35]. This process, known as hinge shift, involves the relocation of a hinge (a position with low flexibility) from one region of the protein to another [1,35].

The increased flexibility of α3, as observed in [35], corresponds to greater conformational freedom relative to the rest of the protein within its conformational landscape. To examine how this increased flexibility influences the protein’s topological landscape, we analyze the intercrossing number (ICN) of α3 with the rest of the protein in both ancestral and extant variants. The ICN is employed to analyze the conformational complexity of two components with respect to each other (in this case the helix with the rest of the protein), as opposed to single conformational complexity that the writhe captures. We note that we employ the ICN instead of the linking number because the ICN captures subtle geometrical information that can be lost in the linking number (due to sign cancellations), which is more topological in nature.

The results, shown in Fig 5, indicate that the ICN distribution in the extant protein is shifted toward higher mean values 2.16 for E. coli compared to 1.76 for LBCA and 2.01 for the extant human as compared to 1.73 for the ancestral variant. This is also accompanied by an increase in variance from for AECA to for the human variant. We see a similar increase in the variance for the E. coli branch with the variance changing from for LBCA to for the extant E. coli variant. The shift in ICN suggests that α3 has greater topological accessibility, providing insight into its enhanced flexibility in modern thioredoxin. This is further supported by the increased variance, which reflects the ability of α3 to explore a broader range of relative positions. Additionally, α3 appears to adopt conformations that interact more closely with the rest of the protein, as evidenced by the increase in mean ICN. This observation suggests that α3 may form a tighter turn or position itself within a structural “pocket” of the protein. Thus, this topological feature solely analyzed using molecular dynamics simulations based on crystal structures of ancestral and modern thioredoxin provides mechanistic insight about dynamics divergence of thioredoxin during their functional evolution.

thumbnail
Fig 5. Flexibility correlates with the intercrossing number distribution.

The α3 helix is highlighted, and it is observed to become more flexible in extant species. The distribution of the intercrossing number is also shown, with a larger variance observed in more flexible extant species. This increased variance is accompanied by a shift in the mean, where higher flexibility is correlated with a higher mean intercrossing number.

https://doi.org/10.1371/journal.pcbi.1013985.g005

For we find that the variance of ICN decreases with evolution (see S5 Fig). This is in agreement with having decreased flexibility, and thus being constraint to a narrower topological landscape.

Discussion

It has been established via both experiments and simulations that structure-encoded dynamics governs function, yet, the understanding of the exact mechanisms of this connection remains elusive. An exceptional context within which the relation between structure and dynamics can be explored is that of protein evolution. Ancestrally reconstructed proteins have shown that the evolution of a new function and/or adaptation to a new environment is always accomplished while preserving the 3D structure.

In this work, we employ mathematical topology and geometry to rigorously characterize functional evolution within a protein family through extensive sequence diversification while maintaining essentially invariant three-dimensional structures by including ancestral and extant homologs. This design allows us to isolate whether topological metrics computed solely from structure, using only backbone C-α positions, can capture functional divergence that sequence-based analyses detect but traditional structure-based metrics cannot. Our analysis focuses on two well-studied systems: thioredoxin and β–lactamase. Specifically, we examine global structural properties using the Gauss linking integral. To capture local structural features, we employ the local topological free energy (LTE), which quantifies the rarity of a given local writhe value within the protein data bank (PDB) [75] ensemble of folded proteins.

Our results showed that the LTE of the static native state alone can distinguish the ancestral and extant structures and moreover, reconstruct the evolutionary tree. By comparing the LTE of both thioredoxin and β-lactamase proteins with their DFI profile, we find that LTE correlates with the DFI profile of a protein. This is the first time that LTE (a measure of native structure geometrical complexity) is associated with its dynamical response under perturbation.

By employing molecular dynamics simulations for both the ancestral and extant proteins (of both thioredoxin and β-lactamase), we obtain the topological landscape in writhe. Our results show that the extant and ancestral topological landscapes in writhe are different. For all systems, we observe that the DFI profiles for the extremes of the writhe distributions are different at sites important for the evolution of function. In order to better understand which aspects of the conformation contribute to the DFI profile differences, we focus on the and helices in thioredoxin, whose DFI profile indicates a transition to increased (resp. decreased) flexibility. By making use of the intercrossing number, which captures the relative topological/geometrical landscape of the helix with respect to the rest of the protein, we observe a shift in all systems (including the human branch), which indicates a larger (resp. smaller) ensemble of geometrical conformations with increasing (resp. decreasing) flexibility.

These results establish a direct link between the mathematical geometrical and topological complexity of proteins, seen as open curves in 3-space, and their dynamical properties. By uncovering this relationship, we provide a quantitative framework for analyzing protein structure that offers deeper insights into the interplay between structure, dynamics, and function.

Supporting information

S1 Fig. Crossings in diagrams.

The projection of an oriented curve gives rise to crossings of two types: a positive crossing (Left) and a negative crossing (Right).

https://doi.org/10.1371/journal.pcbi.1013985.s001

(PDF)

S2 Fig. LTE analysis of TEM-1.

Panel A shows the LTE profile colored on the structures of TEM-1 and GNCA. In B we see the LTE profiles cluster together based on how they evolve. The clustering was performed using the Ward’s linkage method taking the top three eigenvalues as they accounted for over 90% of the variance.

https://doi.org/10.1371/journal.pcbi.1013985.s002

(PDF)

S3 Fig. β-lactamase LTE vs DFI Profiles.

A shows the correlation of TEM-1 with DFI and B shows the same for GNCA. We find a similar trend in the LTE vs percentile DFI for both TEM-1 and GNCA, i.e., higher LTE implies more flexibility.

https://doi.org/10.1371/journal.pcbi.1013985.s003

(PDF)

S4 Fig. Low vs High Writhe DFI profiles for the E.Coli.

A shows the writhe distribution of extant E. coli and LBCA. We observe the same trend seen in TEM-1, namely that extant species exhibit lower writhe on average compared to their ancestors. In B we perform an analysis analogous to that in the main text and find that the flexibility of the α4 helix obtained from the high-writhe LBCA structures is greater than that obtained from the low-writhe E. coli structures. The simulation data used for this analysis is available at Zenodo (doi: https://doi.org/10.5281/zenodo.17716722).

https://doi.org/10.1371/journal.pcbi.1013985.s004

(PDF)

S5 Fig. Flexibility vs ICN for the α4 helix.

In the main text we had seen that the variance of the intercrossing number of the α3 helix was correlated with its flexibility. We observe a similar correlation between the variance of the ICN distribution with that of the flexibility of the α4 helix.

https://doi.org/10.1371/journal.pcbi.1013985.s005

(PDF)

S6 Fig. Examples of global and local Writhe.

(A) Global Writhe (Wr) of PDB: 1A2P (resp. (B) 1A12, (C) 1A4I). In these examples, Wr increases in absolute value as the length and complexity of the protein increases. (D) Local Writhe values of PDB: 1A2P amino acids 31–34, (E) 1A12 amino acids 110–113, and (F) 1A4I amino acids 185–183. The local Writhe of an amino acid is the Writhe of a polygonal curve of three edges: in practice the Gauss linking integral between the first and third edge (since consecutive edges have zero linking number). This quantity equals the geometric probability that the two straight segments cross in any projection direction (divided by 2). Local writhe satisfies . It measures local orientation and compactness: a tight right-handed (resp. left-handed) turn yields values close to 1 (resp. ), while straighter segments have values near 0.

https://doi.org/10.1371/journal.pcbi.1013985.s006

(PDF)

S7 Fig. Examples of local conformations in the protein sample and their corresponding local Writhe values.

A, B: Similar dihedral angles, different Writhe. C, D: Similar Writhe, different dihedral angles. (axes units are in Å).

https://doi.org/10.1371/journal.pcbi.1013985.s007

(PDF)

S8 Fig. Examples of local conformations in the protein sample with low vs high LTE.

(A) low LTE, (B) high LTE > 2.6 (axes units are in Å).

https://doi.org/10.1371/journal.pcbi.1013985.s008

(PDF)

S9 Fig. Examples of different ICN.

T he intercrossing number (ICN) captures the complexity of the conformation of two chains relative to each other. In the manuscript, we employ the ICN to characterize the complexity of a conformation of a part of a protein relative to the rest of the protein. Here we show examples of the ICN between a region of the protein (green) and the remainder of the chain (red). The conformation in panel A has a higher ICN value compared to panel B, reflecting that the green region in panel A is more surrounded by and wrapped by the rest of the protein for a sample protein (PDB ID: 1V9E).

https://doi.org/10.1371/journal.pcbi.1013985.s009

(PDF)

S1 Data. Enzyme dataset.

List of 100 PDB files used for validating the correlation between LTE and DFI.

https://doi.org/10.1371/journal.pcbi.1013985.s010

(CSV)

References

  1. 1. Campitelli P, Modi T, Kumar S, Ozkan SB. The role of conformational dynamics and allostery in modulating protein evolution. Annu Rev Biophys. 2020;49:267–88. pmid:32075411
  2. 2. Nevin Gerek Z, Kumar S, Banu Ozkan S. Structural dynamics flexibility informs function and evolution at a proteome scale. Evol Appl. 2013;6(3):423–33. pmid:23745135
  3. 3. Bahar I, Lezon TR, Yang L-W, Eyal E. Global dynamics of proteins: bridging between structure and function. Annu Rev Biophys. 2010;39:23–42. pmid:20192781
  4. 4. Bahar I, Cheng MH, Lee JY, Kaya C, Zhang S. Structure-encoded global motions and their role in mediating protein-substrate interactions. Biophys J. 2015;109(6):1101–9. pmid:26143655
  5. 5. Childers MC, Daggett V. Insights from molecular dynamics simulations for computational protein design. Mol Syst Des Eng. 2017;2(1):9–33. pmid:28239489
  6. 6. Karplus M, Kuriyan J. Molecular dynamics and protein function. Proc Natl Acad Sci U S A. 2005;102(19):6679–85. pmid:15870208
  7. 7. Rana MM, Nguyen DD. Geometric Graph Learning to Predict Changes in Binding Free Energy and Protein Thermodynamic Stability upon Mutation. J Phys Chem Lett. 2023;14(49):10870–9. pmid:38032742
  8. 8. Chen D, Liu J, Wei G-W. Multiscale topology-enabled structure-to-sequence transformer for protein-ligand interaction predictions. Nat Mach Intell. 2024;6(7):799–810. pmid:40718138
  9. 9. Tang WS, da Silva GM, Kirveslahti H, Skeens E, Feng B, Sudijono T, et al. A topological data analytic approach for discovering biophysical signatures in protein dynamics. PLoS Comput Biol. 2022;18(5):e1010045. pmid:35500014
  10. 10. Cang Z, Mu L, Opron K, Xia K, Wei GW. A topological analytic approach for protein classification. Mol Based Math, Biol. 2015;3:140–62.
  11. 11. Panagiotou E, Plaxco KW. A topological study of protein folding kinetics. Topology and Geometry of Biopolymers. 2020;746:223–33.
  12. 12. Wang J, Panagiotou E. The protein folding rate and the geometry and topology of the native state. Sci Rep. 2022;12(1):6384. pmid:35430582
  13. 13. Baldwin Q, Panagiotou E. The local topological free energy of proteins. J Theor Biol. 2021;21(21):110854.
  14. 14. Baldwin Q, Sumpter B, Panagiotou E. The Local Topological Free Energy of the SARS-CoV-2 Spike Protein. Polymers (Basel). 2022;14(15):3014. pmid:35893978
  15. 15. Baiesi M, Orlandini E, Trovato A, Seno F. Linking in domain-swapped protein dimers. Sci Rep. 2016;6:33872. pmid:27659606
  16. 16. Baiesi M, Orlandini E, Seno F, Trovato A. Sequence and structural patterns detected in entangled proteins reveal the importance of co-translational folding. Sci Rep. 2019;9(1):8426. pmid:31182755
  17. 17. Liu X, Wee J, Wei GW. Topological machine learning for protein–nucleic acid binding affinity changes upon mutation. Mach Learn: Sci Technol. 2025;6:045035.
  18. 18. Liu X, Huang X, Wei G-W. Topological learning prediction of virus-like particle stoichiometry and stability. Biophys J. 2025;S0006-3495(25)00664-2. pmid:41084239
  19. 19. Jackson SE, Suma A, Micheletti C. How to fold intricately: using theory and experiments to unravel the properties of knotted proteins. 2019.
  20. 20. Niemyska W, Millett KC, Sulkowska JI. GLN: a method to reveal unique properties of lasso type topology in proteins. Sci Rep. 2020;10(1):15186. pmid:32938999
  21. 21. Perlinska AP, Niemyska WH, Gren BA, Bukowicki M, Nowakowski S, Rubach P, et al. AlphaFold predicts novel human proteins with knots. Protein Sci. 2023;32(5):e4631. pmid:36960558
  22. 22. Dabrowski-Tumanski P, Rubach P, Goundaroulis D, Dorier J, Sulkowski P, Millett KC, et al. KnotProt 2.0: a database of proteins with knots and other entangled structures. Nucleic Acids Res. 2019;47(D1):D367–75. pmid:30508159
  23. 23. Onuchic JN, Socci ND, Luthey-Schulten Z, Wolynes PG. Protein folding funnels: the nature of the transition state ensemble. Fold Des. 1996;1(6):441–50. pmid:9080190
  24. 24. Onuchic JN, Luthey-Schulten Z, Wolynes PG. Theory of protein folding: the energy landscape perspective. Annu Rev Phys Chem. 1997;48:545–600. pmid:9348663
  25. 25. Noel JK, Sułkowska JI, Onuchic JN. Slipknotting upon native-like loop formation in a trefoil knot protein. Proc Natl Acad Sci U S A. 2010;107(35):15403–8. pmid:20702769
  26. 26. Fonseka HYY, Javidi A, Oliveira LFL, Micheletti C, Stan G. Unfolding and translocation of knotted proteins by clp biological nanomachines: synergistic contribution of primary sequence and topology revealed by molecular dynamics simulations. J Phys Chem B. 2021;125(27):7335–50. pmid:34110163
  27. 27. Covino R, Skrbić T, Beccara SA, Faccioli P, Micheletti C. The role of non-native interactions in the folding of knotted proteins: insights from molecular dynamics simulations. Biomolecules. 2013;4(1):1–19. pmid:24970203
  28. 28. Flapan E, He A, Wong H. Topological descriptions of protein folding. Proc Natl Acad Sci U S A. 2019;116(19):9360–9. pmid:31000594
  29. 29. Panagiotou E, Plaxco KW. A topological study of protein folding kinetics. Topology of Biopolymers. Providence: AMS. p. 223.
  30. 30. Baiesi M, Orlandini E, Seno F, Trovato A. Exploring the correlation between the folding rates of proteins and the entanglement of their native state. J Phys A: Math Theor. 2017;50:504001.
  31. 31. Dabrowski-Tumanski P, Goundaroulis D, Stasiak A, Rawdon EJ, Sulkowska JI. Theta-curves in proteins. Protein Sci. 2024;33(9):e5133. pmid:39167036
  32. 32. Shen L, Feng H, Li F, Lei F, Wu J, Wei G-W. Knot data analysis using multiscale Gauss link integral. Proc Natl Acad Sci U S A. 2024;121(42):e2408431121. pmid:39392667
  33. 33. Modi T, Ozkan SB. Mutations Utilize Dynamic Allostery to Confer Resistance in TEM-1 β-lactamase. Int J Mol Sci. 2018;19(12):3808. pmid:30501088
  34. 34. Zou T, Risso VA, Gavira JA, Sanchez-Ruiz JM, Ozkan SB. Evolution of conformational dynamics determines the conversion of a promiscuous generalist into a specialist enzyme. Mol Biol Evol. 2015;32(1):132–43. pmid:25312912
  35. 35. Modi T, Huihui J, Ghosh K, Ozkan SB. Ancient thioredoxins evolved to modern-day stability-function requirement by altering native state ensemble. Philos Trans R Soc Lond B Biol Sci. 2018;373(1749):20170184. pmid:29735738
  36. 36. Kim H, Zou T, Modi C, Dörner K, Grunkemeyer TJ, Chen L, et al. A hinge migration mechanism unlocks the evolution of green-to-red photoconversion in GFP-like proteins. Structure. 2015;23(1):34–43. pmid:25565105
  37. 37. Pabis A, Risso VA, Sanchez-Ruiz JM, Kamerlin SC. Cooperativity and flexibility in enzyme evolution. Curr Opin Struct Biol. 2018;48:83–92. pmid:29141202
  38. 38. Petrović D, Risso VA, Kamerlin SCL, Sanchez-Ruiz JM. Conformational dynamics and enzyme evolution. J R Soc Interface. 2018;15(144):20180330. pmid:30021929
  39. 39. Hyeon C, Thirumalai D. Capturing the essence of folding and functions of biomolecules using coarse-grained models. Nat Commun. 2011;2:487. pmid:21952221
  40. 40. Henzler-Wildman K, Kern D. Dynamic personalities of proteins. Nature. 2007;450(7172):964–72. pmid:18075575
  41. 41. Frauenfelder H, McMahon B. Dynamics and function of proteins: the search for general concepts. Proc Natl Acad Sci U S A. 1998;95(9):4795–7. pmid:9560180
  42. 42. Risso VA, Sanchez-Ruiz JM, Ozkan SB. Biotechnological and protein-engineering implications of ancestral protein resurrection. Curr Opin Struct Biol. 2018;51:106–15. pmid:29660672
  43. 43. Ingles-Prieto A, Ibarra-Molero B, Delgado-Delgado A, Perez-Jimenez R, Fernandez JM, Gaucher EA, et al. Conservation of protein structure over four billion years. Structure. 2013;21(9):1690–7. pmid:23932589
  44. 44. Khersonsky O, Roodveldt C, Tawfik DS. Enzyme promiscuity: evolutionary and mechanistic aspects. Curr Opin Chem Biol. 2006;10(5):498–508. pmid:16939713
  45. 45. Khersonsky O, Tawfik DS. Enzyme promiscuity: a mechanistic and evolutionary perspective. Annu Rev Biochem. 2010;79:471–505. pmid:20235827
  46. 46. Tokuriki N, Tawfik DS. Stability effects of mutations and protein evolvability. Curr Opin Struct Biol. 2009;19(5):596–604. pmid:19765975
  47. 47. Harms MJ, Thornton JW. Evolutionary biochemistry: revealing the historical and physical causes of protein properties. Nat Rev Genet. 2013;14(8):559–71. pmid:23864121
  48. 48. Babtie A, Tokuriki N, Hollfelder F. What makes an enzyme promiscuous?. Curr Opin Chem Biol. 2010;14(2):200–7. pmid:20080434
  49. 49. Gerlt JA, Babbitt PC. Enzyme (re)design: lessons from natural evolution and computation. Curr Opin Chem Biol. 2009;13(1):10–8. pmid:19237310
  50. 50. Nguyen V, Wilson C, Hoemberger M, Stiller JB, Agafonov RV, Kutter S, et al. Evolutionary drivers of thermoadaptation in enzyme catalysis. Science. 2017;355(6322):289–94. pmid:28008087
  51. 51. Nguyen DD, Xia K, Wei G-W. Generalized flexibility-rigidity index. J Chem Phys. 2016;144(23):234106. pmid:27334153
  52. 52. Campitelli P, Guo J, Zhou H-X, Ozkan SB. Hinge-Shift Mechanism Modulates Allosteric Regulations in Human Pin1. J Phys Chem B. 2018;122(21):5623–9. pmid:29361231
  53. 53. Kolbaba-Kartchner B, Kazan IC, Mills JH, Ozkan SB. The Role of Rigid Residues in Modulating TEM-1 β-Lactamase Function and Thermostability. Int J Mol Sci. 2021;22(6):2895. pmid:33809335
  54. 54. Luu DD, Ramesh N, Kazan IC, Shah KH, Lahiri G, Mana MD, et al. Evidence that the cold- and menthol-sensing functions of the human TRPM8 channel evolved separately. Sci Adv. 2024;10(25):eadm9228. pmid:38905339
  55. 55. Butler BM, Gerek ZN, Kumar S, Ozkan SB. Conformational dynamics of nonsynonymous variants at protein interfaces reveals disease association. Proteins. 2015;83(3):428–35. pmid:25546381
  56. 56. Sugiyama M, Kosik KS, Panagiotou E. Geometry based prediction of tau protein sites and motifs associated with misfolding and aggregation. Sci Rep. 2025;15(1):10283. pmid:40133414
  57. 57. Sugiyama M, Kosik KS, Panagiotou E. Mathematical topology and geometry-based classification of tauopathies. Sci Rep. 2024;14(1):7560. pmid:38555402
  58. 58. Panagiotou E, Kauffman LH. Vassiliev measures of complexity of open and closed curves in 3-space. Proc R Soc A. 2021;477(2254).
  59. 59. Rustad M, Ghosh K. Why and how does native topology dictate the folding speed of a protein?. J Chem Phys. 2012;137(20):205104. pmid:23206039
  60. 60. Hausrath AC, Goriely A. Repeat protein architectures predicted by a continuum representation of fold space. Protein Sci. 2006;15(4):753–60. pmid:16522802
  61. 61. Arnér ES, Holmgren A. Physiological functions of thioredoxin and thioredoxin reductase. Eur J Biochem. 2000;267(20):6102–9. pmid:11012661
  62. 62. Balsera M, Buchanan BB. Evolution of the thioredoxin system as a step enabling adaptation to oxidative stress. Free Radic Biol Med. 2019;140:28–35. pmid:30862542
  63. 63. Atilgan C, Gerek ZN, Ozkan SB, Atilgan AR. Manipulation of conformational change in proteins by single-residue perturbations. Biophys J. 2010;99(3):933–43. pmid:20682272
  64. 64. Gerek ZN, Ozkan SB. A flexible docking scheme to explore the binding selectivity of PDZ domains. Protein Sci. 2010;19(5):914–28. pmid:20196074
  65. 65. Atilgan AR, Durell SR, Jernigan RL, Demirel MC, Keskin O, Bahar I. Anisotropy of fluctuation dynamics of proteins with an elastic network model. Biophys J. 2001;80(1):505–15. pmid:11159421
  66. 66. Gauss KF. Gauss 1833 Werke. Königlichen Gesellschaft der Wissenschaften zu Göttingen. 1877.
  67. 67. Banchoff T. Self-linking numbers of space polygons. Indiana Univ Math J. 1976;25:1171–88.
  68. 68. Herschberg T, Pifer K, Panagiotou E. A computational package for measuring topological entanglement in polymers, proteins and periodic systems (TEPPP). CPIP. 2022;286:108639.
  69. 69. Wang G, Dunbrack Jr RL. PISCES: a protein sequence culling server. Bioinformatics. 2003;19(12):1589–91. pmid:12912846
  70. 70. Risso VA, Gavira JA, Mejia-Carmona DF, Gaucher EA, Sanchez-Ruiz JM. Hyperstability and substrate promiscuity in laboratory resurrections of Precambrian β-lactamases. J Am Chem Soc. 2013;135(8):2899–902. pmid:23394108
  71. 71. Modi T, Risso VA, Martinez-Rodriguez S, Gavira JA, Mebrat MD, Van Horn WD, et al. Hinge-shift mechanism as a protein design principle for the evolution of β-lactamases from substrate promiscuity to specificity. Nat Commun. 2021;12(1):1852. pmid:33767175
  72. 72. Modi T, Heyden M, Ozkan SB. Vibrational density of states capture the role of dynamic allostery in protein evolution. Biophysical Journal. 2022;121(3):456a.
  73. 73. Rossi M-A, Palzkill T, Almeida FCL, Vila AJ. Slow Protein Dynamics Elicits New Enzymatic Functions by Means of Epistatic Interactions. Mol Biol Evol. 2022;39(10):msac194. pmid:36136729
  74. 74. Knoverek CR, Mallimadugula UL, Singh S, Rennella E, Frederick TE, Yuwen T, et al. Opening of a cryptic pocket in β-lactamase increases penicillinase activity. Proc Natl Acad Sci U S A. 2021;118(47):e2106473118. pmid:34799442
  75. 75. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, et al. The Protein Data Bank. Nucleic Acids Res. 2000;28(1):235–42. pmid:10592235
  76. 76. Stiffler MA, Hekstra DR, Ranganathan R. Evolvability as a function of purifying selection in TEM-1 β-lactamase. Cell. 2015;160(5):882–92. pmid:25723163