Skip to main content
Advertisement
  • Loading metrics

Functional bottlenecks can emerge from non-epistatic underlying traits

  • Anna Ottavia Schulte,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Software, Validation, Writing – original draft, Writing – review & editing

    Affiliation Dipartimento di Fisica, Sapienza Università di Roma, Rome, Italy

  • Samar Alqatari,

    Roles Conceptualization, Methodology, Validation, Writing – original draft, Writing – review & editing

    Affiliation Department of Physics and The James Franck and Enrico Fermi Institutes, The University of Chicago, Chicago, Illinois, United States of America

  • Saverio Rossi,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Supervision, Validation, Writing – original draft, Writing – review & editing

    Affiliation Dipartimento di Fisica, Sapienza Università di Roma, Rome, Italy

  • Francesco Zamponi

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Resources, Software, Supervision, Validation, Writing – original draft, Writing – review & editing

    francesco.zamponi@uniroma1.it

    Affiliation Dipartimento di Fisica, Sapienza Università di Roma, Rome, Italy

?

This is an uncorrected proof.

Abstract

Protein fitness landscapes frequently exhibit epistasis, where the effect of a mutation depends on the genetic context in which it occurs, i.e., the rest of the protein sequence. Epistasis increases landscape complexity, often resulting in multiple fitness peaks. In its simplest form, known as global epistasis, fitness is modeled as a non-linear function of an underlying additive trait. In contrast, more complex epistasis arises from a network of (pairwise or many-body) interactions between residues, which cannot be removed by a single non-linear transformation. Recent studies have explored how global and network epistasis contribute to the emergence of functional bottlenecks - fitness landscape topologies where two broad high-fitness basins, representing distinct phenotypes, are separated by a bottleneck that can only be crossed via one or a few mutational paths. Here, we introduce and analyze a stylized model of global epistasis with an additive underlying trait. We demonstrate that functional bottlenecks arise with high probability if the model is properly calibrated. Furthermore, our results underscore that a proper balance between neutral and non-neutral mutations is needed for the emergence of functional bottlenecks.

Author summary

A central challenge in the study of protein evolution is understanding how interactions between mutations influence evolutionary dynamics. These interactions, collectively known as epistasis, play a key role in shaping protein ‘fitness landscapes’, representing maps between amino acid sequences and their functional performance. However, the impact of epistasis on landscape ruggedness and accessibility remains a subject of debate. Recent experiments have revealed the existence of functional bottlenecks: regions of low fitness that restrict evolutionary transitions between proteins with different functionalities. While these bottlenecks represent a significant constraint on evolutionary outcomes and are often attributed to complex networks of interacting mutations, we demonstrate that they can arise in a much simpler setting. Using a stylized model where fitness depends nonlinearly on an underlying additive trait, we show that functional bottlenecks do not require complex network epistasis. Instead, they can emerge from the inherent variability of the mutational effect distribution. Specifically, these bottlenecks arise when the mutational landscape is dominated by small, nearly neutral effects, but remains punctuated by enough strongly non-neutral mutations to create sharp fitness transitions. Our findings offer a new perspective on the constraints shaping protein evolution and highlight the critical role of mutational effect heterogeneity in determining evolutionary accessibility.

1 Introduction

Understanding how genetic variation translates into differences in reproductive success is a central challenge in evolutionary biology. An organism’s fitness—defined as the number of offspring it produces—depends on how its phenotypic traits interact with the environment. These traits, in turn, are shaped by the organism‘s genotype, suggesting a hierarchical mapping from genotype to phenotype to fitness. By assigning a fitness value to each genotype, one obtains a fitness landscape [15], a conceptual tool that captures the topology of the space of evolutionary possibilities.

Epistasis is a distinctive property of fitness landscapes. Broadly speaking, it is defined as the context-dependence of mutational effects [638]. More precisely, it is defined as follows. Consider a genotype with an associated fitness . The genotype a could be a protein sequence of length L, with representing the amino acid at site i, or a nucleotide sequence, or a sequence of zeros and ones representing the presence/absence of a given mutation, gene, etc. Epistasis occurs when the fitness change due to substituting with at site i depends on the amino acids that are present at other sites in the sequence. As such, epistasis can be of two conceptually distinct origins.

What is sometimes called ‘global’ epistasis is based on introducing the simplest possible non-linearity in the genotype-phenotype-fitness mapping [2,3,5,19,23,24,27,31,32,36,37,39]. More precisely, one assumes the existence of an underlying additive phenotype (or ‘trait’) associated with each genotype . Under this assumption, the variation of phenotype associated with substituting a with b at site i, , is independent of the rest of the genotype. Next, one assumes that the fitness is a non-linear function of , for instance a sigmoid (Fig 2A). Under these assumptions, while the variation of the underlying trait is independent of the rest of the sequence, the variation of fitness depends on it due to the non-linearity of ϕ. Yet, such non-linearity is relatively easy to handle, because one can deduce the function by ‘deconvolution’ of experimental fitness measurements [23,24,27,32,37,4042], and thus describe the fitness function with a limited number of parameters. In concrete examples, few parameters enter in the definition of , and of the order of L parameters enter in the definition of E. Furthermore, such a globally epistatic fitness landscape only features a single maximum, because is an increasing function of E, and E itself has a single maximum, obtained by choosing at each site the value that maximizes the additive contribution [5]. Multi-peaked landscapes can be obtained by choosing a non-monotonic function , as in Fisher’s Geometric Model [2,3,5,19,43].

A more complex behavior is obtained in presence of what is sometimes called ‘network’ epistasis [8,11,12,14,15,20,22,2830,3335,4042,44], which implies that no function can transform fitness into a purely additive function of the genotype [27,32,37]. While a function ϕ can be chosen to make E as close as possible to an additive form, higher-order interactions will still play a role in the presence of network epistasis. More formally, E can always be decomposed as a sum of additive, two-site, three-site, and higher-order interactions [10,14,17,18,21,27,28,32,37], i.e.,

(1)

however, due to network epistasis, no choice of ϕ can eliminate the higher-order terms , , and beyond. Let us focus for simplicity on the case in which only the two-site interactions are present. The number of parameters required to fully specify the function E now scales as the number of pairs, i.e., proportionally to , which makes inferring the fitness function from a few measurements significantly more difficult. Even more importantly, the function in Eq (1) can have many local maxima, corresponding to many fitness peaks, leading to a rough, complex fitness landscape. From Eq (1), any nonlinear function of – that is, any form of global epistasis – can, in principle, be represented as a (possibly infinite) sum of high-order terms on the right-hand side. However, this representation is less practical, as it requires estimating a large number of parameters. A more effective strategy is to first capture as much of the nonlinearity as possible using a single global function. This substantially reduces the residual nonlinearity, making it easier to model the remainder with only low-order local terms.

The relative importance of global and network epistasis, and how to properly fit , has recently been the subject of intense debate [4042]. A particular focus of the debate has been the role played by epistasis in shaping the topology of the fitness landscape in the proximity of functional switches. For instance, Poelwijk et al. [27] studied experimentally the fitness landscape separating two phenotypically distinct variants of the Entacmaea quadricolor fluorescent protein, one fluorescent in the red and the other in the blue, separated by 13 mutations. Assaying all intermediate variants, they observed, in addition to global epistasis, significant network contributions (both pairwise and higher order) to fitness. They also identified a functional bottleneck separating the ‘blue’ and ‘red’ space of fluorescent variants, i.e., a narrow region of accessible evolutionary paths between the two phenotypes, which they attributed to epistatic constraints. These functional bottlenecks illustrate how fitness landscapes can remain navigable despite their ruggedness. While epistasis can constrain viable evolutionary paths, it can also facilitate adaptation by carving out narrow high-fitness pathways that connect otherwise isolated functional states – typically via critical “switch” mutations [4547].

In a subsequent study, Alqatari and Nagel [38] used similar methods to investigate functional switches in an elastic network, considered as a model for allosteric behavior in proteins. Analyzing ensembles of fitness landscapes, they observed different topologies at the threshold for viable evolution, and found that functional bottlenecks – often involving a single critical mutation – emerged generically, similar to the observations in the experimental study.

Many other recent studies investigated small combinatorial fitness landscapes. However, in most cases, these landscapes connect a reference wild type to an evolved variant with increased or decreased fitness with respect to a single functionality [37,48,49]. Experimental studies that investigate the combinatorial landscape separating two reference genotypes with distinct functionality are less widespread [32,5052]. Moreover, in many of these studies functional transitions proceed through promiscuous intermediate variants that are as fit, or even fitter, than the two reference genotypes. Allowing for functional promiscuity represents a less constrained scenario than one where a functional switch must occur between mutually exclusive phenotypes. In the latter case, which is the one we consider here, the requirement for specialization naturally makes the landscape more prone to the formation of bottlenecks. The question of how widespread bottlenecks are – particularly in presence of promiscuity – is not addressed here.

In this work, we aim to identify the minimal conditions under which bottleneck topologies can emerge. Specifically, we investigate whether network epistasis is a necessary condition for the existence of functional bottlenecks in fitness landscapes. To address this, we introduce a simple, stylized model of global epistasis based on a random underlying additive trait and a nonlinear fitness function. Our model is not intended to fit specific empirical data; rather, it follows the spirit of the Fisher’s Geometric Model and related frameworks [2,3,5,19,43] by generating an ensemble of random fitness landscapes with stylized features. Upon calibration, we demonstrate that our model encompasses a wide range of parameters for which the resulting topologies exhibit a functional bottleneck with high probability. This result, by itself, shows that global epistasis alone is sufficient to generate functional bottlenecks.

Furthermore, we studied the role of the distribution of single mutational effects (SMEs). More specifically, we find that when the model is properly calibrated, the distribution of SMEs selected by evolution displays a proper balance between nearly neutral SMEs and enough strongly non-neutral SMEs, which is needed to create sharp fitness transitions. Our stylized model thus sheds light on some fundamental constraints that shape fitness landscapes and possibly evolution.

2 Results

The central result of our work is the introduction of a stylized fitness landscape model, designed to investigate the minimal conditions under which bottleneck topologies emerge. However, to provide the motivation and inspiration for this construction, we begin with a pedagogical discussion of experimental data from Poelwijk et al. [27]. This analysis of empirical data serves to ground the stylized features of the model that follows.

2.1 Experimental data on fitness bottlenecks

Poelwijk et al. [27] analyzed a family of variants of the naturally occurring wild type Entacmea Quadricolor red fluorescent protein eqFP611 [53]. More specifically, from a natural variant of eqFP611, called eqFP578 (with 76% amino acid sequence homology), the TagRFP mutant was engineered via random mutagenesis. TagRFP is 21 mutations away from eqFP578 [54, see S1 Text] and has enhanced fluorescence and stability. From TagRFP, two engineered variants have been derived to have distinct fluorescence (See https://www.fpbase.org/protein/eqfp578 for the complete tree of variants.):

  • mTagBFP2, which is fluorescent in the blue and has 13 mutations and 6 insertions relative to TagRFP [55];
  • mKate2, which is fluorescent in the deep red and has 8 mutations and one insertion relative to TagRFP [56].

The sequences of mTagBFP2 and of mKate2 differ by 13 mutations (12 of which are shown in Fig 2A reproduced from Ref [27]), thus resulting in a total of different combinations of intermediate amino acid substitutions. The experiment of Poelwijk et al. [27] measured the fluorescence of all these intermediates, both in the red and in the blue. Note that there are also 5 insertions in mTagBFP2 with respect to mKate2 that were neglected in the experiment (details were not found in Ref [27]).

We focus on these published data as they characterize a combinatorial landscape bridging two reference genotypes with distinct functionalities, within which a functional bottleneck was explicitly identified. A more recent study, employing a similar experimental design but investigating a different protein, reported the absence of such a bottleneck [50]. This discrepancy highlights the topological variability of fitness landscapes and motivates our search for the minimal conditions that govern the emergence of these constraints.

We downloaded the original data from Ref [27] and represented the genotypes with binary vectors of 13 variables, with being the blue mKate2 and being the red mTagBFP2. In the experiment, E.Coli cells expressing the mutant genotypes were sorted by a microfluidic device able to push them into separate channels according to their fluorescence. From the raw experimental measurement, we assign to each genotype a red fitness, , and a blue fitness, , based on the enrichment in the two channels, each normalized to its corresponding reference value such that (see the S1 Text for details). This normalization provides a consistent reference scale, with each reference variant serving as the baseline functional state for its phenotype, enabling a more direct quantitative comparison between the red and blue fitness values.

Note that a specific measurable trait – fluorescence intensity – is here employed as a proxy for fitness, despite its lack of direct coupling to reproductive success. Instead, fluorescence quantifies the protein‘s functional efficiency. Under the assumption that enhanced biophysical performance may confer a selective advantage within the relevant environment, fluorescence serves as a proxy for fitness. This approach is standard in many empirical studies of fitness landscapes, such as Refs [27,50].

For each measured phenotype (we will use a suffix ‘R’ for red and ‘B’ for blue fluorescence), we can define an underlying trait and a non-linear mapping to fitness . Inverting the non-linear function, we can thus derive the underlying trait from the measured fitness,

(2)

The optimal methodology for inferring the function remains a subject of active debate [24,32,37,4042]; for instance, it has been suggested that the nonlinearity be inferred directly through rank-based statistics [42]. While acknowledging that results can be sensitive to the specific choice of ϕ, for our illustrative purposes we keep the same power-law form utilized in Ref [27] for both phenotypes. Alternative specifications, such as the sigmoid function shown in Fig 2A, may yield quantitatively different results, but the qualitative topology of the landscape remains robust.

We note that when there are two phenotypes only, for simplicity, we can follow Alqatari and Nagel [38] and encode them both in a single trait

(3)

which is positive for the ‘blue’ phenotype and negative for the ‘red’ phenotype. This is only possible when the two phenotypes are exclusive, as in Ref [27], which may not be true in other cases [32,5052]. We also define the reference trait values as

(4)

The structure of the two proteins and the 13 substitutions are illustrated in Fig 2A, together with a schematic illustration of the non-linear mapping between and the two fitnesses.

Fig 2B shows the histogram of across all genotypes, with red and blue lines corresponding to the two reference variants. Note that the reference values are and . Their absolute value slightly differs from one due to the contribution from the other phenotype. As observed in Ref [27], most genotypes are non-functional with . There is a rather sharp peak of blue genotypes around , while red genotypes exhibit a much broader distribution, ranging from down to large negative values of E. This indicates that some intermediates are significantly more fluorescent in the red than the red reference variant itself.

Having defined , we can analyze the distribution of SMEs. For each of the backgrounds , we consider all 13 single mutations and we compute the SME,

(5)

The histogram of the absolute value of the SMEs is reported in Fig 2C. (Note that the SMEs include each mutation and its reverse, hence the histogram would be symmetric by construction, which is why we focus here on .) Although we have limited data, we clearly observe that the distribution is fat-tailed. A good fit is achieved by a Pareto distribution,

(6)

here with . The fit is just an indication, in particular due to the limited amount of data and the limited range of the experimental fitness, which both cutoff the distribution at large . While other distributions could also fit the data, the distribution tails are quite broad, indicating substantial heterogeneity among SMEs (in the S1 text we show similar results for another, independent dataset of SMEs [37]).

Finally, in Fig 2D, we examine the topology of the resulting functional bottleneck. To perform this analysis, we need to first establish a formal definition of a ‘functional’ genotype. As noted in Ref [27] and illustrated in Fig 2B, the threshold for functionality is inherently context-dependent; shifting this threshold can substantially alter the perceived landscape topology [38]. Indeed, viability is not an intrinsic property of a genotype alone but is contingent upon the environment – specifically the selection pressure exerted on the relevant trait – which varies across different experimental conditions. In this work, we consider that the fitness function is growing fast around a given threshold (either on the positive or negative side, see Fig 2A). Hence, genotypes with are considered dysfunctional and evolutionarily disallowed. Because the reference variants are functional, they must of course be located above the functionality threshold, i.e., . However, Fig 2D shows that no path of single mutations exists that connects the two reference variants while always maintaining , hence we need to consider in order to preserve the connection between the reference sequences. (Note that a different result was obtained in Ref [27] when the two phenotypes were not normalized to the reference values. We discuss this point in the S1 Text.) What is, then, the largest value of the functionality threshold such that at least one viable mutational path exists between the two reference variants?

In order to precisely answer this question, following Ref [38], we characterize the space of the paths connecting the two reference variants upon increasing the threshold . We consider the subset of paths that (i) are made of single mutations, (ii) connect the red and blue reference sequences, and (iii) are such that all along the path. We start from , and gradually increase to the maximum value such that at least one path remains. The resulting value, which we call , quantifies how ‘hard’ it is for evolution to find a path that realizes the functionality switch. If , then one can find a path of single mutations, such that all intermediates are equally functional to the reference variants. If instead , in order to connect the two reference variants one has to accept a loss of fitness with respect to the reference fitness, which makes the transition less likely, but still possible.

For the experimental data of Ref [27], this analysis yields a value of approximately , or . The resulting space of paths, depicted in Fig 2D, exhibits the characteristic bottleneck shape also reported in Ref [38], where all paths traverse a single ‘jumper’ genotype with . In Fig 2D, the jumper genotype has a blue phenotype, after which a single mutation brings to the red phenotype. This functional switch is driven by a mutation located roughly at the midpoint. The number of functional intermediates determines the number of evolutionary paths that can reach the critical ‘jumper’ genotype. More paths imply greater evolutionary accessibility, making it easier for evolution to find the key mutation; while fewer paths indicate a more constrained transition requiring a specific mutational sequence. Note that the analysis in Ref [27] found a less pronounced bottleneck, with multiple mutational paths surviving – a scenario that can be reproduced here by adopting a less stringent definition of which allows for multiple paths, see Ref [38].

2.2 Stylized model of global epistasis

The goal of this paper is to determine whether a bottleneck structure like the one observed in Sec. requires network epistasis, or if it can also be explained by a simpler model of global epistasis.

To formulate a simple stylized model, we begin by an important observation. The reference proteins analyzed in Ref [27] are not the product of natural evolution, but rather highly engineered sequences. In engineered proteins, derived either by random mutagenesis as in Sec. or by directed evolution, one typically observes that the fitness improvement is achieved by just a few highly beneficial mutations, see, e.g., [37,57,58]. While this might seem, in principle, a highly specific feature of engineered proteins, a similar scenario has been observed when a natural protein needs to quickly adapt to a novel environment and acquire a new function. A notable example is the SARS-COV-2 Spike protein that displayed a few highly beneficial mutations just after the virus jumped to human hosts (see, e.g., [59]), but similar dynamics has been observed in other viral proteins [60], and in the immune system [32]. In each of these cases, just a few advantageous mutations allow the protein to acquire the new desired function, whether the selection pressure is applied artificially or emerges from a change in the natural environment. A very different dynamics might be at play in the neutral evolution of optimized initial proteins [6164], which might explain why the combinatorial landscape of Ref [50] does not display a bottleneck.

To capture a few minimal ingredients inspired by the results of Sec. and the above discussion, we introduce a stylized model featuring only global epistasis. The model is defined as follows:

  • The genotype is a binary sequence (i.e., ) of length L.
  • An underlying additive trait has random SMEs that are identically and independently distributed according to a symmetric input distribution . After having drawn the values independently, we impose that by shifting their mean, i.e., .
  • The two fitness functions that will be associated with the two ‘colors’ are non-linear functions of the underlying trait, and , as illustrated in Fig 2A. If then the genotype is functional ‘blue’, if it is functional ‘red’, and if , the genotype is non-functional.

The fitness functions grow sharply around , in a way controlled by the parameter β. The choice we will make implicitly, following Ref [38], is to consider the limit , in which the fitness is close to a threshold (or Heaviside) function at (see Fig 2A, dashed curves). Note that the values can be multiplied by an arbitrary constant that can be absorbed into and β, which allows us to fix the overall scale (e.g., the variance) of without loss of generality.

Given the above definitions, both genotypes and have and are thus non-functional. We stress that, contrarily to the analysis of experimental data reported above, these two genotypes will not be the reference sequences in the stylized model. Following Ref [38], we thus introduce a ‘tuning procedure’ to generate two reference variants, the ‘red’ with and the ‘blue’ with , for a chosen ‘tuning’ value .

The procedure starts from the genotype , which is thus considered as a ‘common ancestor’ or ‘wild type’ (for example, in the experiment described in Sect II A, it would correspond to the eqFP578 protein). One then sequentially introduces mutations as follows.

  • With probability p, we perform a ‘greedy’ step. We scan all the values that have not been introduced yet (those for which ), and choose the one that has the maximum impact towards the desired goal, i.e., the largest positive for the blue reference variant or the smallest negative for the red reference variant. We introduce the corresponding mutation by switching .
  • With probability , we perform a ‘random’ step. We choose at random one site such that , and mutate it to .

The tuning procedure is performed independently for the two colors starting from , and continues until for the first time to construct the red reference variant, or to construct the blue reference variant.

This procedure is not designed to be a fully realistic model of molecular evolution. Nonetheless, it is grounded in experimental observations. As noted at the beginning of this section, directed evolution experiments often yield final protein variants carrying a small number of beneficial mutations together with a few neutral ones. More generally, when a protein is challenged to acquire a new function, adaptation often involves selecting a handful of advantageous substitutions on top of incidental neutral changes. Our approach for constructing the reference variants is simply the most straightforward way to reproduce this pattern, without any claim of population-genetic realism. We also believe that this procedure mimics the tuning of elastic networks performed in Ref [38], in which the network is tuned based on a local proxy for the global fitness; we hereby assume that this local proxy can indeed provide the best choice (with probability p) or miss it (with probability ).

The parameters that define the model are the sequence length L, the a priori distribution of SMEs , the probability p and the tuning parameter that controls the reference sequence construction. We note that may be interpreted as describing the a priori distribution of mutational effects – arising, for instance, from underlying biophysical constraints – which represents the potential ‘pool’ of mutations accessible to evolution. We found that various choices of lead to similar outcomes, provided the model is appropriately calibrated. For the sake of parsimony, and recognizing that these are stylized representations, we restrict our analysis to two distributions that capture distinct mutational behaviors.

  1. Gaussian: A Gaussian distribution with unit variance.
  2. Pareto cutoff: Motivated by the analysis of experimental data in Fig 2C, a Pareto distribution, i.e., Eq (6) with , where we fix and . We also include an upper cutoff as in the data, i.e., we set for and adjust the normalization constant accordingly.

For a given choice of parameters, an ‘instance’ of our random model is thus defined by the a priori pool of the L SMEs, , and by the two reference variants and . For a given instance, we can define the number of mutations M separating and (i.e., the number of that differ in the two sequences), and we can then investigate the directed paths that connect them, obtained by introducing these mutations in all possible orders. Following Ref [38] and as discussed above, we can find the maximum value of such that at least one single-mutational path connecting the two reference variants, with all intermediate states having , survives, and we call this value . The closer is to , the easier it is to perform the functional switch.

Given and , we wish to characterize the statistical properties of the quantities M, , and , which characterize the overall topology of the fitness landscape. Another interesting quantity is the distribution of SMEs selected by the tuning procedure to generate the two reference variants (i.e., those that correspond to in each reference genotype), which may be interpreted as the distribution of mutations that are ‘fixed’ by evolution. We will also define . In the following, we will use brackets, , to denote the statistical average of an observable A over an ensemble of random instances generated by the model. Unless otherwise specified, the average is taken over 20000 independent instances of the model for each value of the parameters.

Our model is similar in spirit to Fisher’s geometric model (FGM) as discussed in Refs [2,3,5,19,43], from which it however differs in two important aspects. First, the choice of non-linearity, which reflects here the existence of two distinct fitness functions associated to each color, see also Ref [43] for a similar choice. Second, the fact that we choose two reference variants using the stochastic procedure described above, and we then restrict to the space of intermediates between the selected variants. As we show in the following, these two ingredients, together with a proper calibration of the model, give rise to the desired bottleneck structure.

Before discussing the details of the calibration procedure, a typical example of the space of mutational paths obtained from a calibrated model is shown in Fig 1. The qualitative similarity with Fig 2D, modulo the (irrelevant) change of scale of E, is striking.

thumbnail
Fig 1. Experimental data from Ref [27].

(A) Schematic shape of the fitness functions for the red and blue phenotypes as a function of the underlying trait E. Sigmoid functions have been used for illustrations. The two reference structures with the corresponding mutations are also shown, from Ref [27]. (B) Histogram of the value of E obtained from Eqs (2), (3) for each of the 213 variants. The red and blue lines correspond respectively to the reference values and . (C) Histogram of the absolute value of single mutational effects (SMEs) from Eq (5) for all single mutations that can be obtained from the dataset, shown in log-log (main panel) and lin-log (inset) scales. The black curve is the fit obtained with the Pareto distribution in Eq (6). (D) Topology of the space of paths obtained keeping only genotypes with , and finding the largest possible value of (dashed line) such that the red and blue reference sequences remain connected. The lower panel reports the values of for each functional genotype (blue dots for and red dots for ) as a function of the number of mutations from the blue reference, with the gray lines connecting pairs of genotypes that differ by a single mutation. The upper panel shows the resulting graph of connections.

https://doi.org/10.1371/journal.pcbi.1014000.g001

thumbnail
Fig 2. Same representation as in. Fig 2D, using an instance of the calibrated model with Gaussian P(h) instead of the experimental data.

The lower panel reports the value of for each functional variant as a function of the distance from the blue reference variant (here with and ). The threshold value for which at least a mutational path remains is (dashed lines) and when normalized with respect to the largest of the two reference genotypes it reads .

https://doi.org/10.1371/journal.pcbi.1014000.g002

2.3 Calibration of the model

The first step is to calibrate the parameters in such a way that our stylized model reproduces the basic phenomenology observed in Refs [27,38]. The requirements are the following.

  • The phenotype of the two reference sequences should be comparable to , i.e., , in order for to be a meaningful fitness scale.
  • We want to reproduce a ‘bottlenecked’ structure of the space of evolutionary paths connecting the two reference variants, as shown in Figs 2D and 1 and in Ref [38]. This is characterized by a single ‘jumper’ genotype through which all paths must pass. The mutation responsible for the functionality switch occurs just after (or just before) this genotype. We note that the jumper genotype can be connected by a single mutation to only one, or more than one, genotypes carrying the other phenotype.
  • The ‘jumper’ genotype should occur approximately at half distance along the evolutionary trajectory, as it is observed in experimental data (Fig 2D). Furthermore, we want the largest possible number of functional intermediate variants to survive before and after the jump, thus maximizing the number of functional evolutionary paths connecting the two reference variants.
  • The fitness variation produced by the ‘jumper’ mutation(s) should be as large as possible, in such a way that is as close as possible to . If this is the case, a path can go in a single jump from having fitness close to the red reference, to having fitness close to the blue reference. Otherwise, one would necessarily have to tolerate a decrease in fitness to perform the functional switch. This makes the resulting mutational paths more viable for evolution.
  • Finally, for practical reasons, the number of mutations M connecting the two reference variants should be small enough, otherwise enumerating all the paths becomes computationally very expensive. To avoid this, we impose , a value close to that chosen in Ref [38].

An important remark is in order about the expected value of . Consider for illustration the case in which the tuning procedure reaches the target with a single greedy step for both colors. We can thus expect and . In absence of network epistasis, the largest jumper mutation that can happen along a path has , but this mutation must bring the system from to , hence . Based on this simple argument, the largest possible value of we can expect is such that , as we observe numerically (Fig 3C).

thumbnail
Fig 3. Calibration of the two models with different , a Gaussian distribution or a Pareto distribution with cutoff.

The black symbols correspond to the choice of p after calibration. (A) Value of for which as a function of p. Dashed lines are linear fits. (B) Average value as a function of p. (C) Average of as a function of p. (D) Probability of having a single jumper at , or equivalently that , as a function of p.

https://doi.org/10.1371/journal.pcbi.1014000.g003

To fix the model parameters we proceed as follows. First of all, we fix as the length of a typical protein. We checked that the results are very weakly dependent on L (see the S1 Text). Next, for each p, we progressively increase until the average number of mutations separating the two reference variants is . The resulting is plotted as a function of p for both choices of in Fig 3A, and we find that it is approximately linear in p. The linear fit allows us to fix for all p.

The next step is to calibrate p. For this, we consider the ratio for both colors, which we want to be close to one. The evolution of its statistical mean with p is shown in Fig 3B. For small p, we mostly perform random steps in the training, and the requirement that M is small forces to be small as well (Fig 3A). As a result, it is very likely to ‘overshoot’ the target and end up with . Hence, in order to have , we need a large enough p.

Next, we consider the connectivity threshold . We recall that is obtained, for each instance, in such a way that at least one single-mutational path connecting the two reference variants exists, with all intermediate states having . Because both reference sequences must always be included in the space of allowed genotypes, increasing the value of beyond is not meaningful [38]. This gives an upper bound for the largest value of for each instance. We show in Fig 3C its average value as a function of p, where the average is restricted to instances such that . When p is too large, the tuning procedure requires too many greedy steps and as a consequence tends to decrease, contrarily to what we want in order to have bottleneck topologies with large . This suggests to lower the value of p as much as possible.

However, in Fig 3D we report the probability of having a single jumper at , which we estimate for numerical convenience by the probability that , as a function of p. When p is too small, this probability drops quickly below one. This is because starts increasing, the reference variants are obtained by many random steps, and the bottleneck structure is lost.

Taken together, these results suggest to choose the lowest value of p such that the probability of a single jumper is still very close to one, which also gives both and . The value of is fixed by the requirement that . For both choices of , this optimal value is around , although slightly larger values could be considered as well.

The crucial insight from the calibration is that one needs to tune p in order to achieve a good balance between greedy and random steps. Correspondingly, as we will discuss later, the distribution of the SMEs that are selected by the tuning procedure that generates the reference sequences (that may be interpreted as those ‘fixed by evolution’) is very heterogeneous, featuring a coexistence of large SMEs (generated by greedy steps) and small SMEs (generated by random steps).

2.4 Results for the calibrated model

In the following, we choose and the parameters corresponding to the black dots in Fig 3 as the final calibrated parameters for which we report more detailed results: Gaussian with and for the tuning procedure, and Pareto cutoff with and . Keeping these two models fixed, we generated many () random instances of the model and studied the distribution of the relevant quantities that characterize the topology of the fitness landscape. An example of a ‘good’ topology, closely resembling the one found experimentally, has already been given in Fig 1. We want to quantify the probability of generating such a topology in the stylized model.

We first focus on the properties of the two reference variants, shown in Fig 4 for both models. More specifically, Fig 4A shows the histograms of M, the total number of mutations separating the two reference sequences. We note that, by construction, , and we fixed . We see that the distribution is quite broad, leading to a significant fraction of instances with M as large as 20, and resembles that shown for elastic networks in Ref [38]. However, when we investigate the mutational paths, we restrict the analysis to instances with for computational tractability, as in Ref [38]. Fig 4B shows the histograms of for the blue reference variant (similar results are obtained for the red reference variant due to the inherent symmetry of the model). We recall that, by construction of the training procedure, . We observe that the histogram is peaked slightly above the minimal value, which confirms that the tuning procedure produces and . Fig 4C shows the histograms of the number of greedy steps in the tuning procedure, which is peaked around one, as expected. Finally, Fig 4D shows the histograms of the M SMEs that are selected by the tuning procedure. These mutations characterize a posteriori (after tuning) the space of intermediates between the reference variants, and can be thought of as being those ‘fixed’ by evolution. We observe a strongly bimodal distribution for both input distributions , which reflects the choice of the tuning procedure: mutations selected by the random steps tend to be small, while mutations selected by the greedy steps tend to be much larger (in a way that can be quantified by extreme value statistics, see the S1 Text).

thumbnail
Fig 4. Distribution of some relevant quantities for two different calibrated models, the Gaussian model and the Pareto cutoff model.

In the former, SMEs are extracted from a Gaussian distribution with unit variance and the tuning procedure has , ; in the latter, SMEs are extracted from a distribution with Pareto tails decaying with and a cutoff , and the tuning procedure has , . In each plot the dashed and dot-dashed vertical lines represent the mean and the median of the data, respectively. (A) Distribution of the number of mutations. (B) Distribution of the value of the positive (blue) reference phenotype divided by . (C) Distribution of the number of greedy steps needed to reach the target value when generating the two reference variants. (D) Distribution of SMEs selected by the tuning procedure to generate the two reference variants.

https://doi.org/10.1371/journal.pcbi.1014000.g004

Next, we analyze the statistical properties of the space of paths connecting the reference variants (Fig 5). We recall that in the calibrated models we find with probability one. The space of paths thus contains a single genotype through which all paths must pass: raising would remove this genotype from the set, and disconnect the reference variants. A bottleneck is thus observed. We note that the jumper genotype, in most cases, is connected by a single mutation to only one genotype carrying the other phenotype, as in Fig 1. However, in a few cases, we also observe situations where the jumper genotype is connected to a few genotypes carrying the other phenotype. In Fig 5A and 5D we show the histograms of , separately for instances with given M. We observe that the histogram is almost independent of M (with more variability observed for the Pareto cutoff model), and peaked around as discussed above. Fig 5B, 5E shows the histogram of the location of the jumper, called j, along the mutational path. We observe that the jumper is often located mid-way between the reference variants, leading to a peaked histogram around . The peak (slowly) becomes sharper upon increasing M. The qualitative shape of the possible topologies produced by the model is indicated in Fig 5G. Finally, we can compute the number of paths that remain viable at . The average of the logarithm of this number is shown in Fig 5C, 5F as a function of M. We find that it grows almost linearly at large M, indicating that the number of paths grows exponentially in M. This suggests that there are exponentially many ways to reach the bottleneck, which could thus maintain a good evolutionary accessibility from both reference sequences [38].

thumbnail
Fig 5. Statistical properties of the space of paths for two different calibrated model, the Gaussian model (A-C) and the Pareto cutoff model (D-F), with the same parameters as in Fig 4.

(A), (D) Distribution of values of , separately for several values of M. The mean and the median (dashed and dot-dashed vertical lines, respectively) are computed with respect to the full distribution (with all M values). (B), (E) Probability distribution of the jumper position j along the path, for different values of M. (C), (F) Average of logarithm of the number of paths that remain viable at , as a function of M. (G) Some examples of topologies, each associated to a schematic representation used in panel b.

https://doi.org/10.1371/journal.pcbi.1014000.g005

We conclude that our stylized model, which features a global non-linearity on top of an underlying additive phenotype, with properly calibrated parameters, generates topologies that are very close to those observed experimentally in Ref [27] and Sec. An example is shown in Fig 1, to be compared with the result obtained re-analyzing the experimental data from Ref [27], shown in Fig 2D. The model generates an ensemble of random fitness functions [5], whose topology is likely (with probability one) characterized by a bottleneck, i.e., by a genotype through which all paths connecting the two reference variants must pass. Of course, the bottleneck can be made less pronounced if one allows for less stringent selection, i.e., reduces [38]. The bottleneck is more likely found mid-way along the path. These features are essentially independent of M, i.e., the number of mutations separating the two reference variants, at least in the limited range investigated here.

We note that there is quite a lot of flexibility in the choice of parameters. The initial size L can be varied in a broad interval without affecting much the results (see S1 Text). The tuning parameter has been chosen here in such a way to obtain the desired average of M, but other choices are possible. The value of p, i.e., the probability of making a greedy step when constructing the reference variants, can be varied in a relatively wide range, provided it is neither too small (otherwise is too small and the reference phenotypes end up overshooting the fitness threshold) nor too large (otherwise decreases). Finally, the a priori distribution that characterizes the input distribution of SMEs can be varied essentially at will in a broad range (see also the S1 Text for other choices of the input distribution). The tuning procedure used to construct the reference variants ensures that the SMEs selected a posteriori are broadly distributed, featuring the right balance between almost neutral SMEs and strongly beneficial or deleterious SMEs (Fig 4D), which is a necessary condition for the emergence of a bottleneck. We thus believe that our results are robust and do not depend crucially on the specific choices made during calibration.

3 Discussion

In this work, we have demonstrated that functional bottlenecks in protein fitness landscapes can emerge even in the absence of network epistasis, solely due to a non-linear mapping between an underlying additive trait and fitness. To do so, we constructed and analyzed a simple stylized model of global epistasis, in which two reference variants are constructed from a common ancestor by selecting at random either a very beneficial mutation, or a neutral one, with the goal of acquiring two distinct new phenotypes. We have shown that the emergence of bottlenecks separating the two phenotypes is closely tied to the heterogeneity of the single-mutation effects selected during the construction of the reference variants. When mutational effects are heterogeneously enough, evolutionary paths between distinct phenotypes are constrained, leading to the formation of narrow evolutionary corridors.

Our findings provide a novel perspective on how fitness landscape topologies arise and suggest that strong constraints on evolutionary accessibility can exist even in systems governed by simple, global epistatic interactions [2,3,5,19,23,24,27,31,32,36,37]. In fact, in real evolution experiments, one always has access to a single realization of the process. Our stylized model, following and simplifying the approach of Ref [38], constructs an ensemble of fitness landscapes and allows us to assess to what extent the realizations observed in a given experiment are ‘typical‘ of a reasonable ensemble of possibilities. Moreover, despite being stylized, our model successfully captures key qualitative features observed in experimental data, supporting the idea that broad mutational effect distributions play a central role in shaping evolutionary trajectories.

Our results indicate that functional bottlenecks do not necessarily entail network epistasis. Such bottlenecks may instead arise because proteins, under strong evolutionary pressure to acquire new functions, tend to accumulate a few mutations that confer large benefits for the new function, alongside a handful of random neutral mutations introduced by the inherent stochasticity of evolution. Hence, functional bottlenecks may not necessarily indicate complex, higher-order interactions but could instead be a natural consequence of non-linear selection on an underlying additive trait.

Of course, network epistasis does exist in proteins, as it has been shown by a variety of approaches [8,11,12,14,15,20,22,2830,3335,44]. Our stylized model might serve as a null model to assess the relevance of network epistasis effects in the analysis of future experimental data.

We also stress that in different kinds of evolutionary dynamics, for instance neutral space evolution under weak selection [6163], mutations are selected in a different way and our stylized model would likely not apply. The existence of functional bottlenecks during this kind of evolutionary dynamics is less established and might depend on subtle correlations between large groups of interacting mutations [15,64,65]. A recent study of a combinatorial landscape separating naturally occurring bacterial transcription factor binding sites did not find a bottleneck [50].

Our model is simple enough that it could possibly be solved analytically, using techniques from probability and statistical physics [2,5,19]. For instance, the tuning procedure consists in a random walk in the space of mutations, and preliminary results suggest that the probability of the resulting reference traits and can be written in simple form. Making some analytical progress would eliminate the limitation on the number of mutations M separating the two reference variants. Future work could explore this possibility to extend our approach to larger mutational spaces, incorporate additional biological constraints, and investigate the robustness of these results across different fitness functions and evolutionary pressures, possibly with more than two phenotypes.

It should be noted that numerous techniques for extracting genotype-trait-fitness relationships directly from data – including modern machine learning frameworks – have emerged in what is now a rapidly expanding field (see, e.g., [44,6673]). The present work adopts a fundamentally different approach. Rather than attempting to fit specific empirical datasets, we propose here a simple stylized model, designed to identify relevant qualitative features and universal constraints in a generic way.

By highlighting the role of mutational heterogeneity in the emergence of functional bottlenecks, our study contributes to a deeper understanding of the constraints shaping protein evolution and hopefully opens new directions for theoretical and experimental exploration of evolutionary landscapes.

4 Methods

For convenience, we summarize here in closed form the main steps that have to be followed to generate an instance of our model.

4.1 Genotype representation and underlying additive phenotype

The first step consists in choosing a value of L and considering a genotype as a binary sequence of length L, where each site i can take values . The genotype-dependent underlying additive trait is given by:

(7)

where the single-mutation effects (SMEs) are independent and identically distributed according to the a priori distribution . The two choices we used are a Gaussian distribution with zero mean and unit variance, and a Pareto distribution with cutoff:

(8)

with and the constant A determined by normalization. After sampling, the SMEs are shifted to ensure . These choices fix an additive constant and an overall multiplicative factor in , which can be absorbed by β and in the fitness function, without loss of generality:

4.2 Fitness function and phenotypic classes

The global epistatic fitness function is a non-linear transformation of , defined separately for the blue (B) and red (R) phenotypes:

(9)

where without loss of generality, is the functionality threshold parameter, and controls the sharpness of the fitness transition. Accordingly, genotypes with are functional for the “blue” phenotype, those with are functional for the “red” phenotype, and those with are non-functional.

4.3 Reference variants construction

The red and blue reference variants, and , are independently generated starting from the ancestral genotype by sequentially introducing mutations. At each step, with probability p, the mutation with the largest contribution to E in the desired direction is chosen (greedy step), and with probability , a random site is mutated (random step). The process is iterated until and .

4.4 Mutational paths and bottleneck characterization

The mutational distance M between and is defined as the number of differing sites. All possible evolutionary paths between the reference variants are considered, and a value is determined such that at least one path of single mutations exists, maintaining at all intermediate steps. The distribution of and the likelihood of a single critical ‘jumper’ mutation define the presence and severity of the bottleneck.

Supporting information

S1 Text. This supporting document contains all supplementary figures cited in the main text.

https://doi.org/10.1371/journal.pcbi.1014000.s001

(PDF)

Acknowledgments

We warmly thank Sid Nagel and Martin Weigt for providing very useful feedback and inspiration all along the development of this work, and Emily Hinds for very useful guidance in reproducing the data analysis from Ref [27].

References

  1. 1. Wright S. In: Proceedings of the sixth international congress on genetics. 1932. p. 355–66.
  2. 2. de Visser JAGM, Krug J. Empirical fitness landscapes and the predictability of evolution. Nat Rev Genet. 2014;15(7):480–90. pmid:24913663
  3. 3. Tenaillon O. The utility of Fisher’s geometric model in evolutionary genetics. Annu Rev Ecol Evol Syst. 2014;45:179–201. pmid:26740803
  4. 4. Manrubia S, Cuesta JA, Aguirre J, Ahnert SE, Altenberg L, Cano AV, et al. From genotypes to organisms: state-of-the-art and perspectives of a cornerstone in evolutionary dynamics. Phys Life Rev. 2021;38:55–106. pmid:34088608
  5. 5. Pahujani S, Krug J. 2025. https://arxiv.org/abs/2502.05896
  6. 6. Bateson WH. Science. Science. 1907;26(672):649.
  7. 7. Fisher RA. XV—The Correlation between Relatives on the Supposition of Mendelian Inheritance. Trans R Soc Edinb. 1919;52(2):399–433.
  8. 8. Lunzer M, Golding GB, Dean AM. Pervasive cryptic epistasis in molecular evolution. PLoS Genet. 2010;6(10):e1001162. pmid:20975933
  9. 9. Harms MJ, Thornton JW. Evolutionary biochemistry: revealing the historical and physical causes of protein properties. Nat Rev Genet. 2013;14(8):559–71. pmid:23864121
  10. 10. Weinreich DM, Lan Y, Wylie CS, Heckendorn RB. Should evolutionary geneticists worry about higher-order epistasis?. Curr Opin Genet Dev. 2013;23(6):700–7. pmid:24290990
  11. 11. Olson CA, Wu NC, Sun R. A comprehensive biophysical description of pairwise epistasis throughout an entire protein domain. Curr Biol. 2014;24(22):2643–51. pmid:25455030
  12. 12. Figliuzzi M, Jacquier H, Schug A, Tenaillon O, Weigt M. Coevolutionary landscape inference and the context-dependence of mutations in beta-lactamase TEM-1. Mol Biol Evol. 2016;33(1):268–80. pmid:26446903
  13. 13. Starr TN, Thornton JW. Epistasis in protein evolution. Protein Sci. 2016;25(7):1204–18. pmid:26833806
  14. 14. Poelwijk FJ, Krishna V, Ranganathan R. The Context-Dependence of Mutations: A Linkage of Formalisms. PLoS Comput Biol. 2016;12(6):e1004771. pmid:27337695
  15. 15. Rivoire O, Reynolds KA, Ranganathan R. Evolution-Based Functional Decomposition of Proteins. PLoS Comput Biol. 2016;12(6):e1004817. pmid:27254668
  16. 16. Miton CM, Tokuriki N. How mutational epistasis impairs predictability in protein evolution and design. Protein Sci. 2016;25(7):1260–72. pmid:26757214
  17. 17. Sailer ZR, Harms MJ. High-order epistasis shapes evolutionary trajectories. PLoS Comput Biol. 2017;13(5):e1005541. pmid:28505183
  18. 18. Sailer ZR, Harms MJ. Detecting High-Order Epistasis in Nonlinear Genotype-Phenotype Maps. Genetics. 2017;205(3):1079–88. pmid:28100592
  19. 19. Hwang S, Park S-C, Krug J. Genotypic Complexity of Fisher’s Geometric Model. Genetics. 2017;206(2):1049–79. pmid:28450460
  20. 20. Cocco S, Feinauer C, Figliuzzi M, Monasson R, Weigt M. Inverse statistical physics of protein sequences: a key issues review. Rep Prog Phys. 2018;81(3):032601. pmid:29120346
  21. 21. Domingo J, Diss G, Lehner B. Pairwise and higher-order genetic interactions during the evolution of a tRNA. Nature. 2018;558(7708):117–21. pmid:29849145
  22. 22. Starr TN, Flynn JM, Mishra P, Bolon DNA, Thornton JW. Pervasive contingency and entrenchment in a billion years of Hsp90 evolution. Proc Natl Acad Sci U S A. 2018;115(17):4453–8. pmid:29626131
  23. 23. Otwinowski J. Biophysical Inference of Epistasis and the Effects of Mutations on Protein Stability and Function. Mol Biol Evol. 2018;35(10):2345–54. pmid:30085303
  24. 24. Otwinowski J, McCandlish DM, Plotkin JB. Inferring the shape of global epistasis. Proc Natl Acad Sci U S A. 2018;115(32):E7550–8. pmid:30037990
  25. 25. Domingo J, Baeza-Centurion P, Lehner B. The Causes and Consequences of Genetic Interactions (Epistasis). Annu Rev Genomics Hum Genet. 2019;20:433–60. pmid:31082279
  26. 26. Biswas A, Haldane A, Arnold E, Levy RM. Epistasis and entrenchment of drug resistance in HIV-1 subtype B. Elife. 2019;8:e50524. pmid:31591964
  27. 27. Poelwijk FJ, Socolich M, Ranganathan R. Learning the pattern of epistasis linking genotype and phenotype in a protein. Nat Commun. 2019;10(1):4213. pmid:31527666
  28. 28. Ballal A, Laurendon C, Salmon M, Vardakou M, Cheema J, Defernez M, et al. Sparse Epistatic Patterns in the Evolution of Terpene Synthases. Mol Biol Evol. 2020;37(7):1907–24. pmid:32119077
  29. 29. Lyons DM, Zou Z, Xu H, Zhang J. Idiosyncratic epistasis creates universals in mutational effects and evolutionary trajectories. Nat Ecol Evol. 2020;4(12):1685–93. pmid:32895516
  30. 30. Miton CM, Buda K, Tokuriki N. Epistasis and intramolecular networks in protein evolution. Curr Opin Struct Biol. 2021;69:160–8. pmid:34077895
  31. 31. Reddy G, Desai MM. Global epistasis emerges from a generic model of a complex trait. Elife. 2021;10:e64740. pmid:33779543
  32. 32. Phillips AM, Lawrence KR, Moulana A, Dupic T, Chang J, Johnson MS, et al. Binding affinity landscapes constrain the evolution of broadly neutralizing anti-influenza antibodies. Elife. 2021;10:e71393. pmid:34491198
  33. 33. Bakerlee CW, Nguyen Ba AN, Shulgina Y, Rojas Echenique JI, Desai MM. Idiosyncratic epistasis leads to global fitness–correlated trends. Science. 2022;376(6593):630–5.
  34. 34. Vigué L, Croce G, Petitjean M, Ruppé E, Tenaillon O, Weigt M. Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes. Nat Commun. 2022;13(1):4030. pmid:35821377
  35. 35. Chen JZ, Bisardi M, Lee D, Cotogno S, Zamponi F, Weigt M, et al. Understanding epistatic networks in the B1 β-lactamases through coevolutionary statistical modeling and deep mutational scanning. Nat Commun. 2024;15(1):8441. pmid:39349467
  36. 36. Johnson MS, Reddy G, Desai MM. Epistasis and evolution: recent advances and an outlook for prediction. BMC Biol. 2023;21(1):120. pmid:37226182
  37. 37. Buda K, Miton CM, Tokuriki N. Pervasive epistasis exposes intramolecular networks in adaptive enzyme evolution. Nat Commun. 2023;14(1):8508. pmid:38129396
  38. 38. Alqatari S, Nagel SR. Evolutionary pathways in epistatic mechanical networks. Proc Natl Acad Sci U S A. 2026;123(5):e2505183123. pmid:41615748
  39. 39. Husain K, Murugan A. Physical Constraints on Epistasis. Mol Biol Evol. 2020;37(10):2865–74. pmid:32421772
  40. 40. Park Y, Metzger BPH, Thornton JW. The simplicity of protein sequence-function relationships. Nat Commun. 2024;15(1):7953. pmid:39261454
  41. 41. Dupic T, Phillips AM, Desai MM. bioRxiv. 2024.
  42. 42. Carlson MO, Andrews BL, Simons YB. Distinguishing direct interactions from global epistasis using rank statistics. Proc Natl Acad Sci U S A. 2025;122(39):e2509444122. pmid:40986352
  43. 43. Martin G, Lenormand T. The fitness effect of mutations across environments: Fisher’s geometrical model with multiple optima. Evolution. 2015;69(6):1433–47. pmid:25908434
  44. 44. Russ WP, Figliuzzi M, Stocker C, Barrat-Charlaix P, Socolich M, Kast P, et al. An evolution-based model for designing chorismate mutase enzymes. Science. 2020;369(6502):440–5. pmid:32703877
  45. 45. Weinreich DM, Delaney NF, Depristo MA, Hartl DL. Darwinian evolution can follow only very few mutational paths to fitter proteins. Science. 2006;312(5770):111–4. pmid:16601193
  46. 46. Wu NC, Dai L, Olson CA, Lloyd-Smith JO, Sun R. Adaptation in protein fitness landscapes is facilitated by indirect paths. Elife. 2016;5:e16965. pmid:27391790
  47. 47. Kantroo P, Wagner GP, Machta BB. High fitness paths can connect proteins with low sequence overlap. bioRxiv. 2024;:2024.11.13.623265. pmid:39605533
  48. 48. Papkou A, Garcia-Pastor L, Escudero JA, Wagner A. A rugged yet easily navigable fitness landscape. Science. 2023;382(6673):eadh3860. pmid:37995212
  49. 49. Johnston KE, Almhjell PJ, Watkins-Dulaney EJ, Liu G, Porter NJ, Yang J, et al. A combinatorially complete epistatic fitness landscape in an enzyme active site. Proc Natl Acad Sci U S A. 2024;121(32):e2400439121. pmid:39074291
  50. 50. Westmann CA, Goldbach L, Wagner A. bioRxiv. 2024.
  51. 51. Starr TN, Picton LK, Thornton JW. Alternative evolutionary histories in the sequence space of an ancient protein. Nature. 2017;549(7672):409–13. pmid:28902834
  52. 52. Bendixsen DP, Collet J, Østman B, Hayden EJ. Genotype network intersections promote evolutionary innovation. PLoS Biol. 2019;17(5):e3000300. pmid:31136568
  53. 53. Wiedenmann J, Vallone B, Renzi F, Nienhaus K, Ivanchenko S, Röcker C, et al. Red fluorescent protein eqFP611 and its genetically engineered dimeric variants. J Biomed Opt. 2005;10(1):14003. pmid:15847584
  54. 54. Merzlyak EM, Goedhart J, Shcherbo D, Bulina EM, Shcheglov AS, Fradkov AF, et al. Nature methods. 2007;4:555.
  55. 55. Subach OM, Cranfill PJ, Davidson MW, Verkhusha VV. An enhanced monomeric blue fluorescent protein with the high chemical stability of the chromophore. PLoS One. 2011;6(12):e28674. pmid:22174863
  56. 56. Shcherbo D, Murphy CS, Ermakova GV, Solovieva EA, Chepurnykh TV, Shcheglov AS, et al. Far-red fluorescent tags for protein imaging in living tissues. Biochem J. 2009;418(3):567–74. pmid:19143658
  57. 57. Zhong Z, Wong BG, Ravikumar A, Arzumanyan GA, Khalil AS, Liu CC. Automated Continuous Evolution of Proteins in Vivo. ACS Synth Biol. 2020;9(6):1270–6. pmid:32374988
  58. 58. Fröhlich C, Bunzel HA, Buda K, Mulholland AJ, van der Kamp MW, Johnsen PJ, et al. Epistasis arises from shifting the rate-limiting step during enzyme evolution of a β-lactamase. Nat Catal. 2024;7(5):499–509. pmid:38828429
  59. 59. Rodriguez-Rivas J, Croce G, Muscat M, Weigt M. Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes. Proc Natl Acad Sci U S A. 2022;119(4):e2113118119. pmid:35022216
  60. 60. Hadfield J, Megill C, Bell SM, Huddleston J, Potter B, Callender C, et al. Bioinformatics. Bioinformatics. 2018;34:4121.
  61. 61. Fantini M, Lisi S, De Los Rios P, Cattaneo A, Pastore A. Protein Structural Information and Evolutionary Landscape by In Vitro Evolution. Mol Biol Evol. 2020;37(4):1179–92. pmid:31670785
  62. 62. Stiffler MA, Poelwijk FJ, Brock KP, Stein RR, Riesselman A, Teyra J, et al. Protein Structure from Experimental Evolution. Cell Syst. 2020;10(1):15-24.e5. pmid:31838147
  63. 63. Bisardi M, Rodriguez-Rivas J, Zamponi F, Weigt M. Modeling Sequence-Space Exploration and Emergence of Epistatic Signals in Protein Evolution. Mol Biol Evol. 2022;39(1):msab321. pmid:34751386
  64. 64. Di Bari L, Bisardi M, Cotogno S, Weigt M, Zamponi F. Emergent time scales of epistasis in protein evolution. Proc Natl Acad Sci U S A. 2024;121(40):e2406807121. pmid:39325427
  65. 65. Rossi S, Di Bari L, Weigt M, Zamponi F. Fluctuations and the limit of predictability in protein evolution. Rep Prog Phys. 2025;88(7):10.1088/1361-6633/adea92. pmid:40592352
  66. 66. Figliuzzi M, Barrat-Charlaix P, Weigt M. Molecular biology and evolution. Mol Biol Evol. 2018;35(7):1821. pmid:29659984
  67. 67. Wang J, Cao H, Zhang JZH, Qi Y. Computational Protein Design with Deep Learning Neural Networks. Sci Rep. 2018;8(1):6349. pmid:29679026
  68. 68. Tubiana J, Cocco S, Monasson R. Learning protein constitutive motifs from sequence data. Elife. 2019;8:e39397. pmid:30857591
  69. 69. Trinquier J, Uguzzoni G, Pagnani A, Zamponi F, Weigt M. Efficient generative modeling of protein sequences using simple autoregressive models. Nat Commun. 2021;12(1):5800. pmid:34608136
  70. 70. Rives A, Meier J, Sercu T, Goyal S, Lin Z, Liu J, et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proc Natl Acad Sci U S A. 2021;118(15):e2016239118. pmid:33876751
  71. 71. Malbranke C, Bikard D, Cocco S, Monasson R, Tubiana J. Machine learning for evolutionary-based and physics-inspired protein design: current and future synergies. Curr Opin Struct Biol. 2023;80:102571. pmid:36947951
  72. 72. Wagner A. Genotype sampling for deep-learning assisted experimental mapping of a combinatorially complete fitness landscape. Bioinformatics. 2024;40(5):btae317. pmid:38745436
  73. 73. Razo-Mejia M, Mani M, Petrov DA. bioRxiv. 2025.