Figures
Abstract
We show that logic computational circuits in gene regulatory networks arise from a fibration symmetry breaking in the network structure. From this idea we implement a constructive procedure that reveals a hierarchy of genetic circuits, ubiquitous across species, that are surprising analogues to the emblematic circuits of solid-state electronics: starting from the transistor and progressing to ring oscillators, current-mirror circuits to toggle switches and flip-flops. These canonical variants serve fundamental operations of synchronization and clocks (in their symmetric states) and memory storage (in their broken symmetry states). These conclusions introduce a theoretically principled strategy to search for computational building blocks in biological networks, and present a systematic route to design synthetic biological circuits.
Author summary
We show that the core functional logic of genetic circuits arises from a fundamental symmetry breaking of the interactions of the biological network. The idea can be put into a hierarchy of symmetric genetic circuits that reveals their logical functions. We do so through a constructive procedure that naturally reveals a series of building blocks, widely present across species. This hierarchy maps to a progression of fundamental units of electronics, starting with the transistor, progressing to ring oscillators and current-mirror circuits and then to synchronized clocks, switches and finally to memory devices such as latches and flip-flops.
Citation: Leifer I, Morone F, Reis SDS, Andrade JS Jr, Sigman M, Makse HA (2020) Circuits with broken fibration symmetries perform core logic computations in biological networks. PLoS Comput Biol 16(6): e1007776. https://doi.org/10.1371/journal.pcbi.1007776
Editor: Mercedes Pascual, University of Chicago, UNITED STATES
Received: October 30, 2019; Accepted: March 6, 2020; Published: June 17, 2020
Copyright: © 2020 Leifer et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data is publicly available without restriction and can be fully accessed from the following websites: 1. Arabidopsis Thaliana Database: http://atrm.cbi.pku.edu.cn/ 2. Mycobacterium Tuberculosis Database: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2600667/bin/msb200863-s2.xls 3. Bacillus subtilis Database: http://subtiwiki.uni-goettingen.de/ 4. Escherichia coli Database: http://regulondb.ccg.unam.mx/ 5. Salmonella SL1344 Database: http://salmonet.org/ 6. Yeast Database: http://cosbi3.ee.ncku.edu.tw/YTRP/Home 7. Mouse Database: https://www.grnpedia.org/trrust/ 8. Human Database #1: https://www.grnpedia.org/trrust/ 9. Human Database #2: https://www.genome.jp/kegg/pathway.html.
Funding: This work was supported by NSF-IIS 1515022. SDSR and JSA thank the Brazilian agencies CAPES, FUNCAP, and CNPq for financial support. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: No authors have competing interests.
Introduction
In all biological networks [1] some simple ‘network motifs’ appear more often than they would by pure chance [2–4]. This regularity has been interpreted as evidence that these motifs are basic building blocks of biological machineries, a proposal that has had a major impact on systems biology [4, 5]. However, mere statistical abundance by itself does not imply that these circuits are core bricks of biological systems. In fact, whether these network motifs may have a functional role in biological computation remains controversial [6–9].
Functional building blocks should offer computational repertoires drawing parallels between biological networks and electronic circuits [10]. Indeed, the idea of using electronic circuitry and devices to mimic aspects of gene regulatory networks has been in circulation almost since the inception of regulatory genetics itself [11]. This idea has been a driving force in synthetic biology [12]; with several demonstrations showing that engineered biological circuits can perform computations [13], such as toggle switches [14–17], logic [18], memory storage [15, 19, 20], pulse generators and oscillators [14, 21–23].
In a previous work [24], we have shown that the building blocks of gene regulatory networks can be identified by the fibration symmetries of these networks. In the present paper, we first demonstrate the functionality of these symmetric building blocks in terms of synchronized biological clocks. We then show that the breaking of the fibration symmetries of the network identifies additional building blocks with core logic computational functions. We do so through a constructive procedure based on symmetry breaking that naturally reveals a hierarchy of genetic building blocks widely present across biological networks and species. This hierarchy maps to a progression of fundamental units of electronics, starting with the transistor, and progressing to ring oscillators and current-mirror circuits and then to memory devices such as toggle switches and flip-flops. We show that, while symmetric circuits work as synchronized oscillators, the breaking of these symmetries plays a fundamental role by switching the functionality of circuits from synchronized clocks to memory units.
Thus, our constructive theoretical framework identifies 1) building blocks of genetic networks in a unified way from fundamental symmetry and broken symmetry principles, which 2) assure that they perform core logic computations, 3) suggests a natural mapping onto the foundational circuits of solid state electronics [12, 13], and 4) endows the mathematical notion of symmetry fibration [24] with biological significance.
Results
Feed-forward loops do not synchronize
We start our analysis by considering the dynamics of the most abundant network motif in transcriptional regulatory networks, the so-called feed-forward loop (FFL) introduced by Alon and coworkers [3, 4, 25]. A cFFL motif consists of three genes (X, Y and Z; c refers to coherent where all regulators are activators) where the transcription factor expressed by gene X positively regulates the transcription of Y and Z, and, in turn Y regulates Z (Fig 1A). Fig 1A shows an example of cFFL motif in E. coli with X = cpxR, Y = baeR, and Z = spy. Numerical and analytic solutions for the expression levels of the genes in the cFFL (Fig 1B) demonstrate that the FFL does not reach synchronization (unless for very specific setting of parameters) nor oscillations in expression levels. This is consistent with previous research which has interpreted the functionality of the FFL as signal delay [3, 25, 26].
A: FFL network representation. B: Numerical solution of FFL dynamics. The expression levels of genes Y and Z do not synchronize. The oscillation pattern presented is due to the square-wave behavior of gene X expression levels. We use α = 2.0, γx = 0.12, γy = 0.7, kx = 0.5, ky = 0.1, y0 = 0.7 and z0 = 0.0. C: Input tree representation of FFL. The input trees of genes X, Y, and Z are not isomorphic, as a consequence, their expression levels do not synchronize. D: Base representation of FFL. The base is the same as the original circuits since there are no symmetries. E: SAT-FFF network representation. As an example, we consider genes cpxR = X, baeR = Y, and spy = Z from the gene regulatory network of E. coli. The addition of the autoregulation leads to a symmetry between the expression levels of genes Y and Z. F: The numerical solution of the SAT-FFF dynamics shows the synchronization of the expression levels of genes Y and Z. We use α = 0.06, γx = 0.775, γy = 0.775, kx = 0.5, ky = 0.1, y0 = 0.85 and z0 = 0.0. Again, the oscillation is due to the wave-like pattern of X. G: As a result, genes Y and Z have isomorphic input trees. However, the input tree of the external regulator cpxR is not isomorphic, despite the fact that it directly regulates the fiber. H: Since Y and Z synchronize, gene Z can be collapsed into Y, resulting in a simpler base representation.
We illustrate this result by presenting an analytical solution of the FFL [2–4]. We use a discrete time, continuous state variable model with a logic Boolean interaction function in the spirit of the Glass and Kauffman model of biochemical networks [27]. The dynamics of the expression levels yt and zt of genes Y and Z, respectively, as a function of time t in the cFFL is given by the following difference equations [4]: (1) where xt is the expression level of gene X, α is the degradation rate of the gene, γx and γy are the strength of the interaction representing the maximum expression rate of genes X and Y, respectively, and the thresholds kx and ky are the dissociation constant between the transcription factor and biding site. The expression level is measured in terms of abundance of gene product, e.g., mRNA concentration. The Heaviside step functions θ(xt − kx) and θ(yt − ky) represent the activator regulation from gene X and Y, respectively. They represent the Boolean logic approximation of Hill input functions in the limit of strong cooperativity [4, 27]. We consider an AND gate for the combined interaction of transcription factors of genes X and Y onto the binding sites of gene Z [4]. Analogous results shown in S1 File Section I can be obtained with an OR gate and with ODE continuum models.
In S1 File Sec. I A, we show that the expression levels of the genes Y and Z do not synchronize, in other words, limt → ∞(yt − zt) ≠ 0. For example, Fig 1B shows a particular set of parameters which results in a non-synchronized state. Such state is obtained under initial condition y0 > ky and α < γx. Specifically, we use the parameters: α = 0.2, γx = 0.12, γy = 0.7, kx = 0.5 and ky = 0.1. For this combination, yt and zt do not synchronize since yt saturates at yt → γx/α = 0.6 when t → ∞, and zt saturates at zt → γxγy/α = 0.42, for t → ∞. In this figure, we set xt equal to a square wave and then monitor the expression levels of yt and xt. When x < kx, both yt and zt decay exponentially to zero. On the other hand, when x > kx, both variables evolve to saturate again at yt = γx/α and zt = γxγy/α, in agreement with the analytical solution.
Feed-forward fibers synchronize via a symmetry fibration
In the FFL, gene Z receives input from X and Y, while gene Y, instead, only from X, and therefore the inputs are not symmetric (Fig 1C). But as it turns out, a search of motifs in biological networks [24] shows that the FFL circuit in Fig 1A regularly appears in conjunction with an autoregulation (AR) loop [28] at Y = baeR (Fig 1E). This minimal inclusion in the FFL symmetrizes the circuit and we show, next, that this symmetry results in a circuit with a first and minimal form of function: synchronization in gene expression. This can be formalized by analyzing invariances in the input tree [24], which represents all the paths that converge to a given gene (Fig 1C and 1G). The mathematical principle of symmetry fibration in dynamical systems introduced in [24, 29, 30] predicts this, since symmetries Y ↔ Z that leave invariant the tree of inputs (but not necessarily the outputs as imposed by automorphisms) are necessary and sufficient to achieve synchronization. When two genes have isomorphic input trees, their expression levels are synchronized, see [24] for details.
For instance, by means of its autoregulation, baeR forms a symmetry fibration with gene spy, that can be formalized and characterized by an isomorphism between the input trees of these genes (Fig 1G). When two input trees, like those of baeR and spy, are isomorphic, the expression levels of these two genes are synchronized. These genes are said to belong to the same ‘fiber’ [24, 30]. Genes in the same fiber are redundant, and can be collapsed into the ‘base’ (see Fig 1H) by a symmetry fibration [24] (the FFL, instead, cannot be reduced since it has no symmetry, see Fig 1D). Numerical simulations and analytical solutions (Fig 1F, S1 File Section II A and Section II B) confirm that the addition of the AR loop to the FFL—leading to a circuit that we call the Feed-Forward Fiber (FFF)—changes its functionality qualitatively, leading to synchronization of genes Y and Z into coherent co-expression. This prediction is confirmed with experimental co-expression profiles in Ref. [24].
The FFF with activator regulations in Fig 1E has a simple dynamics converging to a synchronized fixed point: all interactions are satisfied meaning that the Heaviside step functions evaluate to 1 and we call this circuit SAT-FFF. Instead, when the autoregulation is a repressor, the loop behaves as a logical NOT gate. When expression is high it inhibits itself shifting to low state. Instead, if it is low it promotes itself to shift to a high state. Hence, the activity of this gene oscillates indefinitely [14, 27]. This is the simplest expression of frustration [27, 31], a core concept in physics which refers to a system which is always in tension and thus never reaches a stable fixed configuration.
A biological transistor as a core building block
To understand the computation rationale of symmetric and frustrated circuits made of repressors, we map them to electronic analogues. We begin the analogy with the simplest circuit of a single gene with a feedback loop with repression (AR loop, Fig 2D). The dynamics for the expression level yt is described by the discrete time model with Boolean interaction [27]: (2) Here, α is the degradation rate of the Y gene product, γy is the maximum expression rate of gene Y, and ky is the dissociation constant.
A: A pnp transistor allows current flow if the voltage applied to its base is lower than the voltage at its emitter (VB < VE). Since it has a high (low) output for a lower (high) input, it is logically represented by a NOT gate. The yellow box shows the mapping between the pnp transistor and the biological repressor. B: A repressor regulation link plays the role of the pnp transistor since the rate of expression of a gene is repressed by gene Y if ky < yt. C: By connecting the base of the transistor to its collector, one forms a one stage ring oscillator. D: This connection is translated to the biological analogue as a repressor autoregulation at gene Y. In this way, the rate expression of gene Y is able to oscillate, depending on the adjustment of parameters α, ky and γy (see S1 File Section III). E: Widlar current-mirror circuit and F: its biological analogue (UNSAT-FFF). By mirroring the ring oscillator, the Widlar mirror circuit allows synchronization and oscillations (see S1 File Section III). G: Phase diagram of oscillations of the UNSAT-FFF. An oscillatory phase is defined by the condition γy/ky > (γx/α)−1 (see S1 File Section III). For instance, on the right side, we plot the solution of the discrete dynamics for a set of parameters satisfying such condition. Specifically, α = 0.205, γx = 0.454, γy = 0.454, kx = 0.5, and ky = 1.0.
The Heaviside step function θ(ky − yt) reflects the repressor autoregulation in the Boolean logic approximation. We will show that this genetic repressor interaction, shown as the stub in Fig 2B, is the genetic analogue of a solid-state transistor shown in Fig 2A.
A transistor is typically made up of three semiconductors, a base sandwiched between an emitter and a collector (Fig 2A). The current flows between the emitter and collector only if voltage applied to the base is lower than at the emitter (VB < VE) and thus the transistor acts as a switch and inverter. In the genetic circuit, the expression yt drives the rate of expression of gene Y, like the voltage drives current around an electric circuit. Simply comparing Eq (2) to the pnp transistor in Fig 2A leads to the analogy in which the expression yt is an analogue for the base potential VB of a transistor, ky an analogue for VE, γy an analogue for the emitter current IE, αyt an analogue for IB, and Δyt an analogue for IC. Then, Eq (2) provides the genetic equivalent of the equation for a transistor’s collector current IC = IE − IB (Fig 2C and 2D).
The repressor AR genetic circuit of Fig 2D becomes a one-stage ring oscillator (Fig 2C) where the collector of the transistor connects to the base forming the minimal signal feedback loop. As shown in Fig 2D, an example of the AR is the gene Y = trpR from the E. coli transcriptional network. The repressor AR loop can be extended to the FFF by symmetrizing it, adding gene Z, such that it synchronizes with Y to express an enzyme that catalyzes a biochemical reaction (Fig 2F). The circuit is completed with the external regulator X which keeps the symmetry between Y and Z. The resulting circuit is called UNSAT-FFF (since it is frustrated).
The UNSAT-FFF maps to the so-called Widlar current-mirror electronic circuit shown in Fig 2E, a popular building block of integrated circuits used since the foundations of the semiconductor industry (1967 US patent [32, 33]). It serves two key functions as we show below: mirror synchronization of yt = zt and oscillatory activity (Fig 2G and S1 File Section III).
UNSAT-FFF solution oscillates and synchronize
Next, we show analytically and numerically that the UNSAT-FFF circuit has an oscillatory solution plus synchronization of genes Y and Z. S1 File Section III A shows that the prediction of oscillatory behavior can be obtained also from a model of gene expression in the continuum time-domain using a first-order ODE model with time delay due to the process of transcription and translation. Below, we focus on the discrete dynamics.
The UNSAT-FFF circuit is obtained by the addition of a repressor AR loop to the FFL: in Fig 2F, gene Y acts as a repressor regulator on the gene Z and on itself. There are many variants of this circuit depending on the combinations of activator and repressor regulators. Here, we use X gene as an activator and Y gene as a repressor. Different variants yields analogous results and will be discussed elsewhere. The important ingredient is the existence of frustration in the interactions. For instance if gene X is high, then it will make genes Y and Z high too. However, this configuration does not satisfy the repressor autoregulation bond neither the repression from Y → Z. Thus, two bonds are unsatisfied and this circuit unsatisfied: UNSAT-FFF. The discrete-time dynamics of the expression levels of genes yt and zt are given by: (3) where γx and γy are the strength of the interaction (maximum expression rate) of genes X and Y, respectively, and kx and ky are the dissociation constant, respectively. Similarly to the SAT-FFF case, synchronization between y and z occurs for the UNSAT-FFF (see S1 File Section II). However, the impact of the repressor feedback loop on the dynamical behavior of this circuit is more profound, since it leads to oscillations. Thus, while both, SAT-FFF and UNSAT-FFF, lead to synchronization of Y and Z, the former synchronizes into a fixed point and the later into an oscillatory limit cycle.
We set λ = γxγy/ky α, and β = 1 − α, so that we rewrite Eq (3) for the rescaled variables ψt = yt/ky and ζt = zt/ky as (4) Now, we set xt = x constant in time for simplicity. For x < kx, the solutions exponentially decay as ψt = ψ0e−t/τ and ζt = ζ0e−t/τ, where ψ0 is the initial condition. For x > kx, Eq (4) defines an iterative map which satisfies the following recursive equation: (5) This iterative map results in different solutions depending on the value of λ.
We consider first the case where the initial condition is ψ0 > 1. Thus, the solution of Eq (4) is ψt = ψ0e−t/τ, where τ−1 = −log(1 − α). This solution is correct as long as ψt > 1, but ceases to be valid at a certain time t* such that ψt < 1, which is given by t* = ⌈τ log ψ0⌉.
Next, we consider the case ψ0 < 1. In this case the solution is given by ψt = ψ0e−t/τ + λ(1 − e−t/τ), which is always valid for λ < 1. Thus, when λ < 1 the system does not oscillate but it converges monotonically to a fixed point ψ∞ = λ. However when λ > 1, this solution ceases to be valid at the time such that ψt > 1. Therefore, the solution ψt oscillates in time for λ > 1. For the case of ψ0 > 1, the explicit solution is given by the general analytical expression which is plotted in Fig 2G, right: (6) The general solution with initial condition ψ(t0) < 1 can be written in a similar way.
Thus, the main condition for oscillations in the circuits is λ > 1, and if λ < 1, there is no oscillatory behavior, and the solution ψt converges monotonically to λ. Therefore, the oscillatory phase is separated from the non-oscillatory phase by the condition: (7) which is depicted in the phase diagram in Fig 2G, left.
Thus, the repressor autoregulation at gene Y converts the circuit into a synchronized clock, a primary building block in any logic computational device. S1 File Section V and S2 File show all FFFs found across gene regulatory networks of the studied species spanning A. thaliana, M. tuberculosis, B. subtilis, E. coli, salmonella, yeast, mouse and humans. We also show in Table 1 the count of circuits across species and their associated Z-scores showing that these circuits are statistically significant. The algorithm to find these fibers in biological networks is explained in [24] and S1 File Section IV and it is available at https://github.com/makselab/FiberCodes.
A broad class of logic and dynamic repertoires in the class of fibers
The procedure to build more complex fibers can be systematically extended through an algebra of circuits that adds external regulators and loops to grow the base of symmetric circuits (Fig 3 and S1 File Section IV A). In this space, the AR is the core loop unit referred to as |n = 1, ℓ = 0〉 in the nomenclature of [24] since it has n = 1 loop and ℓ = 0 external regulator (Fig 3B), and the FFF is |n = 1, ℓ = 1〉 (Fig 3C).
The addition of autoregulation loops and feedback loops results on a hierarchy of circuits of increasing complexity. For example, turning the A: repressor link into a B: repressor autoregulation results on an input tree that feeds its own expression levels with Qt = 1 and branching ratio n = 1 and it is equivalent to its own base. C: The addition of an external regulator (ℓ = 1) creates the UNSAT-FFF where genes Y and Z synchronize and oscillate. This can be verified in its corresponding input tree and logic circuit. D: The addition of a second feedback loop results in an input tree that follows the Fibonacci sequence Qt = 1, 2, 3, 5, 8, …. Here, gene X is not part of the base. The branching structure of the input tree implies that the Fibonacci Fiber can oscillate and synchronize, but is unable to store static information. Examples are shown from all studied species. E: The second autoregulation at X results in a symmetric input tree with Qt = 2Qt−1 and branching ratio n = 2. This fiber collapses into a base with two autoregulators. Examples of n = 2 Fiber are two fibers from the regulatory networks of B. subtilis. In this figure, we present activator links (black), repressor links (red), and interactions with unknown functionality (grey).
From this starting point, one can grow the number of regulator genes, |1, ℓ〉 with ℓ > 1. This does not affect the complexity because all the relevant dynamics remain constrained to the sole loop in the FFF circuit. Likewise, there are a number of circuits that can correspond to |1, 1〉. However, only certain modifications conserve the topological class identified by |1, 1〉. For instance, changing the sign of the edges is allowed as long as the edges of each gene are the same, but removing the edge X→Z will break the symmetry of the fiber, so it is not allowed. Adding a second node downstream of Y will conserve the topological class but only if it interacts with X. This situation changes as soon as the fiber feeds information back to the external world. This is the case of the circuits in Fig 3D, where the gene Y now regulates its own regulator gene X. The inclusion of this second feedback loop results in the coexistence of two time-scales in the network. This, in turn, increases the diversity of trajectories and delays in the network; a dynamic complexity that is measured by the divergence of the input tree, which is captured by the sequence Qt representing the number of source genes with paths of length t − 1 to the target gene, see Fig 3 and [24]. Simply put, the input tree is a rooted tree with a gene at the root (Q1 = 1), and Qt represents the number of genes in the t−th layer of the input tree. Then, the divergence of the input tree is captured by its branching ratio measured by , see [24] for details.
A quantitative analysis of this measure yields exactly the golden ratio for the circuits in Fig 3D, revealing that the input tree is a Fibonacci sequence Qt = Qt−1 + Qt−2 updating the current state two steps backwards, see [24]. We have called this class of circuits, Fibonacci Fibers, in [24]. For example, the repressor interactions between genes X = uxuR, Y = exuR, and Z = lgoR from the E. coli network function exactly as a Fibonacci Fiber (Fig 3D). Other typical examples of Fibonacci Fibers in the transcriptional networks across species are also shown in Fig 3D. S1 File Section V and S2 File show all found fibers, and Table 1 the counts for all fibers and their associated Z-scores showing that these circuits are statistically significant. The important component of these circuits is the delay in the feedback loop through the regulator from Y → X and back to Y captured by the Qt−2 term in the Fibonacci sequence. This circuit has been synthetically implemented by Stricker et al. [22] using a hybrid promoter that drives the transcription of genes araC and lacI forming a dual-feedback circuit. The functionality of this circuit has been demonstrated to be robust oscillations due to the negative feedback loop [22]. We will show next that a symmetry breaking in this Fibonacci circuit forms the base of the JK flip-flop, which is the universal storage device of computer memories [33].
The complexity of the Fibonacci Fiber with feedback to the regulator is 1.6180…, which is lower than the number of loops in the circuit (two). The intuition of what this reveals is that, in this circuit the regulator X is still not part of the base (see Fig 3D), since it does not receive input from itself and hence it is not within the symmetry of the fiber. This, in turn, indicates naturally that the next element in the hierarchy of fibers results from the inclusion of an AR loop in X. This creates a fully symmetric circuit (Fig 3E) with a core |2, ℓ = 0〉 that feeds the reporter/enzyme Z. In this case, the complexity of the circuit is exactly two (the number of autoregulators within the fiber). Examples are shown in Fig 3E and Supplementary Materials.
Broken symmetry circuits as memory storage devices
All symmetric circuits shown in Fig 3 [24] work as clocks with varying levels of sophistication and robustness given by their time-scales or loops (see S1 File Sections III C and IV A). Additionally, the more complex Fibonacci Fibers store memory dynamically by integrating sequences of its two immediate past states, according to the input tree, which computes temporal convolutions. However, this is only a short-term memory, stored dynamically and continually erased. This raises the necessity of understanding how these canonical biological circuits can perform controlled memory set and reset, a fundamental component of all computing devices [33].
We show next that static memory storage requires breaking the fibration symmetry of each circuit creating structures analogous to ‘flip-flops’ [33] in electronics that use a bistable toggle switch [14, 15] to store a bit of binary information into computer memory. As we show below, in genetic networks, symmetry breaking endows the circuits with the ability to remember.
Next, we extend the constructive process described above to include symmetry breaking. We do so by mimicking an evolutionary process where circuits ‘grow’ by a ‘duplication’ event (analogous to gene duplication in evolution) that conserves the base of the original circuit (Fig 4, first and second row). Then, breaking the symmetry creates a new functionality. We begin this with the simplest case of AR gene Y: |n = 1, ℓ = 0〉. This gene is duplicated by ‘opening up’ the AR loop into two mutually repressed replica genes, Y and Y’ (Fig 4, second row). This creates a bistable toggle switch as studied in synthetic genetic circuits [14, 15] that is topologically equivalent to the core AR loop |1, 0〉 since both have the same input tree and base (Fig 4, first row).
AR (first column): The replica symmetry duplication of the AR symmetric circuit results in a network that is analogous to the SR flip-flop circuit. The symmetry between Y and Y’ is broken by the inputs S and R, such that S ≠ R, resulting into a pair logical outputs Q and . FFF (second column): Following the same strategy, we replicate the FFF through a replica symmetry duplication. Note that this operation adds a second level of logic gates to the SR flip-flop. In order to have consistent logic operations, we add an input clock gene CLK in addition to S and R. The resulting circuit is analogous to the Clocked SR flip-flop logic circuit. Fibonacci (third column: The replica symmetry duplication of the Fibonacci Fiber results in a logic circuit which is analogous to the JK flip-flop. For each symmetry breaking class, we show two examples of circuits from the human regulatory network. The external regulator genes, S and R (J and K), provide inputs which are logically processed by the circuit, accordingly to the type of interaction links between the genes, activators (black arrows) or repressors (red flat links). The outputs of the circuits (green genes) regulate the expression levels of other genes (in red) without affecting the circuit functionality. Here, grey arrows correspond to interactions with unknown functionality.
The symmetry of the replica circuit is then explicitly broken by including different inputs, S and R, to regulate genes Y and Y’, respectively (Fig 4, third row). S and R represent either genes or effectors that break the symmetry of the circuit. Symmetry is now broken, and the genetic circuit becomes analogous to a Set-Reset (SR) flip-flop [33], the simplest canonical circuit to store a bit of binary information in electronic computer memory (see Fig 4, fourth row).
The symmetry is explicitly broken by applying set-reset inputs S ≠ R. Specifically, when S = 1 and R = 0, the circuit stores a bit of information. But here is the interesting fact: this state is kept in memory even when S = R = 1. That is, the circuit remains in the broken symmetry state even if the inputs are now equal and symmetric. In other words, the symmetry is now ‘spontaneously’ broken [34], since a specific state is selected even without external bias, thus allowing the circuit to remember its state.
Thus, this genetic network is a toggle switch as studied in synthetic genetic circuits [14, 15] analogous to the SR flip-flop logic circuit shown in Fig 4, fourth row. When the controlling inputs are S = 0 and R = 1, the SR flip-flop stores one bit of information resulting in Q = 1 and (, and the output Q is the logic conversion of not Y). The SR flip-flop retains this logical state even when the controlling inputs change. In other words, when S = 1 and R = 1, the feedback (the repressor interactions between Y and Y’) maintains the outputs Q and to its previous state. This state changes only when we reset the circuits with S = 1 and R = 0. In this last case, the SR flip-flop stores Q = 0 and , which is also remembered when both inputs are high (S = 1 and R = 1, S1 File Section IV B).
This spontaneous symmetry breaking [34] has analogy with a ferromagnetic material. When the temperature is high enough, the ferromagnet does not show any magnetic property. Moreover, even lowering the temperature, in absence of any polarizing field, the material does not magnetize. On the contrary, if an external magnetic field is applied to the ferromagnet, like a magnet put in contact to a needle for enough time, and then removed, then the needle becomes magnetic itself, in that it remembers the direction of the previously applied magnetic field, thus breaking, spontaneously, the rotational symmetry. Biological realizations of this replica symmetry breaking process are shown in Fig 4, last row. Full list of symmetry broken circuits in gene regulatory networks across species appears in S1 File Section V and S2 File. Table 2 shows the Z-scores of these circuits indicating their significance. The algorithm to identify these broken symmetry flip-flops in biological networks is developed in S1 File Section VII and is available at https://github.com/makselab/CircuitFinder.
Extending this duplication and symmetry breaking process to the FFF (Fig 4, second column), one can replicate the X and Y genes from the FFF base to create a circuit isomorphic to the Clocked SR flip-flop, another computer memory building block [33] (see S1 File Section IV B). The symmetry is broken by regulators or inducers of genes X and X’ acting as S (set) and R (reset) of memory, and it is restored when S = R, leaving the system ‘magnetized’. The hierarchy continues by replicating the Fibonacci Fiber and consequent breaking of symmetry when the inputs J and K are different (Fig 4 third column, see S1 File Section IV B). This structure is isomorphic (i.e., has the same base) to the JK flip-flop in electronics, which is the most widely used of all flip-flop designs [33]. In its symmetric state, the JK flip-flop is isomorphic to the symmetric Fibonacci base. In the symmetry broken phase, it acts as a memory device which presents two possibilities. A ‘chiral’ symmetry (where Y feeds X and Y’ feeds X’) and a ‘parity’ symmetry (left-right reflection, where Y feeds X’ and Y’ feeds X). This last one is the one realized in biological circuits, see Fig 4, third column. Examples of JK flip-flops are abundant in gene regulatory networks in human and mouse. They are shown in Fig 4, last row and full list in S1 File Section V, S2 File, and Z-scores in Table 2.
Discussion
In summary, fibration symmetries and broken symmetries in gene regulatory networks reveal the functions of synchronization, clocks and memory through electronic analogues of transistors, ring oscillators, current-mirror circuits, and flip-flops. They result in a hierarchy of building blocks with progressively more complex dynamics obtained by iterating a procedure of replication and symmetry breaking. Beyond the circuits discussed here, the biological hierarchy can be extended to any number m of loops of length d and autoregulators in the fiber n, to form ever more sophisticated circuits whose complexity is expressed in generalized Fibonacci sequences Qt = nQt−1 + mQt−d.
Gene regulatory structures are a mixture of combinational logic circuits, like FFF, and sequential logic circuits, like FF. They provide the network with a structure analogous to a programmable logic device or chip where the ‘register’ is a set of flip-flop circuits tied up together acting as the memory clock of the genetic network that feeds the combinatorial logic circuits made of simpler feed-forward circuit of low symmetry. Complex biological circuitry can then be seen as an emergent process guided by the laws of symmetry that determine biological functions analogous to electronic components. The discovery of these building blocks and building rules of logic computation will allow to: (1) systematically design synthetic genetic circuits following biological symmetry, and (2) systematically map the structure and function of all biological networks, from the symmetries of the connectome [35] to genetic [24], protein and metabolic networks, following a first principle theoretical approach.
Supporting information
S1 File. Supplementary materials.
Detailed description of all analytical solutions mentioned in the main text, of the data acquisition and treatment, and detailed description of the proposed algorithm to find fibers.
https://doi.org/10.1371/journal.pcbi.1007776.s001
(PDF)
S2 File. List of symmetry and broken symmetry circuits.
In S2 File we present a list of circuits found in different species.
https://doi.org/10.1371/journal.pcbi.1007776.s002
(PDF)
References
- 1. Hartwell LH, Hopfield JJ, Leibler S, Murray AW. From molecular to modular cell biology. Nature. 1999;402: C47–C52. pmid:10591225
- 2. Milo R, Shen-Orr SS, Itzkovitz S, Kashtan N, Chklovskii D, Alon U. Network motifs: simple building blocks of complex networks. Science. 2002;298: 824–827. pmid:12399590
- 3. Shen-Orr SS, Milo R, Mangan S, Alon U. Network motifs in the transcriptional regulation network of Escherichia coli. Nature Genet. 2002;31: 64–68. pmid:11967538
- 4.
Alon U. An Introduction to Systems Biology: Design Principles of Biological Circuits. Boca Raton: CRC Press; 2006.
- 5.
Klipp E, Liebermeister W, Wierling C, Kowald A, Herwig R. Systems Biology: a textbook. Weinheim: Wiley-VCH; 2016.
- 6. Ingram PJ, Stumpf MP, Stark J. Network motifs: structure does not determine function. BMC Genomic. 2006;7: 108.
- 7. Payne JL, Wagner A. Function does not follow form in gene regulatory circuits. Sci Rep. 2015;5: 13015. pmid:26290154
- 8. Macía J, Widder S, Solé R. Specialized or flexible feed-forward loop motifs: a question of topology. BMC Syst Biol. 2009;3: 84. pmid:19719842
- 9. Ahnert SE, Fink TMA. Form and function in gene regulatory networks: the structure of network motifs determines fundamental properties of their dynamical state space. J Royal Soc Interface. 2016;13: 20160179.
- 10. Tyson JJ, Chen KC, Novak B. Sniffers, buzzers, toggles and blinkers: dynamics of regulatory and signaling pathways in the cell. Curr Opin Cell Biol. 2003;15(2): 221–31. pmid:12648679
- 11. Monod J, Jacob F. General conclusions: teleonomic mechanisms in cellular metabolism, growth and differentiation. Cold Spring Harb Symp Quant Biol. 1961;26: 389–401.
- 12. Teo JY, Woo SS, Sarpeshkar R. Synthetic Biology: A Unifying View and Review Using Analog Circuits. IEEE Trans. on Biomed. Circuits and Syst. 2015;9: 453–474.
- 13. Dalchau N, Szé G, Hernansaiz-Ballesteros R, Barnes CP, Cardelli L, Phillips A, et al. Computing with biological switches and clocks. Natural Computing. 2018;17: 761–779. pmid:30524215
- 14. Atkinson MR, Savageau MA, Myers JT, Ninfa AJ. Development of genetic circuitry exhibiting toggle switch or oscillatory behavior in Escherichia coli. Cell. 2003;113: 597–607. pmid:12787501
- 15. Gardner TS, Cantor CR, Collins JJ. Construction of a genetic toggle switch in Escherichia coli. Nature. 2000;403: 339–342. pmid:10659857
- 16. Kramer BP, Fussenegger M. Hysteresis in a synthetic mammalian gene network. Proc Natl Acad Sci USA. 2005;102: 9517–9522. pmid:15972812
- 17. Kramer BP, Viretta AU, Baba MD-E, Aube D, Weber W, Fussenegger M. An engineered epigenetic transgene switch in mammalian cells. Nature Biotech. 2004;22: 867–870.
- 18. Guet CC, Elowitz MB, Hsing W, Leibler S. Combinatorial synthesis of genetic networks. Science. 2002;296: 1466–1470. pmid:12029133
- 19. Ajo-Franklin CM, Drubin DA, Eskin JA, Gee EP, Landgraf D, Phillips I, et al. Rational design of memory in eukaryotic cells. Genes Dev. 2007;21: 2271–2276. pmid:17875664
- 20. Ham TS, Lee SK, Keasling JD, Arkin AP. Design and construction of a double inversion recombination switch for heritable sequential genetic memory. PLoS One. 2008;3: e2815. pmid:18665232
- 21. Elowitz MB, Leibler S. A synthetic oscillatory network of transcriptional regulators. Nature. 2000;403: 335–338. pmid:10659856
- 22. Stricker J, Cookson S, Bennett MR, Mather WH, Tsimring LS, Hasty J. A fast, robust and tunable synthetic gene oscillator. Nature. 2008;456: 516–519. pmid:18971928
- 23. Tigges M, Marquez-Lago TT, Stelling J, Fussenegger M. A tunable synthetic mammalian oscillator. Nature. 2009;457: 309–312. pmid:19148099
- 24. Morone F, Leifer I, Makse HA. Fibration symmetries uncover the building blocks of biological networks. Proc Natl Acad Sci USA. 2020;117: 8306–8314. pmid:32234788
- 25. Mangan S, Alon U. Structure and function of the feed-forward loop network motif. Proc Natl Acad Sci USA. 2003;100: 11980–11985. pmid:14530388
- 26. Mangan S, Zaslaver A, Alon U. The coherent feedforward loop serves as a sign-sensitive delay element in transcription networks. J Mol Biol. 2003;334: 197–204. pmid:14607112
- 27. Glass L, Kauffman SA. The logical analysis of continuous, non-linear biochemical control networks. J Theor Biol. 1973;38: 103–129.
- 28. Mangan S, Zaslaver A, Alon U. Negative autoregulation increases the input dynamic-range of the arabinose system of Escherichia coli. BMC Sys Biol. 2011;5: 111.
- 29. Golubitsky M, Stewart I. Nonlinear dynamics of networks: the groupoid formalism. Bull Am Math Soc. 2006;43: 305–364.
- 30. Boldi P, Vigna S. Fibrations of graphs. Discrete Mathematics. 2001;243: 21–66.
- 31. Anderson PW. The concept of frustration in spin glasses. J of the Less-Common Metals. 1978;62: 291–294.
- 32. Widlar RJ, Some circuit design techniques for linear integrated circuits. IEEE Trans Circuit Theory. 1965;4: 586–590. See also Widlar RJ: US Patent Number 3,320,439; Filed May 26, 1965; Granted May 16, 1967: Low-value current source for integrated circuits and Widlar RJ. Design techniques for monolithic operational amplifiers. IEEE Solid-State Circuits. 1969;4: 184-191.
- 33.
Horowitz P, Hill W. The Arts of Electronics. 3rd ed. New York:Cambridge University Press; 2015.
- 34.
Weinberg S. The Quantum Theory of Fields. Cambridge: Cambridge University Press; 2005.
- 35. Morone F, Makse HA. Symmetry group factorization reveals the structure-function relation in the Caenorhabditis elegans connectome. Nat Commun. 2019;10: 4961.