## Figures

## Abstract

The timing of DNA synthesis, mitosis and cell division is regulated by a complex network of biochemical reactions that control the activities of a family of cyclin-dependent kinases. The temporal dynamics of this reaction network is typically modeled by nonlinear differential equations describing the rates of the component reactions. This approach provides exquisite details about molecular regulatory processes but is hampered by the need to estimate realistic values for the many kinetic constants that determine the reaction rates. It is difficult to estimate these kinetic constants from available experimental data. To avoid this problem, modelers often resort to ‘qualitative’ modeling strategies, such as Boolean switching networks, but these models describe only the coarsest features of cell cycle regulation. In this paper we describe a hybrid approach that combines the best features of continuous differential equations and discrete Boolean networks. Cyclin abundances are tracked by piecewise linear differential equations for cyclin synthesis and degradation. Cyclin synthesis is regulated by transcription factors whose activities are represented by discrete variables (0 or 1) and likewise for the activities of the ubiquitin-ligating enzyme complexes that govern cyclin degradation. The discrete variables change according to a predetermined sequence, with the times between transitions determined in part by cyclin accumulation and degradation and as well by exponentially distributed random variables. The model is evaluated in terms of flow cytometry measurements of cyclin proteins in asynchronous populations of human cell lines. The few kinetic constants in the model are easily estimated from the experimental data. Using this hybrid approach, modelers can quickly create quantitatively accurate, computational models of protein regulatory networks in cells.

## Author Summary

The physiological behaviors of cells (growth and division, differentiation, movement, death, etc.) are controlled by complex networks of interacting genes and proteins, and a fundamental goal of computational cell biology is to develop dynamical models of these regulatory networks that are realistic, accurate and predictive. Historically, these models have divided along two basic lines: deterministic or stochastic, and continuous or discrete; with scattered efforts to develop hybrid approaches that bridge these divides. Using the cell cycle control system in eukaryotes as an example, we propose a hybrid approach that combines a continuous representation of slowly changing protein concentrations with a discrete representation of components that switch rapidly between ‘on’ and ‘off’ states, and that combines the deterministic causality of network interactions with the stochastic uncertainty of random events. The hybrid approach can be easily tailored to the available knowledge of control systems, and it provides both qualitative and quantitative results that can be compared to experimental data to test the accuracy and predictive power of the model.

**Citation: **Singhania R, Sramkoski RM, Jacobberger JW, Tyson JJ (2011) A Hybrid Model of Mammalian Cell Cycle Regulation. PLoS Comput Biol 7(2):
e1001077.
doi:10.1371/journal.pcbi.1001077

**Editor: **Daniel A. Beard, Medical College of Wisconsin, United States of America

**Received: **June 13, 2010; **Accepted: **January 7, 2011; **Published: ** February 10, 2011

**Copyright: ** © 2011 Singhania et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

**Funding: **This work was supported by NIH Subaward No. 5710002511 to J.J.T. through the Integrative Cancer Biology Program at MIT (Prime Award No. U54-CA112967); by NSF award DMS-0342283 to J.J.T.; by NIH award R01-CA73413 to J.W.J., and P30-CA43703 for Core Facilities at CWRU. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

**Competing interests: ** The authors have declared that no competing interests exist.

## Introduction

The cell division cycle is the fundamental physiological process by which cells grow, replicate, and divide into two daughter cells that receive all the information (genes) and machinery (proteins, organelles, etc.) necessary to repeat the process under suitable conditions [1]. This cycle of growth and division underlies all biological expansion, development and reproduction. It is highly regulated to promote genetic fidelity and meet the demands of an organism for new cells. Altered systems of cell cycle control are root causes of many severe health problems, such as cancer and birth defects.

In eukaryotic cells, the processes of DNA replication and nuclear/cell division occur sequentially in distinct phases (S and M) separated by two gaps (G1 and G2). Mitosis (M phase) is further subdivided into stages: prophase (chromatin condensation, spindle formation, and nuclear envelope breakdown), prometaphase (chromosome attachment and congression), metaphase (chromosome residence at the mid-plane of the spindle), anaphase (sister chromatid separation and movement to opposite poles of the spindle), telophase (re-formation of the nuclear envelopes), and cytokinesis (cell division). G1 phase is subdivided into uncommitted and committed sub-phases, often referred to as G1-pm (postmitotic interval) and G1-ps (pre S phase interval), separated by the ‘restriction point’ [2]. In this paper, we shall refer to the sub-phases G1-pm and G1-ps as ‘G1a’ and ‘G1b’ respectively.

Progression through the correct sequence of cell-cycle events is governed by a set of cyclin-dependent kinases (Cdk's), whose activities rise and fall during the cell cycle as determined by a complex molecular regulatory network. For example, cyclin synthesis and degradation are controlled, respectively, by transcription factors and ubiquitin-ligating complexes whose activities are, in turn, regulated by cyclin/Cdk complexes.

Current models of the Cdk control system can be classified as either continuous or discrete. Continuous models track the changes of protein concentrations, *C _{j}*(

*t*) for

*j*= 1, 2, …,

*N*, by solving a set of nonlinear ordinary differential equations (ODEs) of the form:(1)where ρ

*is the rate of the*

_{r}*r*

^{th}reaction and ν

*is the stoichiometric coefficient of species*

_{ir}*i*in reaction

*r*. To each rate term is associated one or more kinetic constants that determine exactly how fast the reaction proceeds under specific conditions. These kinetic constants must be estimated from experimental data, and often there is insufficient kinetic data to determine their values. Nonetheless, continuous models, based on rate equations, have been used successfully to account for the properties of cell proliferation in a variety of cell types: yeast [3], [4], [5], fruit fly [6], frog egg [7], [8], and cultured mammalian cells [9], [10], [11]. They have also proved successful in predicting novel cell-cycle characteristics [12], [13].

Discrete models, on the contrary, represent the state of each regulatory protein as *B _{j}*(

*τ*) = 0 or 1 (inactive or active), and the state variables update from one discrete time step to the next (

*τ*= 0, 1, 2, … = ticks of a metronome) according to the rule:(2)where Ψ

*(…) is a Boolean function (i.e., it equates to either 0 or 1) determined by the topology of the reaction network. For Boolean networks (BNs) there is no notion of reaction ‘rate’ and, hence, no need to estimate kinetic constants. BN models of the Cdk regulatory network have been proposed for yeast cells [14], [15] and for mammalian cells [16]. They have been used to study notions of ‘robustness’ of the cell cycle, but they have not been compared in detail to quantitative properties of cell cycle progression, and they have not been used as predictive tools.*

_{j}In this paper we propose to combine the strengths of both continuous and discrete modeling, while avoiding the weaknesses of each. Our ‘hybrid’ model is inspired by the work of Li et al. [14], who proposed a BN for cell cycle controls. Their model employs 11 state variables that move around in a space of 2^{11} = 2048 possible states. Quite remarkably they found that 1764 of these states converge quickly onto a ‘super highway’ of 13 consecutive states that represent a typical cell cycle trajectory (G1b—S—G2—M—G1a). The results of Li et al. indicate that the cell cycle control network is ‘robustly designed’ in the sense that even quite large perturbations away from the usual sequence of cell cycle states are quickly restored to the super highway. In the model of Li et al., G1a is a stable steady state; they do not address the signals that drive cells past the restriction point (the G1a-to-G1b transition).

Despite their intuitive appeal, Boolean models have severe limitations. First of all, metronomic time in BN's is unrelated to clock time in the laboratory, so Boolean models cannot be compared to even the most basic observations of time spent by cells in the four phases of the division cycle [1]. Also, these models do not incorporate cell size, so they cannot address the evident importance of cell growth in driving events of the cell cycle [17], [18], [19]. Lastly, cyclins are treated as either absent or present (0 or 1), so Boolean models cannot simulate the continuous accumulation and removal of cyclin molecules at different stages of the cell cycle [20].

Our goal is to retain the elegance of the Boolean representation of the switching network, while introducing continuous variables for cell size, cell age, and cyclin composition, in order to create a model that can be compared in quantitative detail to experimental measurements with a minimal number of kinetic parameters that must be estimated from the data. To this end, we keep the cyclin regulators as Boolean variables but model the cyclins themselves as continuous concentrations that increase and decrease due to synthesis and degradation. Next, we replace the Boolean model's metronome with real clock time to account for realistic rates of cyclin synthesis and degradation, and for stochastic variability in the time spent in each Boolean state of the model. Finally, we introduced a cell size variable, *M*(t), which affects progression through late G1 phase. *M*(t) increases exponentially with time as the cell grows and decreases by a factor of ∼2 when the cell divides. (The assumption of exponential growth is not crucial; similar results are obtained assuming linear growth between cell birth and division.)

Since the pioneering work of Leon Glass [21], [22], hybrid (discrete-continuous) models have been employed by systems biologists in a variety of forms and contexts [23], [24], [25]. Engineers have been modeling hybrid control systems for many years [26], [27], [28], and they have created powerful simulation packages for such systems [29]: SIMULINK [28], SHIFT [30], [31] and CHARON [32], to name a few. We have not used these simulation packages because our model can be solved analytically.

## Results

### Hybrid modeling approach

The modeling approach we are proposing is hybrid in two senses. First, we employ both continuous and discrete variables, and second we allow for both deterministic and stochastic processes. Concerning the components of the control system, we track cyclin levels as continuous concentration variables, but we use discrete Boolean variables to represent the activities (‘on’ or ‘off’) of the regulatory proteins (transcription factors and ubiquitinating enzymes) that control cyclin synthesis and degradation. This distinction is equivalent to a presumed ‘separation of time scales’: the activities of the regulatory proteins change rapidly between 0 and 1, while the concentrations of cyclins change more slowly due to synthesis and degradation. The Boolean variables, we assume, proceed from one state to the next according to a fixed sequence corresponding roughly to the super highway of Li et al. [14]. The time spent in each state, however, is not a ‘tick’ of the metronome but rather the sum of a deterministic execution time (which may be 0) plus a random, exponentially distributed waiting time. In this sense, the model combines deterministic and stochastic processes.

In its present version, our model is not fully autonomous. The discrete variables do not update according to Boolean functions of the current state of the network. Rather, they go through a fixed sequence of states predetermined by the Boolean network model of Li et al. [14]. The discrete variables determine the rates of synthesis and degradation of the continuous variables (the cyclins), and the cyclins feedback on the discrete variables by determining how much time is spent in some of the Boolean states. This strategy keeps the model simple and is appropriate for the cases, considered in this paper, of unperturbed cycling of ‘wild type’ cells, which travel serenely along the super highway of Li et al. To consider more complicated cases, of mutant cells that travel a different route through discrete state space or of cells that are perturbed by drugs or radiation, we will have to elaborate on this basic model with additional rules governing the interactions of the discrete and continuous variables. We are currently working on alternative strategies to adapt this basic modeling paradigm to more complex situations.

Our model (Fig. 1) tracks three cyclin species (A, B and E), two transcription factors (‘TFE’ and ‘TFB’) and two different E3 ubiquitin-ligase complexes (APC-C and SCF). TFE drives the synthesis of cyclins E and A early in the cell cycle (comparable to the E2F family of transcription factors) [33], and TFB drives the synthesis of cyclins B and A late in the cell cycle (comparable to FoxM1 and Myc) [34], [35]. The Anaphase Promoting Complex—Cyclosome (APC-C) is active during M phase and early G1, when it combines with Cdc20 and Cdh1 to label cyclins A and B for degradation by proteasomes. We make a further distinction between Cdc20 activity on cyclin A (Cdc20A, active throughout mitosis) from Cdc20 activity on cyclin B (Cdc20B, activated at anaphase). The SCF labels cyclin E for degradation via ubiquitination, but only when cyclin E is phosphorylated [36], which we assume is correlated primarily with cyclin A/Cdk2 activity [37].

(A) The synthesis and degradation of cyclin proteins is regulated by transcription factors (TFE and TFB) and by ubiquitination machinery (SCF, Cdc20 and Cdh1). (B) Three successive cell cycles are simulated as explained in the Methods. Upper panel: gray curve, 30·*M*(*t*); blue curve, [CycE]·*M*(*t*); the gold line and the pink line indicate the time periods when TFE = 1 and SCF = 1, respectively. Lower panel: green curve, [CycA]·*M*(*t*); red curve, [CycB]·*M*(*t*); the colored bars indicate the time periods when the Boolean variables are active, according to the legend in the inset.

In our model, the two transcription factors and the four ubiquitination factors are each represented by a Boolean variable, *B*_{TFE}, etc. For each cyclin component we write an ordinary differential equation, d[CycX]/d*t* = *k*_{sx}−*k*_{dx}[CycX], where the rate ‘constants’ for synthesis and degradation, *k*_{sx} and *k*_{dx}, depend on the Boolean variables (see Table 1). Hence, each cyclin concentration is governed by a piecewise linear ODE. The parameters in the model (, , etc.) are assigned numerical values (Table 1), chosen to fit observations of how fast cyclins accumulate and disappear during different phases of the cell cycle.

Next, we must assign rules for updating the Boolean variables in the model. We assume that the Boolean variables follow a strict sequence of states (see Table 1) that corresponds roughly to the super highway discovered by Li et al. [14]. This sequence of states conforms to current ideas of how the mammalian cell cycle is regulated. Newborn cells are said to be in ‘G1a’ state, because they are not yet committed to a new round of DNA synthesis and mitosis. The transcription factors, TFE and TFB, are silent, and Cdh1/APC-C is active, so the levels of cyclins A, B and E are low in newborn cells. For a mammalian cell to leave the G1a state and commit to a new round of DNA replication and division, it must receive a specific set of extracellular signals (growth factors, matrix binding factors, etc.), which up-regulate the activity of TFE. We assume that these ‘proliferation signals’ are present and that our (simulated) cell spends only a few hours in G1a before transiting into G1b. In our model, the time spent in G1a is an exponentially distributed random variable with mean = 2 h. When the cell passes the ‘restriction point’ and enters G1b, TFE is activated and CycE begins to accumulate. Among other chores, Cdk2/CycE inactivates Cdh1/APC-C, allowing Cdk2/CycA dimers to accumulate. In our model, the transition from early G1b to late G1b is weakly size dependent, because the condition for this transition is that [CycE]*Mass exceeds a certain threshold (*θ*_{E}). Because this transition depends on cell mass, those cells that are larger than average tend to make the transition sooner, and cells that are smaller than average tend to make the transition later. This effect allows the cell population to achieve a stable size distribution. In the late G1b state, CycA/Cdk2 level rises to a certain threshold (*θ*_{A}), when it triggers entry into S phase. Cdk2/CycA also promotes the degradation of cyclin E by SCF during S phase. We assume that DNA synthesis requires at least 7 h.

Cyclin B begins to accumulate in late G1 and S, after Cdh1 is inactivated, but the major accumulation of cyclin B protein occurs in G2 phase, after DNA synthesis is completed and TFB is activated. The G2—M transition is delayed until enough Cdk1/CycB dimer accumulates ([CycB]>*θ*_{B}′) to promote entry into prophase and the appearance Cdc20A/APC-C, which begins the process of cyclin A degradation [38], [39], [40]. Cdc20B/APC-C is activated at the metaphase—anaphase transition, where it promotes three crucial tasks: (1) separation of sister chromatids by the mitotic spindle, (2) partial degradation of cyclin B, and (3) re-activation of Cdh1. Cdh1/APC-C degrades Cdc20 [41], and then finishes the job of cyclin B degradation (telophase). When [CycB] drops below the threshold *θ*_{B}″, the cell finishes telophase and divides into two newborn daughter cells in G1 phase (unreplicated chromosomes) with low levels of cyclins A, B and E.

We assume that cell division is symmetric, with some variability; i.e., the mass of the two daughter cells at birth are *δM*_{div} and (1−*δ*)*M*_{div}, where *M*_{div} = mass of mother cell at division, and *δ* is a Gaussian-distributed random variable with mean = 0.5 and standard deviation = 0.0167. In all simulations reported here we assume that cells grow exponentially between birth and division. However, we have also simulated linear growth, and the results are not significantly different.

We introduce stochastic effects into the model by assuming that the time spent in each state of the Boolean subsystem, as it moves along the super highway, has a random component () as well as a deterministic component (): . From Table 1, we see that for *i* = 1, 6, 7, 8, and h. For the remaining cases (*i* = 2, 3, 5, 9), is however long it takes for the cyclin variable to reach its threshold. The stochastic component for each transition is a random number chosen from an exponential distribution with mean = *λ _{i}*. The random time delay is calculated from a uniform random deviate,

*r*, by the formula = . The values chosen for the

*λ*'s are given in Table 1.

_{i}In the Methods section, we describe how we simulate the progression of a single cell through its DNA replication/division cycle. Because the model's differential equations are piecewise linear, they can be solved analytically, and an entire ‘cell cycle trajectory’ can be determined by computing a few random numbers and solving some algebraic equations. A typical result of such simulations, over three cell cycles, is illustrated in Fig. 1B. Not surprisingly, the accumulation and loss of the cyclins correlate with the activities of the cyclin regulators. At the beginning of each cycle, the cell starts in State 1 (G1a phase in Table 1), with low levels of all cyclin because TFE and TFB are off and Cdh1 is on. When the cell leaves G1a, TFE turns on and cyclin E rises rapidly, but cyclin A increases only modestly, because Cdh1 is still active in early G1b. Cdh1 turns off when cyclin E level crosses *θ*_{E}, allowing cyclin A to increase dramatically in late G1b and drive the cell into S phase (State 4). Cyclin B increases modestly in late G1 and S phase, because Cdh1 is off but TFB has not yet turned on. Cyclin E is degraded in S phase, because SCF is now active. When the cell finishes DNA synthesis, TFB turns on, causing further increase of cyclins A and B. When cyclin B level rises above its first threshold, *θ*_{B}′, the cell enters prophase (State 6) and then prometaphase-metaphase (State 7). During State 7, cyclin A level drops precipitously because Cdc20A is turned on. After the replicated chromosomes are fully aligned on the mitotic spindle, Cdc20B turns on (State 8) and cyclin B is partially degraded. Cdc20B activates Cdh1 (State 9) and cyclin B is degraded even faster. When cyclin B level drops below its second threshold, *θ*_{B}″, the cell divides and returns to G1a (State 1).

### Cyclin distributions in an asynchronous culture

Our first test for the hybrid model is to simulate flow cytometry measurements of the DNA content and cyclin levels in an asynchronous population of RKO (colon carcinoma) cells [42]. In the data set, a typical scatter plot has about 65000 data points, each point displaying the measurements of two observables in a single cell chosen at random from the cell cycle (Fig. 2). When the data are plotted in this way, they form a cloudy tube of points through a projection of the state space (say, cyclin B versus cyclin A). Because there will be some cells from every phase of the cell cycle, the tube closes on itself. If the system were completely deterministic and the measurements were absolutely precise, the data points would be a simple closed curve (a ‘limit cycle’) in the state space. The data actually present a fuzzy trajectory that snakes through state space before closing on itself. The indeterminacy of the points comes (presumably) from two sources: intrinsic noise in the molecular regulatory system (modeled by the random waiting times, ) and extrinsic measurement errors, which we shall introduce momentarily. Our strategy for simulating flow-cytometry data is explained in more detail in the Methods section.

(A,C,E) Flow cytometry data from Yan et al [42]. DNA = 190 corresponds to G1 and DNA = 380 corresponds to G2/M. (B,D,F) Our simulations. We are plotting the total amount of cyclin A and cyclin B per cell, i.e., [CycA]·*M*(*t*) and [CycB]·*M*(*t*). DNA = 1 in G0/G1 phase; = 2 in G2/M phase. Some ‘instrumental noise’ has been added to the calculated levels of cyclins and DNA, as described in the Methods. The arrows in (A, B) indicate the rate of cyclin B accumulation in S phase in the measurements and in the model. The arrows in (C, D) indicate the cyclin A level at the onset of DNA synthesis, compared to the maximum expression level of ∼600 AU.

In Fig. 2 we compare our simulated flow-cytometry scatter plots with experimental results of Yan et al. [42]. We color-code each cell in the simulated plot according to which Boolean State (Table 1) the cell is in at the time of fixation. In Fig. 3 we plot cyclin E fluctuations, as predicted by our model, along with a projection of the cell cycle trajectory in a subspace spanned by the three cyclin variables (A, B and E).

(A) Scatter plots. (B) Stochastic limit cycle in the state space of cyclins A, B and E. We provide two different perspectives of this three dimensional figure to help visualize how the cyclin levels go up and down. In addition, we have added golden-colored balls to help guide the eye along the cell cycle trajectory. Each ball represents the average of the cyclin levels of all the cells binned over a hundredth of the φ* _{i}* interval [0,1], where φ

*refers to the fraction of the cell cycle completed by cell*

_{i}*i*(as described in the Methods section). Finally, it may help to recognize that Fig. 2E is a projection of the data on the CycA-CycB plane, and Fig. 3B is a projection on the CycA-CycE plane.

### Contact inhibition of cultured cells

As a further test of the utility of this modeling approach, we have used our hybrid model to simulate an exponentially growing population of an immortalized Human Umbilical Vein Endothelial cell line (HUVEC). In the experiment (Fig. 4A; see Methods), a culture is seeded with 5×10^{4} cells on ‘Day 0’ and allowed to grow. At Day 6, it reaches confluence and cell number plateaued at a constant level.

(A) Growth curve for the HUVEC population over 10 days, showing the base-10 logarithm of the cell count for both experimental data and our simulation (with *N*_{0} = 11000 and *N*_{1} = 500). (B) Daily distribution of cells across the phases of the cell cycle, from experimental data. (C) Model simulation of the phase distributions.

To apply the hybrid model to this data, we had to devise a way to model contact inhibition, which arrests cells in a stable quiescent state. To this end, we assume that the transition probability, *p*, for exiting State 1 is a function of the number of cells alive at that time, *N*:(3)For 0<*N*_{1}≪*N*_{0}, *p* is a sigmoidal function of *N* that drops abruptly from *p*_{0} to 0 for *N*>*N*_{0}. For each cell in this simulation, we set *λ*_{1} (the mean for the random time spent in G1a) to 1/*p*, and we choose *p*_{0} = 0.5 h^{−1} to conform to the value of *λ*_{1} in Table 1. As the population size *N* increases, the time spent in G1a phase increases until cells eventually arrest in State 1, and the growth curve, *N*(*t*), levels off. In this case, State 1 in our model corresponds to a quiescent state (G0) in which cells are alive but not proliferating.

To make the simulation more tractable, we start off with 500 cells (instead of 50,000 cells) and follow the lineage of each initial cell until Day 10. Every 24 hours, we compute the number of cells alive at that point of time and plot the results in Fig. 4A, along with the experimental data (scaled down by a factor of 100). The parameter values, *N*_{0} = 11,000 and *N*_{1} = 500, are chosen to fit the simulation to the observed growth curve. From the model we can also compute the percentage of cells in G0/G1, S and G2/M phases on each day (Fig. 4C), and the results compare favorably with the experimental observations (Fig. 4B). Lastly, we also simulate the patterns of cyclin A2 and cyclin B1 expression on each day for the growing population of HUVEC cells (see Supporting Fig. S1).

## Discussion

We have constructed a simple, effective model of the cyclin-dependent kinase control system in mammalian cells and used the model to simulate faithfully the accumulation and degradation of cyclin proteins during asynchronous proliferation of RKO (colon carcinoma) cells. The model is inspired by the work of Li et al. [14], who proposed a robust Boolean model of cell cycle regulation in budding yeast. Our goal was to retain the elegance of the Boolean representation of the switching network, while introducing continuous variables for cell size, cell age, and cyclin composition, in order to create a model that could be compared in quantitative detail to experimental measurements.

We have shown that this model can accurately simulate flow-cytometric measurements of cyclin abundances in asynchronous populations of growing-dividing mammalian cells. The parameters in the model that allow for a quantitative description of the experimental measurements are easily estimated from the data itself. Now that the model is parameterized and validated for wild-type cells, we are currently extending it to handle the behavior of cell populations perturbed by drugs and by genetic interference. In some cases, only modest extensions of the model are required; in other cases, a more thorough overhaul of the way the discrete and continuous variables interact with each other is necessary.

We have chosen parameter values in our model to capture the major features of cyclin fluctuations as measured by flow cytometry during the somatic division cycle of mammalian cells. We have used a human tumor cell line to calibrate our model. Between cell lines and normal human cultured cells, there are differences in the expressions of A and B cyclins [43]; however, when the levels of cyclin B1 were rigorously compared for HeLa, K562, and RKO cells, both the patterns and magnitudes of expression are remarkably similar, apparently dependent to some degree on the rate of population growth [44]. In addition, the patterns of expression of cyclins A2 and B1 are similar for these human tumor cell lines and stimulated normal human circulating lymphocytes (Supporting Fig. S2). Overall, the simulation outputs have satisfying similarity both in pattern and magnitude to the real data for RKO cells, and our simulated expression patterns of cyclins A, B and E for the tumor cell line are quite similar to the simulated expression patterns in HUVEC cells (see Supporting Fig. S1).

However, there remain some inconsistencies between our mathematical simulations and our experimental observations that point out where future modifications to the model are needed. For example, in the model DNA synthesis starts when cyclin A has accumulated to ∼8% of its maximum level (see arrow in Fig. 2D; 50/600≈8%), whereas in our measurements DNA synthesis starts when cyclin A is ∼5% of its maximum level (arrow in Fig. 2C). This discrepancy is tempered by the fact that we are not confident of the quantitative accuracy of cyclin A expression levels below ∼4% of its maximum level in Fig. 2C. Where we place the minimum expression level of cyclin A in Fig. 2D affects our estimate of the cyclin A level at onset of DNA synthesis (50 AU at present). By lowering the minimum expression level of cyclin A below 10 AU in Fig. 2D (e.g., by lowering *k*′* _{sa}*), we could line up the two arrows in Figs. 2C and D. Nonetheless, we observe (Supporting Fig. S3) that cyclin A expression correlates highly with BrdU incorporation, suggesting that significant accumulation of cyclin A begins simultaneously with the onset of DNA synthesis, whereas in our model cyclin A production begins in mid-G1 phase. This discrepancy could be minimized by lowering the cyclin A threshold (

*θ*) in the model.

_{A}The simulation (Fig. 2B) captures the observed accummulation of cyclin B in late G1 (when Cdh1 turns off), but the simulated rise in cyclin B during S phase appears to be faster than the observed rise [45] (compare the arrows in Figs. 2A and B). The simulation does capture the rapid accumulation of cyclin B observed in G2. Finally, while we did not calibrate the cyclin E expression parameters to any specific dataset, the pattern of expression in Fig. 3A is quite similar to expected expression patterns for normal human somatic cells and some human tumor cell lines [46].

We believe that our hybrid approach will be generally useful for modeling macromolecular regulatory networks in cells, because it combines the qualitative appeal of Boolean models with the quantitative realism of reaction kinetic models.

## Methods

### Simulations

We simulate a flow cytometry experiment with our hybrid model in two steps.

*Step 1: Creating complete ‘life histories’ for thousands of cells.* At the start of the simulation, we specify initial conditions at the beginning of the cycle (State 1) for a progenitor cell. We used the following initial values of the state variables: [CycA] = [CycB] = [CycE] = 1 and *M* = 3. Our strategy is to follow this cell through its cycle until it divides into two daughters. We then choose one of the two daughters at random and repeat the process, continuing for 32500 iterations. We discard the first 500 cells, and keep a sample of 32000 cells that have completed a replication-division cycle according to our model. In the second step, we create a simulated sample of 32000 cells chosen at random phases of the cell cycle, to represent the cells that were assayed by the flow cytometer.

Let us consider cell *i* (1<*i*<32500) at the time of its birth, *t _{i}*

_{0}. By definition, this cell is in State 1, and we assume that we know its birth mass,

*M*(

*t*

_{i}_{0}), and its starting concentrations of cyclins A, B and E. Denote the starting concentrations as [CycA(

*t*

_{i}_{0})], [CycB(

*t*

_{i}_{0})], [CycE(

*t*

_{i}_{0})]. In the ensuing discussion, unless it is necessary for clarity, we drop the

*i*subscript, it being understood that we are talking about a representative cell in the population. We will follow this cell until it divides to produce a daughter cell with known concentrations of cyclins.

According to Table 1, a cell in State 1 has no special conditions to satisfy before moving to State 2. Hence the residence time in State 1 is a random number chosen from an exponential distribution with mean *λ*_{1} = 2 h. The cell enters State 2 at *t*_{1} = *t*_{0}+. Assuming exponential growth, its size at this time is *M*(*t*_{1}) = *M*(*t*_{0}) exp{γ(*t*_{1}−*t*_{0})} = *M*(*t*_{0}) exp{γ*A*_{1}}, where γ is the specific growth rate of the culture and *A*_{1} = *t*_{1}−*t*_{0} is the age of the cell when it exits State 1. To illustrate how cyclin concentrations are computed at *t* = *t*_{1}, let us consider cyclin A as an example. During the interval *t*_{0}<*t*<*t*_{1}, [CycA] satisfies a linear ODE with effective rate constants *k*_{sa1} = *k*′_{sa} = 5 and *k*_{da1} = *k*′_{da}+*k*″′_{da} = 1.4, because *B*_{TFE} = *B*_{TFB} = *B*_{Cdc20A} = 0 and *B*_{Cdh1} = 1 for a cell in State 1. We can compute the concentration of cyclin A at any time during this interval from(4)Setting *t* = *t*_{1} in this equation gives the number we seek. In this fashion, we start tabulating the following information for each simulated cell:

Notice that, at *t* = *t*_{1} when the cell enters State 2, the transcription factor (TFE) for cyclins E and A turns on, and these cyclins start to accumulate. The cell cannot leave State 2 until cyclin E accumulates to a sufficiently high level: [CycE](*t*)·*M*(*t*) = *θ*_{E}, according to Table 1. When this condition is satisfied, the cell leaves State 2 and enters State 3. The size dependence on this transition is a way to couple cell growth to the DNA replication-division cycle. According to the parameter settings in Table 1, there is no stochastic component to the transition out of State 2.

We continue in this fashion until the cell leaves State 9 and returns to State 1, when cyclin B is degraded at the end of mitosis. This is the signal for cell division. The age of the cell at division is *A*_{9} = *t*_{9}−*t*_{0}, and the mass of the cell at division is *M*(*t*_{9}) = *M*(*t*_{0}) exp(γ·*A*_{9}). The mass of the daughter cell at the beginning of her life history is *M*_{daughter}(*t*_{0}) = *δ*·*M*_{mother}(*t*_{9}), where *δ* is a random number sampled from a normal distribution of mean 0.5 and standard deviation 0.0167 to allow for asymmetries of cell division.

Notice that simulating the life history of a single cell only requires generating about a dozen random numbers and performing a handful of algebraic calculations. At no point do we need to solve differential equations numerically. Hence we can quickly calculate the life histories of tens of thousands of cells.

*Step 2: Finding the DNA and cyclin levels of each cell in an asynchronous sample*. In the flow cytometry experiments of Yan et al. [42], a random sample of cells is taken from an asynchronous population, the cells are fixed and stained, and then run one-by-one through laser beams where fluorescence measurements are made. So each data point consists of measurements of light scatter (related to cell size) and fluorescence proportional to DNA and cyclin content for a single cell taken at some random point in the cell cycle. To simulate this experiment we must assign to each of our 32000 simulated cells a number *φ _{i}* selected randomly from the interval [0,1], where

*φ*refers to the fraction of the cell cycle completed by cell

_{i}*i*when it was fixed and stained for measurement. Because each mother cell divides into two daughter cells, the density of cells at birth,

*φ*= 0, is twice the density of cells at division,

*φ*= 1. The ‘ideal’ probability density for an asynchronous population of cells expanding exponentially in number is(5)According to the ‘transformation method’ [47, Chapter 7.2], we compute

*φ*as(6)where

*r*is a random number chosen from a uniform distribution on [0,1]. In this way, we generate 32000 fractions,

*φ*.

_{i}If *φ _{i}* is the cell-cycle location of the

*i*

^{th}cell when it is selected for the flow cytometry measurements, then its age at the time of selection is

*a*=

_{i}*φ*·

_{i}*A*

_{i}_{9}, where

*A*

_{i}_{9}is the age of the

*i*

^{th}cell at division. Given a value for

*a*, we then find the state

_{i}*n*( = 1, 2, … or 9) of the

*i*

^{th}cell at the time of its selection:(7)where

*t*

_{i}_{,n}(as defined above) is the time at which the

*i*

^{th}cell left state

*n*to enter state

*n*+1.

Once we know the state *n* of the cell, we can compute the concentration of each cyclin in the cell at its exact age *a _{i}* by analogy to Eq. [4]:(8)where

*k*

_{sa,n}and

*k*

_{da,n}are the synthesis and degradation rate constants for cyclin A in state

*n*. This is a straightforward calculation because in Step 1 we stored the values of

*t*and [CycA(

_{n}*t*)] for every state of each cell. We can also calculate the mass of cell

_{n}*i*at the time of its selection:(9)where

*M*(

*t*

_{i}_{0}) is the mass at birth of cell

*i*and γ is the specific growth rate of the culture. Because the flow cytometer measures the total amount of fluorescence proportional to all cyclin A molecules in the

*i*

^{th}cell, we take as our measurable the product of [CycA(

*a*)] times

_{i}*M*(

*a*).

_{i}Lastly we determine the DNA content of cell *i* at age *a _{i}* according to:

- DNA = 1 for
*t*_{i}_{0}≤*t*_{i}_{0}+*a*<_{i}*t*_{i}_{3}= entry of*i*^{th}cell into S phase - DNA = 1+(
*t*_{i}_{0}+*a*−_{i}*t*_{i}_{3})/(*t*_{i}_{4}−*t*_{i}_{3}) for*t*_{i}_{3}≤*t*_{i}_{0}+*a*<_{i}*t*_{i}_{4}= exit of*i*^{th}cell from S phase - DNA = 2 for
*t*_{i}_{4}≤*t*_{i}_{0}+*a*<_{i}*t*_{i}_{9}

Now we have simulated values for the measurable quantities of each cell at the time point in the cell cycle when it was selected for analysis. Before plotting these numbers, we should take into account experimental errors, such as probe quality, fixation, staining and measurement. We do so by multiplying each measurable quantity (DNA content and cyclin levels) by a random number chosen from a Gaussian distribution with mean 1 and standard deviation = 0.03 for DNA measurements and 0.15 for cyclin measurements. These choices give scatter to the simulated data that is comparable to the scatter in the experimental data.

Source codes for the hybrid model are provided in the Supporting Text S1.

### Cells, culture, and fixation

Culture and fixation of RKO cells were described in [42]. The immortalized HUVEC cells [48] at passage 93 were seeded at 2.5×10^{3} cells/cm^{2} in 10 ml EGM-2 media with 2% fetal bovine serum (Lonza, Basel). Duplicate plates were prepared for each time point at days 1, 2, 3, 4, 5, 6, 7, 10, and 15. Cells were fed every other day by replacing half the volume of used media. At the indicated times, cells were trypsinized, washed, and cell counts performed with a Guava Personal Cytometer (Millipore, Billerica, MA). Fixation was as previously described [49]; briefly, cells were treated with 0.125% formaldehyde (Polysciences, Warrington, PA) for 10 min at 37°C, washed, then dehydrated with 90% Methanol. Cells were fixed in aliquots of 1×10^{6} cells (days 1–3) or 2×10^{6} (days 4–15). Fixed cell samples were stored at −20°C until staining for cytometry.

### Immunofluorescence staining, antibodies, flow cytometry

Staining and cytometry for RKO cells were described in [42]. Briefly, cells were trypsinized, fixed with 90% MeOH, washed with phosphate buffered saline, then stained with monoclonal antibodies reactive with cyclin B1, cyclin A, phospho-S10-histone H3, and with 4′,6-diamidino-2-phenylindole (DAPI). For a detailed, updated version of antibodies, staining, and cytometry for cyclins A2 and B1, phospho-S10-histone H3, and DNA content, see Jacobberger et al. (38).

### Data pre-processing

Data pre-processing was performed with WinList (Verity Software House, Topsham, ME). Doublet discrimination (peak versus area DAPI plot) was used to limit the analysis to singlet cells; non-specific binding was used to remove background fluorescence from the total fluorescence related to cyclin A2 and B1 staining. The phycoerythrin channel (cyclin A2) was compensated for spectral overlap from FITC or Alexa Fluor 488. For simplification, very large 2C G1 HUVEC cells and any cells cycling at 4C→8C were removed from the analysis. These were present at low frequency. Data were written as text files then transferred to Microsoft Excel.

## Supporting Information

### Figure S1.

Patterns of Cyclin A and Cyclin B expression in simulated populations of HUVECs growing toward confluence over days 0–10.

doi:10.1371/journal.pcbi.1001077.s001

(0.75 MB PDF)

### Figure S2.

Pattern of Cyclin A expression in stimulated human circulating lymphocytes.

doi:10.1371/journal.pcbi.1001077.s002

(0.05 MB PDF)

### Figure S3.

Correlation of Cyclin A expression with BrdU labeling.

doi:10.1371/journal.pcbi.1001077.s003

(0.33 MB PDF)

## Acknowledgments

We thank Keith E. Schultz for collecting the data in Fig. 4, as well as Kathy Chen and Tongli Zhang for helpful suggestions. The cyclin A2 antibody was a gift from Vincent Shankey, Beckman-Coulter (Miami).

## Author Contributions

Conceived and designed the experiments: RMS JWJ. Performed the experiments: RMS. Analyzed the data: RS JWJ JJT. Wrote the paper: RS RMS JWJ JJT. Conceived modeling approach: JJT. Implemented the model and ran all simulations: RS. Designed the model: JWJ.

## References

- 1.
Mitchison JM (1971) The Biology of the Cell Cycle. Cambridge UK: Cambridge University Press. 313 p.
- 2. Zetterberg A, Larsson O, Wiman KG (1995) What is the restriction point? Curr Opin Cell Biol 7: 835–842.
- 3. Chen KC, Csikasz-Nagy A, Gyorffy B, Val J, Novak B, et al. (2000) Kinetic analysis of a molecular model of the budding yeast cell cycle. Mol Biol Cell 11: 369–391.
- 4. Novak B, Pataki Z, Ciliberto A, Tyson JJ (2001) Mathematical model of the cell division cycle of fission yeast. Chaos 11: 277–286.
- 5. Chen KC, Calzone L, Csikasz-Nagy A, Cross FR, Novak B, et al. (2004) Integrative analysis of cell cycle control in budding yeast. Mol Biol Cell 15: 3841–3862.
- 6. Calzone L, Thieffry D, Tyson JJ, Novak B (2007) Dynamical modeling of syncytial mitotic cycles in Drosophila embryos. Mol Syst Biol 3: 131–141.
- 7.
Novak B, Tyson JJ (1993) Numerical analysis of a comprehensive model of M-phase control in
*Xenopus*oocyte extracts and intact embryos. J Cell Sci 106: 1153–1168. - 8. Pomerening JR, Kim SY, Ferrell JE Jr (2005) Systems-level dissection of the cell-cycle oscillator: bypassing positive feedback produces damped oscillations. Cell 122: 565–578.
- 9. Aguda BD, Tang Y (1999) The kinetic origins of the restriction point in the mammalian cell cycle. Cell Prolif 32: 321–335.
- 10. Qu Z, Weiss JN, MacLellan WR (2003) Regulation of the mammalian cell cycle: a model of the G1-to-S transition. Am J Physiol Cell Physiol 284: C349–C364.
- 11. Novak B, Tyson JJ (2004) A model for restriction point control of the mammalian cell cycle. J Theor Biol 230: 563–579.
- 12.
Sha W, Moore J, Chen K, Lassaletta AD, Yi C-S, et al. (2003) Hysteresis drives cell-cycle transitions in
*Xenopus laevis*egg extracts. Proc Natl Acad Sci USA 100: 975–980. - 13. Pomerening JR, Sontag ED, Ferrell JE Jr (2003) Building a cell cycle oscillator: hysteresis and bistability in the activation of Cdc2. Nature Cell Biol 5: 346–351.
- 14. Li F, Long T, Lu Y, Ouyang Q, Tang C (2004) The yeast cell-cycle network is robustly designed. Proc Natl Acad Sci USA 101: 4781–4786.
- 15. Davidich MI, Bornholdt S (2008) Boolean network model predicts cell cycle sequence of fission yeast. PLoS One 3: e1672.
- 16. Faure A, Naldi A, Chaouiya C, Thieffry D (2006) Dynamical analysis of a generic Boolean model for the control of mammalian cell cycle. Bioinformatics 22: e124–131.
- 17.
Fantes PA, Nurse P (1981) Division timing: controls, models and mechanisms. In: John PCL, editor. The Cell Cycle. Cambridge UK: Cambridge University Press. pp. 11–33.
- 18. Tyson JJ (1985) The coordination of cell growth and division – intentional or incidental? Bioessays 2: 72–77.
- 19. Tyson JJ (1987) Size control of cell division. J Theor Biol 126: 381–391.
- 20. Darzynkiewicz Z, Crissman H, Jacobberger JW (2004) Cytometry of the cell cycle: cycling through history. Cytometry A 58: 21–32.
- 21. Glass L, Kauffman SA (1973) The logical analysis of continuous, non-linear biochemical control networks. J Theor Biol 39: 103–129.
- 22. Glass L, Pasternack J (1978) Stable oscillations in mathematical models of biological control systems. J Math Biol 6: 207–223.
- 23. Matsuno H, Inouye ST, Okitsu Y, Fujii Y, Miyano S (2006) A new regulatory interaction suggested by simulations for circadian genetic control mechanism in mammals. J Bioinform Comput Biol 4: 139–153.
- 24. Bosl WJ (2007) Systems biology by the rules: hybrid intelligent systems for pathway modeling and discovery. BMC Syst Biol 1: 13.
- 25. Li C, Nagasaki M, Ueno K, Miyano S (2009) Simulation-based model checking approach to cell fate specification during Caenorhabditis elegans vulval development by hybrid functional Petri net with extension. BMC Syst Biol 3: 42.
- 26.
Alur R, Dang T, Esposito JM, Fierro RB, Hur Y, et al. (2001) Hierarchical Hybrid Modeling of Embedded Systems. In: Henzinger TA, Kirsch CM, editors. Embedded Software: Proceedings of the First International Workshop. Berlin: Springer. pp. 14–31.
- 27.
Fishwick PA (2007) Handbook of dynamic system modeling. Boca Raton: Chapman & Hall/CRC.
- 28.
Klee H, Allen R (2011) Simulation of dynamic systems with MATLAB and Simulink. Boca Raton, FL: CRC Press. 840 p.
- 29.
Mosterman P (1999) An Overview of Hybrid Simulation Phenomena and Their Support by Simulation Packages. In: Vaandrager F, van Schuppen J, editors. Hybrid Systems: Computation and Control. Berlin: Springer. pp. 165–177.
- 30.
Deshpande A, Gollu A, Varaiya P (1997) SHIFT: A Formalism and a Programming Language for Dynamic Networks of Hybrid Automata. In: Antsaklis P, Kohn W, Nerode A, Sastry S, editors. Hybrid Systems IV. Berlin: Springer. pp. 113–133.
- 31. Deshpande A, Gollu A, Semenzato L (1998) The SHIFT programming language and run-time system for dynamic networks of hybrid systems. IEEE Trans Automat Contr 43: 584–587.
- 32.
Alur R, Grosu R, Hur Y, Kumar V, Lee I (2000) Modular Specification of Hybrid Systems in CHARON. In: Lynch N, Krogh BH, editors. Hybrid Systems: Computation and Control. Berlin: Springer. pp. 6–19.
- 33. Trimarchi JM, Lees JA (2002) Sibling viralry in the E2F family. Nat Rev Mol Cell Biol 3: 11–20.
- 34. Laoukili J, Kooistra MR, Bras A, Kauw J, Kerkhoven RM, et al. (2005) FoxM1 is required for execution of the mitotic programme and chromosome stability. Nat Cell Biol 7: 126–136.
- 35. Wierstra I, Alves J (2007) FOXM1, a typical proliferation-associated transcription factor. Biol Chem 388: 1257–1274.
- 36. Cardozo T, Pagano M (2004) The SCF ubiquitin ligase: insights into a molecular machine. Nat Rev Mol Cell Biol 5: 739–751.
- 37. Welcker M, Singer J, Loeb KR, Grim J, Bloecher A, et al. (2003) Multisite phosphorylation by Cdk2 and GSK3 controls cyclin E degradation. Mol Cell 12: 381–392.
- 38. Harper JW, Burton JL, Solomon MJ (2002) The anaphase-promoting complex: it is not just for mitosis any more. Genes Dev 16: 2179–2206.
- 39. Peters JM (2002) The anaphase-promoting complex proteolysis in mitosis and beyond. Mol Cell 9: 931–943.
- 40. Geley S, Kramer E, Gieffers C, Gannon J, Peters J-M, et al. (2001) Anaphase-promoting complex/cyclosome-dependent proteolysis of human cyclin A starts at the beginning of mitosis and is not subject to the spindle assembly checkpoint. J Cell Biol 153: 137–148.
- 41. Pfleger CM, Kirschner MW (2000) The KEN box: an APC recognition signal distinct from the D box targeted by Cdh1. Genes Dev 14: 655–665.
- 42. Yan T, Desai AB, Jacobberger JW, Sramkoski RM, Loh T, et al. (2004) CHK1 and CHK2 are differentially involved in mismatch repair-mediated 6-thioguanine-induced cell cycle checkpoint responses. Mol Cancer Ther 3: 1147–1157.
- 43. Gong J, Ardelt B, Traganos F, Darzynkiewicz Z (1994) Unscheduled expression of cyclin B1 and cyclin E in several leukemic and solid tumor cell lines. Cancer Res 54: 4285–4288.
- 44. Frisa PS, Jacobberger JW (2009) Cell cycle-related cyclin b1 quantification. PLoS One 4: e7064.
- 45. Jacobberger JW, Sramkoski RM, Wormsley SB, Bolton WE (1999) Estimation of kinetic cell-cycle-related gene expression in G1 and G2 phases from immunofluorescence flow cytometry data. Cytometry 35: 284–289.
- 46. Darzynkiewicz Z, Gong J, Juan G, Ardelt B, Traganos F (1996) Cytometry of cyclin proteins. Cytometry 25: 1–13.
- 47.
Press WH, Teukolsky SA, Vetterling WT, Flannery BP (1992) Numerical recipes in C. The art of scientific computing. Cambridge: Cambridge University Press. pp. 707–752.
- 48. Freedman DA, Folkman J (2005) CDK2 translational down-regulation during endothelial senescence. Exp Cell Res 307: 118–130.
- 49. Schimenti KJ, Jacobberger JW (1992) Fixation of mammalian cells for flow cytometric evaluation of DNA content and nuclear immunofluorescence. Cytometry 13: 48–59.